Algorithm of Scheduling for Data-intensive Computing Operations onto GPU Cluster

doi:10.13328/j.cnki.jos.006362

微信服务号

微信订阅号

Home > Archive>Volume 33, Issue 12, 2022 >4429-4451. DOI:10.13328/j.cnki.jos.006362

PDF HTML XML Export Cite reminder

Algorithm of Scheduling for Data-intensive Computing Operations onto GPU Cluster
DOI:
                        10.13328/j.cnki.jos.006362
                    
Author:
                        
                        
                    
Affiliation:
Clc Number:TP301
Fund Project:National Key Research & Development Program of China(2018YFB1003400)

Article

Figures

Metrics

Reference

Cited by

Materials

Comments

Abstract:

Data-intensive tasks include a large number of tasks. Using GPU devices to improve the performance of tasks is the main method currently. However, in the case of solving the fair sharing of GPU resources between data-intensive tasks and reducing the cost of data network transmission, the existing research methods do not comprehensively consider the contradiction between resource fairness and data transmission costs. The study analyzes the characteristics of GPU cluster resource scheduling, and proposes an algorithm based on the minimum cost and the maximum number of tasks in GPU cluster resource scheduling. The method can solve the contradiction between the fair allocation of GPU resources and the high cost of data transmission. The scheduling process is divided into two stages. In the first stage, each job gives its own optimal plan according to the data transmission costs, and in the second stage, the resource allocator merges the plan of each job. Firstly, the study gives the overall structure of the framework, and the source allocator works globally after each job giving its own optimal plan. Secondly, the network bandwidth estimation strategy and the method of computing the data transmission cost of the task are given. Thirdly, the basic algorithm for the fair allocation of resources based on the number of GPUs is given. Fourthly, the scheduling algorithm with the smallest cost and the largest number of tasks is proposed, which describing the implementation strategies of resource non-grabbing, robbing and resource fairness strategies. Finally, six data-intensive computing tasks are designed, and the algorithm proposed in the study is tested, and the experiments verifies that the scheduling algorithm can achieve about 90% of resource fairness, while also ensuring that the parallel operation time of jobs is minimized.

Reference

Cited by

Get Citation

汤小春,朱紫钰,毛安琪,符莹,李战怀.数据密集作业在GPU集群上的调度算法研究.软件学报,2022,33(12):4429-4451

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:May 10,2020
Revised:November 30,2020
Adopted:
Online: May 21,2021
Published: December 06,2022

You are the firstVisitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-4
Address：4# South Fourth Street, Zhong Guan Cun, Beijing 100190,Postal Code：100190
Phone：010-62562563 Fax：010-62562533 Email：jos@iscas.ac.cn
Technical Support：Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063

微信服务号

微信订阅号

Get Citation

Share

Article Metrics

History