首页 | 本学科首页   官方微博 | 高级检索  
     

一种基于两级DAG模型的MapReduce工作流异构调度算法
引用本文:王宇新,王飞,王冠,郭禾. 一种基于两级DAG模型的MapReduce工作流异构调度算法[J]. 计算机工程与科学, 2019, 41(8): 1353-1359
作者姓名:王宇新  王飞  王冠  郭禾
作者单位:(大连理工大学计算机科学与技术学院,辽宁 大连 116023)
基金项目:国家自然科学基金(11372067,61772112)
摘    要:MapReduce编程模型被广泛应用于大数据处理平台,而一个有效的任务调度算法对模型的运行效率至关重要。将MapReduce工作流的Map和Reduce阶段分别拆解为若干个有先后序限定关系的作业,每个作业再拆解为多个任务。之后基于计算集群的可用资源和任务异构性,构建面向作业和任务的2级有向无环图(DAG)模型,同时提出基于2级优先级排序的异构调度算法2-MRHS。算法的第1阶段进行优先级排序,即对作业和任务分别进行优先权值计算,再汇总得到任务的调度队列;第2阶段进行任务分配,即基于最快完成时间将每个任务所包含的数据块子任务分配给最适合的计算结点。采用大批量随机生成的DAG模型进行实验,结果表明与其他相关算法相比,本文算法有更短的调度长度(makespan)且更加稳定。

关 键 词:MapReduce  工作流  异构计算  任务调度  
收稿时间:2018-12-01
修稿时间:2019-08-25

A MapReduce workflow heterogeneous schedulingalgorithm based on two-level DAG model
WANG Yu-xin,WANG Fei,WANG Guan,GUO He. A MapReduce workflow heterogeneous schedulingalgorithm based on two-level DAG model[J]. Computer Engineering & Science, 2019, 41(8): 1353-1359
Authors:WANG Yu-xin  WANG Fei  WANG Guan  GUO He
Affiliation:(School of Computer Science and Technology,Dalian University of Technology,Dalian 116023,China) 
Abstract:The MapReduce programming model is widely applied in big data processing platforms, and an effective task scheduling algorithm is critical to the efficiency of the model. In our approach, a MapReduce workflow is decomposed as a number of jobs with successive qualifying relationships and each job has a Map phase and a Reduce phase that both contain multiple tasks. Based on the available resources and task heterogeneity of computing cluster, we construct a two-level directed acyclic graph (DAG) model for job and tasks, and propose a MapReduce workflow heterogeneous scheduling algorithm based on two level priority ordering (2-MRHS). In the first stage of the algorithm, the priority ordering is performed: the priority weights of the job level and task level are calculated respectively to form the scheduling queue of tasks. In task assignment stage, the data block subtasks of each task are assigned to the appropriate computing node according to the tasks' earliest finish time (EFT). A large number of randomly generated DAG models are used to conduct experiments and the results show that our algorithm has shorter scheduling length (makespan) and better stability than those of others.
Keywords:MapReduce  workflow  heterogeneous computing  task scheduling  
点击此处可从《计算机工程与科学》浏览原始摘要信息
点击此处可从《计算机工程与科学》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号