首页 | 本学科首页   官方微博 | 高级检索  
     

同构Hadoop集群环境下改进的延迟调度算法
引用本文:柯何杨,杨 群,王立松,段 汐.同构Hadoop集群环境下改进的延迟调度算法[J].计算机应用研究,2013,30(5):1397-1401.
作者姓名:柯何杨  杨 群  王立松  段 汐
作者单位:南京航空航天大学 计算机科学与技术学院, 南京 210016
摘    要:在Hadoop框架下计算资源和数据资源可以在不同物理位置的特点产生本地化问题。延迟调度算法的产生旨在解决本地化问题, 此算法根据任务待处理数据的物理位置作为作业的计算节点, 调度任务至目标节点。但是可能出现同一作业中若干任务集中运行在某一计算节点, 导致作业达不到理想的并行效果。针对原有的延迟调度算法, 提出延迟一容量调度算法, 允许部分任务选择非本地化节点作为原延迟调度算法中任务的目标计算节点, 以提高作业的响应时间与增加作业的并行程度。最后通过实验对比分析, 改进后的算法在执行效率和并行效果明显优于原延迟调度算法。

关 键 词:本地化    延迟调度    延迟—容量调度

Improved delay-scheduler algorithm in homogeneous Hadoop cluster
KE He-yang,YANG Qun,WANG Li-song,DUAN Xi.Improved delay-scheduler algorithm in homogeneous Hadoop cluster[J].Application Research of Computers,2013,30(5):1397-1401.
Authors:KE He-yang  YANG Qun  WANG Li-song  DUAN Xi
Affiliation:College of Computer Science & Technology, Nanjing University of Aeronautics & Astronautics, Nanjing 210016, China
Abstract:Locality problem is caused by the physical location inconsistency between computing resource and data resource in Hadoop. Delay scheduling algorithm to solve locality problem which taking the physical location of task data to be processed as computing nodes and migrating task to the target nodes. However, it may appear with a work tasks focus on running in one computing node, resulting non-ideal parallelling effect in operation. To solver this problem, this paper proposed delay-capacity scheduler algorithm on the basis of delay scheduler algorithm, which allowed some task run on a node that did not contain its input data, so that decrease the job response time and improve the degree of job parallelization. Finally, through experimental analysis, the improved algorithm in efficiency and parallelization effect is obviously superior to the original delay scheduling algorithm.
Keywords:locality  delay scheduling  delay-capacity scheduling
点击此处可从《计算机应用研究》浏览原始摘要信息
点击此处可从《计算机应用研究》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号