首页 | 本学科首页   官方微博 | 高级检索  
     

混合存储模式下MapReduce作业调度
引用本文:杨振宇,牛天洋,吕敏.混合存储模式下MapReduce作业调度[J].计算机系统应用,2023,32(3):70-85.
作者姓名:杨振宇  牛天洋  吕敏
作者单位:中国科学技术大学 计算机科学与技术学院, 合肥 230022
基金项目:国家自然科学基金重点项目(61832011)
摘    要:在异构Hadoop集群场景中, 为了缓和由于纠删码和副本存储模式混合使用, 以及服务器节点本身实时算力差异造成的MapReduce作业处理效率低下的问题, 本文实现了一种根据数据存储情况和节点实时负载来在多并发场景下动态调节MapReduce作业任务分配情况的调度策略. 该策略通过修改当前Hadoop框架中的数据存储选址策略并对节点任务并发量进行动态控制, 在多作业并发时实现更加均衡的作业间资源分配. 实验结果表明, 相较于Hadoop默认的两种作业调度策略, 本文提出的调度模式能够将作业完成时间缩短约17%, 并有效避免部分作业面临的饥饿现象.

关 键 词:MapReduce  作业调度  纠删码  异构集群  混合存储  云计算  负载均衡  大数据
收稿时间:2022/8/9 0:00:00
修稿时间:2022/9/15 0:00:00

MapReduce Job Scheduling in Hybrid Storage Modes
YANG Zhen-Yu,NIU Tian-Yang,LYU Min.MapReduce Job Scheduling in Hybrid Storage Modes[J].Computer Systems& Applications,2023,32(3):70-85.
Authors:YANG Zhen-Yu  NIU Tian-Yang  LYU Min
Affiliation:School of Computer Science and Technology, University of Science and Technology of China, Hefei 230022, China
Abstract:In a heterogeneous Hadoop cluster scenario, the hybrid use of erasure codes and replica storage modes, as well as the real-time computing capability difference of server nodes lead to the low efficiency of MapReduce job processing. To deal with this problem, this study implements a scheduling strategy that dynamically adjusts MapReduce job assignment in multi-concurrent scenarios according to data storage situations and the real-time load of nodes. This strategy dynamically controls the concurrent amount of tasks of each node by modifying data storage location strategies in the current Hadoop framework, so as to achieve more balanced resource allocation among jobs when multiple jobs are concurrent. The experimental results show that the scheduling mode proposed in this study can shorten the job completion time by about 17% and effectively avoid the starvation phenomenon faced by some jobs compared with the two default job scheduling strategies of Hadoop.
Keywords:MapReduce  job scheduling  erasure code  heterogeneous cluster  hybrid storage  cloud computing  load balance  big data
点击此处可从《计算机系统应用》浏览原始摘要信息
点击此处可从《计算机系统应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号