首页 | 本学科首页   官方微博 | 高级检索  
     

基于Storm平台的多任务分组调度策略与实现
引用本文:王中华,柴小丽.基于Storm平台的多任务分组调度策略与实现[J].计算机系统应用,2021,30(2):250-254.
作者姓名:王中华  柴小丽
作者单位:中国电子科技集团公司第三十二研究所,上海201808;中国电子科技集团公司第三十二研究所,上海201808
摘    要:随着大数据与人工智能技术的飞速发展,高性能,实时性的流式计算系统逐渐取代传统基于数据仓库的批量计算系统.Apache storm作为一款开源,高容错,实时处理的分布式大数据流式计算平台,支持任务平均分配策略,单机任务指定策略等多种任务分配方案.当任务拓扑结构中存在多个任务时,且集群中只有某些机器支持某一任务执行时,传统的任务调度方法只能实现将单一的任务分配给单一指定的机器,使得整个集群的资源没有充分的利用.通过调整任务调度策略,获得满足条件的机器队列,查看机器队列中可用工作节点,将指定任务均匀分配给可用工作节点,其他任务仍通过默认策略分配给集群中的剩余机器,实现多任务的分组调度策略.

关 键 词:Apache  storm平台  分布式  流式计算  拓扑  多任务分组调度策略
收稿时间:2020/6/9 0:00:00
修稿时间:2020/7/10 0:00:00

Multi-Task Group Scheduling Strategy and Implementation Based on Storm Platform
WANG Zhong-Hu,CHAI Xiao-Li.Multi-Task Group Scheduling Strategy and Implementation Based on Storm Platform[J].Computer Systems& Applications,2021,30(2):250-254.
Authors:WANG Zhong-Hu  CHAI Xiao-Li
Affiliation:The 32nd Research Institute of China Electronics Technology Group Corporation, Shanghai 201808, China
Abstract:As big data and artificial intelligence technologies are booming, high-performance, real-time streaming computing systems are gradually replacing traditional batch computing systems based on data warehouses. As an open-source distributed big-data streaming computing platform that is highly fault-tolerant and can realize real-time processing, Apache storm supports a variety of task distribution schemes such as average task distribution strategy and single-machine task assignment strategy. When there are multiple tasks in the task topology and only certain machines in the cluster support the execution of a certain task, the traditional task scheduling method can only allocate a single task to a single designated machine, failing to make best use of resources in the entire cluster. By the adjustment to the task scheduling strategy, the eligible machine queue is obtained. Then, the assigned tasks are evenly distributed to available work nodes in the machine queue, and other tasks are distributed to the remaining machines in the cluster through the default strategy. In this way, multi-task group scheduling strategy can be achieved.
Keywords:Apache storm platform  distributed  stream computing  topology  multi-task group scheduling strategy
本文献已被 维普 万方数据 等数据库收录!
点击此处可从《计算机系统应用》浏览原始摘要信息
点击此处可从《计算机系统应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号