首页 | 本学科首页   官方微博 | 高级检索  
     

基于任务负载监测的高性能集群节点启停机制*
引用本文:曹宗雁,曹荣强,戴志辉,朱鹏,迟学斌.基于任务负载监测的高性能集群节点启停机制*[J].计算机应用研究,2011,28(12):4663-4665.
作者姓名:曹宗雁  曹荣强  戴志辉  朱鹏  迟学斌
作者单位:1. 中国科学院计算机网络信息中心超级计算中心,北京100190;中国科学院研究生院,北京100049
2. 中国科学院计算机网络信息中心超级计算中心,北京,100190
基金项目:国家“863”计划重点资助项目(2006AA01A116,2006AA01A117);中国科学院“十一五”信息化专项资助项目(INFO-115-B01)
摘    要:对高性能计算集群在运行过程中如何通过关闭闲置节点来实现有效节能的问题进行了研究和探讨,设计和实现了基于任务负载量统计监测的节点启停机制.根据对系统中作业运行和排队情况的记录和分析,通过参数估计设计了反映队列任务情况的负载因子,并围绕负载因子制定具体策略,结合作业系统的队列设置和资源分配规则,对集群中的空闲节点进行自动启停控制.模拟实验表明,基于任务负载监测的节点启停机制能够有效地自动启停系统中闲置的节点,从而降低系统功耗,并且对系统中作业的整体完成时间基本不造成影响.

关 键 词:高性能计算机  集群  任务负载  节点控制  参数估计  节能

Nodes start/stop mechanism for high-performance computing clusters based on task load monitoring
CAO Zong-yan,CAO Rong-qiang,DAI Zhi-hui,ZHU Peng,CHI Xue-bin.Nodes start/stop mechanism for high-performance computing clusters based on task load monitoring[J].Application Research of Computers,2011,28(12):4663-4665.
Authors:CAO Zong-yan  CAO Rong-qiang  DAI Zhi-hui  ZHU Peng  CHI Xue-bin
Affiliation:CAO Zong-yan1,2,CAO Rong-qiang1,DAI Zhi-hui1,ZHU Peng1,CHI Xue-bin1(1.Supercomputing Center,Computer Network Information Center of Chinese Academy of Sciences,Beijing 100190,China,2.Graduate School of Chinese Academy of Sciences,Beijing 100049,China)
Abstract:This paper discussed the method of closing idle nodes to save power in high-performance computing clusters. It proposed a mechanism for nodes start and stop control based on task load monitoring and statistics and designed task load indicator using parameter estimation. It set up detail strategies around this indicator to automatically control the idle nodes starting and stopping. It also considered queue configuration and resource allocation of job manage system in the strategies. Simulation tests indicate that the nodes start/stop mechanism can effectively control the idle nodes in the system, so that the power consumption can be reduced; moreover, the mechanism impacts very little on the system overall job scheduling and running.
Keywords:high-performance computer  cluster  task load  node control  parameter estimation  power saving
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《计算机应用研究》浏览原始摘要信息
点击此处可从《计算机应用研究》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号