首页 | 本学科首页   官方微博 | 高级检索  
     

存储中的副本分级存储调度策略
引用本文:杨冬菊,李青. 存储中的副本分级存储调度策略[J]. 计算机科学, 2017, 44(4): 85-89
作者姓名:杨冬菊  李青
作者单位:北方工业大学云计算研究中心 北京100144大规模流数据集成与分析技术北京市重点实验室 北京100144,北方工业大学云计算研究中心 北京100144大规模流数据集成与分析技术北京市重点实验室 北京100144
基金项目:本文受北京市教育委员会科技计划重点项目:支持数据资源联动的云服务社区研究(KZ201310009009),北京市属高等学校创新团队建设与教师职业发展计划基金资助
摘    要:当集群中的部分节点是廉价主机时,采用HDFS的随机存储策略可能使访问频率高的数据存储在廉价节点上,受到廉价节点的性能影响,访问时间过长,降低了集群效率。为改善以上问题,提出一种改进的副本分级存储调度策略。为减少副本调度的次数,先根据节点的CPU、内存、网络、存储负载以及网络距离来评价节点的性能,再从中选取高性能节点进行存储。副本调度以节点中副本的访问频率为依据,结合硬件配置,把访问频率高的副本尽可能存储在高性能、高配置的节点中,以加快集群响应速度。实验结果表明,改进后的策略可以在异构集群中提高副本的访问效率,优化负载均衡。

关 键 词:云存储  HDFS  分级存储  副本调度
收稿时间:2015-11-30
修稿时间:2016-03-05

Scheduling Strategy of Hierarchical Storage about Replication in Cloud Storage
YANG Dong-ju and LI Qing. Scheduling Strategy of Hierarchical Storage about Replication in Cloud Storage[J]. Computer Science, 2017, 44(4): 85-89
Authors:YANG Dong-ju and LI Qing
Affiliation:Research Center for Cloud Computing,North China University of Technology,Beijing 100144,China Beijing Key Laboratory on Integration and Analysis of Large-scale Stream Data,Beijing 100144,China and Research Center for Cloud Computing,North China University of Technology,Beijing 100144,China Beijing Key Laboratory on Integration and Analysis of Large-scale Stream Data,Beijing 100144,China
Abstract:HDFS takes random storage strategy,if cluster has some cheap nodes,it is possible to make high frequency data store in the low processing performance nodes,causing a long time access and poor efficiency.To solve these problems,an improved scheduling strategy of hierarchical storage about replication was proposed.In order to reduce the number of replication scheduling,firstly,the information of data node from CPU load,memory load,network load,sto-rage load and network distance are used to evaluate node availability.Secondly,the optimal one is selected.Accessing frequency and hardware configuration are used to realize the replication scheduling.The response rate of cluster is improved by making high frequency data store on the high processing performance and high configuration node.The experimental results show that the strategy can improve access efficiency of replicas and local balancing for data storage in the heterogeneous clusters.
Keywords:Cloud storage  HDFS  Hierarchical storage  Replication scheduling
点击此处可从《计算机科学》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号