首页 | 本学科首页   官方微博 | 高级检索  
     

基于纠删码和动态副本策略的HDFS改进系统
引用本文:李晓恺,代翔,李文杰,崔喆.基于纠删码和动态副本策略的HDFS改进系统[J].计算机应用,2012,32(8):2150-2158.
作者姓名:李晓恺  代翔  李文杰  崔喆
作者单位:1. 中国科学院 成都计算机应用研究所,成都 6100412. 中国科学院 研究生院,北京 100049
摘    要:为了让Hadoop分布式文件系统(HDFS)达到更高的存储效率以及更加优化的负载均衡能力,针对HDFS的多副本存储技术提出了改进方案——Noah。Noah引入了编码和译码模块,对HDFS中的block进行编码分解,生成更多数量的数据分片(section),并随机地分散保存到集群当中,替代原有系统的多副本容灾策略;在集群出现节点失效的情况下,通过收集与失效block相关的任意70%左右的section进行原始数据的恢复;同时根据分布式集群运行情况以及对副本数目需求的不同采用动态副本策略。通过相关的集群实验,表明Noah在容灾效率、负载均衡、存储成本以及安全性上对HDFS作了相应的优化。

关 键 词:Hadoop分布式文件系统    分布式存储    数据容灾    负载均衡    动态副本
收稿时间:2012-02-22
修稿时间:2012-03-26

Improved HDFS scheme based on erasure code and dynamical-replication system
LI Xiao-kai , DAI Xiang , LI Wen-jie , CUI Zhe.Improved HDFS scheme based on erasure code and dynamical-replication system[J].journal of Computer Applications,2012,32(8):2150-2158.
Authors:LI Xiao-kai  DAI Xiang  LI Wen-jie  CUI Zhe
Affiliation:1. Chengdu Institute of Computer Application, Chinese Academy of Sciences, Chengdu Sichuan 610041, China2. Graduate University of Chinese Academy of Sciences, Beijing 100049, China
Abstract:In order to improve the storage efficiency of Hadoop Distributed File System(HDFS) and its load balance ability,this paper presented an improved solution named Noah to replace the original multiple-replication strategy.Noah introduced a coding module to HDFS.Instead of adopting the multiple-replication strategy by the original system,the module encoded every data block of HDFS into a greater number of data sections(pieces),and saved them dispersedly into the clusters of the storage system in distributed fashion.In the case of cluster failure,the original data would be recovered via decoding by collecting any 70% of the sections,while the dynamic replication strategy also worked synchronously,in which the amount of copies would dynamically change with the demand.The experimental results in analogous clusters of storage system show the feasibility and advantages of new measures in proposed solution.
Keywords:Hadoop Distributed File System(HDFS)  distributed storage  data disaster recovery  load-balance  dynamic replication
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《计算机应用》浏览原始摘要信息
点击此处可从《计算机应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号