首页 | 本学科首页   官方微博 | 高级检索  
     

基于重复数据消除的差异备份方法
引用本文:吴晓勇,杨频,胡晓勤,臧文娟.基于重复数据消除的差异备份方法[J].计算机工程,2010,36(21):251-253.
作者姓名:吴晓勇  杨频  胡晓勤  臧文娟
作者单位:(四川大学计算机学院,成都 610065)
基金项目:国家自然科学基金资助项目,教育部创新工程重大项目培育基金资助项目,教育部博士点基金资助项目
摘    要:为消除重复数据对数据传输和存储产生的影响,提出一种基于重复数据消除的差异备份方法。通过将文件的块按照一定区间划分固定大小并采用Hash表对文件块进行唯一性标识,使Rsync算法能检测不同文件之间的重复数据,通过分割Hash表,使块实现局部匹配,并利用校验和文件实现文件不同版本的差异传输。实验结果表明,与Rsync算法相比,该方法能有效减少传输的数据量,降低备份中心的存储量,提高块查找的效率。

关 键 词:Rsync算法  重复数据  区域块长  分组Hash

Differential Backup Method Based on Duplicated Data Elimination
WU Xiao-yong,YANG Pin,HU Xiao-qin,ZANG Wen-juan.Differential Backup Method Based on Duplicated Data Elimination[J].Computer Engineering,2010,36(21):251-253.
Authors:WU Xiao-yong  YANG Pin  HU Xiao-qin  ZANG Wen-juan
Affiliation:(College of Computer Science, Sichuan University, Chengdu 610065, China)
Abstract:In order to eliminate the influence of duplicated data on transmission and storage, this paper proposes a differential backup method based on duplicated data elimination. By segmenting the block of file into several fixed size according to some interval and using Hash table to identify unique block, Rsync algorithm can detect duplicated data among different files. Local match is realized by segmenting Hash table. Differences transmission between different versions of files is realized by using local checksum file. Experimental results show that, compared with Rsync algorithm, the method can reduce the amount of data transmitted, decrease the disk capacity, and enhance the block search efficiency.
Keywords:Rsync algorithm  duplicated data  length of regional block  group Hash
本文献已被 维普 万方数据 等数据库收录!
点击此处可从《计算机工程》浏览原始摘要信息
点击此处可从《计算机工程》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号