首页 | 本学科首页   官方微博 | 高级检索  
     


Auditable versioned data storage outsourcing
Affiliation:1. Department of Diagnostic and Interventional Neuroradiology, Inselspital, Bern University Hospital and University of Bern, 3010 Bern, Switzerland;2. Department of Neurology, Inselspital, Bern University Hospital and University of Bern, 3010 Bern, Switzerland;1. Heilongjiang Key Laboratory for Animal Disease Control and Pharmaceutical Development, Department of Preventive Veterinary, College of Veterinary, Northeast Agricultural University, Harbin, China;2. Department of Radiological and Protection, Heilongjiang Province Center for Disease Control and Prevention, Harbin, China;1. Information Engineer College, Hangzhou Dianzi University, Hangzhou 310018, PR China;2. Shanghai Key Laboratory of Trustworthy Computing, East China Normal University, Shanghai 200062, PR China
Abstract:Auditability is crucial for data outsourcing, facilitating accountability and identifying data loss or corruption incidents in a timely manner, reducing in turn the risks from such losses. In recent years, in synch with the growing trend of outsourcing, a lot of progress has been made in designing probabilistic (for efficiency) provable data possession (PDP) schemes. However, even the recent and advanced PDP solutions that do deal with dynamic data, do so in a limited manner, and for only the latest version of the data. A naive solution treating different versions in isolation would work, but leads to tremendous overheads, and is undesirable. In this paper, we present algorithms to achieve full persistence (all intermediate configurations are preserved and are modifiable) for an optimized skip list (known as FlexList) so that versioned data can be audited. The proposed scheme provides deduplication at the level of logical, variable sized blocks, such that only the altered parts of the different versions are kept, while the persistent data-structure facilitates access (read) of any arbitrary version with the same storage and process efficiency that state-of-the-art dynamic PDP solutions provide for only the current version, while commit (write) operations incur around 5% additional time. Furthermore, the time overhead for auditing arbitrary versions in addition to the latest version is imperceptible even on a low-end server. Additionally, the application of our approach opens up the possibility to naturally support block level deduplication. While a naive solution to audit versions would copy the whole data and the data structure for each version, our solution utilizes storage space amounting very close to the most efficient delta-based solutions. Accordingly, we explore how the proposed data structure benefits the system with block level deduplication besides adding auditability property, and how it can be integrated with a state-of-the-art versioning system (Git), and in the process scale the storage efficiency of Git, and thus help scale the size of data to be stored in Git, without compromising the retrieval efficiency of arbitrary versions.
Keywords:Version  Skip list  FlexList  File systems
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号