首页 | 本学科首页   官方微博 | 高级检索  
     

海量小文件系统的可移植操作系统接口兼容技术
引用本文:陈博,何连跃,严巍巍,徐照淼,徐俊.海量小文件系统的可移植操作系统接口兼容技术[J].计算机应用,2018,38(5):1389-1392.
作者姓名:陈博  何连跃  严巍巍  徐照淼  徐俊
作者单位:1. 国防科技大学 计算机学院, 长沙 410073;2. 北京网云飞信息技术有限公司, 北京 100067
摘    要:基于Hadoop分布式文件系统(HDFS)研发的海量小文件系统(SMDFS)遗留了HDFS不兼容可移植操作系统接口(POSIX)约束的问题,为解决SMDFS的这一问题,提出基于本地缓存的POSIX兼容技术和基于数据暂存区的元数据高效管理技术。首先,通过设置数据暂存区来实现读写模式文件流的重定向,然后建立异步线程池模型,实现数据暂存区镜像文件的同步,从而完成用户层到存储层的所有POSIX相关的文件操作。此外,借助跳表结构的元数据缓存实现List目录等元数据操作效率优化。测试表明,相较于HDFS的Linux客户端,基于技术成果实现的SMDFS3.0的随机读性能有10倍以上的性能提升,顺序读和顺序写性能有约3~4倍的提升,随机写性能可以达到本地文件系统的20%,基于目录的元数据缓存的设计使目录的List操作效率提升近10倍。但是,由于用户空间文件系统(FUSE)挂载的客户端会引入额外的内核态和用户态切换等带来的开销,因此SMDFS3.0的Linux客户端相对于系统的Java接口会有大约50%的性能损耗。

关 键 词:海量小文件系统  分布式文件系统  可移植操作系统接口兼容  元数据缓存  云存储  
收稿时间:2017-12-13
修稿时间:2017-12-14

Portable operating system interface of UNIX compatibility technology in mass small distributed file system
CHEN Bo,HE Lianyue,YAN Weiwei,XU Zhaomiao,XU Jun.Portable operating system interface of UNIX compatibility technology in mass small distributed file system[J].journal of Computer Applications,2018,38(5):1389-1392.
Authors:CHEN Bo  HE Lianyue  YAN Weiwei  XU Zhaomiao  XU Jun
Affiliation:1. College of Computer, National University of Defense Technology, Changsha Hunan 410073, China;2. Beijing Netclouds Information Technology Corporation Limited, Beijing 100070, China
Abstract:Focused on the issue that the mass small file system developed based on HDFS (Hadoop Distributed File System), SMDFS (Mass Small Distributed File System), is not compatible with POSIX (Portable Operating System Interface of UNIX) constraints, a POSIX compatible technology based on local cache and an efficient metadata management technology based on temporary data cache were proposed. Firstly, the data storage area was set to realize the redirection of the file flow in the read-write mode, and then an asynchronous thread pool model was established to synchronize the data in temporary cache, thereby completing all POSIX-related file operations from the user layer to the storage layer. In addition, with the help of the metadata cache of the skip list structure, the efficiency of metadata operations such as the List directory was optimized. The test results show that, compared to the Linux client of HDFS, the performance of random read improves ten times more, the sequential read and sequential write improves about three to four times. The performance of random write can reach 20% of the local file system. Besides, the List operation efficiency of the directory improves about 10 times. However, due to the additional switching of kernel-mode and user-mode introduced by FUSE (Filesystem in Userspace), the Linux client of SMDFS3.0 has a performance penalty of about 50% compared to Java interface.
Keywords:mass small file system  distributed file system  Portable Operating System Interface of UNIX (POSIX) compatibility  metadata cache  cloud storage  
点击此处可从《计算机应用》浏览原始摘要信息
点击此处可从《计算机应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号