首页 | 本学科首页   官方微博 | 高级检索  
     

云环境下分布式文件系统负载均衡研究
引用本文:吴瑶瑶,杨庚.云环境下分布式文件系统负载均衡研究[J].计算机工程与应用,2019,55(10):67-72.
作者姓名:吴瑶瑶  杨庚
作者单位:南京邮电大学 计算机学院,南京,210023;南京邮电大学 计算机学院,南京 210023;江苏省大数据安全与智能处理重点实验室,南京 210023
基金项目:国家自然科学基金;国家自然科学基金;国家自然科学基金;江苏省高等学校自然科学研究项目;中国博士后科学基金;江苏省自然科学基金面上项目
摘    要:Hadoop分布式文件系统(Hadoop Distributed File System,HDFS)是一种适合在通用硬件上运行的低成本、高度容错性的分布式文件系统,能提供高吞吐量的数据访问,适合针对大规模数据集上的应用。然而,HDFS中还面临一些性能优化问题,如负载均衡不足。虽然Hadoop系统自带的负载均衡器可以实现均衡调整,但需要用户预先给出静态的阈值。为了解决阈值的固定性和主观性,通过对磁盘空间使用率、CPU利用率、内存利用率、磁盘I/O占用率、网络带宽占用率等参数的分析评估优化,形成对阈值的计算表达式,并通过理论分析和仿真实验对阈值的计算和负载均衡进行验证。实验结果表明,相比较Hadoop静态的输入阈值的算法,该方法达到了更好的平衡效果,提高了计算资源的利用率。

关 键 词:云环境  Hadoop分布式文件系统(HDFS)  负载均衡  动态阈值

Distributed File System Load Balancing in Cloud Environment
WU Yaoyao,YANG Geng.Distributed File System Load Balancing in Cloud Environment[J].Computer Engineering and Applications,2019,55(10):67-72.
Authors:WU Yaoyao  YANG Geng
Affiliation:1.College of Computer Science, Nanjing University of Posts and Telecommunications, Nanjing 210023, China 2.Jiangsu Key Laboratory of Big Data Security & Intelligent Processing, Nanjing 210023, China
Abstract:Hadoop Distributed File System(HDFS) is a low-cost, highly fault-tolerant distributed file system that suitable for running on commodity hardware, and offers high-throughput data access for applications on large datasets. However, there are some performance optimization problems in HDFS, such as under-load balancing. Although Hadoop system comes with a load balancer to achieve balanced adjustment, but users need to give a static threshold in advance. In order to solve the fixed threshold and subjectivity, through the analysis, evaluation and optimization of disk space utilization, CPU utilization, memory utilization, the disk I/O occupancy rate, the network bandwidth occupancy rate and other parameters, this paper forms a calculating expression for a threshold, and through the theoretical analysis and simulation experiments, this paper verifies the threshold calculation and load balancing. The experimental results show that this method achieves a better balance effect and improves the utilization of computing resources compared with the Hadoop static input threshold algorithm.
Keywords:cloud environment  Hadoop Distributed File System(HDFS)  load balancing  dynamic threshold  
本文献已被 万方数据 等数据库收录!
点击此处可从《计算机工程与应用》浏览原始摘要信息
点击此处可从《计算机工程与应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号