首页 | 本学科首页   官方微博 | 高级检索  
     

一种Mapreduce作业内存精确预测方法
引用本文:罗永刚,陈兴蜀,杨露.一种Mapreduce作业内存精确预测方法[J].电子科技大学学报(自然科学版),2016,45(6):986-991.
作者姓名:罗永刚  陈兴蜀  杨露
作者单位:四川大学网络空间安全研究院 成都 610065
基金项目:国家科技支撑计划2012BAH18B05
摘    要:针对准确预测mapreduce作业内存资源需求困难的问题,根据Java虚拟机(JVM)的分代(JVM将堆内存划分为年轻代和年长代)内存管理特点,该文提出一种分代内存预测方法。建立年轻代大小与垃圾回收时间的模型,将寻找合理年轻代大小的问题转换为一个受约束的非线性优化问题,并设计搜索算法求解该优化问题。建立mapreduce作业的map任务和reduce任务性能与内存的关系模型,求解最佳性能的内存需求,从而获得map任务和reduce任务的年长代内存大小。实验结果表明,本文提出的方法能准确预测作业的内存需求;与默认配置相比,能提供平均6倍的性能提升。

关 键 词:垃圾回收    Java虚拟机    mapreduce    资源管理
收稿时间:2015-11-03

An Innovative Memory Prediction Approach for Mapreduce Job
Affiliation:Cybersecurity Research Institutes, Sichuan University Chengdu 610065
Abstract:It is difficult to predict the amount of memory for a mapreduce job. Based on the fact that Java virtual machine (JVM) divides the heap space managed by the JVM garbage collector into young and old generations, a generational memory prediction method is put forward. We build up a function that models the relationship between the amount of young generation and the total garbage collection time, and then we use a constrained nonlinear optimization model to find the rational footprint of young generation. The memory model for the map phase is established, the phase of a mapreduce job is reduced, then a relationship between map/reduce tasks' performance (runtime of a task) and the amount of memory of the old generation is set up, and finally, the reasonable old generation memory size is obtained. The experimental results show that the proposed approach can accurately predict the memory size of map and reduce the tasks of a mapreduce job. In comparison with the default configuration, the proposed approach can give us 6 times performance improvement than default settings.
Keywords:
点击此处可从《电子科技大学学报(自然科学版)》浏览原始摘要信息
点击此处可从《电子科技大学学报(自然科学版)》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号