首页 | 本学科首页   官方微博 | 高级检索  
     

虚拟化环境下面向多目标优化的自适应SSD缓存系统
引用本文:唐震,吴恒,王伟,魏峻,黄涛.虚拟化环境下面向多目标优化的自适应SSD缓存系统[J].软件学报,2017,28(8):1982-1998.
作者姓名:唐震  吴恒  王伟  魏峻  黄涛
作者单位:中国科学院软件研究所软件工程技术研发中心, 北京 100190;计算机科学重点实验室(中国科学院软件研究所), 北京 100190;中国科学院大学, 北京 100049,中国科学院软件研究所软件工程技术研发中心, 北京 100190;计算机科学重点实验室(中国科学院软件研究所), 北京 100190,中国科学院软件研究所软件工程技术研发中心, 北京 100190;计算机科学重点实验室(中国科学院软件研究所), 北京 100190,中国科学院软件研究所软件工程技术研发中心, 北京 100190;计算机科学重点实验室(中国科学院软件研究所), 北京 100190,中国科学院软件研究所软件工程技术研发中心, 北京 100190;计算机科学重点实验室(中国科学院软件研究所), 北京 100190
基金项目:国家重点研发计划(2016YFB1000103);国家自然科学基金(61572480);中国科学院青年创新促进会(2015088)
摘    要:以SSD为代表的新型存储介质在虚拟化环境下得到了广泛的应用,通常作为虚拟机读写缓存,起到优化磁盘IO性能的作用.已有研究往往关注SSD缓存的容量规划,依据缓存读写命中率评价SSD缓存分配效果,未能充分考虑SSD的服务能力上限,难以适用于典型的分布式应用场景,存在虚拟机抢占SSD缓存资源,导致虚拟机中应用性能违约的可能.本文实现了虚拟化环境下面向多目标优化的自适应SSD缓存系统,考虑了SSD的服务能力上限.基于自适应闭环实现对虚拟机和应用状态的动态感知.动态检测局部SSD缓存抢占状态,基于聚类方法生成虚拟机的优化放置方案,依据全局SSD缓存供给能力确定虚拟机迁移顺序和时机.实验结果表明该方法在应对典型分布式应用场景时可以有效缓解SSD缓存资源的争用,同时满足应用对虚拟机放置的需求,提升应用的性能并兼顾应用的可靠性.在Hadoop应用场景下,平均降低了25%的任务执行时间,对IO密集型应用平均提升39%的吞吐率.在ZooKeeper应用场景下,以不到5%的性能损失为代价应对了虚拟化主机的单点失效带来的虚拟机宕机问题.

关 键 词:固态盘  缓存  虚拟化  动态迁移
收稿时间:2016/6/20 0:00:00
修稿时间:2016/12/1 0:00:00

Self-Adaptive SSD Caching System for Multiobjective Optimization in Virtualization Environment
TANG Zhen,WU Heng,WANG Wei,WEI Jun and HUANG Tao.Self-Adaptive SSD Caching System for Multiobjective Optimization in Virtualization Environment[J].Journal of Software,2017,28(8):1982-1998.
Authors:TANG Zhen  WU Heng  WANG Wei  WEI Jun and HUANG Tao
Affiliation:Technology Center of Software Engineering, Institute of Software, The Chinese Academy of Sciences, Beijing 100190, China;State Key Laboratory of Computer Science(Institute of Software, The Chinese Academy of Sciences), Beijing 100190, China;University of Chinese Academy of Sciences, Beijing 100049, China,Technology Center of Software Engineering, Institute of Software, The Chinese Academy of Sciences, Beijing 100190, China;State Key Laboratory of Computer Science(Institute of Software, The Chinese Academy of Sciences), Beijing 100190, China,Technology Center of Software Engineering, Institute of Software, The Chinese Academy of Sciences, Beijing 100190, China;State Key Laboratory of Computer Science(Institute of Software, The Chinese Academy of Sciences), Beijing 100190, China,Technology Center of Software Engineering, Institute of Software, The Chinese Academy of Sciences, Beijing 100190, China;State Key Laboratory of Computer Science(Institute of Software, The Chinese Academy of Sciences), Beijing 100190, China and Technology Center of Software Engineering, Institute of Software, The Chinese Academy of Sciences, Beijing 100190, China;State Key Laboratory of Computer Science(Institute of Software, The Chinese Academy of Sciences), Beijing 100190, China
Abstract:Solid state disk (SSD), as a new type of storage media, is widely used in the virtualization environment, and isusually used as the read and write cache of the virtual machine (VM) storage, to improve the disk IO performance of the VMs.Existing SSD caching schemesmostly focus on the capacity planning of the SSD cache anduse metrics such as cache hitrateto evaluate the effect of SSD cache allocation. They do not consider the limitations of service capabilities of SSD, which may lead to the contention of cache resource and the performance degradation and violations among VMs, so they are not suitable to be used with some typical distributed applications. We propose a self-adaptive SSD caching system for multiobjective optimization in the virtualization environment, to reduce the resource contention, and take the limitations of service capabilities of SSD into consideration. With the help of closed loop adaption, it can dynamically detect the status of VMs and applications. Moreover, it continuously detects the contention of SSD cache, generates the migration plan using clustering algorithm and decides the timing and order of the VM migrations according to the capabilities of SSDas well as the characteristics and requirements of applications. The evaluation shows that when facing the scenarios of using some typical distributed applications, the contention of SSD cache resource is reduced and the requirements of applications are considered, which lead to the improvement of performance and reliability of applications.For Hadoop applications, the execution time of jobsis reduced by 25%in average, and the throughput for IO sensitive applications is improved by 39%. For ZooKeeper applications, the service outage caused by the single point of fault of the hypervisor can be handled at the cost of less than 5% of performance degradation.
Keywords:SSD  caching  virtualization  live migration
点击此处可从《软件学报》浏览原始摘要信息
点击此处可从《软件学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号