首页 | 本学科首页   官方微博 | 高级检索  
     

大数据负载的体系结构特征分析
引用本文:罗建平,谢梦瑶,王华锋.大数据负载的体系结构特征分析[J].计算机科学,2015,42(11):48-52.
作者姓名:罗建平  谢梦瑶  王华锋
作者单位:中国科学院计算技术研究所先进计算机系统研究中心 北京100190;北京航空航天大学软件学院 北京10091,中国科学院计算技术研究所先进计算机系统研究中心 北京100190;郑州大学信息工程学院 郑州450001,北京航空航天大学软件学院 北京10091
基金项目:本文受国家重点基础研究发展规划项目(2014CB340402),国家自然科学基金(61303054)资助
摘    要:针对大数据离线分析类和交互式查询类负载,首先对这些负载的一些共性进行分析,提取出公共操作集,并对它们进行分组整理;然后在大数据平台上测试这些负载运行过程中的微体系结构特征,采用PCA和SimpleKMeans算法对这些体系结构特征参数进行降维和聚类处理。实验分析结果表明负载之间有公共的操作集,如Join和Cross Production;有些负载有相似的属性,如Difference和Projection共享相同的微体系结构特征。实验结果对于 处理器等硬件平台的设计以及应用程序的优化具有指导性的意义,并且为大数据基准测试平台的设计提供了参考。

关 键 词:大数据  大数据负载  体系结构特征
收稿时间:2014/10/17 0:00:00
修稿时间:2014/12/1 0:00:00

Analysis of Architecture Characteristics of Big Data Workloads
LUO Jian-ping,XIE Meng-yao and WANG Hua-feng.Analysis of Architecture Characteristics of Big Data Workloads[J].Computer Science,2015,42(11):48-52.
Authors:LUO Jian-ping  XIE Meng-yao and WANG Hua-feng
Affiliation:Advanced Computer Systems Laboratory,Institute of Computing Technology,Chinese Academy of Sciences,Beijing 100190,China;School of Software,Beihang University,Beijing 100191,China,Advanced Computer Systems Laboratory,Institute of Computing Technology,Chinese Academy of Sciences,Beijing 100190,China;School of Information Engineering,Zhengzhou University,Zhengzhou 450001,China and Advanced Computer Systems Laboratory,Institute of Computing Technology,Chinese Academy of Sciences,Beijing 100190,China
Abstract:Aiming at the big data workloads with off-line analysis and interactive queries,first we analyzed some common features of these workloads,extracted the common set of operations and arranged the workloads in groups.Then,we tested the big data workloads on the BigDataBench platform and got the micro-architecture characteristics using PCA and SimpleKMeans algorithm for dimensionality reduction and clustering analysis.Our study revealed that big data workloads share a common set of operations such as the Join and Cross Production.We also observed that some of the big data workloads have many similar features.For example,the Difference and Projection operations share micro-architectural characteristics.The result of our experiment has a guiding significance for the design of hardware platforms like processors and the optimization of applications.Meanwhile,it also provides valuable insights into the implementation of the big data benchmark platform.
Keywords:Big data  Big data workloads  Architecture characteristic
本文献已被 万方数据 等数据库收录!
点击此处可从《计算机科学》浏览原始摘要信息
点击此处可从《计算机科学》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号