首页 | 本学科首页   官方微博 | 高级检索  
     

集群上一种面向空间连接聚集的并行计算模型
引用本文:刘义,景宁,陈荦,熊伟.集群上一种面向空间连接聚集的并行计算模型[J].软件学报,2013,24(S2):99-109.
作者姓名:刘义  景宁  陈荦  熊伟
作者单位:国防科学技术大学 电子科学与工程学院, 湖南 长沙 410073;国防科学技术大学 电子科学与工程学院, 湖南 长沙 410073;国防科学技术大学 电子科学与工程学院, 湖南 长沙 410073;国防科学技术大学 电子科学与工程学院, 湖南 长沙 410073
基金项目:国家自然科学基金(61070035, 41271403);国家高技术研究发展计划(863)(2011AA120306, 2007AA120402);教育部高等学校博士学科点专项科研基金(20104307110017)
摘    要:单机运行环境难以满足海量空间数据的连接聚集操作对时空开销的需求,集群上的并行计算是高效处理海量空间数据的连接聚集操作的关键. Map-Reduce是云计算中一种应用于大规模集群进行大规模数据处理的分布式并行编程模型,分析发现,Map-Reduce并不直接支持以既高效又自然的方式来处理具有二次归约特征的并行空间连接聚集操作.因此,提出了一种并行计算模型——Map-Reduce-Combine(MRC)来有效地处理大规模空间数据的连接聚集操作.MRC在Map-Reduce 模型上增加一个Combine阶段,有效地合并分散在各个Reducer的部分聚集结果.针对并行任务划分中空间对象的单分配问题,提出了过滤优化算法,提高了MRC下处理空间连接聚集查询的效率.实验验证所提出的并行计算模型在处理空间连接聚集查询时具有良好的效率、有效性、可扩展性和简单性.

关 键 词:云计算  Map-Reduce  空间连接聚集  空间查询  二次归约
收稿时间:8/5/2012 12:00:00 AM
修稿时间:2013/7/22 0:00:00

Parallel Computing Model for Spatial Join Aggregate on Cluster
LIU Yi,JING Ning,CHEN Luo and WEI Xiong.Parallel Computing Model for Spatial Join Aggregate on Cluster[J].Journal of Software,2013,24(S2):99-109.
Authors:LIU Yi  JING Ning  CHEN Luo and WEI Xiong
Affiliation:College of Electronic Science and Engineering, National University of Defense Technology, Changsha 410073, China;College of Electronic Science and Engineering, National University of Defense Technology, Changsha 410073, China;College of Electronic Science and Engineering, National University of Defense Technology, Changsha 410073, China;College of Electronic Science and Engineering, National University of Defense Technology, Changsha 410073, China
Abstract:Since processing large-scale spatial join aggregate (SJA) is usually difficult to be implemented on a single machine, parallel computing on cluster has been the key to process large-scale SJA operation efficiently. Map-Reduce has been the mainstream parallel computing technique for massive data on cluster. However, Map-Reduce does not directly support processing parallel SJA with both high efficiency and straightforward way, for it needs to perform a second reduce operation. This paper proposes a novel parallel computing model, Map-Reduce-Combine (MRC), which is able to process large-scale SJA efficiently with a simple way on cluster. MRC adds to Map-Reduce a Combine phase that can efficiently combine partial aggregate results distributed among different Reducers, which is caused by the multiple assignment of spatial object. For the spatial object assigned only once, a filter optimization method has been proposed to pick up the result of single assignment object obtained in Reduce phase and further enhance the performance of processing SJA. Extensive experiments in large real spatial data have demonstrated the efficiency, effectiveness, scalability and simplicity of the proposed parallel computing model for processing SJA on massive spatial data.
Keywords:cloud computing  Map-Reduce  spatial join aggregate  spatial query  second reduce
点击此处可从《软件学报》浏览原始摘要信息
点击此处可从《软件学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号