首页 | 本学科首页   官方微博 | 高级检索  
     

MapReduce框架下的Skyline计算
引用本文:张波良,周水庚,关佶红.MapReduce框架下的Skyline计算[J].计算机科学与探索,2011,5(5):385-397.
作者姓名:张波良  周水庚  关佶红
作者单位:1. 复旦大学计算机科学技术学院,上海,200433;复旦大学上海市智能信息处理重点实验室,上海,200433
2. 同济大学计算机科学与技术系,上海,201804
摘    要:由于Skyline查询广泛应用于多目标决策、数据可视化等领域,近年来成为数据库领域的一个研究热点。针对云计算环境,在MapReduce框架下设计并实现了Skyline算法。MapReduce是一个运行在大型集群上处理海量数据的并行计算框架,其主要思想是任务的分解与结果的汇总。基于不同的数据划分思想,实施了三种Skyline并行算法,分别是基于MapReduce的块嵌套循环算法(MapReduce based block-nested-loops,MR-BNL)、基于MapReduce的排序过滤算法(MapReduce based sort-filter-skyline,MR-SFS)以及基于MapReduce的位图算法(MapReduce based bitmap,MR-Bitmap),并针对这三种算法进行了系统的实验比较,得出了不同数据分布、维数、缓存等因素对算法性能的影响结果。

关 键 词:Skyline计算  云计算  MapReduce  数据划分
修稿时间: 

Skyline Computation under MapReduce Framework
ZHANG Boliang,ZHOU Shuigeng,GUAN Jihong.Skyline Computation under MapReduce Framework[J].Journal of Frontier of Computer Science and Technology,2011,5(5):385-397.
Authors:ZHANG Boliang  ZHOU Shuigeng  GUAN Jihong
Affiliation:1. School of Computer Science, Fudan University, Shanghai 200433, China 2. Shanghai Key Laboratory of Intelligent Information Processing, Fudan University, Shanghai 200433, China 3. Department of Computer Science and Technology, Tongji University, Shanghai 201804, China
Abstract:Skyline computation, due to its wide applications in multi-objective decision making and data visualization, has attracted many research interests in database community recently. Aiming at cloud computing applications, this paper addresses the problem of Skyline computation under the MapReduce framework. As a parallel programming model for data-intensive computing applications, MapReduce runs on a cluster of commercial PCs with the main idea of task decomposition and solution reduction. Based on different data division strategies, this paper proposes three algorithms: MapReduce based block-nested-loops (MR-BNL), MapReduce based sort-filter-skyline (MR-SFS) and MapReduce based bitmap (MR-Bitmap). It conducts extensive experiments to evaluate and compare the three algorithms under different situations of different data distributions, dimensions and buffer sizes.
Keywords:Skyline computation  cloud computing  MapReduce  data division
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《计算机科学与探索》浏览原始摘要信息
点击此处可从《计算机科学与探索》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号