首页 | 本学科首页   官方微博 | 高级检索  
     

缓存敏感的封闭冰山立方体计算
引用本文:栾华,杜小勇,王珊.缓存敏感的封闭冰山立方体计算[J].软件学报,2010,21(4):620-631.
作者姓名:栾华  杜小勇  王珊
作者单位:数据工程与知识工程教育部重点实验室(中国人民大学),北京,100872;中国人民大学,信息学院,北京,100872
基金项目:Supported by the National Natural Science Foundation of China under Grant Nos.60496325, 60873017 (国家自然科学基金); the Grant from HP Labs China (惠普中国实验室资助项目)
摘    要:数据立方体计算通常会产生大量的输出结果,冰山立方体和封闭立方体是解决这个问题的比较流行的两种策略,二者可以结合使用.鉴于封闭冰山立方体(closed iceberg cube)的重要性和实用性,如何高效地计算封闭冰山立方体是一个值得研究的问题.提出一种缓存敏感(cache-conscious)的计算封闭冰山立方体的方法,在自底向上对数据进行聚集的同时,寻找覆盖聚集单元的封闭单元,将其输出,使用两种策略进行剪枝,去掉不必要的递归,同时使用Apriori剪枝技术,支持冰山立方体(iceberg cube)的计算.为了减少与内存相关的延迟,快速得到聚集结果,对多个维进行预排序,并将软件预取技术引入到数据扫描中.在模拟数据和真实数据上进行了详细而全面的实验研究,结果表明,封闭冰山立方体的计算方法是快速、有效的.

关 键 词:联机分析处理  封闭冰山立方体  缓存敏感  内存相关延迟
收稿时间:2008/5/14 0:00:00
修稿时间:2008/10/9 0:00:00

Cache-Conscious Computation of Closed Iceberg Cubes
LUAN Hu,DU Xiao-Yong and WANG Shan.Cache-Conscious Computation of Closed Iceberg Cubes[J].Journal of Software,2010,21(4):620-631.
Authors:LUAN Hu  DU Xiao-Yong and WANG Shan
Affiliation:LUAN Hua1,2,DU Xiao-Yong1,WANG Shan1,2 1(Key Laboratory of Data Engineering , Knowledge Engineering (Renmin University of China),Ministry of Education,Beijing 100872,China) 2(School of Information,Renmin University of China,China)
Abstract:The computation of data cubes usually produces huge outputs. There are two popular methods to solve this problem: Iceberg cube and closed cube, which can be combined together. Due to the importance and usability of closed iceberg cube, how to efficiently compute it becomes a key research issue. A cache-conscious computation method is proposed in this paper. The data are aggregated in a bottom-up manner. In the meantime, the closed cells covering the aggregate cells are discovered and output. Two pruning strategies are used to save unnecessary recursive calls. The Apriori pruning is utilized to support iceberg cube computation. To reduce the number of memory-related stalls and produce the aggregate results efficiently, multiple dimensions are pre-sorted and the software prefetching technology is introduced into data scans. A comprehensive and detailed performance study is conducted on both synthetic data and real data sets. The results show that the proposed closed iceberg cube computation method is efficient and effective.
Keywords:OLAP (on-line analytical processing)  closed iceberg cube  cache-conscious  memory-related stalls
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《软件学报》浏览原始摘要信息
点击此处可从《软件学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号