首页 | 本学科首页   官方微博 | 高级检索  
     

数据仓库中雪花模式的Skyline-Join查询
引用本文:陈玲,徐忠华,张剡,肖旭生,柏文阳.数据仓库中雪花模式的Skyline-Join查询[J].计算机研究与发展,2009,46(Z2).
作者姓名:陈玲  徐忠华  张剡  肖旭生  柏文阳
作者单位:南京大学计算机软件新技术国家重点实验室,南京,210093
摘    要:Skyline查询能够有效地实现多目标最优化,而数据仓库中的OLAP也是针对多维数据进行分析,因此,针对Skyline查询在数据仓库中的应用,提出了数据仓库中雪花模式的Skyline-Join查询算法.该算法首先将子维表M-Join父维表,然后渐进选择式地对事实表和父维表进行连接.每次连接之前都对事实表进行分组和组内Skyline计算,删除组内非Skyline元组,这样可以减少许多不必要的连接操作,使得查询效率大大提高.通过实验证明,在事实表元组数量逐渐变大和维表个数逐渐增多的情况下,提出的算法比先Join后Skyline计算的naive算法效率上有明显改善.

关 键 词:数据仓库  雪花模式  Skyline查询  多表连接

Skyline-Join in Snow Schema of Data Warehouse
Chen Ling,Xu Zhonghua,Zhang Yan,Xiao Xusheng,Bai Wenyang.Skyline-Join in Snow Schema of Data Warehouse[J].Journal of Computer Research and Development,2009,46(Z2).
Authors:Chen Ling  Xu Zhonghua  Zhang Yan  Xiao Xusheng  Bai Wenyang
Abstract:Skyline query is efficient in multi-criteria decision making systems,while OLAP in data warehouse mainly focuses on the analysis of multi-dimension data.Therefore,a progressive and selective skyline-join algorithm is proposed to apply skyline query into snow schema of data warehouse.This algorithm first M-Joins child dimension tables with parent dimension tables,then progressively and selectively joins fact table with each parent dimension table.Before each join,fact table will be grouped and skyline computation will start in each group;and then the non-skyline tuples in each group will be pruned to avoid many unnecessary join operations.That can greatly improve the algorithm's efficiency.The experiments demonstrate that when the size of fact table and the number of dimension tables increases,the proposed algorithm is much more efficient than the naive skylinejoin algorithm.
Keywords:data warehouse  snow schema  skyline query  multi-join
本文献已被 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号