首页 | 本学科首页   官方微博 | 高级检索  
     


Specification-based data reduction in dimensional data warehouses
Authors:Janne Skyt  Christian S. Jensen  Torben Bach Pedersen
Affiliation:Department of Computer Science, Aalborg University, Fredrik Bajers Vej 7E, 9220 Aalborg Øst, Denmark
Abstract:Many data warehouses contain massive amounts of data, accumulated over long periods of time. In some cases, it is necessary or desirable to either delete “old” data or to maintain the data at an aggregate level. This may be due to privacy concerns, in which case the data are aggregated to levels that ensure anonymity. Another reason is the desire to maintain a balance between the uses of data that change as the data age and the size of the data, thus avoiding overly large data warehouses. This paper presents effective techniques for data reduction that enable the gradual aggregation of detailed data as the data ages. With these techniques, data may be aggregated to higher levels as they age, enabling the maintenance of more compact, consolidated data and the compliance with privacy requirements. Special care is taken to avoid semantic problems in the aggregation process. The paper also describes the querying of the resulting data warehouses and an implementation strategy based on current database technology.
Keywords:Data reduction   Data warehousing   Multidimensional data   Data models   Physical deletion
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号