首页 | 本学科首页   官方微博 | 高级检索  
     


Evaluating holistic aggregators efficiently for very large datasets
Authors:Lixin?Fu  author-information"  >  author-information__contact u-icon-before"  >  mailto:lfu@uncg.edu"   title="  lfu@uncg.edu"   itemprop="  email"   data-track="  click"   data-track-action="  Email author"   data-track-label="  "  >Email author,Sanguthevar?Rajasekaran
Affiliation:(1) Division of Computer Science, Department of Mathematical Sciences, University of North Carolina at Greensboro, Bryan 383, NC 27402-6170 Greensboro, USA;(2) CSE, University of Connecticut, 191 Auditorium Road, U-155, CT 06269-3155 Storrs, USA
Abstract:In data warehousing applications, numerous OLAP queries involve the processing of holistic aggregators such as computing the ldquotop n,rdquo median, quantiles, etc. In this paper, we present a novel approach called dynamic bucketing to efficiently evaluate these aggregators. We partition data into equiwidth buckets and further partition dense buckets into subbuckets as needed by allocating and reclaiming memory space. The bucketing process dynamically adapts to the input order and distribution of input datasets. The histograms of the buckets and subbuckets are stored in our new data structure called structure trees. A recent selection algorithm based on regular sampling is generalized and its analysis extended. We have also compared our new algorithms with this generalized algorithm and several other recent algorithms. Experimental results show that our new algorithms significantly outperform prior ones not only in the runtime but also in accuracy.Received: 20 December 2000, Published online: 4 March 2004Edited by: P. Scheuermann.Sanguthevar Rajasekaran: This authorrsquos work is supported by NSF Grant 9912395.
Keywords:Quantiles  Dynamic bucketing  Aggregation
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号