首页 | 本学科首页   官方微博 | 高级检索  
     


Discovery of maximum length frequent itemsets
Authors:Tianming Hu  Sam Yuan Sung  Qian Fu
Affiliation:a East China Normal University, DongGuan University of Technology, China
b Department of Computer Science, South Texas College, USA
c MSIS Department, Rutgers University, USA
d Department of Computer Science, National University of Singapore, Singapore
Abstract:The use of frequent itemsets has been limited by the high computational cost as well as the large number of resulting itemsets. In many real-world scenarios, however, it is often sufficient to mine a small representative subset of frequent itemsets with low computational cost. To that end, in this paper, we define a new problem of finding the frequent itemsets with a maximum length and present a novel algorithm to solve this problem. Indeed, maximum length frequent itemsets can be efficiently identified in very large data sets and are useful in many application domains. Our algorithm generates the maximum length frequent itemsets by adapting a pattern fragment growth methodology based on the FP-tree structure. Also, a number of optimization techniques have been exploited to prune the search space. Finally, extensive experiments on real-world data sets validate the proposed algorithm.
Keywords:Association analysis   Frequent itemsets   Maximum length frequent itemsets   FP-tree   Data mining
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号