首页 | 本学科首页   官方微博 | 高级检索  
     

Cherry:一种无须子集检查的闭合频繁集挖掘算法
引用本文:陶利民,黄林鹏.Cherry:一种无须子集检查的闭合频繁集挖掘算法[J].软件学报,2008,19(2):379-388.
作者姓名:陶利民  黄林鹏
作者单位:上海交通大学,计算机科学与工程系,上海,200032
基金项目:Supported by the National Natural Science Foundation of China under Grant No.60673116 (国家自然科学基金)
摘    要:通过对一些著名的闭合频繁集挖掘算法(如CLOSET ,FP-CLOSE,DCI-CLOSED和LCMv2等)的研究并结合挖掘理论分析,提出了一种新的挖掘算法Cherry,它基于FP-tree结构,并采用了新颖的CherryItem检测技术,无须在内存中保留闭合频繁集而直接检测出会导致重复的频繁项前缀,从而极大地提高了挖掘效率.性能实验的比较和测试表明,该Cherry算法在低支持度的测试中要优于目前的一些主流挖掘算法,如LCMv2,DCI-CLOSE和FP-CLOSE等.

关 键 词:关联规则  闭合频繁集
收稿时间:2006-02-21
修稿时间:2006-10-10

Cherry: An Algorithm for Mining Frequent Closed Itemsets without Subset Checking
TAO Li-Min and HUANG Lin-Peng.Cherry: An Algorithm for Mining Frequent Closed Itemsets without Subset Checking[J].Journal of Software,2008,19(2):379-388.
Authors:TAO Li-Min and HUANG Lin-Peng
Abstract:Through the theoretical analysis and research works on some famous mining algorithms, a new mining algorithm named Cherry is proposed in this paper. It bases on FP-tree technology and adopts a novel Cherry-Items-detecting technology. This novel technology can find those prefixes which result to the unclosed or redundant frequent itemsets without maintaining the frequent closed itemsets mined so far in the main memory. In the performance test, the Cherry algorithm is compared with other state of the art algorithms, such as FP-CLOSE, LCMv2 and DCI-CLOSE, in many synthetic and real data sets. The experimental results demonstrate that the Cherry algorithm outperforms them in low support.
Keywords:association rule  frequent closed itemset
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《软件学报》浏览原始摘要信息
点击此处可从《软件学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号