首页 | 本学科首页   官方微博 | 高级检索  
     

频繁闭项目集挖掘算法研究
引用本文:朱玉全,宋余庆.频繁闭项目集挖掘算法研究[J].计算机研究与发展,2007,44(7):1177-1183.
作者姓名:朱玉全  宋余庆
作者单位:江苏大学计算机科学与通信工程学院,镇江,212013
摘    要:目前已提出了许多基于Apriori算法思想的频繁项目集挖掘算法,这些算法可以有效地挖掘出事务数据库中的短频繁项目集,但对于长频繁项目集的挖掘而言,其性能将明显下降.为此,提出了一种频繁闭项目集挖掘算法MFCIA,该算法可以有效地挖掘出事务数据库中所有的频繁项目集,并对其更新问题进行了研究,提出了一种相应的频繁闭项目集增量式更新算法UMFCIA,该算法将充分利用先前的挖掘结果来节省发现新的频繁闭项目集的时间开销.实验结果表明算法MFCIA是有效可行的.

关 键 词:频繁项目集  频繁闭项目集  最小频繁闭项目集  最大频繁闭项目集  增量式更新  频繁闭项目集  挖掘算法  算法研究  Closed  Frequent  Mining  Algorithm  实验  时间开销  发现  结果  利用  更新算法  增量式  问题  性能  长频繁项目集  事务数据库  算法思想  Apriori
修稿时间:2006-11-24

Research on an Algorithm for Mining Frequent Closed Itemsets
Zhu Yuquan,Song Yuqing.Research on an Algorithm for Mining Frequent Closed Itemsets[J].Journal of Computer Research and Development,2007,44(7):1177-1183.
Authors:Zhu Yuquan  Song Yuqing
Abstract:Mining frequent itemsets is a fundamental and essential problem in data mining application.Most of the proposed mining algorithms are a variant of Apriori.These algorithms show good performance with spare datasets.However,with dense datasets such as telecommunications and medical image data,where there are many long frequent itemsets,the performance of these algorithms degrades incredibly.In order to solve this problem,an efficient algorithm MFCIA and its updating algorithm UMFCIA for mining frequent closed itemsets are proposed.The set of frequent closed itemsets uniquely determines the exact frequency of all frequent itemsets,yet it can be orders of magnitude smaller than the set of all frequent itemsets,thus lowering the algorithm computation cost.The algorithm UMFCIA makes use of the previous mining results to cut down the cost of finding new frequent closed itemsets.The experiments show that the algorithm MFCIA is efficient.
Keywords:frequent itemsets  frequent closed itemsets  minimum frequent closed itemsets  maximal frequent closed itemsets  incremental updating
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号