首页 | 本学科首页   官方微博 | 高级检索  
     

基于改进FP-Tree的最大频繁项集高效挖掘算法
引用本文:纪怀猛.基于改进FP-Tree的最大频繁项集高效挖掘算法[J].计算机与数字工程,2014(6):959-963.
作者姓名:纪怀猛
作者单位:福州大学阳光学院,福州350015
摘    要:基于FP-T ree的FP-M ax算法在挖掘最大频繁集时需多次递归建立条件模式树耗费大量存储空间,这大大降低了算法的挖掘效率。提出了一种基于改进FP-T ree的最大频繁集快速挖掘算法-FP-EM ax算法。该算法无需建立条件模式库大大减少了存储空间开销,采用预剪枝策略减少条件模式树的构造次数及子集检测次数,从而算法的挖掘效率大大提高。最后通过实验证明FP-EM ax算法在支持度较小的情况下较之于FP-M ax及同类算法具有更好的性能。

关 键 词:递归  最大频繁集  FP-Tree  条件模式树  索引

Efficient Algorithm for Mining Maximum Frequent Item Sets Based on Improved FP-Tree
JI Huaimeng.Efficient Algorithm for Mining Maximum Frequent Item Sets Based on Improved FP-Tree[J].Computer and Digital Engineering,2014(6):959-963.
Authors:JI Huaimeng
Affiliation:JI Huaimeng (Sunshine College, Fuzhou University, Fuzhou 350015)
Abstract:The efficiency of the algorithm for mining maximum frequent set is greatly reduced ,when the algorithm named FP-Max based on FP-Tree is used in mining maximum frequent item sets .Because it needs recursively to establish conditional pattern tree ,and take a lot of storage space .A new algorithm named FP-EMax based on improved FP-Tree is put forward for efficiently mining maximum frequent sets .The efficiency of the new algorithm is greatly increased ,because not only it doesn't need to establish conditional pattern library ,which greatly reduces the storage space overhead ;but also it uses pruning to reduce the number of establishing conditional pattern tree and testing subset .Finally ,the experiments show that the algorithm has a better performance than the FP-Max algorithm and other similar algorithms in the case of a relatively small support .
Keywords:recursive  maximum frequent item sets  FP-Tree  conditional pattern tree  index
本文献已被 维普 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号