首页 | 本学科首页   官方微博 | 高级检索  
     

基于改进FP-树的最大模式挖掘算法
引用本文:孟祥萍,王华金,王贤勇,任纪川,鞠传香.基于改进FP-树的最大模式挖掘算法[J].计算机工程与应用,2005,41(14):179-181,228.
作者姓名:孟祥萍  王华金  王贤勇  任纪川  鞠传香
作者单位:1. 长春工程学院电气工程系,长春,130012
2. 东北电力学院信息工程系,吉林,132012
基金项目:吉林省教育厅科研基金资助项目(编号:0230)
摘    要:频繁模式挖掘是数据挖掘领域中的一个非常重要的分支,但是由于其内在的计算复杂性,挖掘密集型数据的频繁模式完全集非常困难而且数量往往大得惊人,难以理解和应用。最大频繁模式(最大模式)压缩隐含了所有的频繁模式,存储所占用的空间远远小于完全集,因而最大模式挖掘具有十分重要的意义。该文改进了传统的FP-树结构并提出了一种有效的基于改进FP-树的最大模式挖掘算法IFP-M ax;通过引入后缀子树的概念,算法在挖掘过程中不用生成最大频繁模式候选集,从而大大提高了算法的时间效率和空间可伸缩性。实验表明,IFP-M ax的挖掘速度比M AFIA和GenM ax大约快一个数量级。

关 键 词:数据挖掘  关联规则  最大频繁模式  改进FP-树
文章编号:1002-8331-(2005)14-0179-03

Mining Maximal Frequent Patterns Based on Improved FP-tree
Meng Xiangping,Wang Huajin,Wang Xianyong,Ren Jichuan,Ju Chuanxiang.Mining Maximal Frequent Patterns Based on Improved FP-tree[J].Computer Engineering and Applications,2005,41(14):179-181,228.
Authors:Meng Xiangping  Wang Huajin  Wang Xianyong  Ren Jichuan  Ju Chuanxiang
Affiliation:Meng Xiangping2 Wang Huajin1 Wang Xianyong1 Ren Jichuan1 Ju Chuanxiang11
Abstract:Mining frequent patterns is an important part of data mining,however,because of the inherent complexity,mining complete set of frequent patterns from dense database could be impractical,and its quantity is usually very large,it is hard to understand and make use of them.Maximal frequent patterns contain and compress all frequent patterns,and its memory for saving is much smaller than that of complete patterns,thus it is greatly valuable to mine maximal frequent patterns.In this paper,the structure of a traditional FP-tree is improved and an efficient algorithm for mining maximal frequent patterns based on improved FP-tree,called IFP-Max,is proposed;By introducing the concept of postfix sub-tree,this algorithm needn't generate the candidate of maximal frequent patterns in mining process,therefore greatly improves the mining efficiency in time and space scalability.Experimental results show that IFP-Max is about an order of magnitude faster than MAFIA and GenMax.
Keywords:data mining  association rule  maximal frequent pattern  improved FP-tree  
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号