首页 | 本学科首页   官方微博 | 高级检索  
     

一种基于FP-tree的最大频繁项目集挖掘算法
引用本文:刘乃丽,李玉忱,马磊.一种基于FP-tree的最大频繁项目集挖掘算法[J].计算机应用,2005,25(5):998-1000.
作者姓名:刘乃丽  李玉忱  马磊
作者单位:山东大学,计算机科学与技术学院,山东,济南,250061
摘    要:挖掘关联规则是数据挖掘领域中的重要研究内容,其中挖掘最大频繁项目集是挖掘关联规则中的关键问题之一,以前的许多挖掘最大频繁项目集算法是先生成候选,再进行检验,然而候选项目集产生的代价是很高的,尤其是存在大量长模式的时候。文中改进了FP-树结构,提出了一种基于FP-tree的快速挖掘最大频繁项目集的算法DMFIA-1,该算法不需要生成最大频繁候选项目集,比DMFIA算法挖掘最大频繁项目集的效率更高。改进的FP-树是单向的,每个结点只保留指向父结点的指针,这大约节省了三分之一的树空间。

关 键 词:数据挖掘  最大频繁项目集  关联规则  频繁模式树
文章编号:1001-9081(2005)05-0998-03

Algorithm for mining maximum frequent itemsets based on FP-tree
LIU Nai-li,LI Yu-chen,MA Lei.Algorithm for mining maximum frequent itemsets based on FP-tree[J].journal of Computer Applications,2005,25(5):998-1000.
Authors:LIU Nai-li  LI Yu-chen  MA Lei
Abstract:Mining association rule is an important matter in data mining, in which mining maximum frequent itemsets is a key problem in mining association rule. Many of the previous algorithms mine maximum frequent itemsets by producing candidate itemsets firstly, then pruning. But the cost of producing candidate itemsets is very high, especially when there exist long patterns. In this paper, the structure of a FP-tree was improved, a fast algorithm DMFIA-1 based on FP-tree for mining maximum frequent itemsets was proposed, which did not produce maximum frequent candidate itemsets and was more effective than DMFIA. The new FP-tree is a one-way tree and there is no pointer pointing its children in each node, so at least one third of memory is saved.
Keywords:data mining  maximum frequent itemset  association rule  FP-tree
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号