首页 | 本学科首页   官方微博 | 高级检索  
     

基于FP树的全局最大频繁项集挖掘算法
引用本文:王黎明,赵辉.基于FP树的全局最大频繁项集挖掘算法[J].计算机研究与发展,2007,44(3):445-451.
作者姓名:王黎明  赵辉
作者单位:郑州大学信息工程学院,郑州,450052
摘    要:挖掘最大频繁项集是多种数据挖掘应用了更新最大频繁候选项集集合,需要反复地扫描整个数据库,而且大部分算法是单机算法,全局最大频繁项集挖掘算法并不多见.为此提出MGMF算法,该算法利用FP-树结构,类似FP-树挖掘方法,一遍就可以挖掘出所有的最大频繁项集,并且超集检测非常简单、快捷.另外MGMF算法采用了分布式PDDM算法播报消息的思想,具有很好的拓展性和并行性.实验证明MGMF算法是有效可行的.

关 键 词:数据挖掘  FP-树  分布式数据库  最大频繁项集  频繁模式树  频繁项集挖掘算法  Based  Frequent  Itemsets  Maximum  Global  Mining  验证  并行  拓展性  思想  播报  分布式  超集检测  挖掘方法  树结构  算法利用  机算法  数据库  扫描  候选项集
修稿时间:06 12 2006 12:00AM

Algorithms of Mining Global Maximum Frequent Itemsets Based on FP-Tree
Wang Liming,Zhao Hui.Algorithms of Mining Global Maximum Frequent Itemsets Based on FP-Tree[J].Journal of Computer Research and Development,2007,44(3):445-451.
Authors:Wang Liming  Zhao Hui
Affiliation:School of Information Engineering, Zhengzhou University, Zhengzhou 450052
Abstract:Mining maximum frequent itemsets is a key problem in data mining field with numerous important applications. The present algorithms need scanning the database many times for updating the set of maximum frequent itemsets and are based on local databases. The algorithms of mining global maximum frequent itemsets are very few. Therefore, an algorithm for mining global maximum frequent itemsets is proposed, which can conveniently get all global maximum frequent itemsets using FP-tree structure by one time mining, and superset checking is very simple and speedy. FP-tree structure has provided a kind of convenient depth-first mining method. The algorithm combines FP-tree with restrained sub-tree for mining global maximum frequent itemsets and adopts an efficient distributed PDDM algorithm for broadcasting itemsets information and improves the expansibility and the concurrence. The PDDM algorithm is based on previous DDM algorithm and improves I?O problem and communication of previous distributed algorithms. Experimental results testify the feasibility and effectiveness of the algorithm.
Keywords:data mining  FP-tree  distributed database  maximum frequent itemset  frequent pattern tree
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号