FP-growth算法的实现方法研究 Research on Implementation of the FP-growth Algorithm期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

FP-growth算法的实现方法研究

引用本文：	王新宇,杜孝平,谢昆青. FP-growth算法的实现方法研究[J]. 计算机工程与应用, 2004, 40(9): 174-176

作者姓名：	王新宇杜孝平谢昆青

作者单位：	北京大学信息科学技术学院,北京,100871;北京航空航天大学软件学院,北京,100083

基金项目：	国家973重点基础研究发展规划项目(编号:G1999032705)，留学回国人员科研启动基金资助

摘要：	事务数据库中频繁模式的挖掘研究作为关联规则等许多数据挖掘问题的核心工作,已经研究了许多年。早期算法大都是Apriori型算法,即首先产生候选集,然后在候选集的基础上找出频繁模式,候选集的产生往往是耗时的,特别是挖掘富模式或长模式时。JianweiHan等人提出了一种新颖的数据结构FP-tree及基于其上的FP-growth算法,用于有效的富模式与长模式挖掘。由于不同的实现方法可能会导致不同的挖掘效率,该文在讨论FP-growth算法的基础上,采用了几种不同的方法来实现它,并用几个数据库对它们的性能进行了比较。
关键词：	频繁模式关联规则数据挖掘算法
文章编号：	1002-8331-(2004)09-0174-03
Research on Implementation of the FP-growth Algorithm

Wang Xinyu Du Xiaoping Xie Kunqing. Research on Implementation of the FP-growth Algorithm[J]. Computer Engineering and Applications, 2004, 40(9): 174-176

Authors:	Wang Xinyu Du Xiaoping Xie Kunqing

Affiliation:	Wang Xinyu 1 Du Xiaoping 2 Xie Kunqing 11

Abstract:	Mining frequent patterns in transaction databases,as an essential role in many data mining tasks such as the association rule mining,has been widely studied for many years.Most of the previous studies adopt an Apriori-like candidate set generation-and-test approach.However,candidate set generation is costly if there exist prolific patterns or long patterns.Jianwei Han et al propose a frequent pattern tree structure and a FP-growth algorithm based on this structure that can mine the frequent patterns by pattern fragment growth.Due to different methods will result in different performance,in this paper several methods to implement the FP-growth algorithm are discussed.The performance is studied,analyzed and compared on several canonical datasets.

Keywords:	Frequent Pattern Association Rule Data Mining Algorithm
本文献已被 CNKI 维普万方数据等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏