一种基于前缀树的频繁模式挖掘算法 A Mining Algorithm for Frequent Patterns Based on Prefix Tree期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

一种基于前缀树的频繁模式挖掘算法

引用本文：	朱光喜,吴伟民,阮幼林,刘干.一种基于前缀树的频繁模式挖掘算法[J].计算机科学,2005,32(4):34-36.

作者姓名：	朱光喜吴伟民阮幼林刘干

作者单位：	华中科技大学计算机科学与技术学院,武汉,430074

基金项目：	国家自然科学基金(603905405)，国家自然科学基金(60273075)，国家863计划课题(2001AA123014)

摘要：	挖掘频繁模式是许多数据挖掘任务的关键步骤。基于FP-Tree的挖掘算法由于无须生成候进项集效率明显高于Apriori类算法,但FP-Tree结构存在动态维护复杂、而且在挖掘过程中需要递归地创建大量的条件FP-Tree,时空效率不高。因此,本文提出一种基于前缀树的新算法。该算法通过引入一种新结构—前缀树(Prefix Tree)用来压缩存放数据所相关信息,并通过调整前缀树中节点信息和节点键直接在Prefix Tree上采用深度优先的策略挖掘频繁模式,而不需要任何附加的数据结构,从而大大提高了挖掘效率。
关键词：	频繁模式频繁项集 FP-Tree 前级树
A Mining Algorithm for Frequent Patterns Based on Prefix Tree

ZHU Guang-Xi,Wu Wei-Min,RUAN You-lin,LIU Gan.A Mining Algorithm for Frequent Patterns Based on Prefix Tree[J].Computer Science,2005,32(4):34-36.

Authors:	ZHU Guang-Xi Wu Wei-Min RUAN You-lin LIU Gan

Affiliation:	ZHU Guang-Xi,Wu Wei-Min,RUAN You-lin,LIU Gan College of Computer Science and Technology,Huazhong University of Science and Technolgoy,Wuhan 430074

Abstract:	Mining frequent patterns is a key problem in data mining research. Although mining based on FP-Tre achieves better performance and efficiency than Apriori-like algorithms because of avoiding costly candidate genera tion, it still suffers from creating conditional FP-Tree separately and recursively during the mining process. In this pa per, we propose a new method PTM that designs a new structure called Prefix Tree, which stores all of the informa tion in a highly compact form. PTM mines frequent patterns in depth-first order and directly in Prefix Tree by adjust ing node information and node links without using any additional data structures. Thus, it can improve performanc greatly.

Keywords:	Frequent pattern Frequent itemsets FP-tree Prefix tree
本文献已被 CNKI 维普万方数据等数据库收录！
	点击此处可从《计算机科学》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏