首页 | 本学科首页   官方微博 | 高级检索  
     

一种基于前缀树的频繁模式挖掘算法
引用本文:朱光喜,吴伟民,阮幼林,刘干.一种基于前缀树的频繁模式挖掘算法[J].计算机科学,2005,32(4):34-36.
作者姓名:朱光喜  吴伟民  阮幼林  刘干
作者单位:华中科技大学计算机科学与技术学院,武汉,430074
基金项目:国家自然科学基金(603905405),国家自然科学基金(60273075),国家863计划课题(2001AA123014)
摘    要:挖掘频繁模式是许多数据挖掘任务的关键步骤。基于FP-Tree的挖掘算法由于无须生成候进项集效率明显高于Apriori类算法,但FP-Tree结构存在动态维护复杂、而且在挖掘过程中需要递归地创建大量的条件FP-Tree,时空效率不高。因此,本文提出一种基于前缀树的新算法。该算法通过引入一种新结构—前缀树(Prefix Tree)用来压缩存放数据所相关信息,并通过调整前缀树中节点信息和节点键直接在Prefix Tree上采用深度优先的策略挖掘频繁模式,而不需要任何附加的数据结构,从而大大提高了挖掘效率。

关 键 词:频繁模式  频繁项集  FP-Tree  前级树

A Mining Algorithm for Frequent Patterns Based on Prefix Tree
ZHU Guang-Xi,Wu Wei-Min,RUAN You-lin,LIU Gan.A Mining Algorithm for Frequent Patterns Based on Prefix Tree[J].Computer Science,2005,32(4):34-36.
Authors:ZHU Guang-Xi  Wu Wei-Min  RUAN You-lin  LIU Gan
Affiliation:ZHU Guang-Xi,Wu Wei-Min,RUAN You-lin,LIU Gan College of Computer Science and Technology,Huazhong University of Science and Technolgoy,Wuhan 430074
Abstract:Mining frequent patterns is a key problem in data mining research. Although mining based on FP-Tre achieves better performance and efficiency than Apriori-like algorithms because of avoiding costly candidate genera tion, it still suffers from creating conditional FP-Tree separately and recursively during the mining process. In this pa per, we propose a new method PTM that designs a new structure called Prefix Tree, which stores all of the informa tion in a highly compact form. PTM mines frequent patterns in depth-first order and directly in Prefix Tree by adjust ing node information and node links without using any additional data structures. Thus, it can improve performanc greatly.
Keywords:Frequent pattern  Frequent itemsets  FP-tree  Prefix tree  
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《计算机科学》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号