首页 | 本学科首页   官方微博 | 高级检索  
     

一种无阈值的频繁模式生成算法
引用本文:神鹏飞,王希武,耿志广,王创伟,李国良.一种无阈值的频繁模式生成算法[J].计算机工程,2011,37(8):31-33.
作者姓名:神鹏飞  王希武  耿志广  王创伟  李国良
作者单位:军械工程学院计算机工程系,石家庄,050003
摘    要:在数据挖掘的关联规则挖掘算法中,传统的频繁模式挖掘算法需要用户指定项集的最小支持度。引入Top-k模式挖掘概念的改进算法虽然无需指定最小支持度,但仍需指定阈值k。针对上述问题,对传统挖掘算法进行改进,提出一种新的频繁模式挖掘算法(TNFP- growth)。该算法无需指定最小支持度或阈值,按照支持度降序排列进行模式挖掘,有序地返回频繁模式给用户。实验结果证明,该算法的执行效率更高,具有更强的伸缩性。

关 键 词:数据挖掘  关联规则  频繁项集  频繁模式  Top-k模式

Threshold Needless Frequent Pattern Generating Algorithm
SHEN Peng-fei,WANG Xi-wu,GENG Zhi-guang,WANG Chuang-wei,LI Guo-liang.Threshold Needless Frequent Pattern Generating Algorithm[J].Computer Engineering,2011,37(8):31-33.
Authors:SHEN Peng-fei  WANG Xi-wu  GENG Zhi-guang  WANG Chuang-wei  LI Guo-liang
Affiliation:(Department of Computer Engineering,Ordnance Engineering College,Shijiazhuang 050003,China)
Abstract:In association rule mining algorithm of data mining, conventional frequent pattern mining algorithms need the user to specify minimum support of itemsets. By using the Top-k pattern mining concept, some algorithms are improved tn need no minimum support. However, threshold k need be specified. Based on all above, a new algorithm is proposed, which is called TNFP growth. The algorithm need not specify minimum support or thresholds, and mines patterns with the descending order of their support values and returns frequent patterns to users sequentially. Experimental result proves that it has a high executing efficiency and good scalability.
Keywords:data mining  association rule  frequent itemsets  frequent pattern  Top-k pattern
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《计算机工程》浏览原始摘要信息
点击此处可从《计算机工程》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号