首页 | 本学科首页   官方微博 | 高级检索  
     

基于频繁项目集链式存储方法的关联规则算法
引用本文:尹士闪,马增强,毛晚堆.基于频繁项目集链式存储方法的关联规则算法[J].计算机工程与设计,2012,33(3):1002-1007.
作者姓名:尹士闪  马增强  毛晚堆
作者单位:1. 石家庄铁道大学教务处,河北石家庄,050043
2. 石家庄铁道大学电气与电子工程学院,河北石家庄,050043
3. 石家庄铁道大学工程训练中心,河北石家庄,050043
摘    要:为了提高经典关联规则Apriori算法的挖掘效率,针对Apriori算法的瓶颈问题,提出了一种链式结构存储频繁项目集并生成最大频繁项目集的关联规则算法.该算法采用比特向量方式存储事务,生成频繁项目集的同时,把包含此频繁项目的事务作为链表连接到频繁项目之后,生成最大频繁项目集.该算法能够减小扫描事物数据库的次数和生成候选项目集的数量,从而减少了生成最大频繁项目集的时间,实验结果表明,该算法提高了运算效率.

关 键 词:数据挖掘  Apriori算法  候选集  频繁项目集  关联规则

Association rule algorithm based on chain storage method of frequent itemsets
YIN Shi-shan , MA Zeng-qiang , MAO Wan-dui.Association rule algorithm based on chain storage method of frequent itemsets[J].Computer Engineering and Design,2012,33(3):1002-1007.
Authors:YIN Shi-shan  MA Zeng-qiang  MAO Wan-dui
Affiliation:1.Dean’s Office,Shijiazhuang Tiedao University,Shijiazhuang 050043,China;2.College Electrical and Electronics Engineering,Shijiazhuang Tiedao University,Shijiazhuang 050043,China; 3.Engineering Training Center,Shijiazhuang Tiedao University,Shijiazhuang 050043,China)
Abstract:In order to improve the Apriori algorithm mining efficiency,a mining algorithm is presented to use a chain structure to store frequent itemsets and generate maximum frequent itemsets for the bottleneck problem of the classic Apriori algorithm.In this algorithm,the transaction is stored with a bit vector,when frequent itemset is generated,the transaction including this frequent itemset as a linked list connect to frequent itemset,at end,it make maximum frequent itemsets.This algorithm reduce the number of scanning transaction database and the amount of generating candidate itemsets,thus reduce time of generated maximum frequent itemsets,experimental results show that this algorithm improve the operation efficiency.
Keywords:data mining  Apriori algorithm  candidate itemsets  frequent itemsets  association rule
本文献已被 CNKI 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号