首页 | 本学科首页   官方微博 | 高级检索  
     

Apriori算法的三种优化方法
引用本文:徐章艳,刘美玲,张师超,卢景丽,区玉明.Apriori算法的三种优化方法[J].计算机工程与应用,2004,40(36):190-192,202.
作者姓名:徐章艳  刘美玲  张师超  卢景丽  区玉明
作者单位:广西师范大学计算机科学系,桂林,541004
摘    要:通过对Apriori算法的思想和性能的分析,认为Apriori算法存在以下三点不足:(1)由K阶频繁集生成K+1阶候选频繁集时,在K+1阶候选频繁集中过滤掉非频繁集的策略值得进一步改进;(2)连接程序中相同的项目重复比较太多,因而其效率值得进一步改进;(3)在回扫数据库时有许多不必比较的项目或事务重复比较。根据上述三点不足,提出了相应的三种优化策略来优化Apriori算法,得到一效率较高的改进Apriori算法。

关 键 词:关联规则  Apriori算法  频繁项集  非频繁项集
文章编号:1002-8331-(2004)36-0190-03

Three Optimized Methods of Apriori Algorithm
Xu Zhangyan,Liu Meiling,Zhang Shichao,Lu Jingli,Ou Yuming.Three Optimized Methods of Apriori Algorithm[J].Computer Engineering and Applications,2004,40(36):190-192,202.
Authors:Xu Zhangyan  Liu Meiling  Zhang Shichao  Lu Jingli  Ou Yuming
Abstract:In this paper,after the principle and efficiency of the Apriori algorithm are analyzed,authors think that there are three following disadvantages:First,when candidate frequent(K+1)-itemsets are generated from frequent K-itemsets,the strategy of deleting infrequent (K+1)-itemsets from candidate frequent(K+1)-itemsets is not perfect;Second,the join procedure of Apriori algorithm is not very good because the comparison times of the same item is excessive;In the end,many items or transactions which need not be scanned are scanned repeatedly.According to the above three disadvantages,three corresponding optimized strategies are used to optimize the Apriori algorithm,and present an improved Apriori algorithm,which is more efficient than the original Apriori Algorithm.
Keywords:association rule  Apriori algorithm  frequent itemset  infrequent itemset
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号