首页 | 本学科首页   官方微博 | 高级检索  
     

基于Aproiri算法的频繁项集挖掘优化方法
引用本文:吴学雁,莫赞.基于Aproiri算法的频繁项集挖掘优化方法[J].计算机系统应用,2014,23(6):124-129.
作者姓名:吴学雁  莫赞
作者单位:广东工业大学 管理学院, 广州 510520;广东工业大学 管理学院, 广州 510520
基金项目:国家自然科学基金(71171062);教育部人文社科青年基金(13YJCZH200);广东工业大学高教研究基金(2012ZY26)
摘    要:为了进一步降低扫描数据库的次数和减轻内存负担,从而更好地提高挖掘频繁项集的效率,一种基于Apriori的优化算法(M-Apriori)被提出. 该方法通过构建频繁状态矩阵来存放项集的频繁状态,构建事务布尔矩阵来存放事务与项集的关系,此算法只需在初始化阶段扫描一次数据库产生初始的频繁状态矩阵和事务布尔矩阵,并在此基础上直接递推产生所有的频繁项集. 实验证明,与Apriori算法相比,M-Apriori算法具有更好的性能与效率.

关 键 词:频繁项集挖掘  M-Apriori算法  关联规则挖掘
收稿时间:2013/11/5 0:00:00
修稿时间:2013/12/13 0:00:00

Frequent Itemsets Mining Optimization Methods Based on Aproiri Algorithm
WU Xue-Yan and MO Zan.Frequent Itemsets Mining Optimization Methods Based on Aproiri Algorithm[J].Computer Systems& Applications,2014,23(6):124-129.
Authors:WU Xue-Yan and MO Zan
Affiliation:School of Management, Guangdong University of Technology, Guangzhou 510520, China;School of Management, Guangdong University of Technology, Guangzhou 510520, China
Abstract:To reduce the number of database scanning and reduce the burden of memory further, also to improve the efficiency of mining frequent itemsets better, an Apriori-based optimization algorithm (M-Apriori) is proposed. The method stores frequent itemsets state by constructing the frequent state matrix and store the relationship between the transaction and itemsets by constructing the Boolean matrix. The algorithm scans the database only once and generates the initial frequent state matrix and the Boolean matrix during the initialization phase. On this basis, all frequent itemsets can be found directly without scanning the database repeatedly. Experiments show that M-Apriori algorithm has better performance and efficiency compared with the Apriori algorithm.
Keywords:frequent itemsets mining  M-Apriori algorithm  association rule mining
本文献已被 维普 等数据库收录!
点击此处可从《计算机系统应用》浏览原始摘要信息
点击此处可从《计算机系统应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号