首页 | 本学科首页   官方微博 | 高级检索  
     

一种基于位运算的频繁闭项集挖掘算法
引用本文:徐嘉莉,杨洪军,赵茂娟,樊 云.一种基于位运算的频繁闭项集挖掘算法[J].计算机应用研究,2013,30(11):3280-3282.
作者姓名:徐嘉莉  杨洪军  赵茂娟  樊 云
作者单位:成都大学 电子信息工程学院, 成都 610106
基金项目:四川省科技厅资助项目(2011JY0141); 四川省教育厅资助项目(12ZB171)
摘    要:针对相关算法在挖掘频繁闭项集时所存在的问题, 提出了一种基于位运算的频繁闭项集挖掘算法。该算法首先将数据集转换成布尔矩阵, 只需扫描数据集一次; 通过位运算计算支持度, 利用矩阵和数组存储辅助信息, 减少时间和空间消耗; 深度优先搜索产生频繁闭项集时利用剪枝策略进一步减少挖掘时间; 利用同生项集性质进行闭合性检测, 无须检查超集或子集。理论分析和实验结果验证了该算法的有效性。

关 键 词:数据挖掘  频繁闭项集  矩阵  位运算  同生项集

Algorithm based on bit operation for mining frequent closed itemsets
XU Jia-li,YANG Hong-jun,ZHAO Mao-juan,FAN Yun.Algorithm based on bit operation for mining frequent closed itemsets[J].Application Research of Computers,2013,30(11):3280-3282.
Authors:XU Jia-li  YANG Hong-jun  ZHAO Mao-juan  FAN Yun
Affiliation:School of Electronic & Information Engineering, Chengdu University, Chengdu 610106, China
Abstract:Aiming at the problems of mining frequent closed itemsets, this paper proposed an algorithm based on bit operation for mining frequent closed itemsets (MFCIS). Firstly, the algorithm used the vector to express items in database and scaned the database for only one time. Secondly it computed the support of itemsets through the bit operation and used the matrice and the array to store the ancillary information to reduce the time and memeory, and used pruning technology to improve the mining efficiency during creating the frequent closed itemsets by depth-first search. Finally, it used the nature of syngenetic itemsets to test frequent closed itemsets so as not to test superset or subset. Theoretical analysis and experimental results show that the algorithm is efficient.
Keywords:data mining  frequent closed itemsets  matrix  bit operation  syngenetic itemsets
点击此处可从《计算机应用研究》浏览原始摘要信息
点击此处可从《计算机应用研究》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号