首页 | 本学科首页   官方微博 | 高级检索  
     

基于位编码链表的快速频繁模式挖掘算法研究
引用本文:顾军华,苏鸣,张亚娟,张丹红.基于位编码链表的快速频繁模式挖掘算法研究[J].计算机工程与应用,2020,56(19):86-93.
作者姓名:顾军华  苏鸣  张亚娟  张丹红
作者单位:1.河北工业大学 人工智能与数据科学学院,天津 300401 2.河北省大数据计算重点实验室,天津 300401
基金项目:天津市自然科学基金重点项目
摘    要:多数基于FP-growth思想的频繁模式挖掘算法存在建树过程复杂、支持度计算繁琐的问题。针对这些问题,提出一种基于位编码链表(Bitmap-Code List,BC-List)的频繁项集挖掘算法(BC-List Frequent Itemsets Mining,BCLFIM)。该算法首先采用基于位图表示的节点编码模型生成位图树(BC-tree),以BC-tree的节点信息作为数据结构通过按位运算来快速获取BC-List的节点集,避免了复杂的交集运算,提高了连接效率;其次通过使用超集等价和支持度计数剪枝策略,缩小了挖掘频繁模式的搜索空间。实验结果证明,该算法相比于FIN算法和DFIN算法具有更快的挖掘速度。

关 键 词:频繁项集挖掘  关联规则  剪枝策略  位图编码  

Research on Fast Frequent Pattern Mining Algorithm Based on Bitmap-Code List
GU Junhua,SU Ming,ZHANG Yajuan,ZHANG Danhong.Research on Fast Frequent Pattern Mining Algorithm Based on Bitmap-Code List[J].Computer Engineering and Applications,2020,56(19):86-93.
Authors:GU Junhua  SU Ming  ZHANG Yajuan  ZHANG Danhong
Affiliation:1.School of Artificial Intelligence and Data Science, Hebei University of Technology, Tianjin 300401, China 2.Hebei Province Key Laboratory of Big Data Computing, Tianjin 300401, China
Abstract:Most of the frequent pattern mining algorithms based on the FP-growth idea have the disadvantages of complex construction rules and cumbersome support calculations. This paper proposes a Frequent Item set Mining algorithm(BCLFIM) based on Bitmap-Code List(BC-List) to improve this problem. Firstly, in this algorithm, a node coding model based on bitmap  representation is adopted to generate BC-tree, and the node information of BC-tree is used as the data structure to quickly obtain the node set of BC-List by bitwise operation, which can reduce complicated intersection operation and improve connection efficiency. Secondly, the search space for mining frequent patterns is reduced by using the superset equivalence and support count prune strategy. Experimental show that the algorithm has faster mining speed than FIN and DFIN algorithms.
Keywords:frequent item mining  association rules  pruning strategy  bitmap encoding  
本文献已被 万方数据 等数据库收录!
点击此处可从《计算机工程与应用》浏览原始摘要信息
点击此处可从《计算机工程与应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号