首页 | 本学科首页   官方微博 | 高级检索  
     

基于事务映射区间求交的高效频繁模式挖掘算法
引用本文:吴磊.基于事务映射区间求交的高效频繁模式挖掘算法[J].计算机应用研究,2019,36(4).
作者姓名:吴磊
作者单位:广东工业大学
基金项目:智能制造物联网的数据感知;传输和海量数据处理;国家基金广东省联合基金重点项目;(U1201251);面向船舶产品的智能制造集成平台研究及产业化;广东省省级科技计划项目;(2016B090918045)制造物联网协同感知的服务组合优化模型与寻优算法研究;国家自然科学基金青年科学基金项目(61502110)
摘    要:关联规则挖掘是数据挖掘重要研究课题,大数据处理对关联规则挖掘算法效率提出了更高要求,而关联规则挖掘的最耗时的步骤是频繁模式挖掘。针对当前频繁模式挖掘算法效率不高的问题,结合Apriori算法和FP-growth算法,提出一种基于事务映射区间求交的频繁模式挖掘算法IITM(interval interaction and transaction mapping),只需扫描数据集两次来生成FP树,然后扫描FP树将每个项的ID映射到区间中,通过区间求交来进行模式增长。该算法解决了Apriori算法需要多次扫描数据集,FP-growth算法需要迭代地生成条件FP树来进行模式增长而带来的效率下降的问题。在真实数据集上的实验显示,在不同的支持度下IITM算法都要要优于Apriori、FP-growth以及PIETM算法。

关 键 词:关键词:数据挖掘  频繁模式  事务映射  区间求交
收稿时间:2017/10/17 0:00:00
修稿时间:2019/2/26 0:00:00

Efficient frequent pattern mining algorithm based on interval interaction and transaction mapping
wu lei.Efficient frequent pattern mining algorithm based on interval interaction and transaction mapping[J].Application Research of Computers,2019,36(4).
Authors:wu lei
Affiliation:Guangdong University of Technology
Abstract:Association rules mining is an important research topic in data mining. Big data processing puts forward higher requirements for the efficiency of association rules mining algorithm, where the most time consuming step is frequent pattern mining. For the problem that the state of art frequent pattern mining algorithm is not efficient, a frequent pattern mining algorithm based on interval interaction and transaction mapping (IITM) is proposed, which combines Apriori algorithm and FP-growth algorithm. This algorithm just needs to scan the dataset twice to generate the FP tree, and then scan the FP tree to map the ID of each transaction to the interval. It growths the frequent pattern by interval interaction and solves the problem that the Apriori algorithm needs to scan the dataset multiple times, the FP-growth algorithm needs to iterate to generate the conditional FP tree, which reduce the efficiency of the frequent pattern mining. Experiments on real dataset show that the IITM algorithm is superior to Apriori, FP-growth, and PIETM algorithms at different support.
Keywords:data mining  frequent pattern  interval interaction  transaction mapping
点击此处可从《计算机应用研究》浏览原始摘要信息
点击此处可从《计算机应用研究》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号