首页 | 本学科首页   官方微博 | 高级检索  
     

数据流中频繁闭项集的近似挖掘算法
引用本文:刘旭,毛国君,孙岳,刘椿年.数据流中频繁闭项集的近似挖掘算法[J].电子学报,2007,35(5):900-905.
作者姓名:刘旭  毛国君  孙岳  刘椿年
作者单位:北京工业大学计算机学院,北京市多媒体与智能软件重点实验室,北京 100022
摘    要:在数据流中挖掘频繁项集得到了广泛的研究,传统的研究方法大多关注于在数据流中挖掘全部频繁项集.由于挖掘全部频繁项集存在数据和模式冗余问题,所以对算法的时间和空间效率都具有更大的挑战性.因此,近年来人们开始关注在数据流中挖掘频繁闭项集,其中一个典型的工作就是Moment算法.本文提出了一种数据流中频繁闭项集的近似挖掘算法A-Moment.它采用衰减窗口机制、近似计数估计方法和分布式更新信息策略来解决Moment算法中过度依赖于窗口和执行效率低等问题.实验表明,该算法在保证挖掘精度的前提下,可以比Moment获得更好的效率.

关 键 词:数据挖掘  数据流  频繁闭项集  
文章编号:0372-2112(2007)05-0900-06
收稿时间:2006-03-27
修稿时间:2006-03-272006-12-04

An Algorithm to Approximately Mine Frequent Closed Itemsets from Data Streams
LIU Xu,MAO Guo-Jun,SUN Yue,LIU Chun-Nian.An Algorithm to Approximately Mine Frequent Closed Itemsets from Data Streams[J].Acta Electronica Sinica,2007,35(5):900-905.
Authors:LIU Xu  MAO Guo-Jun  SUN Yue  LIU Chun-Nian
Affiliation:Beijing Municipal Key Laboratory of Multimedia and Intelligent Software Technology, School of Computer Science,Beijing University of Technology,Beijing 100022,China
Abstract:Mining frequent itemsets from data streams has extensively been studied, and most of them focus on finding complete set of frequent itemsets in a data stream. Because of numerous redundant data and patterns in main memory, they cannot get very good performance in time and space. Therefore,mining frequent closed itemsets in data streams becomes a new important problem in recent years, where algorithm Moment was regarded as a typical method of them. This paper presents an algorithm, called AMoment, which uses the damped window technique, approximate count method and distributed updating strategy to get higher mining efficiency. Experimental results show that our algorithm performs much better than the previous approaches.
Keywords:data mining  data stream  frequent closed itemset
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《电子学报》浏览原始摘要信息
点击此处可从《电子学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号