首页 | 本学科首页   官方微博 | 高级检索  
     

事务型滑动窗口下的数据流频繁模式挖掘
引用本文:胡彧,王顺平. 事务型滑动窗口下的数据流频繁模式挖掘[J]. 计算机工程与应用, 2010, 46(22): 175-177. DOI: 10.3778/j.issn.1002-8331.2010.22.052
作者姓名:胡彧  王顺平
作者单位:太原理工大学 计算机与软件学院,太原 030024
摘    要:作为数据流挖掘的一个重要研究问题,滑动窗口下的数据流频繁模式挖掘近年来得到了广泛应用和研究。已有的算法大多要对数据流中所有的数据都进行处理,而现实中用户往往只关注事物的某些方面,由此借鉴MFI-TransSW算法,提出了一种基于事务型滑动窗口的算法BSW-Filter(Bit Sliding Window with Filter)。算法采用比特序列实现滑动窗口操作,同时由于增加了频繁项的筛选,减少了所需保存的数据项个数,从而减小了内存使用和提升处理速度。算法的空间复杂度与滑动窗口大小以及数据流取值范围无关,特别适用于周期较长数据范围广的数据挖掘。分析和实验验证了该算法的可行性和有效性。

关 键 词:数据流  数据挖掘  滑动窗口  频繁模式
收稿时间:2009-01-13
修稿时间:2009-3-26 

Mining frequent patterns in data stream with transaction-sensitive sliding window
HU Yu,WANG Shun-ping. Mining frequent patterns in data stream with transaction-sensitive sliding window[J]. Computer Engineering and Applications, 2010, 46(22): 175-177. DOI: 10.3778/j.issn.1002-8331.2010.22.052
Authors:HU Yu  WANG Shun-ping
Affiliation:College of Computer Engineering and Software,Taiyuan University of Technology,Taiyuan 030024,China
Abstract:As one of the most important problems in data stream mining,the frequent patterns mining with a sliding win- dow is widely researched and used in many fields.Exiting algorithms need process all elements in the data stream, whereas users only focus on several aspects of things.So inspired by the MFI-TransSW algorithm,a new algorithm based on transac- tion-sensitive sliding window is proposed in this paper, in which a sequence of bits is used to implement the sliding win- dow operation.In addition, a mechanism of filtering frequent items, which decreases the memory usage and improve the effi- ciency of processing, because of the reduction of items retained in memory.Furthermore as space complexity is independent to the size of sliding window and the value range of elements, this method is specially applicable to discovery of data with a wide range of values in a long period.The analysis and experiments show the feasibility and effectiveness of the algorithm.
Keywords:data streams  data mining  sliding window  frequent pattern
本文献已被 维普 万方数据 等数据库收录!
点击此处可从《计算机工程与应用》浏览原始摘要信息
点击此处可从《计算机工程与应用》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号