首页 | 本学科首页   官方微博 | 高级检索  
     

界标窗口中数据流频繁模式挖掘算法研究
引用本文:张广路,雷景生,吴兴惠. 界标窗口中数据流频繁模式挖掘算法研究[J]. 计算机工程, 2012, 38(1): 55-58,61
作者姓名:张广路  雷景生  吴兴惠
作者单位:1. 海南师范大学数学与统计学院,海口,571158
2. 南京邮电大学计算机学院,南京,210046
基金项目:海南省自然科学基金资助项目(610221,109002,808155); 海南师范大学青年科研基金资助项目(QN0923)
摘    要:数据流的流量太大会无法被整个存储,或被多次扫描。为此,在研究已有挖掘算法的基础上,提出一种界标窗口中数据流频繁模式挖掘算法DSMFP_LW。利用扩展前缀模式树存储全局临界频繁模式,实现单遍扫描数据流和数据增量更新。实验结果表明,与Lossy Counting算法相比,DSMFP_LW算法具有更好的时空效率。

关 键 词:界标窗口  频繁模式  数据流  DSMFP_LW算法  滑动窗口
收稿时间:2011-06-28

Research on Data Stream Frequent Pattern Mining Algorithm in Landmark Window
ZHANG Guang-lu , LEI Jing-sheng , WU Xing-hui. Research on Data Stream Frequent Pattern Mining Algorithm in Landmark Window[J]. Computer Engineering, 2012, 38(1): 55-58,61
Authors:ZHANG Guang-lu    LEI Jing-sheng    WU Xing-hui
Affiliation:1.School of Mathematics and Statistics,Hainan Normal University,Haikou 571158,China;2.School of Computer Science & Technology,Nanjing University of Posts and Telecommunications,Nanjing 210046,China)
Abstract:For data traffic flow is too large to store the entire data stream or on its scan times and other issues,through the research of algorithms on mining frequent patterns that are proposed,this paper proposes an algorithm on mining frequent patterns over data stream based on Landmark window,named DSMFP_LW.DSMFP_LW has major features as follows: namely single streaming data scan for counting pattern's information,extended prefix-tree-based compact pattern representation,and incremental update of data.Experimental results show that DSMFP_LW algorithm has better utilization of time and space efficiency.In addition,it outperforms the well-known algorithm Lossy Counting in the same streaming environment.
Keywords:landmark window  frequent pattern  data stream  DSMFP_LW algorithm  sliding window
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《计算机工程》浏览原始摘要信息
点击此处可从《计算机工程》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号