首页 | 官方网站   微博 | 高级检索  
     

数据流频繁模式挖掘研究进展
引用本文:潘云鹤,王金龙,徐从富.数据流频繁模式挖掘研究进展[J].自动化学报,2006,32(4):594-602.
作者姓名:潘云鹤  王金龙  徐从富
作者单位:1.浙江大学人工智能研究所,杭州,310027
基金项目:国家自然科学基金;中国科学院资助项目
摘    要:现实世界和工程实践产生了大量的数据流,这种数据不同于传统的静态数据,对其进行有效处理和挖掘遇到了极大的挑战.如何使用有限存储空间进行快速和近似的频繁模式挖掘是数据流挖掘的基本问题,具有非常重要的研究价值和实践意义,已经引起了国内外研究者的广泛关注.本文深入分析数据流中的频繁模式挖掘,对其特点和算法进行较为全面的总结和分类论述,并讨论了存在的主要问题和未来的研究方向.

关 键 词:数据挖掘    数据流    频繁模式    近似算法
收稿时间:2004-09-01
修稿时间:2006-03-03

State-of-the-art on Frequent Pattern Mining in Data Streams
PAN Yun-He,WANG Jin-Long,XU Cong-Fu.State-of-the-art on Frequent Pattern Mining in Data Streams[J].Acta Automatica Sinica,2006,32(4):594-602.
Authors:PAN Yun-He  WANG Jin-Long  XU Cong-Fu
Affiliation:1.Institute of Artificial Intelligence, Zhejiang University, Hangzhou 310027
Abstract:Real-world applications often generate huge amount of data streams, which chal- lenges efficient processing and mining due to its special characteristics. As a fundamental problem in data stream mining, frequent pattern mining techniques employed in these applications should be efficient in terms of space usage and execution time while providing a high quality of yields. This has received considerable attention in the past few years due to its research value and increasing amount of importance in numerous applications. The purpose of this paper is to review the recent work in frequent pattern mining under data stream environments, and summarize its characteristics and algorithms in general. With taxonomy, we dissertate the existing algorithms from probabilistic and deterministic bounds on error, respectively. Throughout the detailed review, some comparisons and evaluations are performed. Finally, future directions in data stream mining research are discussed.
Keywords:Data mining  data streams  frequent pattern  approximate algorithm
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《自动化学报》浏览原始摘要信息
点击此处可从《自动化学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号