面向数据流的频繁项集挖掘研究 Research on mining frequent itemsets in data streams期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

面向数据流的频繁项集挖掘研究

引用本文：	孟彩霞.面向数据流的频繁项集挖掘研究[J].计算机工程与应用,2010,46(24):138-140.

作者姓名：	孟彩霞

作者单位：	西安邮电学院计算机科学系，西安 710061

基金项目：	国家自然科学基金，陕西省自然科学基金，西安市科技创新支撑—应用发展研究计划项目

摘要：	针对数据流的特点，对数据流中频繁模式挖掘问题进行了研究，提出了数据流频繁项集挖掘算法FP-SegCount。该算法将数据流分段并利用改进的FP-growth算法挖掘分段中的频繁项集。然后，利用Count Min Sketch进行项集计数。算法解决了压缩统计和计算快速高效的问题。通过和FP-DS算法的实验对比，FP-SegCount算法具有较好的时间效率。
关键词：	数据流数据挖掘数据流挖掘频繁项集
收稿时间：	2009-2-12
修稿时间：	2010-2-5
Research on mining frequent itemsets in data streams

MENG Cai-xia.Research on mining frequent itemsets in data streams[J].Computer Engineering and Applications,2010,46(24):138-140.

Authors:	MENG Cai-xia

Affiliation:	Department of Computer Science，Xi’an University of Posts & Telecommunications，Xi’an 710065，China

Abstract:	According to the characteristic of data streams,the paper proposes FP-SegCount algorithm for mining frequent itemsets from data streams.The algorithm partitions the data stream and uses modified FP-growth algorithm to mining frequent itemsets in every segment.And then,it counts itemsets in Count Min Sketch.The algorithm solves the problem of compressed statistic and effective computation.Through experimentation and comparision with FP-DS algorithm,FP-SegCount algorithm has a good time efficiency.

Keywords:	data streams data mining data stream mining frequent itemsets
本文献已被维普万方数据等数据库收录！
	点击此处可从《计算机工程与应用》浏览原始摘要信息
	点击此处可从《计算机工程与应用》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏