数据流中一种基于滑动窗口的前K个频繁项集挖掘算法 Alogrithm for mining top-K frequent itemsets over sliding window in data streams期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

数据流中一种基于滑动窗口的前K个频繁项集挖掘算法

引用本文：	张文煜,周满元.数据流中一种基于滑动窗口的前K个频繁项集挖掘算法[J].计算机应用研究,2011,28(7).

作者姓名：	张文煜周满元

作者单位：	桂林电子科技大学计算机科学与工程学院,广西桂林,541004

摘要：	数据流频繁项集挖掘是当今数据挖掘和知识学习领域重要的研究课题之一。数据流高速性、连续性、无界性、实时性对挖掘算法在时间和空间方面提出了更高的要求。传统的数据挖掘算法由于其存储结构需要频繁地维护,其挖掘方式的精度和速度较低,空间、时间效率不高。在基于粒计算和ECLAT算法的基础上提出一种挖掘数据流滑动窗口中top-K频繁项集算法,采用二进制方式存储项,利用位移运算实现增量更新,实施与运算计算项集支持度,同时利用二分查找法插入到项目序表中,输出前K个频繁项。实验结果表明,该算法在K取值不太高时具有较好的时空高效性。
关键词：	数据挖掘数据流频繁项集滑动窗口二进制二分法
Alogrithm for mining top-K frequent itemsets over sliding window in data streams

ZHANG Wen-yu,ZHOU Man-yuan.Alogrithm for mining top-K frequent itemsets over sliding window in data streams[J].Application Research of Computers,2011,28(7).

Authors:	ZHANG Wen-yu ZHOU Man-yuan

Affiliation:	ZHANG Wen-yu,ZHOU Man-yuan(School of Computer Science & Engineering,Guilin University of Electronic Technology,Guilin Guangxi 541004,China)

Abstract:	Frequent itemset mining over data streams is a hot topic in data mining and knowledge discovery.The features of data streams,such as consecution,unboundedness and real-time,raise requirements for higher time and space performance of mining algorithms.The storage structure of the present algorithms need to be revised continually.Otherwise,the accuracy and rate of the present algorithms are low.The above two reasons lead up to disadvantage impact on both time and space efficiency.This paper designed a frequen...

Keywords:	data mining data stream frequent itemset sliding window binary dichotomy
本文献已被 CNKI 万方数据等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏