一种改进的数据流最大频繁项集挖掘算法 An improved algorithm for mining maximal frequent itemsets over data streams期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

一种改进的数据流最大频繁项集挖掘算法

引用本文：	胡,健,吴毛毛.一种改进的数据流最大频繁项集挖掘算法[J].计算机工程与科学,2014,36(5):963-970.

作者姓名：	胡健吴毛毛

摘要：	提出了一种基于DSM MFI算法的改进算法DSMMFI DS算法，它首先将事务数据按一定的全序关系存入DSFI list列表中；然后按排序后的顺序存储到类似概要数据结构的树中；接着删除树中和DSFI list列表中的非频繁项，同时删除窗口衰退支持数大的事务项；最后采用自顶向下和自底向上的双向搜索策略来挖掘数据流的最大频繁项集。通过用例分析和实验表明，该算法比DSM MFI算法具有更好的执行效率。
关键词：	数据挖掘数据流界标窗口最大频繁项集窗口衰减支持数
收稿时间：	2012-12-03
修稿时间：	2014-05-25
An improved algorithm for mining maximal frequent itemsets over data streams

HU Jian,WU Mao mao.An improved algorithm for mining maximal frequent itemsets over data streams[J].Computer Engineering & Science,2014,36(5):963-970.

Authors:	HU Jian WU Mao mao

Affiliation:	(Institute of Information Engineering,Jiangxi University of Science and Technology,Ganzhou 341000,China)

Abstract:	Based on the algorithm of DSM MFI, an improved algorithm, named DSMMFI DS (Dictionary Sequence Mining Maximal Frequent Itemsets over Data Streams), is proposed. Firstly, it stores transaction data into DSFI list in alphabetical order. Secondly, the data are stored sequentially into the tree similar to the summary data structure. Thirdly, non frequent items in the tree and DSFI list are removed, and the transaction items with the maximum count of window attenuation supports are deleted. Finally, the strategy (top down and bottom up two way search) is used to mine maximal frequent itemsets over data streams, and case analysis and experiments prove that the algorithm DSMMFI DS has better performance than the algorithm DSM MFI.

Keywords:	data mining data stream landmark windows maximal frequent itemsets window attenuation support count
本文献已被 CNKI 等数据库收录！
	点击此处可从《计算机工程与科学》浏览原始摘要信息
	点击此处可从《计算机工程与科学》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏