首页 | 本学科首页   官方微博 | 高级检索  
     

一种改进的数据流最大频繁项集挖掘算法
引用本文:胡,健,吴毛毛.一种改进的数据流最大频繁项集挖掘算法[J].计算机工程与科学,2014,36(5):963-970.
作者姓名:    吴毛毛
摘    要:提出了一种基于DSM MFI算法的改进算法DSMMFI DS算法,它首先将事务数据按一定的全序关系存入DSFI list列表中;然后按排序后的顺序存储到类似概要数据结构的树中;接着删除树中和DSFI list列表中的非频繁项,同时删除窗口衰退支持数大的事务项;最后采用自顶向下和自底向上的双向搜索策略来挖掘数据流的最大频繁项集。通过用例分析和实验表明,该算法比DSM MFI算法具有更好的执行效率。

关 键 词:数据挖掘  数据流  界标窗口  最大频繁项集  窗口衰减支持数  
收稿时间:2012-12-03
修稿时间:2014-05-25

An improved algorithm for mining maximal frequent itemsets over data streams
HU Jian,WU Mao mao.An improved algorithm for mining maximal frequent itemsets over data streams[J].Computer Engineering & Science,2014,36(5):963-970.
Authors:HU Jian  WU Mao mao
Affiliation:(Institute of Information Engineering,Jiangxi University of Science and Technology,Ganzhou 341000,China)
Abstract:Based on the algorithm of DSM MFI, an improved algorithm, named DSMMFI DS (Dictionary Sequence Mining Maximal Frequent Itemsets over Data Streams), is proposed. Firstly, it stores transaction data into DSFI list in alphabetical order. Secondly, the data are stored sequentially into the tree similar to the summary data structure. Thirdly, non frequent items in the tree and DSFI list are removed, and the transaction items with the maximum count of window attenuation supports are deleted. Finally, the strategy (top down and bottom up two way search) is used to mine maximal frequent itemsets over data streams, and case analysis and experiments prove that the algorithm DSMMFI DS has better performance than the algorithm DSM MFI.
Keywords:data mining  data stream  landmark windows  maximal frequent itemsets  window attenuation support count  
本文献已被 CNKI 等数据库收录!
点击此处可从《计算机工程与科学》浏览原始摘要信息
点击此处可从《计算机工程与科学》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号