首页 | 本学科首页   官方微博 | 高级检索  
     

基于滑动窗口的数据流闭合频繁模式的挖掘
引用本文:刘学军,徐宏炳,董逸生,钱江波,王永利.基于滑动窗口的数据流闭合频繁模式的挖掘[J].计算机研究与发展,2006,43(10):1738-1743.
作者姓名:刘学军  徐宏炳  董逸生  钱江波  王永利
作者单位:1. 东南大学计算机科学与技术系,南京,210096;南京工业大学信息科学与工程学院,南京,210009
2. 东南大学计算机科学与技术系,南京,210096
基金项目:江苏省高技术研究发展计划项目;江苏省研究生培养创新工程项目
摘    要:频繁闭合模式集惟一确定频繁模式完全集并且数量小得多,然而,如何挖掘滑动窗口中的频繁闭合模式集是一个很大的挑战.根据数据流的特点,提出了一种发现滑动窗口中频繁闭合模式的新方法DS_CFI.DS_CFI算法将滑动窗口分割为若干个基本窗口,以基本窗口为更新单位。利用已有的频繁闭合模式挖掘算法计算每个基本窗口的潜在频繁闭合项集,将它们及其子集存储到一种新的数据结构DSCFI_tree中,DSCFI_tree能够增量更新,利用DSCFI_tree可以快速地挖掘滑动窗口中的所有频繁闭合模式.最后,通过实验验证了这种方法的有效性.

关 键 词:数据流  闭合频繁项集  滑动窗口  关联规则  知识发现
收稿时间:07 28 2005 12:00AM
修稿时间:2005-07-282006-01-06

Mining Frequent Closed Patterns from a Sliding Window over Data Streams
Liu Xuejun,Xu Hongbing,Dong Yisheng,Qian Jiangbo,Wang Yongli.Mining Frequent Closed Patterns from a Sliding Window over Data Streams[J].Journal of Computer Research and Development,2006,43(10):1738-1743.
Authors:Liu Xuejun  Xu Hongbing  Dong Yisheng  Qian Jiangbo  Wang Yongli
Affiliation:1,Department of Computer Science and Technology, Southeast University, Nanjing 210096;2, College of Information Science and Engineering, Nanjing University of Technology, Nanjing 210009
Abstract:The set of frequent closed patterns determines exactly the complete set of all frequent patterns and is usually much smaller than the latter. But how to mine frequent closed patterns from a sliding window is a very big challenge. According to the features of data streams, a new algorithm, call DS_CFI, is proposed to solve the problem of mining the frequent closed itemsets. A sliding window is divided into several basic windows and the basic window is served as an updating unit. Latency frequent closed itemsets of every basic window are mined by the existing frequent closed pattern algorithms. Those itemsets and their subset are stored in a new data structure called DSCFI_tree. The DSCFI_tree can be incrementally updated and the frequent closed itemsets in a sliding window can be rapidly found based on DSCFI_tree. The experimental results show the feasibility and effectiveness of the algorithm.
Keywords:data stream  frequent closed item  sliding window  association rule  knowledge discovery
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号