首页 | 本学科首页   官方微博 | 高级检索  
     

一种单遍扫描频繁模式树结构
引用本文:谭军,卜英勇,杨勃. 一种单遍扫描频繁模式树结构[J]. 计算机工程, 2010, 36(14): 32-33
作者姓名:谭军  卜英勇  杨勃
作者单位:中南林业科技大学计算机学院,长沙,412006;中南大学机电工程学院,长沙,410083;中南大学机电工程学院,长沙,410083
摘    要:针对频繁模式增长算法无法适应数据流的无限性和流动性的特点,提出一种新颖的FP-tree的变形结构-SP-tree,只需单遍扫描便能容纳全部数据库信息。为使SP-tree具有与FP-tree一样良好的压缩性能,给出一种有效的动态重构树的方法,称为宽度排序方法,该方法能够在挖掘过程中动态地逐条分支地重构树,最终产生一棵频繁递减的前缀树。实验结果表明,SP-tree的压缩性能优于其他单遍扫描的前缀树结构。

关 键 词:数据流  频繁模式增长算法  单遍扫描模式树  宽度排序方法

Single-pass Frequent Pattern Tree Structure
TAN Jun,BU Ying-yong,YANG Bo. Single-pass Frequent Pattern Tree Structure[J]. Computer Engineering, 2010, 36(14): 32-33
Authors:TAN Jun  BU Ying-yong  YANG Bo
Affiliation:(1. College of Computer Science, Central South University of Forestry and Technology, Changsha 412006;2. College of Mechanical Electrical Engineering, Central South University, Changsha 410083)
Abstract:Aiming at the problem that FP-growth algorithm requires two database scans, which are not consistent with efficient data stream processing, this paper presents a novel tree structure which is a variation of FP-tree, called SP-tree, which captures database information with one scan. For making SP-tree have the same compact performance, it presents an efficient dynamic tree restructuring method, called the breadth sorting method, which restructures a frequency-descending prefix-tree branch-by-branch. Experimental results show that compact performance of the SP-tree is better than other prefix-tree structure with one scan.
Keywords:data stream  FP-growth algorithm  single-pass pattern tree  breadth sorting method
本文献已被 维普 万方数据 等数据库收录!
点击此处可从《计算机工程》浏览原始摘要信息
点击此处可从《计算机工程》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号