首页 | 本学科首页   官方微博 | 高级检索  
     

数据流中结构二叉树挖掘算法研究
引用本文:唐向红,元宁.数据流中结构二叉树挖掘算法研究[J].计算机应用研究,2017,34(10).
作者姓名:唐向红  元宁
作者单位:贵州大学现代制造技术教育部重点实验室,贵州大学现代制造技术教育部重点实验室
基金项目:贵州省重大科技专项(黔科合重大专项字(2013)6019,黔科合重大专项字〔2012〕6018)和贵州省基础研究重大项目(黔科合JZ字(2014)2001)
摘    要:针对传统数据流挖掘算法不能挖掘出频繁项之间的关系而且挖掘时间和空间复杂度高、准确度不高的问题,本文提出了一种数据流中结构二叉树挖掘算法(AMST)。该算法利用了二叉树结构的优势,将所处理事务数据库中的数据流转化成结构化二叉树,然后利用数据流矩阵对结构二叉树进行挖掘。整个过程只对事务数据库进行了一次扫描,大大提高了挖掘的效率。此外,算法还找出了具有层次关系的频繁子树。实验结果表明,AMST算法性能稳定,在时间复杂度和空间复杂度方面有很大的优越性,能够快速准确地对数据流进行挖掘。

关 键 词:数据流  频繁项集  结构二叉树  数据流矩阵
收稿时间:2016/7/23 0:00:00
修稿时间:2017/6/30 0:00:00

Study on the algorithm for mining structural binary tree in data stream
Tang Xianghong and Yuan Ning.Study on the algorithm for mining structural binary tree in data stream[J].Application Research of Computers,2017,34(10).
Authors:Tang Xianghong and Yuan Ning
Affiliation:Key Laboratory of Advanced Manufacturing Technology of Ministry of Education, Guizhou University,
Abstract:Aiming at the problem that the traditional algorithms of data stream cannot mine the relationship between the frequent items, and the mining time and space complexity is high, while the accuracy is not high. An algorithm for mining structural binary tree in data stream (AMST) was proposed. The algorithm uses the advantage of binary tree, transforms the data stream into a structured binary tree, and then mines the binary tree with the data stream matrix. The algorithm greatly improves the efficiency of mining because it only scans the database once in the whole process. In addition, the algorithm also finds out frequent subtrees with hierarchical relations. Experiment results showed that the algorithm has stable performance and better advantage in time and space complexity, and can mine the data stream rapidly and accurately.
Keywords:Data Stream  Frequent Items  Structural Binary Tree  Data Stream Matrix  
点击此处可从《计算机应用研究》浏览原始摘要信息
点击此处可从《计算机应用研究》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号