首页 | 本学科首页   官方微博 | 高级检索  
     

基于投影分支的快速频繁子树挖掘算法
引用本文:赵传申,孙志挥,张净.基于投影分支的快速频繁子树挖掘算法[J].计算机研究与发展,2006,43(3):456-462.
作者姓名:赵传申  孙志挥  张净
作者单位:东南大学计算机科学与工程系,南京,210096
摘    要:频繁子树挖掘在生物信息、Web挖掘等很多领域都具有较高的应用价值.在频繁子树挖掘中引入投影分支的概念,并提出基于投影分支的快速频繁子树挖掘算法——FTPB.FTPB算法充分利用树结构本身的特点,在计算投影分支的同时解决树同构的判断问题,扫描数据库后能够根据当前的频繁模式树直接生成新的频繁模式树,可减少数据库的扫描次数和候选模式的搜索空间,从而降低算法复杂度.理论分析和实验结果表明,该算法较其他同类算法相比具有较高的效率,是有效可行的.

关 键 词:数据挖掘  频繁子树  投影分支  枚举树
收稿时间:07 11 2005 12:00AM
修稿时间:2005-07-112005-11-15

Frequent Subtree Mining Based on Projected Branch
Zhao Chuanshen,Sun Zhihui,Zhang Jing.Frequent Subtree Mining Based on Projected Branch[J].Journal of Computer Research and Development,2006,43(3):456-462.
Authors:Zhao Chuanshen  Sun Zhihui  Zhang Jing
Affiliation:Department of Computer Science and Engineering, Southeast University, Nanjing 210096
Abstract:Discovering frequent subtrees from ordered labeled trees is an important research problem in data mining with broad applications in bioinformatics, web log, XML documents and so on. In this paper, A new concept of projected branch is introduced, and a new algorithm FTPB (frequent subtrees mining based on projected branch) is proposed. This algorithm does the work of distinguishing isomorphism while computing projected branch, which decreases the complexity of algorithm, improving the efficiency of the algorithm. Theoretical analysis and experimental results show that the FTPB algorithm is efficient and effective.
Keywords:data mining  frequent subtrees  projected branch  enumeration tree
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号