首页 | 本学科首页   官方微博 | 高级检索  
     

基于递推式右路径扩展的XML频繁模式树挖掘
引用本文:施秀升,张东站.基于递推式右路径扩展的XML频繁模式树挖掘[J].现代计算机,2011(3):3-6,23.
作者姓名:施秀升  张东站
作者单位:福建厦门大学计算机科学系
基金项目:国家自然科学基金(No.50604012)
摘    要:在分析现有的频繁模式树挖掘的经典算法FREQT和FreqtTree基础上,提出一种新的基于递推式右路径扩展的XML频繁模式树挖掘算法。该算法采用最右路径扩展的思想,利用递推式的候选节点集更新技术来压缩候选节点集,产生数量较少的候选模式,并且在计算候选模式树的支持数时,采用增量式技术,提高算法效率。从理论上证明该算法的正确性,并对通过具体实验验证算法的高效性。

关 键 词:XML  数据挖掘  频繁模式树  模式匹配

XML Frequent Pattern Tree Mining Based on Recursive Right Path Extending
SHI Xiu-sheng,ZHANG Dong-zhan.XML Frequent Pattern Tree Mining Based on Recursive Right Path Extending[J].Modem Computer,2011(3):3-6,23.
Authors:SHI Xiu-sheng  ZHANG Dong-zhan
Affiliation:(Deparment of Computer Science,Xiamen University,Xiamen 361005)
Abstract:Proposes a novel XML frequent pattern tree mining based on recursive right path extending algorithm by the analysis of the FREQT and FreqtTree algorithm,which are the frequent subtree mining algorithm.The algorithm adopts the technology of the rightmost expansion.Then it uses a method of recursive updating the set of candidate nodes to reduce the number of candidate nodes.Thus,the number of the candidate patterns is small.And,it adopts incremental method to compute the support of candidate pattern trees,which improves the efficiency of algorithm.The correctness of the algorithm is proved in theory,and through specific experiment shows that the algorithm is efficient.
Keywords:XML  Data Mining  Frequent Pattern Tree  Pattern Matching
本文献已被 CNKI 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号