首页 | 本学科首页   官方微博 | 高级检索  
     

一种基于有向树挖掘Web日志中最大频繁访问模式的方法
引用本文:詹宇斌,殷建平,张玲,龙军,程杰仁.一种基于有向树挖掘Web日志中最大频繁访问模式的方法[J].计算机应用,2006,26(7):1662-1665.
作者姓名:詹宇斌  殷建平  张玲  龙军  程杰仁
作者单位:国防科学技术大学,计算机学院,湖南,长沙,410073
摘    要:提出了一种基于Apriori思想的挖掘最大频繁访问模式的s Tree算法。该算法使用有向树表示用户会话,能挖掘出最大前向引用事务和用户的浏览偏爱路径;使用一种基于内容页面优先的支持度计算方法,能挖掘出传统算法不能发现的特定的用户访问模式;使用频繁模式树连接分层的频繁弧克服了图结构数据挖掘算法中直接连接两个频繁模式树要判断连接条件的缺点,同时采用预剪枝策略,降低了算法的开销。实验表明,s Tree算法具有可扩展性,运行效率比直接采用图结构数据挖掘算法要高。

关 键 词:Web使用挖掘  最大频繁访问模式  有向树  Web日志
文章编号:1001-9081(2006)07-1662-04
收稿时间:2006-01-22
修稿时间:2006-01-222006-03-21

A directed tree based approach for mining maximum frequent access patterns in Web logs
ZHAN Yu-bin,YIN Jian-ping,ZHANG Ling,LONG Jun,CHENG Jie-ren.A directed tree based approach for mining maximum frequent access patterns in Web logs[J].journal of Computer Applications,2006,26(7):1662-1665.
Authors:ZHAN Yu-bin  YIN Jian-ping  ZHANG Ling  LONG Jun  CHENG Jie-ren
Affiliation:Computer School, National University of Defense Technology, Changsha Hunan 410073, China
Abstract:A novel Apriori-based algorithm named s-Tree was proposed for mining maximum frequent access patterns in Web logs.The main contributions of the novel algorithm were as follows.Firstly,the directed tree was used to represent the user session,which enabled us to mine the maximum forward reference transaction and the users' preferred access path.Secondly,a novel method for counting supporting degree based on content first,which helped us to discover some more important and interesting patterns than normal methods.Thirdly,two special strategies were adopted to reduce the overhead of jointing frequent patterns.Experiment results show that the s-Tree algorithm is scalable,and is more efficient than previous graph-based structure pattern mining algorithms such as AGM(Apriori-based Graph Mining) and FSG(Frequent Subgraph Discovery).
Keywords:Web usage mining  maximum frequent access pattern  directed tree  Web logs
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《计算机应用》浏览原始摘要信息
点击此处可从《计算机应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号