首页 | 本学科首页   官方微博 | 高级检索  
     

基于BIPL的Web频繁访问模式挖掘
引用本文:吴雅双,张东站. 基于BIPL的Web频繁访问模式挖掘[J]. 计算机工程与应用, 2008, 44(23): 136-138. DOI: 10.3778/j.issn.1002-8331.2008.23.042
作者姓名:吴雅双  张东站
作者单位:厦门大学,计算机科学系,福建,厦门,361005;厦门大学,计算机科学系,福建,厦门,361005
摘    要:挖掘频繁访问模式是Web日志挖掘的一个重要任务。针对类Apriori算法和GITC算法的不足,提出了基于双亲链的单次扫描求交的Web频繁访问模式挖掘算法—BIPL,该算法首先对用户的访问模式两两进行交集运算,生成候选访问模式,并在求交集过程中保存各个候选访问模式的双亲模式,然后通过简单的求和运算,计算出各个候选访问模式的支持数。最后通过理论分析和实验验证,该算法是稳定的和高效的。

关 键 词:Web日志挖掘  交集关系  频繁访问模式
收稿时间:2008-02-22
修稿时间:2008-4-29 

Mining frequent access patterns on Web mining based on BIPL algorithm
WU Yas-huang,ZHANG Dong-zhan. Mining frequent access patterns on Web mining based on BIPL algorithm[J]. Computer Engineering and Applications, 2008, 44(23): 136-138. DOI: 10.3778/j.issn.1002-8331.2008.23.042
Authors:WU Yas-huang  ZHANG Dong-zhan
Affiliation:Department of Computer Science,Xiamen University,Xiamen,Fujian 361005,China
Abstract:Mining frequent access patterns is an important task of Web log mining.In connection with the shortage of the similar Apriori algorithm and the GITC algorithm,the paper presents BIPL algorithm which is used to mine the Web frequent access patterns.The algorithm is based on parents list and intersection,and requests to scan the database only one times.It first gets the intersections of each two access patterns and gives the birth to candidate access patterns.And the parents access patterns of each candidate access pattern are saved in the process of intersection.Then the counts of all the candidate access patterns can be calculated easily through add operational.Finally,the algorithm is proved to be stable and efficient through theoretical analysis and experimental proof.
Keywords:Web log mining  intersection relation  frequent access pattern
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《计算机工程与应用》浏览原始摘要信息
点击此处可从《计算机工程与应用》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号