首页 | 本学科首页   官方微博 | 高级检索  
     

基于Web日志文件的关联规则挖掘模块的实现
引用本文:米娜瓦尔·努拉合买提,玛依拉·别克强塔依娃,张太红,曾明,Osmar. R. Zaiane.基于Web日志文件的关联规则挖掘模块的实现[J].微机发展,2011(9):51-54.
作者姓名:米娜瓦尔·努拉合买提  玛依拉·别克强塔依娃  张太红  曾明  Osmar. R. Zaiane
作者单位:[1]新疆农业大学计算机与信息工程学院,新疆乌鲁木齐830052 [2]西安交通大学软件学院,陕西西安710049 [3]阿尔伯塔大学计算机科学系,埃德蒙顿T6G2E1
基金项目:新疆维吾尔自治区电子信息发展专项资金项目(XJDZZXZJ20109)
摘    要:在对Web应用挖掘的基本步骤作系统性研究的基础上,设计了一个基于Web日志文件的关联规则挖掘模块。该系统应能够对用户访问Web时服务器方留下的访问记录进行挖掘,从中得出用户的访问模式和访问兴趣。为了识别用户浏览模式,实现了利用关联规则挖掘算法Apriori对Web应用挖掘过程中预处理阶段所产生的用户会话文件进行挖掘的模块,该模块针对用户选定的若干页面产生满足最小支持度和最小置信度的页面之间的强关联规则,并以文本的形式显示挖掘的结果。

关 键 词:用户访问序列文件  关联规则  最小支持度  最小置信度

Implementation of Association Mining Model Based on Web Log File
Affiliation:NULAHEMAITI·Mi-nawaer,BIEKEQIANGTAYIWA·Ma-yila,ZHANG Tai-hong, ZENG Ming,Osmar.R.Zaiane(1.College of Computer and Information Engineering,Xinjiang Agricultural University,Urumqi 830052,China; 2.Software Engineering School,Xi'an Jiaotong University,Xi'an 710049,China; 3.Department of Computing Science,Alberta University,Edmonton T6G 2E1,Canada)
Abstract:Underlying the systematic studies on the basic steps of Web usage mining to implement a visual Web usage mining system,which is mainly used to mine the Web log access file that acquired from the Web server,get the user visiting patterns and visiting interests.In order to identify the navigational patterns of Web site visitors,Apriori algorithm is used on the mining of the user session file that has been generated after the data pre-processing process on the Web log file.The association mining model can be used to generate the frequent itemsets that satisfy the minimum support threshold and strong association rules between selected pages that satisfy the both minimum confidence and minimum support thresholds,and display the association rules mining results by text.
Keywords:user visiting sequence file  association rule  minimum confidence  minimum support
本文献已被 维普 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号