首页 | 本学科首页   官方微博 | 高级检索  
     

关于提取Web用户浏览行为特征的研究
引用本文:胡亚慧,ZHAO Hong-jun,赵红军,鲁汉榕,王海杰.关于提取Web用户浏览行为特征的研究[J].计算机工程与设计,2006,27(18):3416-3418.
作者姓名:胡亚慧  ZHAO Hong-jun  赵红军  鲁汉榕  王海杰
作者单位:1. 空军雷达学院,湖北,武汉,430019
2. 武汉理工大学,湖北,武汉,430074
摘    要:当前,Web日志挖掘技术已成为实现网站个性化服务的研究热点.运用Markov模型来预测用户的浏览模式,从而提高站点访问率、为站点重组提供有利信息是该领域广泛采用的方法之一.但传统方法建立的Markov模型,存在着数据冗余复杂、模型庞大繁琐等问题.针对这些问题,介绍了一种改进的Markov模型.其方法主要是在原有模型的基础之上,在数据清洗、用户会话识别过程中删除一些不予考虑的因素,大大简化了建立的Markov模型,提高了Web日志挖掘的效率.

关 键 词:Web日志挖掘  浏览行为  Markov模型  数据清洗  会话识别
文章编号:1000-7024(2006)18-3416-03
收稿时间:2005-07-16
修稿时间:2005-07-16

Research on extracting patterns from web user behavior
ZHAO Hong-jun.Research on extracting patterns from web user behavior[J].Computer Engineering and Design,2006,27(18):3416-3418.
Authors:ZHAO Hong-jun
Affiliation:1. Air Force Radar Academy, Wuhan 430019, China; 2. Wuhan University of Science and Technology, Wuhan 430074, China
Abstract:It is a popular research on web log mining to achieve web personalization. In this field, the user web navigation patterns are usually modeled as a Markov chain to increase web site click rates and provide useful information in reorganizing site. But the original model remains the more complex and redundant data which result in an anfractuous model, It seriousely affects farther research on user navigation patterns. An improved Markov model is introduced. Some irrespective problems are omitted in the process of the data cleaning and the session identification, such as the defeated pageview, the timeouts record and so on. By this way, the model is more simple and more effective than the original.
Keywords:web log mining  navigation patterns  Markov model  data cleaning  session identification
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号