首页 | 本学科首页   官方微博 | 高级检索  
     

基于框架网页与页面阈值的会话识别算法
引用本文:方元康,胡学钢,夏启寿,朱勇.基于框架网页与页面阈值的会话识别算法[J].计算机应用与软件,2009,26(1).
作者姓名:方元康  胡学钢  夏启寿  朱勇
作者单位:1. 池州学院计算机中心,安徽,池州,247000
2. 合肥工业大学计算机与信息学院,安徽,合肥,230009
基金项目:国家自然科学基金,安徽省教育厅自然科学基金,安徽省教育厅自然科学基金 
摘    要:会话识别是Web日志预处理过程中的一个重要环节,针对传统会话识别的不足,提出一种改进的会话识别算法.在识别出具体的用户之后,过滤大量的框架网页;然后根据每个页面的内容及网站结构,构造出相对合理的页面访问时间阈值,并以此阈值来进行用户的会话识别.最后通过实验数据,与几种传统的会话识别方法进行了比较,表明该算法更为合理有效.

关 键 词:Web挖掘  数据预处理  阈值  Frame页面  会话识别

A SESSION IDENTIFICATION ALGORITHM BASED ON FRAME PAGE AND PAGE THRESHOLD
Fang Yuankang,Hu Xuegang,Xia Qishou,Zhu Yong.A SESSION IDENTIFICATION ALGORITHM BASED ON FRAME PAGE AND PAGE THRESHOLD[J].Computer Applications and Software,2009,26(1).
Authors:Fang Yuankang  Hu Xuegang  Xia Qishou  Zhu Yong
Affiliation:Center of Computer Technology;Chizhou College;Chizhou 247000;Anhui;China;College of Computer and Information;Hefei University of Technology;Hefei 230009;China
Abstract:Session identification is an important step in data preprocessing of web log mining.To solve the defects in traditional session identification,an improved session identification algorithm was proposed.After identifying specific users,great deal of frame pages were filtered,the relatively reasonable access time threshold for each page was made up according to the contents of each page and web structure,and users' session sets were identified by this threshold.Finally the algorithm was compared with the tradi...
Keywords:Web mining Data preprocessing Threshold Frame page Session identification  
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号