首页 | 本学科首页   官方微博 | 高级检索  
     


Web usage mining: extracting unexpected periods from web logs
Authors:F. Masseglia  P. Poncelet  M. Teisseire  A. Marascu
Affiliation:(1) INRIA Sophia Antipolis – AxIS Project/Team, 2004 route des Lucioles, P. O. Box 93, Sophia Antipolis, 06902, France;(2) EMA-LGI2P/Site EERIE, Parc Scientifique Georges Besse, Nimes Cedex 1, 30035, France;(3) LIRMM UMR CNRS 5506, 161 Rue Ada, Montpellier Cedex 5, 34392, France
Abstract:Existing Web usage mining techniques are currently based on an arbitrary division of the data (e.g. “one log per month”) or guided by presumed results (e.g. “what is the customers’ behaviour for the period of Christmas purchases?”). These approaches have two main drawbacks. First, they depend on the above-mentioned arbitrary organization of data. Second, they cannot automatically extract “seasonal peaks” from among the stored data. In this paper, we propose a specific data mining process (in particular, to extract frequent behaviour patterns) in order to reveal the densest periods automatically. From the whole set of possible combinations, our method extracts the frequent sequential patterns related to the extracted periods. A period is considered to be dense if it contains at least one frequent sequential pattern for the set of users connected to the website in that period. Our experiments show that the extracted periods are relevant and our approach is able to extract both frequent sequential patterns and the associated dense periods.
Keywords:Web usage mining  Sequential patterns  Periods  Users behaviour
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号