首页 | 本学科首页   官方微博 | 高级检索  
     


Mining web access patterns with super-pattern constraint
Authors:Trang Van  Atsuo Yoshitaka  Bac Le
Affiliation:1.Faculty of Information Technology,Ho Chi Minh City University of Technology (HUTECH),Ho Chi Minh City,Vietnam;2.Faculty of Computer Sciences,University of Information Technology, VNU-HCMC,Ho Chi Minh,Vietnam;3.School of Information Science,Japan Advanced Institute of Science Technology,Nomi,Japan;4.Faculty of Information Technology,University of Science, VNU-HCMC,Ho Chi Minh,Vietnam
Abstract:We consider the problem of mining web access patterns with super-pattern constraint. This constraint requires that the sequential patterns in the sequence database must contain a particular set of patterns as sub-patterns. One common application of this constraint is web usage mining which mines the user access behavior on the web. In this paper, we introduce an efficient strategy for mining web access patterns with super-pattern constraint that requires only one database scan. Firstly, we present the MWAPC (M ining W eb A ccess P atterns based on super-pattern C onstraint) algorithm, in which each frequent pattern has to be checked if it contains at least one pattern from a user-defined set of patterns. Then we develop an effective algorithm, called EMWAPC that prunes the search space at the beginning of mining process and avoids checking the constraints one by one based on three proposed propositions. We have conducted the experiments on real web log databases. The experimental results show that the proposed algorithms outperform the previous methods.
Keywords:
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号