首页 | 本学科首页   官方微博 | 高级检索  
     


Ranking Pages by Topology and Popularity within Web Sites
Authors:José Borges  Mark Levene
Affiliation:(1) School of Engineering, University of Porto, R. Dr. Roberto Frias, 4200 Porto, Portugal;(2) School of Computer Science and Information Systems, Birkbeck University of London, Malet Street, London, WC1E 7HX, UK
Abstract:We compare two link analysis ranking methods of web pages in a site. The first, called Site Rank, is an adaptation of PageRank to the granularity of a web site and the second, called Popularity Rank, is based on the frequencies of user clicks on the outlinks in a page that are captured by navigation sessions of users through the web site. We ran experiments on artificially created web sites of different sizes and on two real data sets, employing the relative entropy to compare the distributions of the two ranking methods. For the real data sets we also employ a nonparametric measure, called Spearman's footrule, which we use to compare the top-ten web pages ranked by the two methods. Our main result is that the distributions of the Popularity Rank and Site Rank are surprisingly close to each other, implying that the topology of a web site is very instrumental in guiding users through the site. Thus, in practice, the Site Rank provides a reasonable first order approximation of the aggregate behaviour of users within a web site given by the Popularity Rank.
Keywords:web data mining  web usage mining  Page Rank  Popularity Rank  Site Rank
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号