首页 | 本学科首页   官方微博 | 高级检索  
     

Web结构挖掘中HITS算法改进的研究
引用本文:范聪贤,徐汀荣,范强贤.Web结构挖掘中HITS算法改进的研究[J].微计算机信息,2010(3).
作者姓名:范聪贤  徐汀荣  范强贤
作者单位:苏州大学计算机科学与技术学院;
摘    要:随着Internet技术的发展,Web网页成为人们获取信息的有效途径,Web数据挖掘逐渐成为国内外研究的热点。基于Web结构挖掘中HITS算法只考虑页面之间的链接关系而忽视了页面的具体内容,在这种情况下容易出现主题偏离1]现象,影响了搜索结果,为了抑制主题偏离现象,本文把超链接信息检索方法与页面内容相结合,提出了一种改进的算法。实验结果证明改进的算法较原算法具有较好的效果,有效的抑制了主题偏离现象,具有一定的实用价值。

关 键 词:Web数据挖掘  Web结构挖掘  HITS  Google  

Research and Improved Algorithm of HITS Based on Web Structure Mining
FAN Cong-xian XU Ting-rong FAN Qiang-xian.Research and Improved Algorithm of HITS Based on Web Structure Mining[J].Control & Automation,2010(3).
Authors:FAN Cong-xian XU Ting-rong FAN Qiang-xian
Affiliation:FAN Cong-xian XU Ting-rong FAN Qiang-xian(Computer Science & Technology School,Soochow University,Suzhou,215006,China)
Abstract:With the development of Internet's technology,Web pages effective approach for people to gain information,Web data mining gradually becomes hot research in domestic and foreign. Based on the Web Structure Mining in the HITS algorithm only considers relations among pages and neglects actual content among pages,in this case easy to present the subject deviation phenomenon and affect the search result,in order to restrain the subject deviation condition,this paper puts forward a kind of improved algorithm whic...
Keywords:web data mining  web structure mining  HITS  Google  
本文献已被 CNKI 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号