首页 | 本学科首页   官方微博 | 高级检索  
     

Inherit/Feedback:一种新的Web主题挖掘方法
引用本文:杨沛,郑启伦,彭宏. Inherit/Feedback:一种新的Web主题挖掘方法[J]. 计算机研究与发展, 2004, 41(5): 807-811
作者姓名:杨沛  郑启伦  彭宏
作者单位:华南理工大学计算机科学与工程学院,广州,510640;华南理工大学计算机科学与工程学院,广州,510640;华南理工大学计算机科学与工程学院,广州,510640
基金项目:广东省科技攻关基金项目 (C10 2 0 1,A10 2 0 10 3)
摘    要:经典链接分析方法(如PageRank和HITS)更多地关注的是网页的权威度,而不是其主题相关度,所以在引导主题搜索的过程中,很快就发生主题漂移.为此,在构建主题关联拓扑模型的基础上,提出了Inherit/Feedback方法,以用于Web主题挖掘.基本思想是:在搜索路径上,一个结点继承其父辈结点的主题相关度,并且将其主题相关度反馈给父辈结点.同时,提出了基于Inhefit/feedback的主题搜索算法(IFC).实验结果表明,这种方法能有效地引导主题搜索,适用于对领域型网站做深层次的搜索和挖掘.

关 键 词:链接分析  主题搜索  Web挖掘

Inherit/Feedback:A New Web Topic-Specific Mining Method
YANG Pei,ZHENG Qi Lun,and PENG Hong. Inherit/Feedback:A New Web Topic-Specific Mining Method[J]. Journal of Computer Research and Development, 2004, 41(5): 807-811
Authors:YANG Pei  ZHENG Qi Lun  and PENG Hong
Abstract:Classical hyperlink analysis algorithms (such as PageRank, HITS) focus on the authority of Web page rather than its topic Thus the crawler based on these algorithms would rapidly drift away in the course of crawling In this paper a new hyperlink analysis method called Inherit/Feedback is presented The key idea is that a page inherits the topic specific correlation from its ancestors and gets the feedback from its descendants There are various applications that can be enhanced by the Inherit/Feedback method, such as pages ranking and topic specific crawling A new topic specific crawling algorithm based on Inherit/Feedback (IFC) is also proposed The experiments show that IFC performs quite well while guiding the topic specific crawling agent and it can be applied to the further discovery and mining from topic specific website
Keywords:hyperlink analysis  topic specific crawling  Web mining  
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号