首页 | 本学科首页   官方微博 | 高级检索  
     

网络爬虫性能研究
引用本文:漆志辉,杨天奇. 网络爬虫性能研究[J]. 微型机与应用, 2011, 30(5): 72-74,80
作者姓名:漆志辉  杨天奇
作者单位:暨南大学,信息科学技术学院,计算机系,广东,广州,510632
基金项目:广东省软科学研究项目(2009B070300052)
摘    要:受到学习模型爬虫的启发,主题爬虫结合网页内容和链接信息来估计网页对给定主题的相关性,得到两个新型的爬虫变种。新型爬虫强调的不仅是有学习相关网页内容的能力,而且有引向相关网页的能力,并且在查找特定主题方面的能力有质的提高。

关 键 词:主题爬虫  学习型爬虫  学习型主题爬虫

Research of the function of network crawlers
Qi Zhihui,Yang Tianqi. Research of the function of network crawlers[J]. Microcomputer & its Applications, 2011, 30(5): 72-74,80
Authors:Qi Zhihui  Yang Tianqi
Affiliation:Qi Zhihui,Yang Tianqi (Department of Computer,Institute of Information Science and Technology,Jinan University,Guangzhou 510632,China)
Abstract:Inspired by learning crawler, this paper obtains two new focused crawlers which combine Web page content and link information. The new focused crawlers emphasis not only on the capability of learning the content of relevant pages but also paths leading to relevant pages. Furthermore, the new crawlers′ ability to find more specific topics has improved.
Keywords:focused crawler  learning crawler  learning focused crawler  
本文献已被 CNKI 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号