基于网络蜘蛛的搜索引擎自动发现 Automatically Discovering Search Engines Based on a Spider期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

基于网络蜘蛛的搜索引擎自动发现

引用本文：	藕军,任明仑,靳鹏.基于网络蜘蛛的搜索引擎自动发现[J].现代电子技术,2007,30(12):127-129.

作者姓名：	藕军任明仑靳鹏

作者单位：	合肥工业大学,计算机网络研究所,安徽,合肥,230009

摘要：	自动发现Web上大量的搜索引擎对于构造大规模元搜索引擎是有益的,提出一种用优化爬行规则的网络蜘蛛自动发现搜索引擎并提取其元信息的方法:通过优化爬行规则的网络蜘蛛爬取页面;利用专门的识别规则从爬取到的页面中识别搜索界面,并提取其相关的元信息。试验结果表明该方法简单有效,自动发现的查准率和查全率分别达到97%和91%。
关键词：	元搜索引擎自动发现网络蜘蛛元信息
文章编号：	1004-373X（2007）12-127-03
收稿时间：	2006-12-08
修稿时间：	2006-12-08
Automatically Discovering Search Engines Based on a Spider

OU Jun,REN Minglun,JIN Peng.Automatically Discovering Search Engines Based on a Spider[J].Modern Electronic Technique,2007,30(12):127-129.

Authors:	OU Jun REN Minglun JIN Peng

Affiliation:	Institute of Computer Network,Hefei University of Technology, Hefei,230009,China

Abstract:	Automatically discovering search engines on the web is useful to build a large-scale meta search engines.Presents a method to discover Search Engines(SEs) automatically based on a spider,and extract meta information from them: use a spider whose crawling rules has been optimized to fetch pages,and identify search face by special rules from fetched pages,finally extract meta information.An experimental study shows this automatic discovering method is simple and effective with the precision of 97% and the recall 91%.

Keywords:	meta search engine spider meta information automatically discovering
本文献已被 CNKI 维普万方数据等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏