首页 | 本学科首页   官方微博 | 高级检索  
     

基于伪爬行器的主题式元搜索引擎研究与设计
引用本文:马奕平,庄毅,叶延风,张霞. 基于伪爬行器的主题式元搜索引擎研究与设计[J]. 计算机工程, 2008, 34(22): 70-72
作者姓名:马奕平  庄毅  叶延风  张霞
作者单位:南京航空航天大学计算机科学与技术系,南京,210016
基金项目:国家“863”计划基金资助项目(2006AA706103); 航空基金资助项目(05F2037)
摘    要:为提高搜索的查准率和查全率,设计一个主题式的元搜索引擎和一个类似于爬行器的伪爬行器,通过调用通用搜索引擎采集信息,查全率高于通用搜索引擎。利用反馈机制,参考用户查询历史记录,搜索结果更加接近用户的要求。通过采用主题式策略,改进文档相似度算法,提高分类的正确率和搜索引擎的查准率与搜索范围,同时减少系统响应时间,降低对服务器性能的要求。

关 键 词:元搜索  主题式  搜索引擎  伪爬行器
修稿时间: 

Research and Design of Topic-specific Meta-search Engine Based on Bogus Crawler
MA Yi-ping,ZHUANG Yi,YE Yan-feng,ZHANG Xia. Research and Design of Topic-specific Meta-search Engine Based on Bogus Crawler[J]. Computer Engineering, 2008, 34(22): 70-72
Authors:MA Yi-ping  ZHUANG Yi  YE Yan-feng  ZHANG Xia
Affiliation:(Department of Computer Science and Technology, Nanjing University of Aeronautics and Astronautics, Nanjing 210016)
Abstract:To improve the correct-rate and completeness-rate of search, a topic-specific meta-search engine is designed. A bogus crawler is invented, which collects information by the normal search engines, so that the search-area is wider than the normal search engine. The feedback mechanism is adopted and the search-history of user is considered, which make the search result is more imminent to the purpose of the user. Owing to the strategy of topic-specific and mending the arithmetic of similitude-degree of the texts, the correct-rate is improved. Both the correct-rate and completeness-rate of searching are improved, the response time is decreased as well, at the same time, the request of capability of the server is reduced.
Keywords:meta-search  topic-specific  search engine  bogus crawler
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《计算机工程》浏览原始摘要信息
点击此处可从《计算机工程》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号