首页 | 本学科首页   官方微博 | 高级检索  
     

基于策略爬行与混合索引的医药行业垂直搜索引擎的设计与实现
引用本文:王凯,余堃,马增红.基于策略爬行与混合索引的医药行业垂直搜索引擎的设计与实现[J].数字社区&智能家居,2008(4):96-99.
作者姓名:王凯  余堃  马增红
作者单位:电子科技大学开放实验室,四川成都610054
摘    要:随着互联网应用的深入,越来越多的用户希望通过搜索引擎获得特定行业的相关信息,通用搜索引擎无法有效地满足相应需求。文中主要介绍医药行业垂直搜索引擎的设计与实现。设计基于智能搜索引擎的架构,采用了任务驱动的聚焦搜索、隐藏搜索技术;字词混合倒排索引及优化的字倒排索引、检索技术。提供了资源收集阶段的可控策略爬行,和高效的索引、检索功能。实现了针对医药行业的高专业度、高准确率、高效率的信息垂直搜索。

关 键 词:垂直搜索引擎  聚焦搜索  隐藏搜索  混合倒排索引

Design and Implementation of a Search Engine for Medicine Industry Based on Strategy Crawling and Multiple Inverted Index
WANG Kai,SHE Kun,MA Zeng-hong.Design and Implementation of a Search Engine for Medicine Industry Based on Strategy Crawling and Multiple Inverted Index[J].Digital Community & Smart Home,2008(4):96-99.
Authors:WANG Kai  SHE Kun  MA Zeng-hong
Affiliation:(Open Laboratory,UEST,Chengdu 610054,China)
Abstract:With the growth of using on Intemet, more and more users attempt to obtain mtormation ot specifiC undustry by search engine. General search engine can't meet this kind of requirement effectively. This paper presents the design and implementation of an vertical search engine for medicine industry. It's designed based on the architecture of intelligent search engine. It takes Focused Crawling and Hidden Web Crawling technologies which based on mission-driven mode; Multiple Inverted Index and optimized inverted index,search techmologies, provides controllable crawling in the stage of resource gathering and high-performance index and search. It makes the vertical searching on medicine industry professionally and high efficiency.
Keywords:Focused Crawling  Hidden Web Crawling Multiple Inverted Index  Vertical Search Engine
本文献已被 维普 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号