首页 | 本学科首页   官方微博 | 高级检索  
     

网络化制造资源垂直搜索引擎的研究与应用
引用本文:张建,程锦.网络化制造资源垂直搜索引擎的研究与应用[J].计算机应用,2007,27(5):1116-1118.
作者姓名:张建  程锦
作者单位:贵州大学CAD/CIMS工程技术中心
摘    要:着重研究了网络化制造资源垂直搜索系统的主题爬虫和中文分词技术。通过在主题爬虫中增加评价网页模块,优先爬行与主题相似度高的网页中的链接,提高了爬虫的工作效率。在对中文分词词典进行分层存储的基础上,通过一种改进的简洁的中文分词词典匹配算法,有效地改善了分词的速度与精度,并缩减了索引库,增强了用户的响应。

关 键 词:网络化制造  制造资源  垂直搜索  页面解析  中文分词  Lucene
文章编号:1001-9081(2007)05-1116-03
收稿时间:2006-11-27
修稿时间:2006-11-27

Research and application of vertical search engine in networked manufacturing resource
ZHANG Jian,CHENG Jin.Research and application of vertical search engine in networked manufacturing resource[J].journal of Computer Applications,2007,27(5):1116-1118.
Authors:ZHANG Jian  CHENG Jin
Affiliation:Department of Economy, Wuhan University of Science and Technology, Wuhan Hubei 430070, China; 2. Guizhou Agriculture Science Institute, Guiyang Guizhou 550006, China; 3. Institute of CAD/CIMS, Guizhou University, Guiyang Guizhou 550003, China
Abstract:This paper put emphasis on the technologies of the system,including the topic crawler and the Chinese word segmentation.To improve the efficiency of the crawler,a model of page evaluation was added into the crawler module;therefore the urls in a page with a high similarity of the topic will be first crawled.Besides,an improved word matching algorithm was proposed to enhance the speed and precision of word segmentation.
Keywords:networked manufacturing  manufacturing resource  vertical search engine  html parser
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《计算机应用》浏览原始摘要信息
点击此处可从《计算机应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号