首页 | 本学科首页   官方微博 | 高级检索  
     

基于NLP技术和相似度计算的智能搜索引擎研究
引用本文:梁晓诚,岳晓光,麦范金,赵子强,路英,王挺.基于NLP技术和相似度计算的智能搜索引擎研究[J].昆明理工大学学报(理工版),2010,35(4):76-79,88.
作者姓名:梁晓诚  岳晓光  麦范金  赵子强  路英  王挺
作者单位:1. 桂林理工大学,信息科学与工程学院,广西,桂林,541004
2. 太原科技大学,机械电子工程学院,山西,太原,030024
3. 莫纳什大学,管理学院,澳大利亚,维多利亚州,墨尔本,3800
4. 利物浦大学,计算机科学系,英国,利物浦,L69 7ZF;西交利物浦大学,计算机科学与软件工程系,江苏,苏州,215123
基金项目:广西自然科学基金资助项目 
摘    要:针对传统的搜索引擎对于自然语言理解方面存在的问题,文章研究了一种新的基于自然语言处理技术和相似度计算的智能搜索引擎的模型.其核心技术是基于自然语言处理的中文分词技术、语义相似度和对立度等理论,将这些概念理论结合起来,从用户习惯的思考角度,结合DotLucene开源全文搜索引擎建立一个智能的搜索引擎.研究表明,该模型在对已经收录的文档有着86.1%的查准率.该智能搜索引擎较好的对查询语句的实现了理解,能够对用户的提问做出正确的回答.

关 键 词:自然语言处理  中文分词  相似度  DotLucene  智能搜索引擎

Research on Intelligent Search Engine Based on NLP Technology and Similarity Calculation
LIANG Xiao-cheng,YUE Xiao-guang,MAI Fan-jin,ZHAO Zi-qiang,LU Ying,WANG Ting.Research on Intelligent Search Engine Based on NLP Technology and Similarity Calculation[J].Journal of Kunming University of Science and Technology(Natural Science Edition),2010,35(4):76-79,88.
Authors:LIANG Xiao-cheng  YUE Xiao-guang  MAI Fan-jin  ZHAO Zi-qiang  LU Ying  WANG Ting
Affiliation:1.School of Information Science and Engineering,Guilin University of Technology,Guilin,Guangxi 541004,China;2.Mechanical and Electronic Engineering College,Taiyuan University of Science and Technology,Taiyuan 030024,China;3.Department of Management,Monash University,Melbourne 3800,Australia;4.Department of Computer Science,University of Liverpool,Liverpool,L69 7ZF,UK;5.Department of Computer Science and Software Engineering,Xi'an Jiaotong-Liverpool University,Suzhou,Jiangsu 215123,China)
Abstract:To deal with the problems of traditional search engine in understanding natural language,this article proposes a new intelligent search engine model which is based on the natural language processing and similarity calculation.Its core technology is Chinese word segmentation technique based on natural language processing,semantic similarity and contrary degree.Thinking from the users' view,the model combines DotLucene with those concepts.The precision of the intelligent search engine is about 86.1%.The intelligent search engine can understand the natural languages to query and offer the right answer to users.
Keywords:DotLucene
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号