首页 | 本学科首页   官方微博 | 高级检索  
     

针对中文检索的Lucene改进策略
引用本文:索红光,孙鑫.针对中文检索的Lucene改进策略[J].计算机应用与软件,2009,26(6):175-177.
作者姓名:索红光  孙鑫
作者单位:中国石油大学计算机与通信工程学院,山东,东营,257061
摘    要:为了提高基于Lucene中文检索系统的检索精度和效率,通过分析Lucene的结构,在系统中加入了中文分词模块和索引文档预处理模块。给出了具体的实验方法和实验过程,对改进原理和实验数据进行了分析,表明了加入中文分词模块和在索引预处理模块中采用提取特定数量的特征词来替代文档的方法能够有效提高Lucene检索系统的效率和精度,增强Lucene检索系统中文的性能。

关 键 词:Lucene  索引  中文分词  文档预处理  

STRATEGIES TO IMPROVE LUCENE AIMING AT THE CHINESE SEARCH
Suo Hongguang,Sun Xin.STRATEGIES TO IMPROVE LUCENE AIMING AT THE CHINESE SEARCH[J].Computer Applications and Software,2009,26(6):175-177.
Authors:Suo Hongguang  Sun Xin
Affiliation:College of Computer and Communication Engineering;China University of Petroleum;Dongying 257061 Shandong;China
Abstract:To improve the efficiency and accuracy of retrieval system based on Lucene in searching Chinese information,we add the Chinese word segmentation module and indexing documents pretreatment module into the system by analyzing the structure of Lucene.The specific way and process of experiment are given in the paper.Both the analysis of improvement principle in theoretic and the experimental results prove that,by substituting documents with specific quantity of characteristic words picked up in index pretreatme...
Keywords:Lucene Index Chinese word segmentation Documents pretreatment  
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号