首页 | 本学科首页   官方微博 | 高级检索  
     

基于CLucene的WORD文档全文检索系统研究与开发
引用本文:杨文涛,司应硕,张森. 基于CLucene的WORD文档全文检索系统研究与开发[J]. 洛阳理工学院学报(自然科学版), 2011, 0(1): 56-60
作者姓名:杨文涛  司应硕  张森
作者单位:郑州航空工业管理学院
摘    要:能够快速有效地检索网络上或站内大量的各种信息资源,是提供高质量检索服务的基础.CLucene是Lucene的C++版本的实现,它是一个优秀的开源全文本搜索技术框架.分析了CLucene的系统结构,详解了CLucene中的索引和检索机制,在CLucene的基础上,解决了对WORD文档的文本抽取问题,增加了CLucene的...

关 键 词:CLucene  WORD  索引  文本抽取  全文检索

Research and Development of WORD Document Full Text Search Engine Based on CLucene
YANG Wen-tao,SI Ying-shuo,ZHANG Sen. Research and Development of WORD Document Full Text Search Engine Based on CLucene[J]. Journal of Luoyang Institute of Science and Technology, 2011, 0(1): 56-60
Authors:YANG Wen-tao  SI Ying-shuo  ZHANG Sen
Affiliation:(Zhengzhou Institute of Aeronautical Industry Management,Zhengzhou 450046,China)
Abstract:To fast and efficiently search the vast information resources of the network or station is the basis of providing high quality information retrieval service.CLucene is the realization of C++ version of Lucene,which is an excellent technology frame of full-text retrieval engine of open source code.This paper analyzes the structure of the CLucene system,explains the index and retrieval mechanisms of the CLucene in detail,resolves the problem of text extraction from WORD document on the basis of CLucene,adds the CLucene Chinese support function,and realizes the application based on the CLucene and supporting Chinese and English WORD document retrieval.
Keywords:CLucene  WORD  index  text extraction  full-text retrieval
本文献已被 CNKI 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号