首页 | 本学科首页   官方微博 | 高级检索  
     

基于Lucene和LSA的法律咨询系统
引用本文:尹芝芳,王鑫,蔡文正,李鹤,阮玲玲.基于Lucene和LSA的法律咨询系统[J].计算机系统应用,2014,23(4):52-56.
作者姓名:尹芝芳  王鑫  蔡文正  李鹤  阮玲玲
作者单位:桂林电子科技大学 计算机科学与工程学院, 桂林 541004;桂林电子科技大学 计算机科学与工程学院, 桂林 541004;桂林电子科技大学 计算机科学与工程学院, 桂林 541004;桂林电子科技大学 计算机科学与工程学院, 桂林 541004;桂林电子科技大学 计算机科学与工程学院, 桂林 541004
基金项目:国家自然科学基金(61262074)
摘    要:本文设计的法律咨询系统,结合法律行业的现状,以中文问答系统为原型,结合了开源数据检索项目Lucene.net,扩展了数据的存储类型. 本文借助中科院研发的中文分词系统,集成到Lucene.Net平台上,弥补了其分词不足. 并使用互信息技术,使同义的法律相关词语优先进行检索. 在中文问答系统的答案提取时,经常出现答案的“漏取”和“错取”的情况,本文提出了一种基于潜在语义分析(LSA)的问题和答案句子相似度计算方法,利用空间向量模型作为表示方法,借助潜在语义分析理论,通过奇异值分解的降维方法构建了一个低维的语义空间,并在语义空间上实现了问题与答案句子相似度计算. 经试验证明,本系统具有较精准的查询正确率以及较少的运行计算时间.

关 键 词:Lucene.Net  LSA  问答系统  互信息
收稿时间:2013/8/30 0:00:00
修稿时间:2013/10/4 0:00:00

Law Consultation System Based on Lucene and LSA
YIN Zhi-Fang,WANG Xin,CAI Wen-Zheng,LI He and RUAN Ling-Ling.Law Consultation System Based on Lucene and LSA[J].Computer Systems& Applications,2014,23(4):52-56.
Authors:YIN Zhi-Fang  WANG Xin  CAI Wen-Zheng  LI He and RUAN Ling-Ling
Affiliation:College of Computer Science and Engineering, Guilin University of Electronic and Technology, Guilin 541004, China;College of Computer Science and Engineering, Guilin University of Electronic and Technology, Guilin 541004, China;College of Computer Science and Engineering, Guilin University of Electronic and Technology, Guilin 541004, China;College of Computer Science and Engineering, Guilin University of Electronic and Technology, Guilin 541004, China;College of Computer Science and Engineering, Guilin University of Electronic and Technology, Guilin 541004, China
Abstract:The designation of this law consultation system, not only considers the situation of the legal profession and based on Chinese Question-Answering System as prototype, but also use searching technology Lucene.net which is a open source project that can preform on many kind of types file. This article also uses ICTCLAS and applies it to the Lucene that makes up for Lucene's lack of word segmentation and mutual information technology to make the law word to be priority search. This paper proposes a method to calculate similarity between question and sentence based on Latent Semantic Analysis (LSA). This method represents the question and sentence with space vector model, under the help of latent semantic analysis theory, and constructs a semantic space, which gets rids of the correlativity between word. And then similarity calculation between question and sentence is implemented in this semantic space. Experiments show that this system has the precision of the operation of the inquiry accuracy and less computation time.
Keywords:Lucene  Net  LSA  Question-Answering system  mutual information
本文献已被 CNKI 等数据库收录!
点击此处可从《计算机系统应用》浏览原始摘要信息
点击此处可从《计算机系统应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号