首页 | 本学科首页   官方微博 | 高级检索  
     

基于问句相似度的中文FAQ问答系统
引用本文:叶正,林鸿飞,杨志豪.基于问句相似度的中文FAQ问答系统[J].计算机工程与应用,2007,43(9):161-163,248.
作者姓名:叶正  林鸿飞  杨志豪
作者单位:大连理工大学,计算机科学与工程系,辽宁,大连,116024
摘    要:常见问题(FAQ)问答系统是一种在已有的“问题—答案”对集合中找到与用户提问相匹配的问句,并将其对应的答案返回给用户的问答式检索系统。其关键问题是用户提出问句与FAQ库中问句进行相似度计算,找出FAQ库中最相近的问句,并返回事先存储好的问题答案。通过对常见问句特点的研究,给出一种基于分解的向量空间模型和语义概念的问句相似度计算方法,其主要思想是对一个问句向量进行分解,提取其三个关键部分:问点、主题词和疑问词,表示成三个分向量,然后对每个分向量计算基于《HIT-IRLab同义词词林(扩展版)》的语义相似度,通过线性加权就可以得出两个问句的语义相似度。试验表明,与传统的基于向量空间模型的TF-DF问句相似度计算方法相比,可以提高问句匹配的精度。

关 键 词:问句相似度  语义相似度  常见问题集  向量空间模型
文章编号:1002-8331(2007)09-0161-03
修稿时间:2006-11

Chinese FAQ system based on sentence similarity
YE Zheng,LIN Hong-fei,YANG Zhi-hao.Chinese FAQ system based on sentence similarity[J].Computer Engineering and Applications,2007,43(9):161-163,248.
Authors:YE Zheng  LIN Hong-fei  YANG Zhi-hao
Affiliation:Department of Computer Science and Engineering,Dalian University of Technology,Dalian,Liaoning 116024,China
Abstract:FAQ system is a QA retrieval system to find the question sentence that is matched with the user question sentence to the set of"question-answer" pairs,and return its corresponding answer to user.Its key question is that questions asked by user and questions in the FAQ carry on similarity computation,discover the closest question in the FAQ and return the question answer stored in advance.This paper presents a question similarity computation approach based on splited vector space model and semantic concept according to the common question characteristic research.The main thought is that splitting a question vector,extracting the main three components--question point,keyword and interrogative,expressing three components,then computing semantic similarity to every component based on "Synonym Word Dictionary" and obtaining two semantic similarity of questions by the linear weighting.The experiment indicates that precision of question match can be improved compared to traditional question similarity computation based on TF-IDF computation of vector space model.
Keywords:sentence similarity  semantic similarity  FAQ  Vector Space Model
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号