首页 | 本学科首页   官方微博 | 高级检索  
     

面向知识库问答中复述问句评分的词向量构建方法*
引用本文:詹晨迪,凌震华,戴礼荣.面向知识库问答中复述问句评分的词向量构建方法*[J].模式识别与人工智能,2016,29(9):825-831.
作者姓名:詹晨迪  凌震华  戴礼荣
作者单位:中国科学技术大学 语音及语言信息处理国家工程实验室 合肥 230027
基金项目:安徽省科技攻关计划(No.2014z02006)、中央高校基本科研业务费专项资金(No.WK2350000001)资助
摘    要:传统的词向量构建方法基于句子内部单词间的共现概率,采用与具体任务无关的无监督训练方法实现。文中提出基于复述关系约束的词向量构建方法,用于改进知识库问答中基于词向量和词袋模型的复述问句评分。首先从复述问句库中按一定规则收集得到满足复述关系的问句对和不满足复述关系的问句对,以问句对之间的相似度不等式表示句子级的语义约束信息,再将该不等式作为约束项加入词向量训练的目标函数中。实验表明,相比传统词向量构建方法,文中方法可以提高问句间复述关系评价的准确度及知识库问答系统中问题回答的准确度。

关 键 词:知识库问答    复述问句    词向量  
收稿时间:2016-03-29

Learning Word Embeddings for Paraphrase Scoring in Knowledge Base Based Question Answering
ZHAN Chendi,LING Zhenhua,DAI Lirong.Learning Word Embeddings for Paraphrase Scoring in Knowledge Base Based Question Answering[J].Pattern Recognition and Artificial Intelligence,2016,29(9):825-831.
Authors:ZHAN Chendi  LING Zhenhua  DAI Lirong
Affiliation:National Engineering Laboratory for Speech and Language Information Processing, .University of Science and Technology of China, Hefei 230027
Abstract:The conventional word embeddings are learned from the co-occurrence probabilities between the words within a same sentence. The learning algorithm is task-independent and unsupervised. A method for constructing word embeddings is proposed by utilizing the constraints of paraphrasing to improve the performance of paraphrase scoring with word embeddings and bag-of-words model in knowledge base (KB) based question answering (QA). In the proposed method, the pairs of paraphrase questions and non-paraphrase questions are collected respectively from a database of question paraphrases according to some designed rules. Then, the inequalities describing the similarities between the pairs of questions are adopted to represent the semantic constraint at the sentence level. These inequalities are integrated into the objective function for training word embeddings. Experimental results show that the proposed method improves the accuracies of paraphrase scoring and KB-based question answering compared with conventional word embedding methods.
Keywords:Knowledge Base Based Question Answering  Question Paraphrase  Word Embedding  
点击此处可从《模式识别与人工智能》浏览原始摘要信息
点击此处可从《模式识别与人工智能》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号