首页 | 本学科首页   官方微博 | 高级检索  
     

基于多特征语义匹配的知识库问答系统
引用本文:赵小虎,赵成龙. 基于多特征语义匹配的知识库问答系统[J]. 计算机应用, 2020, 40(7): 1873-1878. DOI: 10.11772/j.issn.1001-9081.2019111895
作者姓名:赵小虎  赵成龙
作者单位:1. 矿山互联网应用技术国家地方联合工程实验室(中国矿业大学), 江苏 徐州 221008;2. 中国矿业大学 信息与控制工程学院, 江苏 徐州 221116
基金项目:国家重点研发计划项目(2017YFC0804400)。
摘    要:知识库问答(KBQA)任务主要目的在于精确地将自然语言问题和知识库(KB)中的三元组进行匹配。传统的KBQA方法通常专注于实体识别和谓语匹配,实体识别的错误会导致错误传播从而无法得到正确的答案。针对上述问题提出一种端到端的解决方案直接匹配问题和三元组,该系统主要包含候选三元组生成和候选三元组排序两个部分来实现精确问答。首先通过BM25算法计算问题和知识库中三元组的相关性生成候选三元组;然后通过多特征语义匹配模型(MFSMM)进行三元组的排序,即用MFSMM分别通过双向长短时记忆网络(Bi-LSTM)和卷积神经网络(CNN)实现语义相似度和字符相似度的计算,并通过融合来对三元组进行排序。该系统在NLPCC-ICCPOL 2016KBQA数据集上的平均F1为80. 35%,接近了现有最好的表现。

关 键 词:知识库  自然语言问题  三元组  多特征语义匹配模型  语义相似度  字符相似度
收稿时间:2019-11-07
修稿时间:2020-04-19

Knowledge base question answering system based on multi-feature semantic matching
ZHAO Xiaohu,ZHAO Chenglong. Knowledge base question answering system based on multi-feature semantic matching[J]. Journal of Computer Applications, 2020, 40(7): 1873-1878. DOI: 10.11772/j.issn.1001-9081.2019111895
Authors:ZHAO Xiaohu  ZHAO Chenglong
Affiliation:1. National and Local Joint Engineering Laboratory of Internet Application Technology on Mine(China University of Mining and Technology), Xuzhou Jiangsu 221008, China;2. School of Information and Control Engineering, China University of Mining and Technology, Xuzhou Jiangsu 221116, China
Abstract:The task of Question Answering over Knowledge Base (KBQA) mainly aims at accurately matching natural language question with triples in the Knowledge Base (KB). However, traditional KBQA methods usually focus on entity recognition and predicate matching, and the errors in entity recognition may lead to error propagation and thus fail to get the right answer. To solve the above problem, an end-to-end solution was proposed to directly match the question and triples. This system consists of two parts:candidate triples generation and candidate triples ranking. Firstly, the candidate triples were generated by the BM25 algorithm calculating the correlation between the question and the triples in the knowledge base. Then, Multi-Feature Semantic Matching Model (MFSMM) was used to realize the ranking of the triples, which means the semantic similarity and character similarity were calculated by MFSMM through Bi-directional Long Short Term Memory Network (Bi-LSTM) and Convolutional Neural Network (CNN) respectively, and the triples were ranked by fusion. With NLPCC-ICCPOL 2016 KBQA as the dataset, the average F1 of the proposed system is 80.35%, which is close to the existing best performance.
Keywords:Knowledge Base (KB)  natural language question  triple  Multi-Feature Semantics Matching Model (MFSMM)  semantic similarity  character similarity  
本文献已被 万方数据 等数据库收录!
点击此处可从《计算机应用》浏览原始摘要信息
点击此处可从《计算机应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号