首页 | 本学科首页   官方微博 | 高级检索  
     

基于动态注意力和多角度匹配的答案选择模型
引用本文:李志超,吐尔地·托合提,艾斯卡尔·艾木都拉.基于动态注意力和多角度匹配的答案选择模型[J].计算机应用,2021,41(11):3156-3163.
作者姓名:李志超  吐尔地·托合提  艾斯卡尔·艾木都拉
作者单位:新疆大学 信息科学与工程学院,乌鲁木齐 830046
基金项目:新疆维吾尔自治区自然科学基金资助项目(2021D01C076)
摘    要:针对当前主流神经网络在处理答案选择任务时无法同时满足句子的充分表示以及句子间信息充分交互的问题,提出了基于动态注意力和多角度匹配(DAMPM)的答案选择模型。首先,调用预训练语言模型的嵌入(ELMo)获得包含简单语义信息的词向量;接着,在注意力层采用过滤机制有效地去除句子中的噪声,从而更好地得到问句和答案句的句子表征;其次,在匹配层同时引入多种匹配策略来完成句子向量之间的信息交互;然后,利用双向长短期记忆(BiLSTM)网络对匹配层输出的句子向量进行拼接;最后,通过分类器来计算拼接向量的相似度大小,从而得到问句和答案句之间的语义关联。在文本检索会议问答(TRECQA)数据集上的实验结果表明,与基于比较聚合框架的基线模型中的动态滑动注意力网络(DCAN)方法相比,DAMPM在平均准确率均值(MAP)和平均倒数排名(MRR)两个性能指标上均提高了1.6个百分点。在维基百科问答(WikiQA)数据集上的实验结果表明,DAMPM相较DCAN在两个性能指标上分别提高了0.7个百分点和0.8个百分点。所提DAMPM相较于基线模型中的方法整体上有更好的性能表现。

关 键 词:神经网络  答案选择  动态注意力机制  多角度匹配  预训练语言模型  
收稿时间:2021-01-11
修稿时间:2021-05-24

Answer selection model based on dynamic attention and multi-perspective matching
LI Zhichao,TURDI Tohti,ASKAR Hamdulla.Answer selection model based on dynamic attention and multi-perspective matching[J].journal of Computer Applications,2021,41(11):3156-3163.
Authors:LI Zhichao  TURDI Tohti  ASKAR Hamdulla
Affiliation:College of Information Science and Engineering,Xinjiang University,Urumqi Xinjiang 830046,China
Abstract:The current mainstream neural networks cannot satisfy the full expression of sentences and the full information interaction between sentences at the same time when processing answer selection tasks. In order to solve the problems, an answer selection model based on Dynamic Attention and Multi-Perspective Matching (DAMPM) was proposed. Firstly, the pre-trained Embeddings from Language Models (ELMo) was introduced to obtain the word vectors containing simple semantic information. Secondly, the filtering mechanism was used in the attention layer to remove the noise in the sentences effectively, so that the sentence representation of question and answer sentences was obtained in a better way. Thirdly, the multiple matching strategies were introduced in the matching layer at the same time to complete the information interaction between sentence vectors. Then, the sentence vectors output from the matching layer were spliced by the Bidirectional Long Short-Term Memory (BiLSTM) network. Finally, the similarity of splicing vectors was calculated by a classifier, and the semantic correlation between question and answer sentences was acquired. The experimental results on the Text REtrieval Conference Question Answering (TRECQA) dataset show that, compared with the Dynamic-Clip Attention Network (DCAN) method, which is one of the comparison aggregation framework based baseline models, the proposed DAMPM improves the Mean Average Precision (MAP) and Mean Reciprocal Rank (MRR) both by 1.6 percentage points. The experimental results on the Wiki Question Answering (WikiQA) dataset show that, the two performance indices of DAMPM is 0.7 percentage points and 0.8 percentage points higher than those of DCAN respectively. The proposed DAMPM has better performance than the methods in the baseline models in general.
Keywords:neural network  answer selection  dynamic attention mechanism  multi-perspective matching  pre-trained language model  
本文献已被 万方数据 等数据库收录!
点击此处可从《计算机应用》浏览原始摘要信息
点击此处可从《计算机应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号