面向问句复述识别的多卷积自交互匹配方法研究 A Multi-Convolution Self-Interaction Method for Question Paraphrase Identification期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

面向问句复述识别的多卷积自交互匹配方法研究

引用本文：	陈鑫,李伟康,洪宇,周夏冰,张民.面向问句复述识别的多卷积自交互匹配方法研究[J].中文信息学报,2019,33(10):99.

作者姓名：	陈鑫李伟康洪宇周夏冰张民

作者单位：	苏州大学计算机科学与技术学院,江苏苏州 215006

基金项目：	国家自然科学基金(61672367,61672368);国家重点研发计划(2017YFB1002104);江苏省研究生科研与实践创新计划项目(SJCX19_0926)

摘要：	问句复述识别旨在识别两个自然问句是否语义一致。目前,基于表示学习和深度神经网络架构的复述识别技术已取得较好效果。但是,这类方法往往面临复杂度较高且训练难度较大的瓶颈。针对这一问题,该文提出一种快速的多卷积自交互匹配方法。该方法融合了多种句子特征和词义特征,并由此形成分布式表示。在此基础上,这一方法利用卷积神经网络获取短语级的句子向量表示,并利用自交互融合技术将词级与短语级的向量表示进行充分融合,借以获得多粒度句子向量表示。该文将这一方法应用于判定自然语句是否互为复述的二元分类任务中,利用Quora标准问句复述识别语料进行测试。实验结果证明,在不引入外部数据的情况下,该文所提方法与基于双向多视角匹配的基准模型具有可比的性能,但在空间复杂度上具有较高的优越性,并且获得更快训练速度。具体地,该方法训练所需的物理显存比基准模型方法下降80%,训练迭代速度快19倍。
关键词：	复述识别多卷积交互效率
A Multi-Convolution Self-Interaction Method for Question Paraphrase Identification

CHEN Xin,LI Weikang,HONG Yu,ZHOU Xiabing,ZHANG Min.A Multi-Convolution Self-Interaction Method for Question Paraphrase Identification[J].Journal of Chinese Information Processing,2019,33(10):99.

Authors:	CHEN Xin LI Weikang HONG Yu ZHOU Xiabing ZHANG Min

Affiliation:	School of Computer Science and Technology, Soochow University, Suzhou, Jiangsu 215006, China

Abstract:	Question paraphrase identification aims to identify whether two natural questions are semantic consistency. At present, paraphrase identification technology based on representation learning and deep neural network architecture has achieved good results. However, these methods often face bottlenecks with high complexity and difficulty in training. To tackle the problem, this paper proposes a fast method named multi-convolution self-interaction match (MCSM) model. This method combines multiple sentence features with word sense features to form a distributed representation. Then it utilizes convolution neural networks to capture phrase-level sentence representation. A self-interaction fusion technology is employed to obtain multi-granularity sentence vector representation, which can fully integrate word-level and phrase-level feature vector representation. Experimented on the Quora standard paraphrase identification corpus, the proposed method has comparable performance to the benchmark model based on bilateral multi-perspective match without external data. But has a training speed of its 19 times faster than the baseline, and the memory required is reduced by 80%.

Keywords:	paraphrase identification multi convolution interaction efficiency

	点击此处可从《中文信息学报》浏览原始摘要信息
	点击此处可从《中文信息学报》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏