首页 | 本学科首页   官方微博 | 高级检索  
     

融合多角度特征的文本匹配模型
引用本文:李广,刘新,马中昊,黄浩钰,张远明. 融合多角度特征的文本匹配模型[J]. 计算机系统应用, 2022, 31(7): 158-164
作者姓名:李广  刘新  马中昊  黄浩钰  张远明
作者单位:湘潭大学 计算机学院·网络空间安全学院, 湘潭 411105
基金项目:智能化公共法律服务关键技术湖南省重点研发项目(2022SK2106)
摘    要:文本匹配是自然语言处理的一个核心研究领域, 深度文本匹配模型大致可以分为表示型和交互型两种类型, 表示型模型容易失去语义焦点难以衡量词上下文重要性, 交互型模型缺少句型、句间等全局性信息. 针对以上问题提出一种融合多角度特征的文本匹配模型, 该模型以孪生网络为基本架构, 利用BERT模型生成词向量进行词相似度融合加强语义特征, 利用Bi-LSTM对文本的句型结构特征进行编码, 即融合文本词性序列的句型结构信息, 使用Transformer编码器对文本句型结构特征和文本特征进行多层次交互, 最后拼接向量推理计算出两个文本之间的相似度. 在Quora部分数据集上的实验表明, 本模型相比于经典深度匹配模型有更好的表现.

关 键 词:文本匹配  句型结构  Transformer框架  孪生网络  Bi-LSTM  特征融合  注意力机制  自然语言处理
收稿时间:2021-09-22
修稿时间:2021-10-19

Text Matching Model Incorporating Multi-angle Features
LI Guang,LIU Xin,MA Zhong-Hao,HUANG Hao-Yu,ZHANG Yuan-Ming. Text Matching Model Incorporating Multi-angle Features[J]. Computer Systems& Applications, 2022, 31(7): 158-164
Authors:LI Guang  LIU Xin  MA Zhong-Hao  HUANG Hao-Yu  ZHANG Yuan-Ming
Affiliation:School of Computer Science & School of Cyberspace Security, Xiangtan University, Xiangtan 411105, China
Abstract:Text matching is a core research area in natural language processing. Deep text matching models can be broadly classified into representational models and interactive models. The former tends to lose semantic focus and fails to measure the contextual importance of words. The latter lacks global information such as sentence type and inter-sentence information. To address these problems, we propose a text matching model incorporating multi-angle features based on Siamese neural network. The model generates word vectors using the BERT model and enhances semantic features by the similarity fusion of words. It then encodes the syntactic structured features using Bi-LSTM, namely the syntactic structured information containing the lexical sequence. A Transformer encoder is utilized to realize the multi-level interaction between the syntactic structured features and the text features. Finally, the similarity is deduced by spliced vectors. Experiments on part of Quora question pair show that this model performs better than the classical deep matching model.
Keywords:text matching  sentence structure  Transformer framework  Siamese neural network  Bi-LSTM  feature fusion  attention mechanism  natural language processing (NLP)
点击此处可从《计算机系统应用》浏览原始摘要信息
点击此处可从《计算机系统应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号