首页 | 本学科首页   官方微博 | 高级检索  
     

融合多粒度信息与外部知识的短文本匹配模型
引用本文:梁登玉,刘大明.融合多粒度信息与外部知识的短文本匹配模型[J].计算机工程,2022,48(8):129.
作者姓名:梁登玉  刘大明
作者单位:上海电力大学 计算机科学与技术学院, 上海 200090
基金项目:甘肃省自然科学基金(SKLLDJ032016021)。
摘    要:中文短文本通常使用单词序列而非字符序列进行语义匹配,以获得更好的语义匹配性能。然而,中文分词可能是错误或模糊的,容易引入噪声或者错误传播,从而损害模型的匹配性能。此外,多数中文词汇具有一词多义的特点,短文本由于缺少上下文环境,相比一词多义的长文本更难理解,这对于模型正确捕获语义信息是一个更大的挑战。提出一种短文本匹配模型,使用词格长短期记忆网络(Lattice LSTM)融合字符和字符序列的多粒度信息。引入外部知识HowNet解决多义词的问题,使用软注意力机制获取2个句子间的交互信息,并利用均值池化和最大池化算法进一步提取句子的特征信息,获取句子级语义编码表示。在数据集LCQMC和BQ上的实验结果表明,与ESIM、BIMPM和Lattice-CNN模型相比,该模型能有效提升中文短文本语义匹配的准确率。

关 键 词:短文本语义匹配  词格长短期记忆网络  多粒度信息  外部知识  软注意力机制  
收稿时间:2021-07-13
修稿时间:2021-09-20

Short Text Matching Model Combined with Multi-Granularity Information and External Knowledge
LIANG Dengyu,LIU Daming.Short Text Matching Model Combined with Multi-Granularity Information and External Knowledge[J].Computer Engineering,2022,48(8):129.
Authors:LIANG Dengyu  LIU Daming
Affiliation:School of Computer Science and Technology, Shanghai University of Electric Power, Shanghai 200090, China
Abstract:Chinese short text semantic matching generally uses word sequences, rather than character sequences, to achieve higher semantic-matching performance.However, Chinese word segmentation may be inaccurate or ambiguous, and word segmentation errors can introduce noise and lead to error propagation, thus deteriorating the final matching performance.Meanwhile, most Chinese words are characterized by polysemy.Short texts are more difficult to understand than long texts due to lack of context, which presents a greater challenge for the model to correctly capture semantic information.To solve this problem, a short text matching model is proposed.This model uses the Lattice Long Short Term Memory(Lattice LSTM) to integrate the multi-granularity information of characters and character sequences, introduces external knowledge HowNet to solve the problem of polyonyms, and employs a soft-attention mechanism to capture the interaction between two sentences.Furthermore, mean and maximum pooling algorithms are used to extract sentence features and obtain semantic encoding representations at the sentence level.Experiments on LCQMC and BQ datasets show that the model effectively improves the performance of Chinese short text semantic matching compared with the ESIM, BIMPM, and Lattice-CNN models.
Keywords:short text semantic matching  Lattice Long Short Term Memory(Lattice LSTM)  multi-granularity information  external knowledge  soft-attention mechanism  
点击此处可从《计算机工程》浏览原始摘要信息
点击此处可从《计算机工程》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号