首页 | 本学科首页   官方微博 | 高级检索  
     

基于神经网络的复句判定及其关系识别研究
引用本文:贾旭楠,魏庭新,曲维光,顾彦慧,周俊生. 基于神经网络的复句判定及其关系识别研究[J]. 计算机工程, 2021, 47(11): 54-61. DOI: 10.19678/j.issn.1000-3428.0059269
作者姓名:贾旭楠  魏庭新  曲维光  顾彦慧  周俊生
作者单位:南京师范大学 计算机科学与技术学院,南京 210023;南京师范大学 国际文化教育学院,南京210097;南京师范大学 文学院,南京 210097;南京师范大学 计算机科学与技术学院,南京 210023;南京师范大学 文学院,南京 210097
基金项目:国家自然科学基金“汉语抽象意义表示关键技术研究”(61772278);江苏省高校哲学社会科学基金“面向机器学习的汉语复句语料库建设研究”(2019JSA0220)。
摘    要:复句是自然语言的基本单位之一,复句的判定及其语义关系的识别,对于句法解析、篇章理解等都有着非常重要的作用。基于神经网络模型识别自然语料中的复句,判断其复句关系,构造复句判定和复句关系识别联合模型,以最大程度地减少误差传递。在复句判定任务中通过Bi-LSTM获得上下文语义信息,采用注意力机制捕获句内跨距离搭配信息,利用CNN捕获句子局部信息。在复句关系识别任务中,使用Bert增强句子的语义表示,运用Tree-LSTM对句法结构和成分标记进行建模。在CAMR中文语料上的实验结果表明,基于注意力机制的复句判定模型F1值达到91.7%,基于Tree-LSTM的复句关系识别模型F1值达到69.15%。在联合模型中,2项任务的F1值分别达到92.15%和66.25%,说明联合学习能够使不同任务获得更多特征,从而提高模型性能。

关 键 词:复句判定  神经网络  复句关系识别  联合模型  语义建模
收稿时间:2020-08-17
修稿时间:2020-10-12

Study on Complex Sentence Identification and Its Relation Recognition Based on Neural Network
JIA Xunan,WEI Tingxin,QU Weiguang,GU Yanhui,ZHOU Junsheng. Study on Complex Sentence Identification and Its Relation Recognition Based on Neural Network[J]. Computer Engineering, 2021, 47(11): 54-61. DOI: 10.19678/j.issn.1000-3428.0059269
Authors:JIA Xunan  WEI Tingxin  QU Weiguang  GU Yanhui  ZHOU Junsheng
Affiliation:1. School of Computer Science and Technology, Nanjing Normal University, Nanjing 210023, China;2. International College for Chinese Studies, Nanjing Normal University, Nanjing 210097, China;3. School of Chinese Language and Literature, Nanjing Normal University, Nanjing 210097, China
Abstract:Complex sentence is one of the basic units in natural languages.The identification of complex sentences and the recognition of their semantic relations are crucial to syntactic parsing and text understanding.In this study,a neural network model is used to recognize the complex sentences in texts and determine the relationships between them.A model is constructed for the joint recognition of complex sentences and their semantic relations to minimize the propagation of errors.For recognition of complex sentences,a Bi-LSTM model is used to obtain sentence-level contextual semantic information,an attention mechanism to capture the cross-distance collocation information within a sentence,and a Convolutional Neural Network(CNN) to capture the local information of the sentences.For recognition of complex sentence relationships,Bert is used to enhance the semantic representation of sentences,and Tree-LSTM is used to model syntactic structure and component tags.The experimental results on the Chinese corpus dataset,CAMR,show that the F1 value of the attention mechanism-based model reaches 91.7% in complex sentence recognition,and that of the Tree-LSTM-based model reaches 69.15% in recognition of complex sentence relationships.The F1 value of the joint model reaches 92.15% and 66.25% in the two tasks respectively,which proves that joint learning increases the number of obtained features and thus improves the model performance.
Keywords:complex sentence identification  neural network  complex sentence relation recognition  joint model  semantic modeling  
本文献已被 万方数据 等数据库收录!
点击此处可从《计算机工程》浏览原始摘要信息
点击此处可从《计算机工程》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号