首页 | 本学科首页   官方微博 | 高级检索  
     

基于篇章主题的中文宏观篇章主次关系识别方法
引用本文:孙振华,周懿,朱巧明,蒋峰,李培峰. 基于篇章主题的中文宏观篇章主次关系识别方法[J]. 中文信息学报, 2021, 34(12): 30-38
作者姓名:孙振华  周懿  朱巧明  蒋峰  李培峰
作者单位:苏州大学 计算机科学与技术学院,江苏 苏州 215006
基金项目:国家自然科学基金(61836007,61773276,61772354);江苏省高校优势学科建设工程资助项目
摘    要:篇章分析是自然语言处理领域研究的热点和重点。作为篇章分析的任务之一,篇章主次关系研究篇章的主要和次要内容,从而更好地理解和把握篇章的核心内容。该文重点研究宏观领域的中文篇章主次关系,提出了一种基于篇章主题的中文宏观篇章主次关系识别方法。该方法利用篇章单元间、篇章单元与篇章主题间的语义交互来识别主次关系,并有选择地应用篇章主题信息,有效提高了主次关系核心的识别。在中文宏观汉语篇章树库(MCDTB)上的实验结果显示,该方法优于目前性能最好的基准系统。

关 键 词:篇章分析  宏观篇章主次识别  BERT  篇章主题  

Recognizing Chinese Macro Discourse Nuclearity Based on Discourse Topic
SUN Zhenhua,ZHOU Yi,ZHU Qiaoming,JIANG Feng,LI Peifeng. Recognizing Chinese Macro Discourse Nuclearity Based on Discourse Topic[J]. Journal of Chinese Information Processing, 2021, 34(12): 30-38
Authors:SUN Zhenhua  ZHOU Yi  ZHU Qiaoming  JIANG Feng  LI Peifeng
Affiliation:School of Computer Science and Technology, Soochow University, Suzhou, Jiangsu 215006, China
Abstract:Discourse analysis is a hot topic in the field of Natural Language Processing. Discourse nuclearity recognition, a subtask of discourse analysis, focuses on recognizing the main and secondary content of a discourse, to better understand and grasp its core content. This paper focuses on the task of macro Chinese discourse nulcearity recognition and proposes a recognition method based on discourse topic. This method introduces the semantic interaction between different discourse units and that between the discourse unit and its topic to identify the nuclearity. Moreover, it applies the selection mechanism of the discourse topic to further improve the performance of nuclearity recognition.Experimental results on MCDTB show that the proposed method outperforms the state-of-the-art baselines.
Keywords:discourse parsing    macro discourse nuclearity recognition    BERT    discourse topic  
点击此处可从《中文信息学报》浏览原始摘要信息
点击此处可从《中文信息学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号