首页 | 本学科首页   官方微博 | 高级检索  
     

宏观篇章结构表示体系和语料建设
引用本文:褚晓敏,奚雪峰,蒋峰,徐昇,朱巧明,周国栋.宏观篇章结构表示体系和语料建设[J].软件学报,2020,31(2):321-343.
作者姓名:褚晓敏  奚雪峰  蒋峰  徐昇  朱巧明  周国栋
作者单位:苏州大学自然语言处理实验室,江苏苏州215006;苏州科技大学电子与信息工程学院,江苏苏州 215009
基金项目:国家自然科学基金(61773276,61673290,61836007)
摘    要:篇章结构分析是自然语言处理领域的一个重要研究方向.篇章结构分析有助于理解篇章的结构和语义,并为自然语言处理的应用(如自动文摘、主题抽取、问答系统等)提供有力的支撑.目前篇章结构分析主要集中在微观的层面,分析重点是句子内部或句子与句子之间的关系和结构,而宏观层面的研究相对较少.因此,本文以篇章结构作为研究对象,并将研究重点放在宏观篇章结构的表示体系和语料资源建设上.本文探讨了篇章结构分析的重要性,从理论体系、语料资源、计算模型等三个方面阐述了篇章结构分析的研究现状,提出了以篇章主次关系为媒介的宏观和微观统一的篇章结构表示框架,并分别构建了宏观篇章的逻辑语义结构和功能语用结构.在此基础上本文标注了规模为720篇新闻报道的宏观篇章结构语料,并对标注的结果进行了一致性分析和标注统计分析.

关 键 词:篇章结构分析  宏观篇章结构  篇章结构表示体系  逻辑语义结构  功能语用结构  语料标注
收稿时间:2018/1/9 0:00:00
修稿时间:2019/4/19 0:00:00

Macro Discourse Structure Representation Schema and Corpus Construction
CHU Xiao-Min,XI Xue-Feng,JIANG Feng,XU Sheng,ZHU Qiao-Ming and ZHOU Guo-Dong.Macro Discourse Structure Representation Schema and Corpus Construction[J].Journal of Software,2020,31(2):321-343.
Authors:CHU Xiao-Min  XI Xue-Feng  JIANG Feng  XU Sheng  ZHU Qiao-Ming and ZHOU Guo-Dong
Affiliation:Natural Language Processing Lab, Soochow University, Suzhou 215006, China,School of Electronic & Information Engineering, Suzhou University of Science and Technology, Suzhou 215000, China,Natural Language Processing Lab, Soochow University, Suzhou 215006, China,Natural Language Processing Lab, Soochow University, Suzhou 215006, China,Natural Language Processing Lab, Soochow University, Suzhou 215006, China and Natural Language Processing Lab, Soochow University, Suzhou 215006, China
Abstract:Discourse structure analysis is an important research topic in natural language processing. Discourse structure analysis not only helps to understand the discourse structure and semantics, but also provides strong support for deep applications of natural language processing, such as automatic summarization, topic extraction, question answering, etc. At present, the analysis of discourse structure is mainly concentrated on the micro level. The analysis focuses on the relations and structures between sentences or sentences groups, while the analysis on macro level is less. Therefore, this paper takes discourse structure as the research object, and focuses on the construction of representation schema and corpus resources on the macro level. This paper discusses the importance of discourse structure analysis, expounds the research status of discourse structure analysis from three aspects of theory system, corpora resource and computing model, and puts forward the macro-micro unified discourse structure representation framework with the primary-secondary relation as the carrier. Furthermore, this paper constructs the logical semantic structure and functional pragmatic structure of macro discourse level respectively. On this basis, this paper annotated a macro Chinese discourse structure corpus, consisting of 720 Newswire articles, and analyzed the results of the annotations in consistency and statistical data.
Keywords:discourse structure analysis  macro discourse structure  discourse structure representation schema  logical semantic structure  functional pragmatic structure  corpus annotating
本文献已被 万方数据 等数据库收录!
点击此处可从《软件学报》浏览原始摘要信息
点击此处可从《软件学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号