首页 | 本学科首页   官方微博 | 高级检索  
     

基于短语成分表示的中文关系抽取
引用本文:刘娜娜,程婧,闵可锐,康昱,王新,周扬帆. 基于短语成分表示的中文关系抽取[J]. 数据采集与处理, 2020, 35(3): 449-457
作者姓名:刘娜娜  程婧  闵可锐  康昱  王新  周扬帆
作者单位:复旦大学计算机科学技术学院,上海,201203;上海智能电子与系统研究院,上海,201203;上海秘塔网络科技有限公司,上海,200135;微软亚洲研究院,北京,100080
基金项目:国家自然科学基金(61702107)资助项目; 赛尔网络下一代互联网技术创新(NGII20180611)资助项目。
摘    要:关系抽取是自然语言处理的重要研究内容,短语成分结构则是学界普遍认为能对关系抽取有重要影响的特征信息。然而目前短语成分应用于关系抽取任务时没有明显效果。这主要有两个原因:短语成分分析模型的泛化能力较差,会在关系抽取上造成错误传播,从而影响了它对关系抽取的有效性;关系抽取任务上使用短语成分特征的方式存在缺陷,即丧失短语成分分析学习到的句子结构信息,或者加大其对关系抽取的错误影响。本文在提升短语成分分析效果的基础上,提出了基于短语成分表示的中文关系抽取方法。该方法将短语成分分析模型学习到的文本表示嵌入到关系抽取模型中,从而提升关系抽取的性能。本文在公开的中文关系抽取数据集上验证了该方法的有效性。

关 键 词:短语成分表示  中文关系抽取  特征融合  短语成分分析
收稿时间:2019-08-20
修稿时间:2019-12-09

Chinese Relation Extraction Based on Constituency Representation
Liu Nan,Cheng Jing,Min Kerui,Kang Yu,Wang Xin,Zhou Yangfan. Chinese Relation Extraction Based on Constituency Representation[J]. Journal of Data Acquisition & Processing, 2020, 35(3): 449-457
Authors:Liu Nan  Cheng Jing  Min Kerui  Kang Yu  Wang Xin  Zhou Yangfan
Abstract:Relation extraction is an important research in the natural language processing (NLP) area. The constituency grammar information, which is widely believed by the academic community, has an important influence on relation extraction. However, there is no obvious effect when the phrase syntactic tree is applied to the relation extraction task. There are two main reasons for this: First, the generalization ability of the constituency parser is poor, which will cause error propagation and then affect its effectiveness in the relation extraction; Second, there are flaws in the way of the use of the phrase syntactic features in the relation extraction task,that is the phrase syntactic structure information learned by the constituency parser is lost, or the wrong influence on the relation extraction is increased. This paper proposes a Chinese relation extraction method based on constituency vector representation to solve the above two problems. The method embeds the text representation learned by the constituency parser into the relation extraction model, thereby improving the relation extraction performance. This paper validates the method on a public Chinese relation extraction data set.
Keywords:constituency vector representation  Chinese relation extraction  feature combination  constituency parser
本文献已被 万方数据 等数据库收录!
点击此处可从《数据采集与处理》浏览原始摘要信息
点击此处可从《数据采集与处理》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号