首页 | 本学科首页   官方微博 | 高级检索  
     

基于CWHC-AM的实体及关系联合抽取方法
引用本文:李宏宇,段利国,候晨蕾,姚龙飞.基于CWHC-AM的实体及关系联合抽取方法[J].中文信息学报,2022,36(11):79-90.
作者姓名:李宏宇  段利国  候晨蕾  姚龙飞
作者单位:1.太原理工大学 信息与计算机学院,山西 太原 030024;
2.山西师范大学现代文理学院 转改筹备处,山西 临汾 041000
基金项目:山西省科技厅省基础研究计划项目(201801D121137)
摘    要:实体及关系抽取是从非结构化自然语言文本中抽取三元组。传统流水线的方法先抽取实体再抽取关系,容易造成误差传播,也忽略了两个子任务的内在联系和依赖关系,抽取多元关系及重叠关系效果较差。针对上述问题,该文首先将多元关系问题转换成多个二元关系问题进行抽取,充分考虑两个子任务之间的联系,提出一种基于CWHC-AM(character word hybrid coding and attention mechanism)的实体及关系联合抽取模型,采用多层指针网络标注方案,将实体及关系联合抽取任务转化为序列标注问题,实现重叠关系抽取。最后,引入对抗训练提高模型的鲁棒性。在百度DuIE 2.0中文数据集上进行实验,结果表明该文方法可有效地同时抽取多元关系及二元关系,取得比基线模型都要好的效果。

关 键 词:关系抽取  联合抽取  多元关系  对抗训练  
收稿时间:2021-07-16

Joint Extraction of Entities and Relations Based on CWHC-AM
LI Hongyu,DUAN Liguo,HOU Chenlei,YAO Longfei.Joint Extraction of Entities and Relations Based on CWHC-AM[J].Journal of Chinese Information Processing,2022,36(11):79-90.
Authors:LI Hongyu  DUAN Liguo  HOU Chenlei  YAO Longfei
Affiliation:1.College of Information and Computer, Taiyuan University of Technology, Taiyuan, Shanxi 030024, China;2.Transfer Prepatory Office, Shanxi Normal University College of Modern Art and Sciences, Linfen, Shanxi 041000, China
Abstract:Entity and relationship extraction is to extract triples from unstructured natural language text. The existing pipeline method extracts entities first and then relations, without capturing the internal relations and dependencies of the two subtasks. This article proposes a joint extraction model of entities and relations based on CWHC-AM, regarding the multiple relationship issue in extraction as multiple binary relationship tasks. The multi-layer pointer network labeling scheme is adopted to transform the joint extraction task of entities and relations into a sequence labeling problem. And the adversarial training is introduced to improve the robustness of the model. Experiments on the Baidu DuIE2.0 Chinese dataset show that the method in this article can effectively extract multiple relations and binary relations at the same time with better results than the baseline model.
Keywords:relation extraction  joint extraction  multiple relations  adversarial training  
点击此处可从《中文信息学报》浏览原始摘要信息
点击此处可从《中文信息学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号