首页 | 本学科首页   官方微博 | 高级检索  
     

基于强化学习的实体关系联合抽取模型
引用本文:陈佳沣,滕冲. 基于强化学习的实体关系联合抽取模型[J]. 计算机应用, 2019, 39(7): 1918-1924. DOI: 10.11772/j.issn.1001-9081.2019010182
作者姓名:陈佳沣  滕冲
作者单位:武汉大学国家网络安全学院,武汉,430072;武汉大学国家网络安全学院,武汉,430072
基金项目:国家自然科学基金面上项目(61772378)。
摘    要:针对现有的基于远程监督的实体和关系抽取方法存在着标签噪声问题,提出了一种基于强化学习的实体关系联合抽取方法。该模型有两个模块:句子选择器模块和实体关系联合抽取模块。首先,句子选择器模块选择没有标签噪声的高质量句子,将所选句子输入到实体关系联合抽取模型;然后,实体关系联合抽取模块采用序列标注方法对输入的句子进行预测,并向句子选择器模块提供反馈,指导句子选择器模块挑选高质量的句子;最后,句子选择器模块和实体关系联合抽取模块同时训练,将句子选择与序列标注一起优化。实验结果表明,该模型在实体关系联合抽取中的F1值为47.3%,与CoType为代表的联合抽取模型相比,所提模型的F1值提升了1%;与LINE为代表的串行模型相比,所提模型的F1值提升了14%。结果表明强化学习结合实体关系联合抽取模型能够有效地提高序列标注模型的F1值,其中句子选择器能有效地处理数据的噪声。

关 键 词:强化学习  联合抽取  序列标注  命名实体识别  关系分类
收稿时间:2019-01-24
修稿时间:2019-03-21

Joint entity and relation extraction model based on reinforcement learning
CHEN Jiafeng,TENG Chong. Joint entity and relation extraction model based on reinforcement learning[J]. Journal of Computer Applications, 2019, 39(7): 1918-1924. DOI: 10.11772/j.issn.1001-9081.2019010182
Authors:CHEN Jiafeng  TENG Chong
Affiliation:School of Cyber Science and Engineering, Wuhan University, Wuhan Hubei 430072, China
Abstract:Existing entity and relation extraction methods that rely on distant supervision suffer from noisy labeling problem. A model for joint entity and relation extraction from noisy data based on reinforcement learning was proposed to reduce the impact of noise data. There were two modules in the model:an sentence selector module and a sequence labeling module. Firstly, high-quality sentences without labeling noise were selected by instance selector module and the selected sentences were input into sequence labeling module. Secondly, predictions were made by sequence labeling module and the rewards were provided to sentence selector module to help the module select high-quality sentences. Finally, two modules were trained jointly to optimize instance selection and sequence labeling processes. The experimental results show that the F1 value of the proposed model is 47.3% in the joint entity and relation extraction, which is 1% higher than those of joint extraction models represented by CoType and 14% higher than those of serial models represented by LINE(Large-scale Information Network Embedding). The results show that the joint entity and relation extraction model in combination with reinforcement learning can effectively improve F1 value of sequential labeling model, in which the sentence selector can effectively deal with the noise of data.
Keywords:reinforcement learning   joint extraction   sequence tagging   named entity recognition   relation classification
本文献已被 维普 万方数据 等数据库收录!
点击此处可从《计算机应用》浏览原始摘要信息
点击此处可从《计算机应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号