首页 | 官方网站   微博 | 高级检索  
     

结合噪声网络的强化学习远程监督关系抽取
引用本文:谢斌红,王恩慧,张英俊.结合噪声网络的强化学习远程监督关系抽取[J].计算机工程与应用,2022,58(23):169-177.
作者姓名:谢斌红  王恩慧  张英俊
作者单位:太原科技大学 计算机科学与技术学院,太原 030024
摘    要:针对目前远程监督关系抽取任务中存在的错误标注问题,提出使用强化学习策略设计噪声指示器,通过与由关系分类器和噪声数据组成的环境相交互,动态识别每个关系类别的假正例与假负例,并为其重新分配正确的关系标签,从而将噪声数据转换成有用的训练样本,有利于提高远程监督关系抽取模型的性能;另外,在训练过程中,通过在策略网络权重上添加噪声,平衡策略网络的探索和利用问题,从而增强噪声指示器的探索能力,使噪声指示器更准确地选择出能够正确表达实体-关系的句子。在Freebase对齐NYT公共数据集上的实验结果表明,提出的方法可以显著提高远程监督关系抽取模型的性能,表明模型拥有识别并纠正噪声数据标签的能力,可以更好地学习关系特征。

关 键 词:远程监督关系抽取  强化学习  噪声网络  假负例  

Distant Supervision Relation Extraction Based on Reinforcement Learning with Noisy Network
XIE Binhong,WANG Enhui,ZHANG Yingjun.Distant Supervision Relation Extraction Based on Reinforcement Learning with Noisy Network[J].Computer Engineering and Applications,2022,58(23):169-177.
Authors:XIE Binhong  WANG Enhui  ZHANG Yingjun
Affiliation:School of Computer Science and Technology, Taiyuan University of Science and Technology, Taiyuan 030024, China
Abstract:Aiming at the noisy labeling problem in the current distant supervision relation extraction task, this paper proposes a reinforcement learning strategy to design a noisy indicator. By interacting with the environment composed of relation classifier and noisy data, the false positive instances and false negative instances of each relation category are dynamically identified, and the correct relation labels are redistributed, thus, the noisy data is transformed into useful training samples, which is helpful to improve the performance of the distant supervision relation extraction model. In addition, in the process of training, noise is added to the weight of policy network to balance the exploration and utilization of policy network, so as to enhance the exploration ability of noisy indicator and make the noisy indicator more accurately select sentences that can correctly express entity relationship. The experimental results on freebase aligned NYT public dataset show that the proposed method can significantly improve the performance of the distant supervision relation extraction model, which shows that the model has the ability to recognize and correct noisy data labels, and can better learn the relation features.
Keywords:distant supervision relation extraction  reinforcement learning  noisy network  false negative instances  
点击此处可从《计算机工程与应用》浏览原始摘要信息
点击此处可从《计算机工程与应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号