首页 | 本学科首页   官方微博 | 高级检索  
     

面向科技情报分析的知识库构建方法
引用本文:王勇,江洋,王红滨,侯莎.面向科技情报分析的知识库构建方法[J].计算机工程与应用,2022,58(22):142-149.
作者姓名:王勇  江洋  王红滨  侯莎
作者单位:1.哈尔滨工程大学 计算机科学与技术学院,哈尔滨 150001 2.中国船舶集团有限公司 第七一四研究所,北京 100101
摘    要:在知识库构建中,最重要的部分就是提取文本中的三元组,而三元组的提取需要实体抽取和实体关系抽取技术。针对实体抽取提出了一种CWATT-BiLSTM-LSTMd(character word attention-bidirectional long short-term memory-long short-term memory)模型。该模型可以有效解决实体抽取中一词多义问题,并且可以模拟标签的依赖问题。在实体抽取的基础上进行实体关系的抽取,为解决实体关系抽取中远程监督的局限性,提出一种基于强化深度学习的RL-TreeLSTM(reinforcement learning tree long short-term memory)模型。该模型分为选择器和分类器,选择器选择有效的句子传入分类器,分类器对句子中实体对的关系标签进行预测。选择器和分类器共同训练以优化选择和分类过程,可以有效降低远程监督带来的噪音。实验结果表明,提出的模型和方法能有效地提高实体及其关系的抽取性能。

关 键 词:知识库构建  神经网络  强化学习  实体抽取  实体关系抽取  

Knowledge Base Construction Method for Scientific and Technical Information Analysis
WANG Yong,JIANG Yang,WANG Hongbin,HOU Sha.Knowledge Base Construction Method for Scientific and Technical Information Analysis[J].Computer Engineering and Applications,2022,58(22):142-149.
Authors:WANG Yong  JIANG Yang  WANG Hongbin  HOU Sha
Affiliation:1.College of Computer Science and Technology, Harbin Engineering University, Harbin 150001, China 2.The 714 Research Institute of China State Shipbuilding Corporation Limited, Beijing 100101, China
Abstract:In the construction of knowledge base, the most important part is to extract the triplets in the text, and the extraction of triples requires entity extraction and entity relationship extraction techniques. A CWATT-BiLSTM-LSTMd(character word attention-bidirectional long short-term memory-long short-term memory) model for entity extraction is proposed. The model can effectively solve the polysemy problem in entity extraction, and simulate the dependency of the tag. On the basis of entity extraction, entity relationship extraction is performed. To solve the limitation of remote supervision in entity relationship extraction, a RL-TreeLSTM(reinforcement learning tree long short-term memory) model based on enhanced deep learning is proposed. The selector and classifier are trained together to optimize the selection and classification process, which can effectively reduce the noise caused by remote supervision. The experimental results show that the proposed model in this paper can effectively extract entities and their relationships.
Keywords:knowledge base construction  neural networks  reinforcement learning  entity extraction  entity relation extraction  
点击此处可从《计算机工程与应用》浏览原始摘要信息
点击此处可从《计算机工程与应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号