首页 | 本学科首页   官方微博 | 高级检索  
     

基于SAO结构的中文专利实体关系抽取
引用本文:张永真,吕学强,申闫春,徐丽萍.基于SAO结构的中文专利实体关系抽取[J].计算机工程与设计,2019,40(3):706-712.
作者姓名:张永真  吕学强  申闫春  徐丽萍
作者单位:北京信息科技大学 网络文化与数字传播北京市重点实验室,北京,100101;北京信息科技大学 虚拟现实与系统仿真实验室,北京,100085;北京城市系统工程研究中心,北京,100089
基金项目:国家自然科学基金;北京成像技术高精尖创新中心基金项目;国家社会科学基金;国家自然科学基金
摘    要:针对当前中文专利文本实体关系抽取中采用词法特征、上下文特征、距离特征等传统特征导致抽取效率低的问题,提出一种将传统特征和句法语义特征相结合的方法。将中文专利文本的关系抽取问题转换为SAO结构的识别问题,进行分词和实体标注,抽取专利文本中的候选SAO三元组;提取候选SAO三元组的传统特征和句法语义特征;利用xg-boost算法在这些特征上做训练和预测,对特征的有效性进行实验分析。实验结果表明,该方法较使用传统特征的方法有明显提高,验证了句法语义特征的有效性。

关 键 词:关系抽取  梯度提升算法  SAO结构  句法特征  语义特征

Chinese patent entity relation extraction based on subject action object structure
ZHANG Yong-zhen,LYU Xue-qiang,SHEN Yan-chun,XU Li-ping.Chinese patent entity relation extraction based on subject action object structure[J].Computer Engineering and Design,2019,40(3):706-712.
Authors:ZHANG Yong-zhen  LYU Xue-qiang  SHEN Yan-chun  XU Li-ping
Affiliation:(Beijing Key Laboratory of Internet Culture and Digital Dissemination Research,Beijing Information Science and Technology University,Beijing 100101,China;Laboratory of VR and System Simulation,Beijing Information Science and Technology University,Beijing 100085,China;Beijing Research Center of Urban System Engineering,Beijing 100089,China)
Abstract:To solve the problem that relation exaction from Chinese patent literatures uses traditional features such as word features, context features and distance features, leading to low extraction efficiency, a method combining traditional features with syntactic semantic features was proposed. Relation exaction from Chinese patent literatures was transferred into recognition problem of SAO structure. Word segmentation and entity tagging were used to extract the candidate SAO three tuple in the patent literatures. The traditional features and the syntactic semantic features were extracted in candidate three tuple. The xgboost was used to train these features and the efficiency of those features were analyzed. Experimental results show that the proposed method is more effective than methods using traditional features, and the validity of syntactic semantic features is verified.
Keywords:relation extraction  xgboost  SAO structure  syntactic features  semantic features
本文献已被 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号