首页 | 本学科首页   官方微博 | 高级检索  
     

基于双词主题模型的半监督实体消歧方法研究
引用本文:张雄,陈福才,黄瑞阳.基于双词主题模型的半监督实体消歧方法研究[J].电子学报,2018,46(3):607-613.
作者姓名:张雄  陈福才  黄瑞阳
作者单位:国家数字交换系统工程技术研究中心, 河南郑州 450001
摘    要:针对实体上下文信息主题漂移的问题,提出一种基于双词主题模型的实体消歧方法.方法考虑到实体在一定语义环境下具有不同的主题,且在同一文档中同时出现的其他实体在一定程度上能够帮助待消歧实体确定所指代内容,利用命名实体构建双词的思想,将协同实体关系融合到主题模型中,并在此基础上利用维基百科知识库,进行半监督消歧.本文最后在网络文本数据上进行了相关的实验,验证了所提算法的有效性.实验表明该方法有效的提高了实体消歧精度.

关 键 词:实体消歧  维基百科  双词主题模型  
收稿时间:2016-07-11

Semi-supervised Entity Disambiguation Method Research Based on Biterm Topic Model
ZHANG Xiong,CHEN Fu-cai,HUANG Rui-yang.Semi-supervised Entity Disambiguation Method Research Based on Biterm Topic Model[J].Acta Electronica Sinica,2018,46(3):607-613.
Authors:ZHANG Xiong  CHEN Fu-cai  HUANG Rui-yang
Affiliation:National Digital Switching System Engineering and Technological R & D Center, Zhengzhou, Henan 450001, China
Abstract:Aimed at the problem of theme drift of the entity context information,this paper proposes an entity disambiguation method based on biterm topic model.The proposed method considers that the entity has a different theme in a certain semantic environment and the other entity appearing in the same document at the same time can help the disambiguated entity to determine the referred content to a certain extent.Therefore,using the ideas of named entity constructing double words to incorporate collaborative entity relationship to the topic model,and on this basis,we conduct semi-supervised disambiguation using Wikipedia knowledge base.Finally,this paper conducts some relevant experiments on the web text data,and verifies the effectiveness of the proposed algorithm.The experiments show that the proposed method effectively improve the precision of entity disambiguation.
Keywords:entity disambiguation  Wikipedia  biterm topic model  
点击此处可从《电子学报》浏览原始摘要信息
点击此处可从《电子学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号