Graph-ranking collective Chinese entity linking algorithm |
| |
Authors: | Tao XIE Bin WU Bingjing JIA Bai WANG |
| |
Affiliation: | Beijing Key Laboratory of Intelligent Telecommunications Software and Multimedia, Beijing University of Posts and Telecommunications, Beijing 100876, China |
| |
Abstract: | Entity linking (EL) systems aim to link entity mentions in the document to their corresponding entity records in a reference knowledge base. Existing EL approaches usually ignore the semantic correlation between the mentions in the text, and are limited to the scale of the local knowledge base. In this paper, we propose a novel graphranking collective Chinese entity linking (GRCCEL) algorithm, which can take advantage of both the structured relationship between entities in the local knowledge base and the additional background information offered by external knowledge sources. By improved weighted word2vec textual similarity and improved PageRank algorithm, more semantic information and structural information can be captured in the document. With an incremental evidence mining process, more powerful discrimination capability for similar entities can be obtained.We evaluate the performance of our algorithm on some open domain corpus. Experimental results show the effectiveness of our method in Chinese entity linking task and demonstrate the superiority of our method over state-of-the-art methods. |
| |
Keywords: | collective entity linking knowledge mapping word embedding entity correlation graph PageRank |
本文献已被 维普 等数据库收录! |
| 点击此处可从《Frontiers of Computer Science》浏览原始摘要信息 |
|
点击此处可从《Frontiers of Computer Science》下载全文 |