NERank+: a graph-based approach for entity ranking in document collections |
| |
Authors: | Chengyu Wang Guomin Zhou Xiaofeng He Aoying Zhou |
| |
Affiliation: | 1.Shanghai Key Laboratory of Trustworthy Computing, School of Computer Science and Software Engineering,East China Normal University,Shanghai,China;2.Department of Computer and Information Technology,Zhejiang Police College,Hangzhou,China;3.School of Data Science and Engineering,East China Normal University,Shanghai,China |
| |
Abstract: | Most entity ranking research aims to retrieve a ranked list of entities from a Web corpus given a user query. The rank order of entities is determined by the relevance between the query and contexts of entities. However, entities can be ranked directly based on their relative importance in a document collection, independent of any queries. In this paper, we introduce an entity ranking algorithm named NERank+. Given a document collection, NERank+ first constructs a graph model called Topical Tripartite Graph, consisting of document, topic and entity nodes. We design separate ranking functions to compute the prior ranks of entities and topics, respectively. A meta-path constrained random walk algorithm is proposed to propagate prior entity and topic ranks based on the graph model.We evaluate NERank+ over real-life datasets and compare it with baselines. Experimental results illustrate the effectiveness of our approach. |
| |
Keywords: | |
本文献已被 SpringerLink 等数据库收录! |
|