Event graphs for information retrieval and multi-document summarization |
| |
Affiliation: | 1. Department of Electronics Convergence Engineering, Wonkwang University, 344-2, Shinyong-Dong, Iksan, Jeonbuk 570-749, South Korea;2. Department of Electrical and Computer Engineering, University of Alberta, Edmonton, Alberta T6G 2G7, Canada;3. Systems Research Institute, Polish Academy of Sciences, Warsaw, Poland |
| |
Abstract: | With the number of documents describing real-world events and event-oriented information needs rapidly growing on a daily basis, the need for efficient retrieval and concise presentation of event-related information is becoming apparent. Nonetheless, the majority of information retrieval and text summarization methods rely on shallow document representations that do not account for the semantics of events. In this article, we present event graphs, a novel event-based document representation model that filters and structures the information about events described in text. To construct the event graphs, we combine machine learning and rule-based models to extract sentence-level event mentions and determine the temporal relations between them. Building on event graphs, we present novel models for information retrieval and multi-document summarization. The information retrieval model measures the similarity between queries and documents by computing graph kernels over event graphs. The extractive multi-document summarization model selects sentences based on the relevance of the individual event mentions and the temporal structure of events. Experimental evaluation shows that our retrieval model significantly outperforms well-established retrieval models on event-oriented test collections, while the summarization model outperforms competitive models from shared multi-document summarization tasks. |
| |
Keywords: | Event extraction Information extraction Information retrieval Multi-document summarization Natural language processing |
本文献已被 ScienceDirect 等数据库收录! |
|