Cross-lingual entity matching and infobox alignment in Wikipedia |
| |
Authors: | Daniel Rinser Dustin Lange Felix Naumann |
| |
Affiliation: | Hasso Plattner Institute, Potsdam, Germany |
| |
Abstract: | Wikipedia has grown to a huge, multi-lingual source of encyclopedic knowledge. Apart from textual content, a large and ever-increasing number of articles feature so-called infoboxes, which provide factual information about the articles' subjects. As the different language versions evolve independently, they provide different information on the same topics. Correspondences between infobox attributes in different language editions can be leveraged for several use cases, such as automatic detection and resolution of inconsistencies in infobox data across language versions, or the automatic augmentation of infoboxes in one language with data from other language versions. |
| |
Keywords: | Entity resolution Schema matching Linked data Data quality on the web |
本文献已被 ScienceDirect 等数据库收录! |
|