Semantic annotation of Web data applied to risk in food |
| |
Authors: | Hignette Gaëlle Buche Patrice Couvert Olivier Dibie-Barthélemy Juliette Doussot David Haemmerlé Ollivier Mettler Eric Soler Lydie |
| |
Affiliation: | INRA MIA, Unité Mét@risk UR 1204, 16, rue Claude Bernard 75231 Paris Cédex 5, France; AgroParisTech, UFR Informatique, 16, rue Claude Bernard, 75 231 Paris Cedex 05, France. gaelle.hignette@gmail.com |
| |
Abstract: | A preliminary step to risk in food assessment is the gathering of experimental data. In the framework of the Sym'Previus project (http://www.symprevius.org), a complete data integration system has been designed, grouping data provided by industrial partners and data extracted from papers published in the main scientific journals of the domain. Those data have been classified by means of a predefined vocabulary, called ontology. Our aim is to complement the database with data extracted from the Web. In the framework of the WebContent project (www.webcontent.fr), we have designed a semi-automatic acquisition tool, called @WEB, which retrieves scientific documents from the Web. During the @WEB process, data tables are extracted from the documents and then annotated with the ontology. We focus on the data tables as they contain, in general, a synthesis of data published in the documents. In this paper, we explain how the columns of the data tables are automatically annotated with data types of the ontology and how the relations represented by the table are recognised. We also give the results of our experimentation to assess the quality of such an annotation. |
| |
Keywords: | |
本文献已被 PubMed 等数据库收录! |
|