首页 | 本学科首页   官方微博 | 高级检索  
     


Semantic annotation of Web data applied to risk in food
Authors:Hignette Gaëlle  Buche Patrice  Couvert Olivier  Dibie-Barthélemy Juliette  Doussot David  Haemmerlé Ollivier  Mettler Eric  Soler Lydie
Affiliation:INRA MIA, Unité Mét@risk UR 1204, 16, rue Claude Bernard 75231 Paris Cédex 5, France; AgroParisTech, UFR Informatique, 16, rue Claude Bernard, 75 231 Paris Cedex 05, France. gaelle.hignette@gmail.com
Abstract:A preliminary step to risk in food assessment is the gathering of experimental data. In the framework of the Sym'Previus project (http://www.symprevius.org), a complete data integration system has been designed, grouping data provided by industrial partners and data extracted from papers published in the main scientific journals of the domain. Those data have been classified by means of a predefined vocabulary, called ontology. Our aim is to complement the database with data extracted from the Web. In the framework of the WebContent project (www.webcontent.fr), we have designed a semi-automatic acquisition tool, called @WEB, which retrieves scientific documents from the Web. During the @WEB process, data tables are extracted from the documents and then annotated with the ontology. We focus on the data tables as they contain, in general, a synthesis of data published in the documents. In this paper, we explain how the columns of the data tables are automatically annotated with data types of the ontology and how the relations represented by the table are recognised. We also give the results of our experimentation to assess the quality of such an annotation.
Keywords:
本文献已被 PubMed 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号