首页 | 本学科首页   官方微博 | 高级检索  
     


Exploiting Wikipedia and EuroWordNet to solve Cross-Lingual Question Answering
Authors:Sergio Ferrández
Affiliation:a Natural Language Processing and Information Systems Group, Department of Computing Languages and Systems, University of Alicante, P.O. Box 99, E-03080 Alicante, Spain
b Istituto di Linguistica Computazionale, Consiglio Nazionale delle Ricerche, Pisa, Italy
Abstract:This paper describes a new advance in solving Cross-Lingual Question Answering (CL-QA) tasks. It is built on three main pillars: (i) the use of several multilingual knowledge resources to reference words between languages (the Inter Lingual Index (ILI) module of EuroWordNet and the multilingual knowledge encoded in Wikipedia); (ii) the consideration of more than only one translation per word in order to search candidate answers; and (iii) the analysis of the question in the original language without any translation process. This novel approach overcomes the errors caused by the common use of Machine Translation (MT) services by CL-QA systems. We also expose some studies and experiments that justify the importance of analyzing whether a Named Entity should be translated or not. Experimental results in bilingual scenarios show that our approach performs better than an MT based CL-QA approach achieving an average improvement of 36.7%.
Keywords:Natural Language Processing  Cross-Lingual Question Answering  Inter Lingual Index  EuroWordNet  Wikipedia  Multilingual knowledge resources
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号