首页 | 本学科首页   官方微博 | 高级检索  
     


Semi-supervised model adaptation for statistical machine translation
Authors:Nicola Ueffing  Gholamreza Haffari  Anoop Sarkar
Affiliation:(1) Interactive Language Technologies Group, National Research Council Canada, Gatineau, QC, Canada;(2) School of Computing Science, Simon Fraser University, Burnaby, BC, Canada
Abstract:Statistical machine translation systems are usually trained on large amounts of bilingual text (used to learn a translation model), and also large amounts of monolingual text in the target language (used to train a language model). In this article we explore the use of semi-supervised model adaptation methods for the effective use of monolingual data from the source language in order to improve translation quality. We propose several algorithms with this aim, and present the strengths and weaknesses of each one. We present detailed experimental evaluations on the French–English EuroParl data set and on data from the NIST Chinese–English large-data track. We show a significant improvement in translation quality on both tasks.
Keywords:Statistical machine translation  Self-training  Semi-supervised learning  Domain adaptation  Model adaptation
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号