首页 | 本学科首页   官方微博 | 高级检索  
     

基于LSTM的蒙汉机器翻译的研究
引用本文:刘婉婉,苏依拉,乌尼尔,仁庆道尔吉.基于LSTM的蒙汉机器翻译的研究[J].计算机工程与科学,2018,40(10):1890-1896.
作者姓名:刘婉婉  苏依拉  乌尼尔  仁庆道尔吉
基金项目:国家自然科学基金(61363052,61502255);内蒙古自治区自然科学基金(2016MS0605);内蒙古民族事务委员会基金(MW 2017 MGYWXXH 03)
摘    要:由于内蒙古地区蒙汉机器翻译水平落后、平行双语语料规模较小,利用传统的统计机器翻译方法会出现数据稀疏以及训练过拟合等问题,导致翻译质量不高。针对这种情况,提出基于LSTM的蒙汉神经机器翻译方法,通过利用长短时记忆模型构建端到端的神经网络框架并对蒙汉机器翻译系统进行建模。为了更有效地理解蒙古语语义信息,根据蒙古语的特点将蒙古文单词分割成词素形式,导入模型,并在模型中引入局部注意力机制计算与目标词有关联的源语词素的权重,获得蒙古语和汉语词汇间的对齐概率,从而提升翻译质量。实验结果表明,该方法相比传统蒙汉翻译系统提高了翻译质量。

关 键 词:注意力  端到端模型  机器翻译  蒙汉  LSTM神经网络  
收稿时间:2017-05-31
修稿时间:2018-10-25

Mongolian-Chinese machine translation based on LSTM
LIU Wan wan,SU Yi la,WU Ni er,RENQING Dao er ji.Mongolian-Chinese machine translation based on LSTM[J].Computer Engineering & Science,2018,40(10):1890-1896.
Authors:LIU Wan wan  SU Yi la  WU Ni er  RENQING Dao er ji
Affiliation:(College of Information Engineering,Inner Mongolia University of Technology,Hohhot 010080,China)
Abstract:Due to the small scale of Mongolian-Chinese bilingual parallel corpus and problems such as sparse data and over fitting of data training, the translation quality of traditional statistical machine translation methods for Mongolian Chinese translation needs to be improved. In view of this situation, we propose a Mongolian Chinese neural machine translation method based on LSTM. It constructs an end-to-end neural network frame by using the long and short memory model and models the Mongolian-Chinese machine translation system. In order to understand Mongolian sematic information more effectively, Mongolian words are divided into morphemes according to the characteristics of Mongolian language, which are then introduced into the model. Besides, the local attention mechanism is introduced into the model to calculate the weight of the source morphemes that are associated with the target word to achieve the probability of alignment between Mongolian and Chinese vocabularies and improve the translation quality. Experimental results show that compared with the traditional Mongolian-Chinese translation system, the proposed method obtains better translation quality.
Keywords:attention  end-to-end model  machine translation  Mongolian-Chinese  LSTM neural network  
点击此处可从《计算机工程与科学》浏览原始摘要信息
点击此处可从《计算机工程与科学》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号