首页 | 本学科首页   官方微博 | 高级检索  
     

面向汉维机器翻译的双语关联度优化模型
引用本文:潘一荣,李晓,杨雅婷.面向汉维机器翻译的双语关联度优化模型[J].计算机应用研究,2020,37(3):726-730.
作者姓名:潘一荣  李晓  杨雅婷
作者单位:中国科学院新疆理化技术研究所,乌鲁木齐830011;中国科学院大学,北京100049;新疆民族语音语言信息处理实验室,乌鲁木齐830011;中国科学院新疆理化技术研究所,乌鲁木齐830011;新疆民族语音语言信息处理实验室,乌鲁木齐830011
基金项目:中国科学院青年创新促进会项目;新疆维吾尔自治区高层次人才引进工程项目;国家自然科学基金;西部之光"人才培养计划;新疆维吾尔自治区项目
摘    要:针对汉语—维吾尔语的统计机器翻译系统中存在的语义无关性问题,提出基于神经网络机器翻译方法的双语关联度优化模型。该模型利用注意力机制捕获词对齐信息,引入双语短语间的语义相关性和内部词汇匹配度,预测双语短语的生成概率并将其作为双语关联度,以优化统计翻译模型中的短语翻译得分。在第十一届全国机器翻译研讨会(CWMT 2015)汉维公开机器翻译数据集上的实验结果表明,与基线系统相比,在使用较小规模的训练数据和词汇表的条件下,所提方法可以有效地同时提高短语级别和句子级别的机器翻译任务性能,分别获得最高2.49和0.59的BLEU值提升。

关 键 词:维吾尔语  神经网络机器翻译  注意力机制  词对齐  生成概率
收稿时间:2018/8/22 0:00:00
修稿时间:2020/1/21 0:00:00

Bilingual relatedness optimization model for Chinese-Uyghur machine translation
panyirong,lixiao and yangyating.Bilingual relatedness optimization model for Chinese-Uyghur machine translation[J].Application Research of Computers,2020,37(3):726-730.
Authors:panyirong  lixiao and yangyating
Affiliation:University of Chinese Academy of Sciences; Xinjiang Technical Institute of Physics DdDd Chemistry, Chinese Academy of Sciences,,
Abstract:Focused on the issue of semantic independence in Chinese-Uyghur statistical machine translation system, this paper proposed a bilingual relatedness optimization model based on neural machine translation method. The model utilized the attention mechanism to capture word alignment information as well as introduced bilingual phrase semantic relevance and inner word correlation to predict the conditional probability of bilingual phrase pair. And it took the probability as bilingual relatedness to optimize the phrase translation scores in statistical translation model. Experimental results on the 11th China Workshop on Machine Translation(CWMT 2015) Chinese-Uyghur public machine translation datasets show that the proposed approach can achieve obvious improvements both in the phrase-level and the sentence-level machine translation tasks, which outperforms the baseline system with a relative small-scale training data and vocabulary. The highest BLEU point of the proposed algorithm gain 2.49 and 0.59 respectively.
Keywords:Uyghur  neural network machine translation  attention mechanism  word alignment  conditional probability
本文献已被 万方数据 等数据库收录!
点击此处可从《计算机应用研究》浏览原始摘要信息
点击此处可从《计算机应用研究》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号