首页 | 本学科首页   官方微博 | 高级检索  
     

基于GAN模型优化的神经机器翻译
引用本文:熊伟,高娟娟,刘锴.基于GAN模型优化的神经机器翻译[J].计算机系统应用,2022,31(12):95-103.
作者姓名:熊伟  高娟娟  刘锴
作者单位:华北电力大学(保定) 计算机系, 保定 071003;华北电力大学(保定) 复杂能源系统智能计算工程研究中心, 保定 071003
摘    要:针对神经机器翻译和人工翻译性能的差异最小化、训练语料不足问题, 提出了一种基于生成对抗网络的神经机器翻译改进方法.首先对目标端句子序列添加微小的噪声干扰, 通过编码器还原原始句子形成新的序列; 其次将编码器的处理结果交给判别器和解码器进一步处理, 在训练过程中, 判别器和双语评估基础值(BLEU)目标函数用于评估生成的句子, 并将结果反馈给生成器, 引导生成器学习及优化.实验结果表明, 对比传统的神经机器翻译模型, 基于GAN模型的方法极大地提高了模型的泛化能力和翻译的精度.

关 键 词:生成对抗网络  神经机器翻译  Transformer  BLEU  深度学习
收稿时间:2022/3/18 0:00:00
修稿时间:2022/4/14 0:00:00

Optimization of Neural Machine Translation with GAN Model
XIONG Wei,GAO Juan-Juan,LIU Kai.Optimization of Neural Machine Translation with GAN Model[J].Computer Systems& Applications,2022,31(12):95-103.
Authors:XIONG Wei  GAO Juan-Juan  LIU Kai
Affiliation:Computer Department, North China Electric Power University (Baoding), Baoding 071003, China;Engineering Research Center of Intelligent Computing for Complex Energy Systems, North China Electric Power University (Baoding), Baoding 071003, China
Abstract:To minimize the performance difference between neural machine translation (NMT) and human translation and solve the problem of insufficient training corpora, this study proposes an improved NMT method based on the generative adversarial network (GAN). First, the sentence sequence of the target end is added with small noise interference, and then the original sentence is restored by the encoder to form a new sequence. Secondly, the results of the encoder are presented to the discriminator and decoder for further processing. In the training process, the discriminator and the bilingual evaluation understudy (BLEU) objective function are employed to evaluate the generated sentences, and the results are fed back to the generator to instruct its learning and optimization. The experimental results demonstrate that compared with the traditional NMT model, the GAN-based model greatly improves the generalization ability and translation accuracy of the model.
Keywords:generative adversarial?network?(GAN)  neural machine translation (NMT)  Transformer  bilingual evaluation?understudy (BLEU)  deep learning
点击此处可从《计算机系统应用》浏览原始摘要信息
点击此处可从《计算机系统应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号