首页 | 本学科首页   官方微博 | 高级检索  
     

注意力机制对生成对抗网络语音增强迁移学习模型的影响
引用本文:曹中辉,黄志华,葛文萍,黄浩.注意力机制对生成对抗网络语音增强迁移学习模型的影响[J].声学技术,2021,40(1):77-81.
作者姓名:曹中辉  黄志华  葛文萍  黄浩
作者单位:新疆大学信息科学与工程学院, 信号检测与处理新疆维吾尔自治区重点实验室, 新疆乌鲁木齐 830001
基金项目:新疆维吾尔自治区自然科学基金项目资助(2017D01C044)
摘    要:基于深度学习的语音增强模型对训练集外语言语音和噪声进行降噪时,性能明显下降.为了解决这一问题,提出一种引入注意力机制的生成对抗网络(Generative Adversarial Network,GAN)语音增强迁移学习模型.在生成对抗语音增强模型的判别模型中引入注意力机制,以高资源场景下的大量语音数据训练得到的语音增强...

关 键 词:生成对抗网络(GAN)  语音增强  迁移学习  跨语言语音增强  注意力机制
收稿时间:2019/12/6 0:00:00
修稿时间:2020/2/3 0:00:00

Influence of attention mechanism on generative adversarial network speech enhancement transfer learning model
CAO Zhonghui,HUANG Zhihu,GE Wenping,HUANG Hao.Influence of attention mechanism on generative adversarial network speech enhancement transfer learning model[J].Technical Acoustics,2021,40(1):77-81.
Authors:CAO Zhonghui  HUANG Zhihu  GE Wenping  HUANG Hao
Affiliation:College of Information Science and Engineering, Key Laboratory of Signal Detection and Processing in Xinjiang Uygur Autonomous Region, Xinjaing University, Urumqi 830001, Xinjiang, China
Abstract:The deep learning based speech enhancement model encounters the problem of enhancement performance degradation when de-noising the unseen languages and noise in training sets. In order to solve this problem, a generative adversarial network (GAN) speech enhancement transfer learning model with attention mechanism (called ATGAN speech enhancement model) is proposed in this paper. The attention mechanism is introduced into the discriminator of GAN speech enhancement model. Based on the well-trained model obtained with high-resource materials and combining a small amount of speech training data in low-resource condition, the weight transfer of the basic enhancement model trained with low-resource data is carried out to improve the enhancement effect in low-resource condition. Experiments show that the use of ATGAN speech enhancement model can effectively enhance the denoising effect of low-resource noisy speech.
Keywords:generative adversarial network (GAN)  speech enhancement  transfer learning  cross-language speech enhancement  attention mechanism
点击此处可从《声学技术》浏览原始摘要信息
点击此处可从《声学技术》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号