首页 | 本学科首页   官方微博 | 高级检索  
     

基于细粒度韵律建模和条件CycleGAN的非平行蒙古语语音转换方法
引用本文:吴则诚,飞龙,张晖,王海波.基于细粒度韵律建模和条件CycleGAN的非平行蒙古语语音转换方法[J].信号处理,2021,37(10):1825-1834.
作者姓名:吴则诚  飞龙  张晖  王海波
作者单位:内蒙古大学计算机学院
基金项目:国家重点研发计划项目(2018YFE0122900);国家自然基金项目(62066033,61773224);内蒙古自治区应用技术研究与开发资金项目(2019GG372,2020GG0046,2021GG0158,2020PT0002);内蒙古自治区成果转化项目(2019CG028)
摘    要:语音转换技术在保持语义内容不变的前提下将源说话人的语音音色转换为目标说话人。目前,蒙古语语音转换面临语料匮乏、蒙古语字词在发音上韵律变化丰富等问题。针对这些问题,本文提出一种基于细粒度韵律建模和条件CycleGAN的非平行蒙古语语音转换方法。该方法首先使用连续小波变换提取细粒度的语音韵律特征,然后向CycleGAN中加入说话人向量构建条件CycleGAN,最后使用条件CycleGAN得到源说话人和目标说话人之间稳定的韵律转换。实验结果表明,该方法与传统CycleGAN语音转换方法相比能够有效提升蒙古语语音转换效果,在语音自然度和说话人相似度的MOS评分上分别提升了0.1和0.2。 

关 键 词:蒙古语语音转换    非平行    条件CycleGAN    细粒度韵律建模
收稿时间:2021-08-13

Non-parallel Mongolian Voice Conversion Method Based on Fine-grained Prosody Modeling and Conditional CycleGAN
Affiliation:College of Computer Science, Inner Mongolia UniversityNational & Local Joint Engineering Research Center of Intelligent Information Processing Technology for Mongolian
Abstract:The voice conversion technique converts the voice tone of the source speaker to the target speaker while keeping the linguistic information unchanged. At present, Mongolian voice conversion is facing problems such as lack of corpus and rich prosodic changes in pronunciation of Mongolian words. To address these problems, this paper presents a non-parallel Mongolian voice conversion method based on fine-grained prosody modeling and conditional CycleGAN. This method used continuous wavelet transform to extract fine-grained prosodic features, then added speaker identity vectors to the CycleGAN to build a conditional CycleGAN, Finally, the conditional CycleGAN was used to obtain a stable prosody conversion between source and target speakers. Experimental results showed that compared with the traditional CycleGAN voice conversion method, this method can effectively improve the Mongolian voice conversion effect, and the MOS scores of speech naturalness and speaker similarity are improved by 0.1 and 0.2 respectively. 
Keywords:
点击此处可从《信号处理》浏览原始摘要信息
点击此处可从《信号处理》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号