首页 | 本学科首页   官方微博 | 高级检索  
     

基于混合码书映射的高效语音转换方法
引用本文:李海燕,王程程,徐宁,胡芳.基于混合码书映射的高效语音转换方法[J].数据采集与处理,2016,31(3):512-524.
作者姓名:李海燕  王程程  徐宁  胡芳
作者单位:1.河海大学物联网工程学院,常州,213022; 2.河海大学与法国Alderbaran机器人与认知联 合实验室,常州,213022; 3.常州市特种机器人与智能技术重点实验室,常州,213022
摘    要:目前主流语音转换算法计算量大,复杂度高, 难以在内核小的嵌入式系统上运行。为了降低语音转换的计算复杂度,缩短训练时间,提出 一种基于混合码书映射的高效语音转换方法。在训练阶段,根据不同的参与训练的语音数据 量 建立不同的码书映射关系,节约训练时长,提高准确度。在转换阶段,系统依据训练阶段建 立的码书映射关系对浊音帧的声道参数进行转换。另外,为了提高转换语音的主观音质,系 统对清音帧的特征参数也作了相应转换,并且修正了转换语音的共振峰频率以克服帧间共振 峰抖动的问题。主客观测试结果表明:在保证转换音质的前提下,本文提出的语音转换方法 降低了计算复杂度、明显缩减了训练时间。

关 键 词:语音转换  混合码书映射  共振峰频率修正  清音转换

Efficient Voice Conversion Method Based on Mixture Mapping of Codebooks
Abstract:The state of the art algorithms for voice conversion are computationally expensive and time consuming, thus they cannot be run in the embedded systems efficiently. An voice conversion method bas ed on mixture mapping of codebooks is proposed. In the training stage, different codebook mapping relationships are built according to the training speech amount, which saves training time and improves conver sion accuracy. In the transformation stage, the system converts the vocal tract parameters of voiced frames according to the corresponding codebook mapping buil t in the training stage. In addition, to improve the quality of the con verted speech, the system converts the feature parameters of unvoiced frames as well as correcting the formant frequency to overcome the formant jitters between frames. Both objective and subjective experiments show that the proposed method reduc es computational complexity and saves training time without degrading or deterio rating the quality of the converted speech.
Keywords:voice conversion  mixture mapping of codebook  formant frequency correction  unv oiced frames conversion
点击此处可从《数据采集与处理》浏览原始摘要信息
点击此处可从《数据采集与处理》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号