基于混合码书映射的高效语音转换方法 Efficient Voice Conversion Method Based on Mixture Mapping of Codebooks期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

基于混合码书映射的高效语音转换方法

引用本文：	李海燕,王程程,徐宁,胡芳.基于混合码书映射的高效语音转换方法[J].数据采集与处理,2016,31(3):512-524.

作者姓名：	李海燕王程程徐宁胡芳

作者单位：	1.河海大学物联网工程学院，常州，213022； 2.河海大学与法国Alderbaran机器人与认知联合实验室，常州，213022； 3.常州市特种机器人与智能技术重点实验室，常州，213022

摘要：	目前主流语音转换算法计算量大，复杂度高，难以在内核小的嵌入式系统上运行。为了降低语音转换的计算复杂度，缩短训练时间，提出一种基于混合码书映射的高效语音转换方法。在训练阶段,根据不同的参与训练的语音数据量建立不同的码书映射关系，节约训练时长，提高准确度。在转换阶段，系统依据训练阶段建立的码书映射关系对浊音帧的声道参数进行转换。另外，为了提高转换语音的主观音质，系统对清音帧的特征参数也作了相应转换，并且修正了转换语音的共振峰频率以克服帧间共振峰抖动的问题。主客观测试结果表明：在保证转换音质的前提下，本文提出的语音转换方法降低了计算复杂度、明显缩减了训练时间。
关键词：	语音转换混合码书映射共振峰频率修正清音转换
Efficient Voice Conversion Method Based on Mixture Mapping of Codebooks

Abstract:	The state of the art algorithms for voice conversion are computationally expensive and time consuming, thus they cannot be run in the embedded systems efficiently. An voice conversion method bas ed on mixture mapping of codebooks is proposed. In the training stage, different codebook mapping relationships are built according to the training speech amount, which saves training time and improves conver sion accuracy. In the transformation stage, the system converts the vocal tract parameters of voiced frames according to the corresponding codebook mapping buil t in the training stage. In addition, to improve the quality of the con verted speech, the system converts the feature parameters of unvoiced frames as well as correcting the formant frequency to overcome the formant jitters between frames. Both objective and subjective experiments show that the proposed method reduc es computational complexity and saves training time without degrading or deterio rating the quality of the converted speech.

Keywords:	voice conversion mixture mapping of codebook formant frequency correction unv oiced frames conversion

	点击此处可从《数据采集与处理》浏览原始摘要信息
	点击此处可从《数据采集与处理》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏