一种汉语语音变换技术 Chinese Speech Conversion Technology Based on LPC and Residual Model期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

一种汉语语音变换技术

引用本文：	孙卓,岳振军.一种汉语语音变换技术[J].电声技术,2007,31(6):37-40.

作者姓名：	孙卓岳振军

作者单位：	1. 总参谋部通信工程设计研究院,辽宁,沈阳,110005 2. 解放军理工大学,理学院基础电子学系,江苏,南京,211101

摘要：	汉语语音变换技术的目的是将汉语语音中源说话人的语音特征转换为目标说话人语音特征。提出的适用于汉语说话人的变换算法分为3个部分:前两部分用高斯混合模型实现了语音的谱包络(线性预测编码)及其激励(残差)的转换;第三部分采用支持向量回归算法实现语音的韵律变换规则建模,结合汉语语音特点利用基音同步叠加算法实现语音的超音段特征调整。与现有的语音变换算法进行比较,算法针对汉语语音超音段发音特点进行韵律调整,有效实现了汉语语音变换并得到高自然度合成语音,是一种有效的汉语语音变换算法。
关键词：	汉语语音变换线性预测编码残差高斯混合模型超音段特征支持向量回归
文章编号：	1002-8684（2007）06-0037-04
修稿时间：	2007-02-08
Chinese Speech Conversion Technology Based on LPC and Residual Model

SUN Zhuo,YUE Zhen-jun.Chinese Speech Conversion Technology Based on LPC and Residual Model[J].Audio Engineering,2007,31(6):37-40.

Authors:	SUN Zhuo YUE Zhen-jun

Affiliation:	1. GCEDRI, Shenyang 110005, China; 2. Department of Basic Electronic IS, PLAUST, Nanjing 211101, China

Abstract:	Speech conversion is a new technology to change the source speaker's features to the target speaker's features in the speech. In this paper, the Chinese speech conversion system is divided in to three parts. In the first and second part, GMM(Gaussian Mixed Model) is used to transform the spectral envelopeLPC(Linear Prediction Coding)] and the impulse (residual). In the third part, the Chinese speech's super-segmental features is regulated with the SVR(Support Vector Regression) and the TD-PSOLA(Time-Domain Pitch Synchronous OverLap-Add). This algorithm is capital of transforming Chinese speech and producing spontaneous voice.

Keywords:	Chinese speech conversion LPC residual GMM super-segmental feature SVR
本文献已被 CNKI 维普万方数据等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏