首页 | 本学科首页   官方微博 | 高级检索  
     

一种汉语语音变换技术
引用本文:孙卓,岳振军.一种汉语语音变换技术[J].电声技术,2007,31(6):37-40.
作者姓名:孙卓  岳振军
作者单位:1. 总参谋部通信工程设计研究院,辽宁,沈阳,110005
2. 解放军理工大学,理学院基础电子学系,江苏,南京,211101
摘    要:汉语语音变换技术的目的是将汉语语音中源说话人的语音特征转换为目标说话人语音特征。提出的适用于汉语说话人的变换算法分为3个部分:前两部分用高斯混合模型实现了语音的谱包络(线性预测编码)及其激励(残差)的转换;第三部分采用支持向量回归算法实现语音的韵律变换规则建模,结合汉语语音特点利用基音同步叠加算法实现语音的超音段特征调整。与现有的语音变换算法进行比较,算法针对汉语语音超音段发音特点进行韵律调整,有效实现了汉语语音变换并得到高自然度合成语音,是一种有效的汉语语音变换算法。

关 键 词:汉语语音变换  线性预测编码  残差  高斯混合模型  超音段特征  支持向量回归
文章编号:1002-8684(2007)06-0037-04
修稿时间:2007-02-08

Chinese Speech Conversion Technology Based on LPC and Residual Model
SUN Zhuo,YUE Zhen-jun.Chinese Speech Conversion Technology Based on LPC and Residual Model[J].Audio Engineering,2007,31(6):37-40.
Authors:SUN Zhuo  YUE Zhen-jun
Affiliation:1. GCEDRI, Shenyang 110005, China; 2. Department of Basic Electronic IS, PLAUST, Nanjing 211101, China
Abstract:Speech conversion is a new technology to change the source speaker's features to the target speaker's features in the speech. In this paper, the Chinese speech conversion system is divided in to three parts. In the first and second part, GMM(Gaussian Mixed Model) is used to transform the spectral envelopeLPC(Linear Prediction Coding)] and the impulse (residual). In the third part, the Chinese speech's super-segmental features is regulated with the SVR(Support Vector Regression) and the TD-PSOLA(Time-Domain Pitch Synchronous OverLap-Add). This algorithm is capital of transforming Chinese speech and producing spontaneous voice.
Keywords:Chinese speech conversion  LPC  residual  GMM  super-segmental feature  SVR
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号