首页 | 本学科首页   官方微博 | 高级检索  
     

基于基元段特征和GMM的源-目标说话人F0~t转换
引用本文:孙俊,戴蓓蒨,张剑.基于基元段特征和GMM的源-目标说话人F0~t转换[J].信号处理,2007,23(2):283-287.
作者姓名:孙俊  戴蓓蒨  张剑
作者单位:中国科学技术大学电子科学与技术系,合肥,安徽,230026
摘    要:基音轨迹F_0~t转换是实现高质量源-目标说话人声音转换的重要组成部分。本文给出了一种与说话内容无关的F_0~t转换方法,为了在与文本无关的前提下,提取出尽可能反映说话人个性特征的基音起伏的较长时的信息,本文采用了从n个短时帧组成的基元段提取特征矢量,并以基元段矢量为单元进行转换,采用了基于高斯混合模型(GMM)的概率加权转换算法使每个特征矢量的转换规则是由多个类规则的线性加权组合得到的,从而提高了转换精度,同时还解决了一般分类器中处于类边界数据的分类错误。实验表明,基于基元段特征矢量和GMM的转换方法具有很好的效果。

关 键 词:基音轨迹转换  与文本无关  混合高斯模型  基元段特征矢量
修稿时间:2005年5月31日

F0~t Transformation From source Speaker To Target Speaker Based on Segment Feature Vector and GMM
Sun Jun,Dai Beiqian,Zhang Jian.F0~t Transformation From source Speaker To Target Speaker Based on Segment Feature Vector and GMM[J].Signal Processing,2007,23(2):283-287.
Authors:Sun Jun  Dai Beiqian  Zhang Jian
Abstract:The Transformation of F_0 contour from source speaker to target speaker is a key issue of the high quality voice trans- formation.In precondition of text-Independent,in order to extract the parameter which can reflect the speaker's relative long term pro- sodie characteristic feature,a kind of segment feature vector made up with n frames pitch is proposed as characteristic parameter to be transformed as a unit.This paper utilizes probability weighted transformation algorithm based on Gaussian Mixture Model(GMM), which linearly combines a few rules derived from each subclass transformation,thus the transformation accuracy is improved.Mean- while,for the features which nearly lie on the borders of several classes,the classification error may be eliminated.The proposed ap- proach is proved to be effective by conducted experiments.
Keywords:Fo contour transformation  Text-Independent  GMM  Segment Feature Vector
本文献已被 CNKI 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号