首页 | 本学科首页   官方微博 | 高级检索  
     

考虑语速和前后环境的基频Target模型及实现
引用本文:陈高鹏,胡郁,王仁华.考虑语速和前后环境的基频Target模型及实现[J].中文信息学报,2004,18(3):82-86.
作者姓名:陈高鹏  胡郁  王仁华
作者单位:中国科学技术大学电子工程与信息科学系
摘    要:本文通过一些实验和数据分析,对以音节为单位的基频target模型(认为音节的实际基频是一个有语音学意义的隐藏的目标target和前后环境作用的结果)的实现进行修正,并结合数据挖掘的方法自动得到了一个实用化的target基频模型。文中指出,音节的target必须不受语速影响,但同时受前后语言环境影响,实际的基频曲线是在前后的韵律曲线作用下向target的一个逼近过程。文章的主要任务就是如何假设一个合理的target,实现基频的target参数自动提取,最后进行基于机器学习的模型训练,成功实现了完全自动化的完整句子的基频预测和合成。集外测试结果预测的均方误差为22Hz,相关系数为0.72。

关 键 词:计算机应用  中文信息处理  语音合成  韵律模型  基频  Target  
文章编号:1003-0077(2004)03-0081-05
修稿时间:2003年8月5日

Pitch Target Model's Realization Considering Speech Speed and Environment
CHEN Gao peng,HU Yu,WANG Ren hua.Pitch Target Model''''s Realization Considering Speech Speed and Environment[J].Journal of Chinese Information Processing,2004,18(3):82-86.
Authors:CHEN Gao peng  HU Yu  WANG Ren hua
Affiliation:University of Science and Technology of China , Electronic Engineer and Information Science
Abstract:This paper, aided by experiments and data analysis, improves and realizes the pitch target model which regards syllable as a basic linguistical unit. The F0 contour of a syllable is the representation of a result that a hidden target and environment interact. A useful model is realized automatically by data mining. In this paper it is proposed that the target of a syllable is independent on speech speed while it is effected by the linguistic environment. The real pitch is the approximation of the target effected by the preceding and following F0. This paper's content is how to hypothesize the a reasonable target, how to implement the parameters' auto exaction and how to realize machine-learning of the model. The prediction and resynthesis of a completed utterance is realized successfully. The test result shows that RMSE is 22 Hz, correlation is 0 72.
Keywords:Target
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《中文信息学报》浏览原始摘要信息
点击此处可从《中文信息学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号