Pitch models of Mandarin text-to-speech期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

Pitch models of Mandarin text-to-speech

Authors:	SHAO Yan-qiu SUI Zhi-fang HAN Ji-qing

Affiliation:	1. Institute of Computational Linguistics, Peking University, Peking 100871, China;School of Computer Science and Technology, Harbin Institute of Technology, Harbin 150001 ,China 2. Institute of Computational Linguistics, Peking University, Peking 100871, China 3. School of Computer Science and Technology, Harbin Institute of Technology, Harbin 150001 ,China

Abstract:	The function of prosody model will directly affect the naturalness of synthesized speech. Aimed at the difficulty in generating the pitch contour in prosody model, two pitch models namely corpus-based pitch model and pitch pattern model are deeply studied in this paper. Key problems in the corpus-based model are calculation of the distance and searching of the optimal path with dynamic programming algorithm. For the pitch pattern model, parameters such as pitch pattern, pitch average and pitch range are used to describe the pitch contour,and six pitch patterns are presented. For the generation of pitch contour, the pitch pattern model is more flexible than the corpus-based model. Both of the two models are linked to the real TTS system, and the MOS results of synthesized Mandarin speech show that the pitch pattern model is better than the corpus-based pitch model.

Keywords:	speech synthesis prosody model pitch model pitch pattern
本文献已被 CNKI 维普万方数据等数据库收录！
	点击此处可从《哈尔滨工业大学学报(英文版)》浏览原始摘要信息
	点击此处可从《哈尔滨工业大学学报(英文版)》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏