A Methodology for Creating a Segment Inventory for Greek Time Domain Speech Synthesis期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

A Methodology for Creating a Segment Inventory for Greek Time Domain Speech Synthesis

Authors:	Stavrou-Laevita?Fotinea author-information" > author-information__contact u-icon-before" > mailto:evita@ilsp.gr" title=" evita@ilsp.gr" itemprop=" email" data-track=" click" data-track-action=" Email author" data-track-label=" " >Email author,George?Tambouratzis

Affiliation:	(1) Institute for Language and Speech Processing, 6, Artemidos str. & Epidavrou, Maroussi, 151 25, Athens, Greece

Abstract:	This article focuses on the systematic design of a segment database which has been used to support a time-domain speech synthesis system for the Greek language. Thus, a methodology is presented for the generation of a corpus containing all possible instances of the segments for the specific language. Issues such as the phonetic coverage, the sentence selection and iterative evaluation techniques employing custom-built tools, are examined. Emphasis is placed on the comparison of the process-derived corpus to naturally-occurring corpora with respect to their suitability for use in time-domain speech synthesis. The proposed methodology generates a corpus characterised by a near-minimal size and which provides a complete coverage of the Greek language. Furthermore, within this corpus, the distribution of segmental units is similar to that of natural corpora, allowing for the extraction of multiple units in the case of the most frequently-occurring segments. The corpus creation algorithm incorporates mechanisms that enable the fine-tuning of the segment database's language-dependent characteristics and thus assists in the generation of high-quality text-to-speech synthesis.

Keywords:	speech synthesis segment database diphone coverage corpus optimisation
本文献已被 SpringerLink 等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏