首页 | 本学科首页   官方微博 | 高级检索  
     


A Methodology for Creating a Segment Inventory for Greek Time Domain Speech Synthesis
Authors:Stavrou-Laevita?Fotinea  author-information"  >  author-information__contact u-icon-before"  >  mailto:evita@ilsp.gr"   title="  evita@ilsp.gr"   itemprop="  email"   data-track="  click"   data-track-action="  Email author"   data-track-label="  "  >Email author,George?Tambouratzis
Affiliation:(1) Institute for Language and Speech Processing, 6, Artemidos str. & Epidavrou, Maroussi, 151 25, Athens, Greece
Abstract:This article focuses on the systematic design of a segment database which has been used to support a time-domain speech synthesis system for the Greek language. Thus, a methodology is presented for the generation of a corpus containing all possible instances of the segments for the specific language. Issues such as the phonetic coverage, the sentence selection and iterative evaluation techniques employing custom-built tools, are examined. Emphasis is placed on the comparison of the process-derived corpus to naturally-occurring corpora with respect to their suitability for use in time-domain speech synthesis. The proposed methodology generates a corpus characterised by a near-minimal size and which provides a complete coverage of the Greek language. Furthermore, within this corpus, the distribution of segmental units is similar to that of natural corpora, allowing for the extraction of multiple units in the case of the most frequently-occurring segments. The corpus creation algorithm incorporates mechanisms that enable the fine-tuning of the segment database's language-dependent characteristics and thus assists in the generation of high-quality text-to-speech synthesis.
Keywords:speech synthesis  segment database  diphone coverage  corpus optimisation
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号