Synthesis and perception of breathy,normal, and Lombard speech in the presence of noise期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

Synthesis and perception of breathy,normal, and Lombard speech in the presence of noise

Affiliation:	1. National Center for Voice and Speech, The University of Utah, Salt Lake City, UT, USA;2. Department of Communication Sciences and Disorders, The University of Iowa, Iowa City, IA, USA

Abstract:	This papers studies the synthesis of speech over a wide vocal effort continuum and its perception in the presence of noise. Three types of speech are recorded and studied along the continuum: breathy, normal, and Lombard speech. Corresponding synthetic voices are created by training and adapting the statistical parametric speech synthesis system GlottHMM. Natural and synthetic speech along the continuum is assessed in listening tests that evaluate the intelligibility, quality, and suitability of speech in three different realistic multichannel noise conditions: silence, moderate street noise, and extreme street noise. The evaluation results show that the synthesized voices with varying vocal effort are rated similarly to their natural counterparts both in terms of intelligibility and suitability.

Keywords:	Statistical parametric speech synthesis Adaptation Vocal effort Lombard speech Breathy speech Intelligibility
本文献已被 ScienceDirect 等数据库收录！