首页 | 本学科首页   官方微博 | 高级检索  
     


Synthesis and perception of breathy,normal, and Lombard speech in the presence of noise
Affiliation:1. National Center for Voice and Speech, The University of Utah, Salt Lake City, UT, USA;2. Department of Communication Sciences and Disorders, The University of Iowa, Iowa City, IA, USA
Abstract:This papers studies the synthesis of speech over a wide vocal effort continuum and its perception in the presence of noise. Three types of speech are recorded and studied along the continuum: breathy, normal, and Lombard speech. Corresponding synthetic voices are created by training and adapting the statistical parametric speech synthesis system GlottHMM. Natural and synthetic speech along the continuum is assessed in listening tests that evaluate the intelligibility, quality, and suitability of speech in three different realistic multichannel noise conditions: silence, moderate street noise, and extreme street noise. The evaluation results show that the synthesized voices with varying vocal effort are rated similarly to their natural counterparts both in terms of intelligibility and suitability.
Keywords:Statistical parametric speech synthesis  Adaptation  Vocal effort  Lombard speech  Breathy speech  Intelligibility
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号