首页 | 本学科首页   官方微博 | 高级检索  
     


A waveform concatenation technique for text-to-speech synthesis
Authors:Soumya Priyadarsini Panda  Ajit Kumar Nayak
Affiliation:1.Department of CSE,Silicon Institute of Technology,Bhubaneswar,India;2.Department of CS&IT,Siksha ‘O’ Anusandhan University,Bhubaneswar,India
Abstract:Designing text-to-speech systems capable of producing natural sounding speech segments in different Indian languages is a challenging and ongoing problem. Due to the large number of possible pronunciations in different Indian languages, a number of speech segments are needed to be stored in the speech database while a concatenative speech synthesis technique is used to achieve highly natural speech segments. However, the large speech database size makes it unusable for small hand held devices or human computer interactive systems with limited storage resources. In this paper, we proposed a fraction-based waveform concatenation technique to produce intelligible speech segments from a small footprint speech database. The results of all the experiments performed shows the effectiveness of the proposed technique in producing intelligible speech segments in different Indian languages even with very less storage and computation overhead compared to the existing syllable-based technique.
Keywords:
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号