首页 | 本学科首页   官方微博 | 高级检索  
     


Scaling and universality in the human voice
Authors:Jordi Luque  Bartolo Luque  Lucas Lacasa
Affiliation:1.Telefonica Research, Edificio Telefonica-Diagonal 00, Barcelona, Spain;2.Departamento de Matemática Aplicada y Estadística, EIAE, Universidad Politécnica de Madrid, Madrid, Spain;3.School of Mathematical Sciences, Queen Mary University of London, Mile End Road, London E14NS, UK
Abstract:Speech is a distinctive complex feature of human capabilities. In order to understand the physics underlying speech production, in this work, we empirically analyse the statistics of large human speech datasets ranging several languages. We first show that during speech, the energy is unevenly released and power-law distributed, reporting a universal robust Gutenberg–Richter-like law in speech. We further show that such ‘earthquakes in speech’ show temporal correlations, as the interevent statistics are again power-law distributed. As this feature takes place in the intraphoneme range, we conjecture that the process responsible for this complex phenomenon is not cognitive, but it resides in the physiological (mechanical) mechanisms of speech production. Moreover, we show that these waiting time distributions are scale invariant under a renormalization group transformation, suggesting that the process of speech generation is indeed operating close to a critical point. These results are put in contrast with current paradigms in speech processing, which point towards low dimensional deterministic chaos as the origin of nonlinear traits in speech fluctuations. As these latter fluctuations are indeed the aspects that humanize synthetic speech, these findings may have an impact in future speech synthesis technologies. Results are robust and independent of the communication language or the number of speakers, pointing towards a universal pattern and yet another hint of complexity in human speech.
Keywords:criticality  speech  scaling
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号