Literature DB >> 25694542

Scaling and universality in the human voice.

Jordi Luque1, Bartolo Luque2, Lucas Lacasa3.   

Abstract

Speech is a distinctive complex feature of human capabilities. In order to understand the physics underlying speech production, in this work, we empirically analyse the statistics of large human speech datasets ranging several languages. We first show that during speech, the energy is unevenly released and power-law distributed, reporting a universal robust Gutenberg-Richter-like law in speech. We further show that such 'earthquakes in speech' show temporal correlations, as the interevent statistics are again power-law distributed. As this feature takes place in the intraphoneme range, we conjecture that the process responsible for this complex phenomenon is not cognitive, but it resides in the physiological (mechanical) mechanisms of speech production. Moreover, we show that these waiting time distributions are scale invariant under a renormalization group transformation, suggesting that the process of speech generation is indeed operating close to a critical point. These results are put in contrast with current paradigms in speech processing, which point towards low dimensional deterministic chaos as the origin of nonlinear traits in speech fluctuations. As these latter fluctuations are indeed the aspects that humanize synthetic speech, these findings may have an impact in future speech synthesis technologies. Results are robust and independent of the communication language or the number of speakers, pointing towards a universal pattern and yet another hint of complexity in human speech.
© 2015 The Author(s) Published by the Royal Society. All rights reserved.

Entities:  

Keywords:  criticality; scaling; speech

Mesh:

Year:  2015        PMID: 25694542      PMCID: PMC4387524          DOI: 10.1098/rsif.2014.1344

Source DB:  PubMed          Journal:  J R Soc Interface        ISSN: 1742-5662            Impact factor:   4.118


  20 in total

1.  Crackling noise.

Authors:  J P Sethna; K A Dahmen; C R Myers
Journal:  Nature       Date:  2001-03-08       Impact factor: 49.962

2.  Modeling of chaotic vibrations in symmetric vocal folds.

Authors:  J J Jiang; Y Zhang; J Stern
Journal:  J Acoust Soc Am       Date:  2001-10       Impact factor: 1.840

3.  Spatio-temporal analysis of irregular vocal fold oscillations: biphonation due to desynchronization of spatial modes.

Authors:  J Neubauer; P Mergell; U Eysholdt; H Herzel
Journal:  J Acoust Soc Am       Date:  2001-12       Impact factor: 1.840

4.  Renormalization-group transformations and correlations of seismicity.

Authors:  Alvaro Corral
Journal:  Phys Rev Lett       Date:  2005-07-07       Impact factor: 9.161

5.  Universality in solar flare and earthquake occurrence.

Authors:  L de Arcangelis; C Godano; E Lippiello; M Nicodemi
Journal:  Phys Rev Lett       Date:  2006-02-06       Impact factor: 9.161

6.  Better speech recognition with cochlear implants.

Authors:  B S Wilson; C C Finley; D T Lawson; R D Wolford; D K Eddington; W M Rabinowitz
Journal:  Nature       Date:  1991-07-18       Impact factor: 49.962

7.  Bifurcations in an asymmetric vocal-fold model.

Authors:  I Steinecke; H Herzel
Journal:  J Acoust Soc Am       Date:  1995-03       Impact factor: 1.840

8.  Distribution of the first return time in fractional Brownian motion and its application to the study of on-off intermittency.

Authors: 
Journal:  Phys Rev E Stat Phys Plasmas Fluids Relat Interdiscip Topics       Date:  1995-07

9.  The Pervasiveness of 1/f Scaling in Speech Reflects the Metastable Basis of Cognition.

Authors:  Christopher T Kello; Gregory G Anderson; John G Holden; Guy C Van Orden
Journal:  Cogn Sci       Date:  2008-10

10.  Beyond word frequency: bursts, lulls, and scaling in the temporal distributions of words.

Authors:  Eduardo G Altmann; Janet B Pierrehumbert; Adilson E Motter
Journal:  PLoS One       Date:  2009-11-11       Impact factor: 3.240

View more
  6 in total

1.  Scale-Free Amplitude Modulation of Neuronal Oscillations Tracks Comprehension of Accelerated Speech.

Authors:  Ana Filipa Teixeira Borges; Anne-Lise Giraud; Huibert D Mansvelder; Klaus Linkenkaer-Hansen
Journal:  J Neurosci       Date:  2017-12-07       Impact factor: 6.167

2.  Multiple Coordination Patterns in Infant and Adult Vocalizations.

Authors:  Drew H Abney; Anne S Warlaumont; D Kimbrough Oller; Sebastian Wallot; Christopher T Kello
Journal:  Infancy       Date:  2016-09-28

3.  Hierarchical temporal structure in music, speech and animal vocalizations: jazz is like a conversation, humpbacks sing like hermit thrushes.

Authors:  Christopher T Kello; Simone Dalla Bella; Butovens Médé; Ramesh Balasubramaniam
Journal:  J R Soc Interface       Date:  2017-10       Impact factor: 4.118

4.  Emergence of linguistic laws in human voice.

Authors:  Iván González Torre; Bartolo Luque; Lucas Lacasa; Jordi Luque; Antoni Hernández-Fernández
Journal:  Sci Rep       Date:  2017-03-08       Impact factor: 4.379

5.  On the physical origin of linguistic laws and lognormality in speech.

Authors:  Iván G Torre; Bartolo Luque; Lucas Lacasa; Christopher T Kello; Antoni Hernández-Fernández
Journal:  R Soc Open Sci       Date:  2019-08-21       Impact factor: 2.963

6.  From Boltzmann to Zipf through Shannon and Jaynes.

Authors:  Álvaro Corral; Montserrat García Del Muro
Journal:  Entropy (Basel)       Date:  2020-02-05       Impact factor: 2.524

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.