Literature DB >> 2137837

Analysis, synthesis, and perception of voice quality variations among female and male talkers.

D H Klatt1, L C Klatt.   

Abstract

Voice quality variations include a set of voicing sound source modifications ranging from laryngealized to normal to breathy phonation. Analysis of reiterant imitations of two sentences by ten female and six male talkers has shown that the potential acoustic cues to this type of voice quality variation include: (1) increases to the relative amplitude of the fundamental frequency component as open quotient increases; (2) increases to the amount of aspiration noise that replaces higher frequency harmonics as the arytenoids become more separated; (3) increases to lower formant bandwidths; and (4) introduction of extra pole zeros in the vocal-tract transfer function associated with tracheal coupling. Perceptual validation of the relative importance of these cues for signaling a breathy voice quality has been accomplished using a new voicing source model for synthesis of more natural male and female voices. The new formant synthesizer, KLSYN88, is fully documented here. Results of the perception study indicate that, contrary to previous research which emphasizes the importance of increased amplitude of the fundamental component, aspiration noise is perceptually most important. Without its presence, increases to the fundamental component may induce the sensation of nasality in a high-pitched voice. Further results of the acoustic analysis include the observations that: (1) over the course of a sentence, the acoustic manifestations of breathiness vary considerably--tending to increase for unstressed syllables, in utterance-final syllables, and at the margins of voiceless consonants; (2) on average, females are more breathy than males, but there are very large differences between subjects within each gender; (3) many utterances appear to end in a "breathy-laryngealized" type of vibration; and (4) diplophonic irregularities in the timing of glottal periods occur frequently, especially at the end of an utterance. Diplophonia and other deviations from perfect periodicity may be important aspects of naturalness in synthesis.

Entities:  

Mesh:

Year:  1990        PMID: 2137837     DOI: 10.1121/1.398894

Source DB:  PubMed          Journal:  J Acoust Soc Am        ISSN: 0001-4966            Impact factor:   1.840


  151 in total

1.  Emotion in speech: the acoustic attributes of fear, anger, sadness, and joy.

Authors:  C Sobin; M Alpert
Journal:  J Psycholinguist Res       Date:  1999-07

2.  Perceptual "vowel spaces" of cochlear implant users: implications for the study of auditory adaptation to spectral shift.

Authors:  J D Harnsberger; M A Svirsky; A R Kaiser; D B Pisoni; R Wright; T A Meyer
Journal:  J Acoust Soc Am       Date:  2001-05       Impact factor: 1.840

3.  Linguistic status of timbre influences pitch encoding in the brainstem.

Authors:  Ananthanarayan Krishnan; Jackson T Gandour; Saradha Ananthakrishnan; Gavin M Bidelman; Christopher J Smalt
Journal:  Neuroreport       Date:  2011-11-16       Impact factor: 1.837

4.  Developing a single comparison stimulus for matching breathy voice quality.

Authors:  Sona Patel; Rahul Shrivastav; David A Eddins
Journal:  J Speech Lang Hear Res       Date:  2012-01-03       Impact factor: 2.297

5.  Restraining mechanisms in regulating glottal closure during phonation.

Authors:  Zhaoyan Zhang
Journal:  J Acoust Soc Am       Date:  2011-12       Impact factor: 1.840

6.  Perceptual interaction of the harmonic source and noise in voice.

Authors:  Jody Kreiman; Bruce R Gerratt
Journal:  J Acoust Soc Am       Date:  2012-01       Impact factor: 1.840

7.  The influence of stop consonants' perceptual features on the Articulation Index model.

Authors:  Riya Singh; Jont B Allen
Journal:  J Acoust Soc Am       Date:  2012-04       Impact factor: 1.840

8.  Pitch strength of normal and dysphonic voices.

Authors:  Rahul Shrivastav; David A Eddins; Supraja Anand
Journal:  J Acoust Soc Am       Date:  2012-03       Impact factor: 1.840

9.  Identification of synthetic vowels based on a time-varying model of the vocal tract area function.

Authors:  Kate Bunton; Brad H Story
Journal:  J Acoust Soc Am       Date:  2010-04       Impact factor: 1.840

10.  Direct measurement of planar flow rate in an excised canine larynx model.

Authors:  Liran Oren; Sid Khosla; Doug Dembinski; Jun Ying; Ephraim Gutmark
Journal:  Laryngoscope       Date:  2014-08-05       Impact factor: 3.325

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.