Literature DB >> 7860829

Modeling the glottal volume-velocity waveform for three voice types.

D G Childers1, C Ahn.   

Abstract

The purpose of this study was to model features of the glottal volume-velocity waveform for three voice types: modal voice, vocal fry, and breathy voice. The study analyzed data measured from two sustained vowels and one sentence uttered by nine adult, male subjects who represented examples of the three voice types. The primary analysis procedure was glottal inverse filtering, which estimated the glottal volume-velocity waveform. The estimated glottal volume-velocity waveform was then fit to an LF model waveform. Four parameters of the LF model were adjusted to minimize the mean-squared error between the estimated glottal waveform and the LF model waveform. Statistical averages and standard deviations of the four parameters of the LF glottal waveform model were calculated using the data for each voice type. The four LF model parameters characterize important low-frequency features of the glottal waveform, namely, the glottal pulse width, pulse skewness, abruptness of closure of the glottal pulse, and the spectral tilt of the glottal pulse. Statistical analysis included ANOVA and multiple linear regression analysis. The ANOVA results demonstrated that there was a difference in three of the four LF model parameters for the three voice types. The linear regression analysis between the four LF model parameters and a formal rating by a listening test of the quality of the three voice types was used to determine the most significant LF model parameters for each voice type. A simple rule was devised for synthesizing the three voice types with a formant synthesizer using the LF glottal waveform model. Listener evaluations of the synthesized speech tended to confirm the results determined by the analysis procedures.

Entities:  

Mesh:

Year:  1995        PMID: 7860829     DOI: 10.1121/1.412276

Source DB:  PubMed          Journal:  J Acoust Soc Am        ISSN: 0001-4966            Impact factor:   1.840


  5 in total

1.  Observation and analysis of in vivo vocal fold tissue instabilities produced by nonlinear source-filter coupling: a case study.

Authors:  Matías Zañartu; Daryush D Mehta; Julio C Ho; George R Wodicka; Robert E Hillman
Journal:  J Acoust Soc Am       Date:  2011-01       Impact factor: 1.840

2.  Modal and non-modal voice quality classification using acoustic and electroglottographic features.

Authors:  Michal Borsky; Daryush D Mehta; Jarrad H Van Stan; Jon Gudnason
Journal:  IEEE/ACM Trans Audio Speech Lang Process       Date:  2017-11-27

3.  Measures of spectral slope using an excised larynx model.

Authors:  Fariborz Alipour; Ronald C Scherer; Eileen Finnegan
Journal:  J Voice       Date:  2011-11-06       Impact factor: 2.009

4.  Subglottal Impedance-Based Inverse Filtering of Voiced Sounds Using Neck Surface Acceleration.

Authors:  Matías Zañartu; Julio C Ho; Daryush D Mehta; Robert E Hillman; George R Wodicka
Journal:  IEEE Trans Audio Speech Lang Process       Date:  2013-09

5.  Vocal fold contact patterns based on normal modes of vibration.

Authors:  Simeon L Smith; Ingo R Titze
Journal:  J Biomech       Date:  2018-04-12       Impact factor: 2.712

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.