Literature DB >> 11757945

Measuring vocal quality with speech synthesis.

B R Gerratt1, J Kreiman.   

Abstract

Much previous research has demonstrated that listeners do not agree well when using traditional rating scales to measure pathological voice quality. Although these findings may indicate that listeners are inherently unable to agree in their perception of such complex auditory stimuli, another explanation implicates the particular measurement method-rating scale judgments-as the culprit. An alternative method of assessing quality-listener-mediated analysis-synthesis-was devised to assess this possibility. In this new approach, listeners explicitly compare synthetic and natural voice samples, and adjust speech synthesizer parameters to create auditory matches to voice stimuli. This method is designed to replace unstable internal standards for qualities like breathiness and roughness with externally presented stimuli, to overcome major hypothetical sources of disagreement in rating scale judgments. In a preliminary test of the reliability of this method, listeners were asked to adjust the signal-to-noise ratio for 12 synthetic pathological voices so that the resulting stimuli matched the natural target voices as well as possible For comparison to the synthesis judgments, listeners also judged the noisiness of the natural stimuli in a separate task using a traditional visual-analog rating scale. For 9 of the 12 voices, agreement among listeners was significantly (and substantially) greater for the synthesis task than for the rating scale task. Response variances for the two tasks did not differ for the remaining three voices. However, a second experiment showed that the synthesis settings that listeners selected for these three voices were within a difference limen, and therefore observed differences were perceptually insignificant. These results indicate that listeners can in fact agree in their perceptual assessments of voice quality, and that analysis-synthesis can measure perception reliably.

Mesh:

Year:  2001        PMID: 11757945     DOI: 10.1121/1.1409969

Source DB:  PubMed          Journal:  J Acoust Soc Am        ISSN: 0001-4966            Impact factor:   1.840


  7 in total

1.  Listener effort for highly intelligible tracheoesophageal speech.

Authors:  Kathy F Nagle; Tanya L Eadie
Journal:  J Commun Disord       Date:  2012-01-20       Impact factor: 2.288

2.  [On the auditory evaluation of voice quality].

Authors:  M Ptok; C Schwemmle; C Iven; M Jessen; T Nawka
Journal:  HNO       Date:  2006-10       Impact factor: 1.284

3.  Psychometric properties associated with perceived vocal roughness using a matching task.

Authors:  David A Eddins; Rahul Shrivastav
Journal:  J Acoust Soc Am       Date:  2013-10       Impact factor: 1.840

4.  Identifying a comparison for matching rough voice quality.

Authors:  Sona Patel; Rahul Shrivastav; David A Eddins
Journal:  J Speech Lang Hear Res       Date:  2012-02-21       Impact factor: 2.297

5.  Integrated software for analysis and synthesis of voice quality.

Authors:  Jody Kreiman; Norma Antoñanzas-Barroso; Bruce R Gerratt
Journal:  Behav Res Methods       Date:  2010-11

6.  A model for the prediction of breathiness in vowels.

Authors:  Rahul Shrivastav; Arturo Camacho; Sona Patel; David A Eddins
Journal:  J Acoust Soc Am       Date:  2011-03       Impact factor: 2.482

7.  Perceptual distances of breathy voice quality: a comparison of psychophysical methods.

Authors:  Sona Patel; Rahul Shrivastav; David A Eddins
Journal:  J Voice       Date:  2009-01-29       Impact factor: 2.009

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.