Literature DB >> 31306599

The Effects of Modulating Fundamental Frequency and Speech Rate on the Intelligibility, Communication Efficiency, and Perceived Naturalness of Synthetic Speech.

Jennifer M Vojtech1,2, Jacob P Noordzij1,2, Gabriel J Cler2,3, Cara E Stepp1,2,3,4.   

Abstract

Purpose This study investigated how modulating fundamental frequency (f0) and speech rate differentially impact the naturalness, intelligibility, and communication efficiency of synthetic speech. Method Sixteen sentences of varying prosodic content were developed via a speech synthesizer. The f0 contour and speech rate of these sentences were altered to produce 4 stimulus sets: (a) normal rate with a fixed f0 level, (b) slow rate with a fixed f0 level, (c) normal rate with prosodically natural f0 variation, and (d) normal rate with prosodically unnatural f0 variation. Sixteen listeners provided orthographic transcriptions and judgments of naturalness for these stimuli. Results Sentences with f0 variation were rated as more natural than those with a fixed f0 level. Conversely, sentences with a fixed f0 level demonstrated higher intelligibility than those with f0 variation. Speech rate did not affect the intelligibility of stimuli with a fixed f0 level. Communication efficiency was highest for sentences produced at a normal rate and a fixed f0 level. Conclusions Sentence-level f0 variation increased naturalness ratings of synthesized speech, whether the variation was prosodically natural or not. However, these f0 variations reduced intelligibility. There is evidence of a trade-off in naturalness and intelligibility of synthesized speech, which may impact future speech synthesis designs. Supplemental Material https://doi.org/10.23641/asha.8847833.

Entities:  

Mesh:

Year:  2019        PMID: 31306599      PMCID: PMC6802860          DOI: 10.1044/2019_AJSLP-MSC18-18-0052

Source DB:  PubMed          Journal:  Am J Speech Lang Pathol        ISSN: 1058-0360            Impact factor:   2.408


  35 in total

1.  Direct magnitude estimation and interval scaling of naturalness and severity in tracheoesophageal (TE) speakers.

Authors:  Tanya L Eadie; Philip C Doyle
Journal:  J Speech Lang Hear Res       Date:  2002-12       Impact factor: 2.297

2.  Impact of aberrant acoustic properties on the perception of sound quality in electrolarynx speech.

Authors:  Geoffrey S Meltzner; Robert E Hillman
Journal:  J Speech Lang Hear Res       Date:  2005-08       Impact factor: 2.297

Review 3.  Functional outcome assessment in dysarthria.

Authors:  K C Hustad; D R Beukelman; K M Yorkston
Journal:  Semin Speech Lang       Date:  1998       Impact factor: 1.761

4.  Development of a test of speech intelligibility in noise using sentence materials with controlled word predictability.

Authors:  D N Kalikow; K N Stevens; L L Elliott
Journal:  J Acoust Soc Am       Date:  1977-05       Impact factor: 1.840

5.  Effect of the number of presentations on listener transcriptions and reliability in the assessment of speech intelligibility in children.

Authors:  Tove B Lagerberg; Jakob Åsberg Johnels; Lena Hartelius; Christina Persson
Journal:  Int J Lang Commun Disord       Date:  2015-01-14       Impact factor: 3.020

6.  The effect of rate control on the intelligibility and naturalness of dysarthric speech.

Authors:  K M Yorkston; V L Hammen; D R Beukelman; C D Traynor
Journal:  J Speech Hear Disord       Date:  1990-08

7.  Evaluation of the effect of speech-rate slowing on speech intelligibility in noise using a simulation of cochlear hearing loss.

Authors:  Y Nejime; B C Moore
Journal:  J Acoust Soc Am       Date:  1998-01       Impact factor: 1.840

8.  Relationship Between Speech Intelligibility and Speech Comprehension in Babble Noise.

Authors:  Lionel Fontan; Julien Tardieu; Pascal Gaillard; Virginie Woisard; Robert Ruiz
Journal:  J Speech Lang Hear Res       Date:  2015-06       Impact factor: 2.297

9.  Intelligibility of synthetic speech for normal-hearing and hearing-impaired listeners.

Authors:  K A Kangas; G D Allen
Journal:  J Speech Hear Disord       Date:  1990-11

10.  The impact of speech disorders quality of life: a questionnaire proposal.

Authors:  Gialile de Sá Lúcio; Tatiana Vargas de Castro Perilo; Laélia Cristina Caseiro Vicente; Amélia Augusta de Lima Friche
Journal:  Codas       Date:  2013
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.