Literature DB >> 21973371

Estimating speech spectra for copy synthesis by linear prediction and by hand.

Robert E Remez1, Kathryn R Dubowski, Morgana L Davids, Emily F Thomas, Nina U Paddu, Yael S Grossman, Marina Moskalenko.   

Abstract

Linear prediction is a widely available technique for analyzing acoustic properties of speech, although this method is known to be error-prone. New tests assessed the adequacy of linear prediction estimates by using this method to derive synthesis parameters and testing the intelligibility of the synthetic speech that results. Matched sets of sine-wave sentences were created, one set using uncorrected linear prediction estimates of natural sentences, the other using estimates made by hand. Phoneme restrictions imposed on linguistic properties allowed comparisons between continuous and intermittent voicing, oral or nasal and fricative manner, and unrestricted phonemic variation. Intelligibility tests revealed uniformly good performance with sentences created by hand-estimation and a minimal decrease in intelligibility with estimation by linear prediction due to manner variation with continuous voicing. Poorer performance was observed when linear prediction estimates were used to produce synthetic versions of phonemically unrestricted sentences, but no similar decline was observed with synthetic sentences produced by hand estimation. The results show a substantial intelligibility cost of reliance on uncorrected linear prediction estimates when phonemic variation approaches natural incidence.
© 2011 Acoustical Society of America

Mesh:

Year:  2011        PMID: 21973371      PMCID: PMC3206912          DOI: 10.1121/1.3631667

Source DB:  PubMed          Journal:  J Acoust Soc Am        ISSN: 0001-4966            Impact factor:   1.840


  17 in total

1.  "Putting the face to the voice": matching identity across modality.

Authors:  Miyuki Kamachi; Harold Hill; Karen Lander; Eric Vatikiotis-Bateson
Journal:  Curr Biol       Date:  2003-09-30       Impact factor: 10.834

2.  Across-ear interference from parametrically degraded synthetic speech signals in a dichotic cocktail-party listening task.

Authors:  Douglas S Brungart; Brian D Simpson; Christopher J Darwin; Tanya L Arbogast; Gerald Kidd
Journal:  J Acoust Soc Am       Date:  2005-01       Impact factor: 1.840

3.  Talker identification based on phonetic information.

Authors:  R E Remez; J M Fellowes; P E Rubin
Journal:  J Exp Psychol Hum Percept Perform       Date:  1997-06       Impact factor: 3.332

4.  Development of a test of speech intelligibility in noise using sentence materials with controlled word predictability.

Authors:  D N Kalikow; K N Stevens; L L Elliott
Journal:  J Acoust Soc Am       Date:  1977-05       Impact factor: 1.840

5.  Mathematical treatment of context effects in phoneme and word recognition.

Authors:  A Boothroyd; S Nittrouer
Journal:  J Acoust Soc Am       Date:  1988-07       Impact factor: 1.840

6.  Estimation of formant frequencies in infant cry.

Authors:  M P Robb; A T Cacace
Journal:  Int J Pediatr Otorhinolaryngol       Date:  1995-04       Impact factor: 1.675

7.  Subcategorical phonetic mismatches slow phonetic judgments.

Authors:  D H Whalen
Journal:  Percept Psychophys       Date:  1984-01

8.  Speech quality evaluation using "phoneme-specific" sentences.

Authors:  A W Huggins; R S Nickerson
Journal:  J Acoust Soc Am       Date:  1985-05       Impact factor: 1.840

9.  Capacity demands in short-term memory for synthetic and natural speech.

Authors:  P A Luce; T C Feustel; D B Pisoni
Journal:  Hum Factors       Date:  1983-02       Impact factor: 2.888

10.  Algorithms for separating the speech of interfering talkers: evaluations with voiced sentences, and normal-hearing and hearing-impaired listeners.

Authors:  R J Stubbs; Q Summerfield
Journal:  J Acoust Soc Am       Date:  1990-01       Impact factor: 1.840

View more
  7 in total

1.  Toddlers' comprehension of degraded signals: Noise-vocoded versus sine-wave analogs.

Authors:  Rochelle S Newman; Monita Chatterjee; Giovanna Morini; Robert E Remez
Journal:  J Acoust Soc Am       Date:  2015-09       Impact factor: 1.840

2.  Modulation sensitivity in the perceptual organization of speech.

Authors:  Robert E Remez; Emily F Thomas; Kathryn R Dubowski; Stavroula M Koinis; Natalie A C Porter; Nina U Paddu; Marina Moskalenko; Yael S Grossman
Journal:  Atten Percept Psychophys       Date:  2013-10       Impact factor: 2.199

3.  Constraints on Sensitivity to Auditory Modulation in the Perceptual Organization of Speech.

Authors:  Robert E Remez; Emily F Thomas; Andrea M Wycoff; Rebecca E Giglio; Aislinn T Crank; Chloe B Cheimets; Stavroula M Koinis
Journal:  Exp Aging Res       Date:  2016       Impact factor: 1.645

4.  Information for coarticulation: Static signal properties or formant dynamics?

Authors:  Navin Viswanathan; James S Magnuson; Carol A Fowler
Journal:  J Exp Psychol Hum Percept Perform       Date:  2014-04-14       Impact factor: 3.332

5.  Acoustic source characteristics, across-formant integration, and speech intelligibility under competitive conditions.

Authors:  Brian Roberts; Robert J Summers; Peter J Bailey
Journal:  J Exp Psychol Hum Percept Perform       Date:  2015-03-09       Impact factor: 3.332

6.  Formant-frequency variation and informational masking of speech by extraneous formants: evidence against dynamic and speech-specific acoustical constraints.

Authors:  Brian Roberts; Robert J Summers; Peter J Bailey
Journal:  J Exp Psychol Hum Percept Perform       Date:  2014-05-19       Impact factor: 3.332

7.  Hierarchical Organization of Auditory and Motor Representations in Speech Perception: Evidence from Searchlight Similarity Analysis.

Authors:  Samuel Evans; Matthew H Davis
Journal:  Cereb Cortex       Date:  2015-07-08       Impact factor: 5.357

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.