Literature DB >> 2958525

Review of text-to-speech conversion for English.

D H Klatt1.   

Abstract

The automatic conversion of English text to synthetic speech is presently being performed, remarkably well, by a number of laboratory systems and commercial devices. Progress in this area has been made possible by advances in linguistic theory, acoustic-phonetic characterization of English sound patterns, perceptual psychology, mathematical modeling of speech production, structured programming, and computer hardware design. This review traces the early work on the development of speech synthesizers, discovery of minimal acoustic cues for phonetic contrasts, evolution of phonemic rule programs, incorporation of prosodic rules, and formulation of techniques for text analysis. Examples of rules are used liberally to illustrate the state of the art. Many of the examples are taken from Klattalk, a text-to-speech system developed by the author. A number of scientific problems are identified that prevent current systems from achieving the goal of completely human-sounding speech. While the emphasis is on rule programs that drive a format synthesizer, alternatives such as articulatory synthesis and waveform concatenation are also reviewed. An extensive bibliography has been assembled to show both the breadth of synthesis activity and the wealth of phenomena covered by rules in the best of these programs. A recording of selected examples of the historical development of synthetic speech, enclosed as a 33 1/3-rpm record, is described in the Appendix.

Entities:  

Mesh:

Year:  1987        PMID: 2958525     DOI: 10.1121/1.395275

Source DB:  PubMed          Journal:  J Acoust Soc Am        ISSN: 0001-4966            Impact factor:   1.840


  14 in total

Review 1.  Comprehension of synthetic speech produced by rule: a review and theoretical interpretation.

Authors:  S A Duffy; D B Pisoni
Journal:  Lang Speech       Date:  1992 Oct-Dec       Impact factor: 1.500

2.  Segmental intelligibility of synthetic speech produced by rule.

Authors:  J S Logan; B G Greene; D B Pisoni
Journal:  J Acoust Soc Am       Date:  1989-08       Impact factor: 1.840

3.  Comprehension of synthetic speech produced by rule: word monitoring and sentence-by-sentence listening times.

Authors:  J V Ralston; D B Pisoni; S E Lively; B G Greene; J W Mullennix
Journal:  Hum Factors       Date:  1991-08       Impact factor: 2.888

4.  Mechanics of human voice production and control.

Authors:  Zhaoyan Zhang
Journal:  J Acoust Soc Am       Date:  2016-10       Impact factor: 1.840

5.  Methods for eliciting, annotating, and analyzing databases for child speech development.

Authors:  Mary E Beckman; Andrew R Plummer; Benjamin Munson; Patrick F Reidy
Journal:  Comput Speech Lang       Date:  2017-09       Impact factor: 1.899

6.  Models of speech synthesis.

Authors:  R Carlson
Journal:  Proc Natl Acad Sci U S A       Date:  1995-10-24       Impact factor: 11.205

7.  Learning phonology from surface distributions, considering Dutch and English vowel duration.

Authors:  Daniel Swingley
Journal:  Lang Learn Dev       Date:  2019-02-14

8.  Native language governs interpretation of salient speech sound differences at 18 months.

Authors:  Christiane Dietrich; Daniel Swingley; Janet F Werker
Journal:  Proc Natl Acad Sci U S A       Date:  2007-10-02       Impact factor: 11.205

9.  An Application of Network Science to Phonological Sequence Learning in Children With Developmental Language Disorder.

Authors:  Sara Benham; Lisa Goffman; Richard Schweickert
Journal:  J Speech Lang Hear Res       Date:  2018-09-19       Impact factor: 2.297

10.  Lexical-Semantic Cues Induce Sound Pattern Stability in Children With Developmental Language Disorder.

Authors:  Sara Benham; Lisa Goffman
Journal:  J Speech Lang Hear Res       Date:  2020-11-30       Impact factor: 2.297

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.