Literature DB >> 7479807

Linguistic aspects of speech synthesis.

J Allen1.   

Abstract

The conversion of text to speech is seen as an analysis of the input text to obtain a common underlying linguistic description, followed by a synthesis of the output speech waveform from this fundamental specification. Hence, the comprehensive linguistic structure serving as the substrate for an utterance must be discovered by analysis from the text. The pronunciation of individual words in unrestricted text is determined by morphological analysis or letter-to-sound conversion, followed by specification of the word-level stress contour. In addition, many text character strings, such as titles, numbers, and acronyms, are abbreviations for normal words, which must be derived. To further refine these pronunciations and to discover the prosodic structure of the utterance, word part of speech must be computed, followed by a phrase-level parsing. From this structure the prosodic structure of the utterance can be determined, which is needed in order to specify the durational framework and fundamental frequency contour of the utterance. In discourse contexts, several factors such as the specification of new and old information, contrast, and pronominal reference can be used to further modify the prosodic specification. When the prosodic correlates have been computed and the segmental sequence is assembled, a complete input suitable for speech synthesis has been determined. Lastly, multilingual systems utilizing rule frameworks are mentioned, and future directions are characterized.

Mesh:

Year:  1995        PMID: 7479807      PMCID: PMC40716          DOI: 10.1073/pnas.92.22.9946

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   11.205


  3 in total

1.  Segmental durations in the vicinity of prosodic phrase boundaries.

Authors:  C W Wightman; S Shattuck-Hufnagel; M Ostendorf; P J Price
Journal:  J Acoust Soc Am       Date:  1992-03       Impact factor: 1.840

2.  The use of prosody in syntactic disambiguation.

Authors:  P J Price; M Ostendorf; S Shattuck-Hufnagel; C Fong
Journal:  J Acoust Soc Am       Date:  1991-12       Impact factor: 1.840

3.  Linguistic modality effects on fundamental frequency in speech.

Authors:  D O'Shaughnessy; J Allen
Journal:  J Acoust Soc Am       Date:  1983-10       Impact factor: 1.840

  3 in total
  2 in total

1.  Computer speech synthesis: its status and prospects.

Authors:  M Liberman
Journal:  Proc Natl Acad Sci U S A       Date:  1995-10-24       Impact factor: 11.205

2.  Deployment of human-machine dialogue systems.

Authors:  D B Roe
Journal:  Proc Natl Acad Sci U S A       Date:  1995-10-24       Impact factor: 11.205

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.