Literature DB >> 7479723

Toward the ultimate synthesis/recognition system.

S Furui1.   

Abstract

This paper predicts speech synthesis, speech recognition, and speaker recognition technology for the year 2001, and it describes the most important research problems to be solved in order to arrive at these ultimate synthesis and recognition systems. The problems for speech synthesis include natural and intelligible voice production, prosody control based on meaning, capability of controlling synthesized voice quality and choosing individual speaking style, multilingual and multidialectal synthesis, choice of application-oriented speaking styles, capability of adding emotion, and synthesis from concepts. The problems for speech recognition include robust recognition against speech variations, adaptation/normalization to variations due to environmental conditions and speakers, automatic knowledge acquisition for acoustic and linguistic modeling, spontaneous speech recognition, naturalness and ease of human-machine interaction, and recognition of emotion. The problems for speaker recognition are similar to those for speech recognition. The research topics related to all these techniques include the use of articulatory and perceptual constraints and evaluation methods for measuring the quality of technology and systems.

Entities:  

Mesh:

Year:  1995        PMID: 7479723      PMCID: PMC40732          DOI: 10.1073/pnas.92.22.10040

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   11.205


  13 in total

1.  Perceptual linear predictive (PLP) analysis of speech.

Authors:  H Hermansky
Journal:  J Acoust Soc Am       Date:  1990-04       Impact factor: 1.840

2.  On the role of spectral transition for speech perception.

Authors:  S Furui
Journal:  J Acoust Soc Am       Date:  1986-10       Impact factor: 1.840

3.  Military and government applications of human-machine communication by voice.

Authors:  C J Weinstein
Journal:  Proc Natl Acad Sci U S A       Date:  1995-10-24       Impact factor: 11.205

4.  Models of natural language understanding.

Authors:  M Bates
Journal:  Proc Natl Acad Sci U S A       Date:  1995-10-24       Impact factor: 11.205

5.  State of the art in continuous speech recognition.

Authors:  J Makhoul; R Schwartz
Journal:  Proc Natl Acad Sci U S A       Date:  1995-10-24       Impact factor: 11.205

6.  The role of voice input for human-machine communication.

Authors:  P R Cohen; S L Oviatt
Journal:  Proc Natl Acad Sci U S A       Date:  1995-10-24       Impact factor: 11.205

7.  Models of speech synthesis.

Authors:  R Carlson
Journal:  Proc Natl Acad Sci U S A       Date:  1995-10-24       Impact factor: 11.205

8.  New trends in natural language processing: statistical natural language processing.

Authors:  M Marcus
Journal:  Proc Natl Acad Sci U S A       Date:  1995-10-24       Impact factor: 11.205

9.  Processing of speech signals for physical and sensory disabilities.

Authors:  H Levitt
Journal:  Proc Natl Acad Sci U S A       Date:  1995-10-24       Impact factor: 11.205

10.  Voice-processing technologies--their application in telecommunications.

Authors:  J G Wilpon
Journal:  Proc Natl Acad Sci U S A       Date:  1995-10-24       Impact factor: 11.205

View more
  3 in total

1.  Speech technology in the year 2001.

Authors:  S E Levinson; F Fallside
Journal:  Proc Natl Acad Sci U S A       Date:  1995-10-24       Impact factor: 11.205

2.  Speech recognition technology: a critique.

Authors:  S E Levinson
Journal:  Proc Natl Acad Sci U S A       Date:  1995-10-24       Impact factor: 11.205

Review 3.  What is the Value of Embedding Artificial Emotional Prosody in Human-Computer Interactions? Implications for Theory and Design in Psychological Science.

Authors:  Rachel L C Mitchell; Yi Xu
Journal:  Front Psychol       Date:  2015-11-12
  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.