Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Toward the ultimate synthesis/recognition system.

Literature DB >> 7479723

Toward the ultimate synthesis/recognition system.

Abstract

This paper predicts speech synthesis, speech recognition, and speaker recognition technology for the year 2001, and it describes the most important research problems to be solved in order to arrive at these ultimate synthesis and recognition systems. The problems for speech synthesis include natural and intelligible voice production, prosody control based on meaning, capability of controlling synthesized voice quality and choosing individual speaking style, multilingual and multidialectal synthesis, choice of application-oriented speaking styles, capability of adding emotion, and synthesis from concepts. The problems for speech recognition include robust recognition against speech variations, adaptation/normalization to variations due to environmental conditions and speakers, automatic knowledge acquisition for acoustic and linguistic modeling, spontaneous speech recognition, naturalness and ease of human-machine interaction, and recognition of emotion. The problems for speaker recognition are similar to those for speech recognition. The research topics related to all these techniques include the use of articulatory and perceptual constraints and evaluation methods for measuring the quality of technology and systems.

Entities: Species

Mesh：

Year: 1995 PMID： 7479723 PMCID： PMC40732 DOI： 10.1073/pnas.92.22.10040

Source DB: PubMed Journal: Proc Natl Acad Sci U S A ISSN： 0027-8424 Impact factor: 11.205

13 in total

Review 3. What is the Value of Embedding Artificial Emotional Prosody in Human-Computer Interactions? Implications for Theory and Design in Psychological Science.

Authors: Rachel L C Mitchell; Yi Xu
Journal: Front Psychol Date: 2015-11-12

3 in total

Toward the ultimate synthesis/recognition system.

1. Perceptual linear predictive (PLP) analysis of speech.

2. On the role of spectral transition for speech perception.

3. Military and government applications of human-machine communication by voice.

4. Models of natural language understanding.

5. State of the art in continuous speech recognition.

6. The role of voice input for human-machine communication.

7. Models of speech synthesis.

8. New trends in natural language processing: statistical natural language processing.

9. Processing of speech signals for physical and sensory disabilities.

10. Voice-processing technologies--their application in telecommunications.

1. Speech technology in the year 2001.

2. Speech recognition technology: a critique.

Review 3. What is the Value of Embedding Artificial Emotional Prosody in Human-Computer Interactions? Implications for Theory and Design in Psychological Science.