Literature DB >> 7479806

Research in speech communication.

J Flanagan1.   

Abstract

Advances in digital speech processing are now supporting application and deployment of a variety of speech technologies for human/machine communication. In fact, new businesses are rapidly forming about these technologies. But these capabilities are of little use unless society can afford them. Happily, explosive advances in microelectronics over the past two decades have assured affordable access to this sophistication as well as to the underlying computing technology. The research challenges in speech processing remain in the traditionally identified areas of recognition, synthesis, and coding. These three areas have typically been addressed individually, often with significant isolation among the efforts. But they are all facets of the same fundamental issue--how to represent and quantify the information in the speech signal. This implies deeper understanding of the physics of speech production, the constraints that the conventions of language impose, and the mechanism for information processing in the auditory system. In ongoing research, therefore, we seek more accurate models of speech generation, better computational formulations of language, and realistic perceptual guides for speech processing--along with ways to coalesce the fundamental issues of recognition, synthesis, and coding. Successful solution will yield the long-sought dictation machine, high-quality synthesis from text, and the ultimate in low bit-rate transmission of speech. It will also open the door to language-translating telephony, where the synthetic foreign translation can be in the voice of the originating talker.

Entities:  

Mesh:

Year:  1995        PMID: 7479806      PMCID: PMC40715          DOI: 10.1073/pnas.92.22.9938

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   11.205


  2 in total

1.  Speech recognition technology: a critique.

Authors:  S E Levinson
Journal:  Proc Natl Acad Sci U S A       Date:  1995-10-24       Impact factor: 11.205

2.  Scientific bases of human-machine communication by voice.

Authors:  R W Schafer
Journal:  Proc Natl Acad Sci U S A       Date:  1995-10-24       Impact factor: 11.205

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.