Literature DB >> 489818

Automatic speech recognition using psychoacoustic models.

E Zwicker, E Terhardt, E Paulus.   

Abstract

An approach to automatic speech recognition is described, which, in a straightforward way, follows the concept of (1) preprocessing in terms of auditory parameters and (2) subsequent classification and recognition. The preprocessing system has been realized in analog hardware, while recognition is carried out on a digital computer. In the preprocessing system, the essential psychoacoustic principles of the perception of loudness, pitch, roughness, and subjective duration are implemented with some approximation. The system essentially consists of 24 bandpass filters, nonlinear transformation of each filter output into specific loudness and specific roughness, and final transformation of these parameters into total loudness, total roughness, and three spectral momenta. As a means to further reduce the information flow, continuous selection of dominant parameters is also considered on the basis of psychoacoustic data. The subsequent recognition process is mainly characterized by (1) discrimination between speech and silent periods, (2) detection of syllable peaks and classification of syllable nuclei, and (3) assumption of syllable boundaries and classification of consonant clusters. Though the entire system as yet is far from being complete and perfect, the present results indicate that the concept provides a systematic and promising way towards automatic recognition of continuous speech.

Mesh:

Year:  1979        PMID: 489818     DOI: 10.1121/1.382349

Source DB:  PubMed          Journal:  J Acoust Soc Am        ISSN: 0001-4966            Impact factor:   1.840


  8 in total

1.  Robust Speech Rate Estimation for Spontaneous Speech.

Authors:  Dagen Wang; Shrikanth S Narayanan
Journal:  IEEE Trans Audio Speech Lang Process       Date:  2007-11-01

2.  Acoustic-phonetic representations in word recognition.

Authors:  D B Pisoni; P A Luce
Journal:  Cognition       Date:  1987-03

3.  Some current theoretical issues in speech perception.

Authors:  D B Pisoni
Journal:  Cognition       Date:  1981 Aug-Dec

4.  Perceptual classification of information in vowel-consonant syllables.

Authors:  R B Ivry; P W Jusczyk
Journal:  Percept Psychophys       Date:  1985-02

5.  Speech perception: some new directions in research and theory.

Authors:  D B Pisoni
Journal:  J Acoust Soc Am       Date:  1985-07       Impact factor: 1.840

6.  [Acoustic analyses of snoring sounds: the possibilities and outlook].

Authors:  M Herzog
Journal:  HNO       Date:  2012-04       Impact factor: 1.284

7.  Speech perception by budgerigars (Melopsittacus undulatus): spoken vowels.

Authors:  R J Dooling; S D Brown
Journal:  Percept Psychophys       Date:  1990-06

Review 8.  Syllabic (∼2-5 Hz) and fluctuation (∼1-10 Hz) ranges in speech and auditory processing.

Authors:  Erik Edwards; Edward F Chang
Journal:  Hear Res       Date:  2013-09-12       Impact factor: 3.208

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.