Literature DB >> 10530027

A model of auditory perception as front end for automatic speech recognition.

J Tchorz1, B Kollmeier.   

Abstract

A front end for automatic speech recognizers is proposed and evaluated which is based on a quantitative model of the "effective" peripheral auditory processing. The model simulates both spectral and temporal properties of sound processing in the auditory system which were found in psychoacoustical and physiological experiments. The robustness of the auditory-based representation of speech was evaluated in speaker-independent, isolated word recognition experiments in different types of additive noise. The results show a higher robustness of the auditory front end in noise, compared to common mel-scale cepstral feature extraction. In a second set of experiments, different processing stages of the auditory front end were modified to study their contribution to robust speech signal representation in detail. The adaptive compression stage which enhances temporal changes of the input signal appeared to be the most important processing stage towards robust speech representation in noise. Low-pass filtering of the fast fluctuating envelope in each frequency band further reduces the influence of noise in the auditory-based representation of speech.

Mesh:

Year:  1999        PMID: 10530027     DOI: 10.1121/1.427950

Source DB:  PubMed          Journal:  J Acoust Soc Am        ISSN: 0001-4966            Impact factor:   1.840


  3 in total

1.  A computational model of mechanisms controlling experience-dependent reorganization of representational maps in auditory cortex.

Authors:  E Mercado; C E Myers; M A Gluck
Journal:  Cogn Affect Behav Neurosci       Date:  2001-03       Impact factor: 3.282

2.  Auditory stream formation affects comodulation masking release retroactively.

Authors:  Torsten Dau; Stephan Ewert; Andrew J Oxenham
Journal:  J Acoust Soc Am       Date:  2009-04       Impact factor: 1.840

3.  Mandarin Chinese tone identification in cochlear implants: predictions from acoustic models.

Authors:  Kenneth D Morton; Peter A Torrione; Chandra S Throckmorton; Leslie M Collins
Journal:  Hear Res       Date:  2008-07-31       Impact factor: 3.208

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.