Literature DB >> 21702797

How should a speech recognizer work?

Odette Scharenborg1, Dennis Norris, Louis Bosch, James M McQueen.   

Abstract

Although researchers studying human speech recognition (HSR) and automatic speech recognition (ASR) share a common interest in how information processing systems (human or machine) recognize spoken language, there is little communication between the two disciplines. We suggest that this lack of communication follows largely from the fact that research in these related fields has focused on the mechanics of how speech can be recognized. In Marr's (1982) terms, emphasis has been on the algorithmic and implementational levels rather than on the computational level. In this article, we provide a computational-level analysis of the task of speech recognition, which reveals the close parallels between research concerned with HSR and ASR. We illustrate this relation by presenting a new computational model of human spoken-word recognition, built using techniques from the field of ASR that, in contrast to current existing models of HSR, recognizes words from real speech input. 2005 Lawrence Erlbaum Associates, Inc.

Entities:  

Year:  2005        PMID: 21702797     DOI: 10.1207/s15516709cog0000_37

Source DB:  PubMed          Journal:  Cogn Sci        ISSN: 0364-0213


  5 in total

1.  DIANA, a Process-Oriented Model of Human Auditory Word Recognition.

Authors:  Louis Ten Bosch; Lou Boves; Mirjam Ernestus
Journal:  Brain Sci       Date:  2022-05-23

2.  Individual aptitude in Mandarin lexical tone perception predicts effectiveness of high-variability training.

Authors:  Makiko Sadakata; James M McQueen
Journal:  Front Psychol       Date:  2014-11-25

3.  Words from spontaneous conversational speech can be recognized with human-like accuracy by an error-driven learning algorithm that discriminates between meanings straight from smart acoustic features, bypassing the phoneme as recognition unit.

Authors:  Denis Arnold; Fabian Tomaschek; Konstantin Sering; Florence Lopez; R Harald Baayen
Journal:  PLoS One       Date:  2017-04-10       Impact factor: 3.240

4.  Attention Differentially Affects Acoustic and Phonetic Feature Encoding in a Multispeaker Environment.

Authors:  Emily S Teoh; Farhin Ahmed; Edmund C Lalor
Journal:  J Neurosci       Date:  2021-12-10       Impact factor: 6.167

5.  Using auditory classification images for the identification of fine acoustic cues used in speech perception.

Authors:  Léo Varnet; Kenneth Knoblauch; Fanny Meunier; Michel Hoen
Journal:  Front Hum Neurosci       Date:  2013-12-16       Impact factor: 3.169

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.