Literature DB >> 22423817

A comparison of automatic and human speech recognition in null grammar.

Amit Juneja1.   

Abstract

The accuracy of automatic speech recognition (ASR) systems is generally evaluated using corpora of grammatically sound read speech or natural spontaneous speech. This prohibits an accurate estimation of the performance of the acoustic modeling part of ASR because the language modeling performance is inherently integrated in the overall performance metric. In this work, ASR and human speech recognition (HSR) accuracies are compared for null grammar sentences in different signal-to-noise ratios and vocabulary sizes-1000, 2000, 4000, and 8000. The results shed light on differences between ASR and HSR in relative significance of bottom-up word recognition and context awareness.
© 2012 Acoustical Society of America

Entities:  

Mesh:

Year:  2012        PMID: 22423817     DOI: 10.1121/1.3684744

Source DB:  PubMed          Journal:  J Acoust Soc Am        ISSN: 0001-4966            Impact factor:   1.840


  1 in total

1.  A psychophysical imaging method evidencing auditory cue extraction during speech perception: a group analysis of auditory classification images.

Authors:  Léo Varnet; Kenneth Knoblauch; Willy Serniclaes; Fanny Meunier; Michel Hoen
Journal:  PLoS One       Date:  2015-03-17       Impact factor: 3.240

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.