Literature DB >> 21110529

Robust speech recognition from binary masks.

Arun Narayanan1, DeLiang Wang.   

Abstract

Inspired by recent evidence that a binary pattern may provide sufficient information for human speech recognition, this letter proposes a fundamentally different approach to robust automatic speech recognition. Specifically, recognition is performed by classifying binary masks corresponding to a word utterance. The proposed method is evaluated using a subset of the TIDigits corpus to perform isolated digit recognition. Despite dramatic reduction of speech information encoded in a binary mask, the proposed system performs surprisingly well. The system is compared with a traditional HMM based approach and is shown to perform well under low SNR conditions.

Entities:  

Mesh:

Year:  2010        PMID: 21110529     DOI: 10.1121/1.3497358

Source DB:  PubMed          Journal:  J Acoust Soc Am        ISSN: 0001-4966            Impact factor:   1.840


  2 in total

1.  Perceptual learning for speech in noise after application of binary time-frequency masks.

Authors:  Mahnaz Ahmadi; Vauna L Gross; Donal G Sinex
Journal:  J Acoust Soc Am       Date:  2013-03       Impact factor: 1.840

2.  The role of binary mask patterns in automatic speech recognition in background noise.

Authors:  Arun Narayanan; DeLiang Wang
Journal:  J Acoust Soc Am       Date:  2013-05       Impact factor: 1.840

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.