Literature DB >> 16359906

Investigation of an HMM/ANN hybrid structure in pattern recognition application using cepstral analysis of dysarthric (distorted) speech signals.

Prasad D Polur1, Gerald E Miller.   

Abstract

Computer speech recognition of individuals with dysarthria, such as cerebral palsy patients requires a robust technique that can handle conditions of very high variability and limited training data. In this study, application of a 10 state ergodic hidden Markov model (HMM)/artificial neural network (ANN) hybrid structure for a dysarthric speech (isolated word) recognition system, intended to act as an assistive tool, was investigated. A small size vocabulary spoken by three cerebral palsy subjects was chosen. The effect of such a structure on the recognition rate of the system was investigated by comparing it with an ergodic hidden Markov model as a control tool. This was done in order to determine if this modified technique contributed to enhanced recognition of dysarthric speech. The speech was sampled at 11 kHz. Mel frequency cepstral coefficients were extracted from them using 15 ms frames and served as training input to the hybrid model setup. The subsequent results demonstrated that the hybrid model structure was quite robust in its ability to handle the large variability and non-conformity of dysarthric speech. The level of variability in input dysarthric speech patterns sometimes limits the reliability of the system. However, its application as a rehabilitation/control tool to assist dysarthric motor impaired individuals holds sufficient promise.

Entities:  

Mesh:

Year:  2005        PMID: 16359906     DOI: 10.1016/j.medengphy.2005.11.002

Source DB:  PubMed          Journal:  Med Eng Phys        ISSN: 1350-4533            Impact factor:   2.242


  2 in total

1.  Regularized Speaker Adaptation of KL-HMM for Dysarthric Speech Recognition.

Authors:  Myungjong Kim; Younggwan Kim; Joohong Yoo; Jun Wang; Hoirin Kim
Journal:  IEEE Trans Neural Syst Rehabil Eng       Date:  2017-03-13       Impact factor: 3.802

2.  Estimation of phoneme-specific HMM topologies for the automatic recognition of dysarthric speech.

Authors:  Santiago-Omar Caballero-Morales
Journal:  Comput Math Methods Med       Date:  2013-10-08       Impact factor: 2.238

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.