Literature DB >> 19011305

Automatic recognition of pathological phoneme production.

Robert Wielgat1, Tomasz P Zieliński, Tomasz Woźniak, Stanisław Grabias, Daniel Król.   

Abstract

OBJECTIVE: Proper diagnosis and therapy of pathological pronunciation of phonemes play an important role in modern logopedics. To enhance the efficiency of diagnosis and therapy an automatic recognition of pathological phoneme pronunciation is addressed in this paper. The authors focus on the therapy of phoneme substitution disorders. PATIENTS AND METHODS: Recognized speech samples come from speech-impaired Polish children and partially from persons imitating speech disorders. Recognized speech disorders were substitutions in pairs (for the correct phonetic charactors please see online article) embedded in Polish carrier words. In order to detect substitutions in the recognized words, recently proposed human factor cepstral coefficients (HFCC) have been implemented. Efficiency of the HFCC approach was compared to the application of standard mel-frequency cepstral coefficients (MFCC) as a feature vector. Both dynamic time warping (DTW), working on whole words or embedded phoneme patterns, and hidden Markov models (HMM) were used as classifiers. The HMM classifier was based on whole-word models as well as phoneme models. Results present a comparative analysis of DTW and HMM methods.
CONCLUSIONS: The superiority of HFCC features over those of MFCC was demonstrated. Results obtained by DTW methods, mainly by modified phoneme-based DTW classifier, were slightly better in comparison with the HMM classifier. Results obtained for the detection of substitution in pairs (for the correct phonetic charactors please see online article) are very promising. The methods developed for these cases can be integrated into computer systems for speech therapy. For substitutions in pairs (for the correct phonetic charactors please see online article) further research is necessary. Copyright 2008 S. Karger AG, Basel.

Entities:  

Mesh:

Year:  2008        PMID: 19011305     DOI: 10.1159/000170083

Source DB:  PubMed          Journal:  Folia Phoniatr Logop        ISSN: 1021-7762            Impact factor:   0.849


  2 in total

1.  Formant analysis in dysphonic patients and automatic Arabic digit speech recognition.

Authors:  Ghulam Muhammad; Tamer A Mesallam; Khalid H Malki; Mohamed Farahat; Mansour Alsulaiman; Manal Bukhari
Journal:  Biomed Eng Online       Date:  2011-05-30       Impact factor: 2.819

2.  Modulation Spectra Morphological Parameters: A New Method to Assess Voice Pathologies according to the GRBAS Scale.

Authors:  Laureano Moro-Velázquez; Jorge Andrés Gómez-García; Juan Ignacio Godino-Llorente; Gustavo Andrade-Miranda
Journal:  Biomed Res Int       Date:  2015-10-18       Impact factor: 3.411

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.