Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Recognizing Whispered Speech Produced by an Individual with Surgically Reconstructed Larynx Using Articulatory Movement Data.

Literature DB >> 29423453

Recognizing Whispered Speech Produced by an Individual with Surgically Reconstructed Larynx Using Articulatory Movement Data.

Beiming Cao¹, Myungjong Kim¹, Ted Mau², Jun Wang^1,3.

Abstract

Individuals with larynx (vocal folds) impaired have problems in controlling their glottal vibration, producing whispered speech with extreme hoarseness. Standard automatic speech recognition using only acoustic cues is typically ineffective for whispered speech because the corresponding spectral characteristics are distorted. Articulatory cues such as the tongue and lip motion may help in recognizing whispered speech since articulatory motion patterns are generally not affected. In this paper, we investigated whispered speech recognition for patients with reconstructed larynx using articulatory movement data. A data set with both acoustic and articulatory motion data was collected from a patient with surgically reconstructed larynx using an electromagnetic articulograph. Two speech recognition systems, Gaussian mixture model-hidden Markov model (GMM-HMM) and deep neural network-HMM (DNN-HMM), were used in the experiments. Experimental results showed adding either tongue or lip motion data to acoustic features such as mel-frequency cepstral coefficient (MFCC) significantly reduced the phone error rates on both speech recognition systems. Adding both tongue and lip data achieved the best performance.

Entities: Chemical Disease Gene Species

Keywords: deep neural network; hidden Markov model; larynx reconstruction; speech articulation; whispered speech recognition

Year: 2016 PMID： 29423453 PMCID： PMC5800526 DOI： 10.21437/SLPAT.2016-14

Source DB: PubMed Journal: Workshop Speech Lang Process Assist Technol ISSN： 2411-9962

Keyword Cloud
References

8 in total

Recognizing Whispered Speech Produced by an Individual with Surgically Reconstructed Larynx Using Articulatory Movement Data.

1. Automatic speech recognition using articulatory features from subject-independent acoustic-to-articulatory inversion.

Review 2. Diagnostic evaluation and management of hoarseness.

3. An Optimal Set of Flesh Points on Tongue and Lips for Speech-Movement Classification.

4. Accuracy of the NDI wave speech research system.

Review 5. Speech production knowledge in automatic speech recognition.

6. Frequency of word occurrence in communication samples produced by adult communication aid users.

7. Articulatory distinctiveness of vowels and consonants: a data-driven approach.

8. Modulating phonation through alteration of vocal fold medial surface contour.