| Literature DB >> 28487827 |
Iman Esmaili1, Nader Jafarnia Dabanloo1, Mansour Vali2.
Abstract
In recent years, many methods have been introduced for supporting the diagnosis of stuttering for automatic detection of prolongation in the speech of people who stutter. However, less attention has been paid to treatment processes in which clients learn to speak more slowly. The aim of this study was to develop a method to help speech-language pathologists (SLPs) during diagnosis and treatment sessions. To this end, speech signals were initially parameterized to perceptual linear predictive (PLP) features. To detect the prolonged segments, the similarities between successive frames of speech signals were calculated based on correlation similarity measures. The segments were labeled as prolongation when the duration of highly similar successive frames exceeded a threshold specified by the speaking rate. The proposed method was evaluated by UCLASS and self-recorded Persian speech databases. The results were also compared with three high-performance studies in automatic prolongation detection. The best accuracies of prolongation detection were 99 and 97.1% for UCLASS and Persian databases, respectively. The proposed method also indicated promising robustness against artificial variation of speaking rate from 70 to 130% of normal speaking rate.Entities:
Keywords: Attention; language; learning; pathologists; speech; speech-language pathology; stuttering
Year: 2017 PMID: 28487827 PMCID: PMC5394801
Source DB: PubMed Journal: J Med Signals Sens ISSN: 2228-7477
Figure 1The block diagram of the proposed system
Figure 2An example of syllable counting method. Vertical lines, solid line, and dashed line are actual syllable boundaries, energy signal, and zero-crossing rate, respectively. Circle marks are considered in the syllable counting process. [The amplitude of signals (i.e., Y axis) were normalized to fit all signals in proper view]
Figure 3The block diagram of the PLP feature extraction method
Figure 4Prolongation detection. (a) Speech sample “to make up” with prolongation in word “make,” (b) highly similar segments, and (c) detected prolonged segment which is longer than threshold (here 400 ms)
The recognition rates for prolongation detection on UCLASS and Persian database
The accuracy of prolongation detection methods for artificial variations of speaking rate from 60 to 140% of normal speaking