| Literature DB >> 28553366 |
Yunfeng Wu1, Pinnan Chen1, Yuchen Yao1, Xiaoquan Ye1, Yugui Xiao1, Lifang Liao1, Meihong Wu1, Jian Chen2.
Abstract
Analysis of quantified voice patterns is useful in the detection and assessment of dysphonia and related phonation disorders. In this paper, we first study the linear correlations between 22 voice parameters of fundamental frequency variability, amplitude variations, and nonlinear measures. The highly correlated vocal parameters are combined by using the linear discriminant analysis method. Based on the probability density functions estimated by the Parzen-window technique, we propose an interclass probability risk (ICPR) method to select the vocal parameters with small ICPR values as dominant features and compare with the modified Kullback-Leibler divergence (MKLD) feature selection approach. The experimental results show that the generalized logistic regression analysis (GLRA), support vector machine (SVM), and Bagging ensemble algorithm input with the ICPR features can provide better classification results than the same classifiers with the MKLD selected features. The SVM is much better at distinguishing normal vocal patterns with a specificity of 0.8542. Among the three classification methods, the Bagging ensemble algorithm with ICPR features can identify 90.77% vocal patterns, with the highest sensitivity of 0.9796 and largest area value of 0.9558 under the receiver operating characteristic curve. The classification results demonstrate the effectiveness of our feature selection and pattern analysis methods for dysphonic voice detection and measurement.Entities:
Mesh:
Year: 2017 PMID: 28553366 PMCID: PMC5434464 DOI: 10.1155/2017/4201984
Source DB: PubMed Journal: Comput Math Methods Med ISSN: 1748-670X Impact factor: 2.238
Pathological stages of 23 patients with Parkinson's disease rated by the modified Hoehn and Yahr (MHAY) scale.
| MHAY stage | Disease description | Number of patients |
|---|---|---|
| 1 | Minimal functional involvement | 3 |
| 1.5 | Unilateral involvement | 3 |
| 2 | Bilateral involvement | 5 |
| 2.5 | Mild bilateral disease | 7 |
| 3 | Mild to moderate bilateral disease | 4 |
| 4 | Severe functional disability | 1 |
Voice perturbation and nonlinear dynamic parameters measured from the acoustic signals of 31 subjects [5, 7].
| Abbreviations | Feature description |
|---|---|
| MDVP:F0 (Hz) | Average vocal fundamental frequency |
| MDVP:Fhi (Hz) | Maximum vocal fundamental frequency |
| MDVP:Flo (Hz) | Minimum vocal fundamental frequency |
| MDVP:Jitter(%) | MDVP jitter in percentage |
| MDVP:Jitter(Abs) | MDVP absolute jitter in ms |
| MDVP:RAP | MDVP relative amplitude perturbation |
| MDVP:PPQ | MDVP five-point period perturbation quotient |
| Jitter:DDP | Average absolute difference of differences between jitter cycles |
| MDVP:Shimmer | MDVP local shimmer |
| MDVP:Shimmer(dB) | MDVP local shimmer in dB |
| Shimmer:APQ3 | Three-point amplitude perturbation quotient |
| Shimmer:APQ5 | Five-point amplitude perturbation quotient |
| MDVP:APQ11 | MDVP 11-point amplitude perturbation quotient |
| Shimmer:DDA | Average absolute differences between the amplitudes of consecutive periods |
| NHR | Noise-to-harmonics ratio |
| HNR | Harmonics-to-noise ratio |
| RPDE | Recurrence period density entropy measure |
| D2 | Correlation dimension |
| DFA | Signal fractal scaling exponent of detrended fluctuation analysis |
| Spread1 | Two nonlinear measures of fundamental |
| Spread2 | Frequency variation |
| PPE | Pitch period entropy |
Pearson correlation coefficients between the vocal features of MDVP:Jitter(%), MDVP:Jitter(Abs), MDVP:RAP, MDVP:PPQ, Jitter:DDP, and NHR.
| Features | MDVP:Jitter(%) | MDVP:Jitter(Abs) | MDVP:RAP | MDVP:PPQ | Jitter:DDP | NHR |
|---|---|---|---|---|---|---|
| MDVP:Jitter(%) | 1 | 0.9357 | 0.9903 | 0.9743 | 0.9903 | 0.907 |
| MDVP:Jitter(Abs) | 0.9357 | 1 | 0.9229 | 0.8978 | 0.9229 | 0.835 |
| MDVP:RAP | 0.9903 | 0.9229 | 1 | 0.9573 | 1 | 0.9195 |
| MDVP:PPQ | 0.9743 | 0.8978 | 0.9573 | 1 | 0.9573 | 0.8446 |
| Jitter:DDP | 0.9903 | 0.9229 | 1 | 0.9573 | 1 | 0.9195 |
| NHR | 0.907 | 0.835 | 0.9195 | 0.8446 | 0.9195 | 1 |
Pearson correlation coefficients between the vocal features of MDVP:Shimmer, MDVP:Shimmer(dB), Shimmer:APQ3, Shimmer:APQ5, MDVP:APQ11, and Shimmer:DDA.
| Features | MDVP:Shimmer | MDVP:Shimmer(dB) | Shimmer:APQ3 | Shimmer:APQ5 | MDVP:APQ11 | Shimmer:DDA |
|---|---|---|---|---|---|---|
| MDVP:Shimmer | 1 | 0.9873 | 0.9876 | 0.9828 | 0.9501 | 0.9876 |
| MDVP:Shimmer(dB) | 0.9873 | 1 | 0.9632 | 0.9738 | 0.961 | 0.9632 |
| Shimmer:APQ3 | 0.9876 | 0.9632 | 1 | 0.9601 | 0.8966 | 1 |
| Shimmer:APQ5 | 0.9828 | 0.9738 | 0.9601 | 1 | 0.9491 | 0.9601 |
| MDVP:APQ11 | 0.9501 | 0.961 | 0.8966 | 0.9491 | 1 | 0.8966 |
| Shimmer:DDA | 0.9876 | 0.9632 | 1 | 0.9601 | 0.8966 | 1 |
Figure 1Estimated probability density functions of (a) recurrence period density entropy measure (RPDE) and (b) Shimmer-LDA features, plotted with blue and red curves for healthy control (CO) and idiopathic Parkinson's disease (IPD) subjects, respectively. The overlapping interclass probability risk (ICPR) areas are shown in gray color. Normalized histograms of (c) RPDE and (d) Shimmer-LDA features are also provided with the overlapped areas in gray color, for the purpose of comparison.
Feature selection by means of the interclass probability risk (ICPR) and modified Kullback-Leibler divergence (MKLD) methods. Bold values are selected features for pattern classifications (ICPR < 0.6; MKLD > 1).
| Features | ICPR | MKLD |
|---|---|---|
| MDVP:F0 |
|
|
| MDVP:Fhi | 0.71 | 0.26 |
| MDVP:Flo | 0.67 |
|
| HNR | 0.66 | 0.73 |
| RPDE | 0.71 | 0.25 |
| D2 | 0.76 | 0.22 |
| Spread1 |
| 0.89 |
| Spread2 | 0.69 | 0.46 |
| MDVP-LDA |
|
|
| Shimmer-LDA |
|
|
| Nonlinear-LDA |
|
|
Figure 2Error curves of the Bagging ensemble input with the (a) modified Kullback-Leibler divergence (MKLD) and (b) interclass probability risk (ICPR) selected features. Boundary of 95% confidence interval is shown in pseudo-red-color.
Figure 3Classification results of the generalized logistic regression analysis (GLRA), support vector machine (SVM), Bagging ensemble, and input with the MKLD and ICPR selected features, respectively. GLRA-MKLD: sensitivity: 0.9116; specificity: 0.5833; MCC: 0.5232; GLRA-ICPR: sensitivity: 0.932; specificity: 0.5833; MCC: 0.5604; SVM-MKLD: sensitivity: 0.9048; specificity: 0.8333; MCC: 0.7105; SVM-ICPR: sensitivity: 0.9252; specificity: 0.8542; MCC: 0.7592; Bagging-MKLD: sensitivity: 0.9592; specificity: 0.6875; MCC: 0.6964; Bagging-ICPR: sensitivity: 0.9796; specificity: 0.6875; MCC: 0.6977.
Figure 4Receiver operating characteristic (ROC) curves and area under the ROC curve (AUC) results of the generalized logistic regression analysis (GLRA), support vector machine (SVM), and Bagging ensemble input with the MKLD and ICPR selected features, respectively. GLRA-MKLD AUC ± standard error (SE): 0.8936 ± 0.024; GLRA-ICPR AUC ± SE: 0.9031 ± 0.0232; SVM-MKLD AUC ± SE: 0.9216 ± 0.023; SVM-ICPR AUC ± SE: 0.9349 ± 0.0219; Bagging-MKLD AUC ± SE: 0.9286 ± 0.0226, Bagging-ICPR AUC ± SE: 0.9558 ± 0.0147.