| Literature DB >> 35242101 |
Antonio Suppa1,2, Giovanni Costantini3, Francesco Asci2, Pietro Di Leo3, Mohammad Sami Al-Wardat4, Giulia Di Lazzaro5, Simona Scalise6, Antonio Pisani7,8, Giovanni Saggio3.
Abstract
INTRODUCTION: Parkinson's disease (PD) is characterized by specific voice disorders collectively termed hypokinetic dysarthria. We here investigated voice changes by using machine learning algorithms, in a large cohort of patients with PD in different stages of the disease, OFF and ON therapy.Entities:
Keywords: L-Dopa; Parkinson's disease; hypokinetic dysarthria; machine learning; voice analysis
Year: 2022 PMID: 35242101 PMCID: PMC8886162 DOI: 10.3389/fneur.2022.831428
Source DB: PubMed Journal: Front Neurol ISSN: 1664-2295 Impact factor: 4.003
Demographic and clinical features of HS and PD.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| PD (whole group) | 68.2 ± 9.2 | 71.8 ± 11.6 | 172.1 ± 9.4 | 5.6 ± 4.7 | 28.4 ± 2.1 | 3.5 ± 1.8 | 16.5 ± 1.4 | 2.2 ± 0.8 | 22.3 ± 14.2 | – | 1.8 ± 1.1 | – | 16.7 ± 16.9 | – |
| 64.2 ± 8.6 | 71.8 ± 10.6 | 172.9 ± 9.8 | 2.1 ± 0.9 | 28.9 ± 1.1 | 3.2 ± 2.0 | 16.6 ± 1.0 | 1.5 ± 0.4 | 12.1 ± 4.1 | – | 0.9 ± 0.7 | – | 7.3 ± 4.9 | – | |
| 72.1 ± 8.1 | 71.9 ± 12.6 | 171.2 ± 9.0 | 9.0 ± 4.4 | 28.0 ± 2.6 | 3.9 ± 1.6 | 16.4 ± 1.6 | 2.8 ± 0.4 | 32.3 ± 13.5 | 28.3 ± 13.8 | 2.7 ± 0.6 | 2.4 ± 0.5 | 25.9 ± 19.2 | 20.0 ± 17.7 | |
| HS | 70.3 ± 10.3 | 68.5 ± 10.6 | 169.0 ± 10.1 | – | 29.0 ± 0.8 | 3.3 ± 1.7 | 16.6 ± 1.1 | – | – | – | – | – | – | – |
DD, disease duration; MMSE, Mini-Mental State Evaluation; HAM-D, Hamilton Depression Rating Scale; FAB, Frontal Assessment Battery; H&Y, Hoehn and Yahr Scale for assessment stage of PD; HS, healthy subjects; PD, patients with Parkinson's disease; UPDRS-III, Unified Parkinson's Disease Rating Scale part III; UPDRS-III-v, Unified Parkinson's Disease Rating Scale part III, voice impairment subitem; VHI, Voice Handicap Index; OFF, not-under the effect of L-Dopa; ON, under the effect of L-Dopa. Results are expressed as average ± standard deviation.
List of the first 30 selected features for the comparison between HS and PD.
|
|
| |||||
|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
| 1 | RASTA coefficients | Coefficient of band 22 | Standard deviation of falling slope | Spectral LLD | Spectral Roll Off point 0.90 | Absolute peak range |
| 2 | Voicing Related LLD | Fundamental Frequency (fo) | Minimum segment length | Spectral LLD | Spectral Roll Off point 0.50 | Inter-quartile 1–3 |
| 3 | Energy Related LLD | Sum of auditory spectrum | Flatness | Spectral LLD | Spectral Roll Off point 0.50 | Quartile 3 |
| 4 | Spectral LLD | Spectral Flux | Quadratic regression coefficient 1 | Energy Related LLD | Zero Crossing Rate | 99% percentile |
| 5 | RASTA coefficients | Coefficient of band 2 | Linear prediction coefficient 4 | Spectral LLD | Spectral Variance | Range |
| 6 | RASTA coefficients | Coefficient of band 21 (de) | Standard deviation of rising slope | Spectral LLD | Spectral Roll Off point 0.25 | Quartile 3 |
| 7 | Spectral LLD | Spectral Slope (de) | Position of max | Spectral LLD | Spectral Roll Off point 0.25 | Linear prediction coefficient 0 |
| 8 | RASTA coefficients | Coefficient of band 25 | Flatness | Spectral LLD | Psychoacoustic Sharpness | 1% percentile |
| 9 | Spectral LLD | Spectral energy 250–650 Hz | Relative min range | RASTA coefficients | Coefficient of band 8 (de) | Flatness |
| 10 | Energy Related LLD | RMS Energy (de) | Linear prediction coefficient 0 | Spectral LLD | Spectral Centroid | 99% percentile |
| 11 | Spectral LLD | Spectral Flux | Standard deviation of falling slope | Spectral LLD | Spectral Roll Off point 0.75 | Absolute peak range |
| 12 | Voicing Related LLD | Fundamental Frequency (fo) | 1% percentile | RASTA coefficients | Coefficient of band 1 | Mean of rising slope |
| 13 | MFCC | 8th Mel Coefficient | Inter-quartile 1–2 | Spectral LLD | Spectral Roll Off point 0.25 | Quadratic regression coefficient 2 |
| 14 | RASTA coefficients | Coefficient of band 25 (de) | Gain of linear prediction | MFCC | 2nd Mel Coefficient | Quadratic regression quadratic |
| 15 | Spectral LLD | Spectral Flux | Range | Spectral LLD | Spectral Roll Off point 0.25 | Inter-quartile 2–3 |
| 16 | Spectral LLD | Spectral Flux | Quadratic regression coefficient 2 | Spectral LLD | Spectral Entropy | Range |
| 17 | Spectral LLD | Spectral Slope | Gain of linear prediction | Energy Related LLD | Zero Crossing Rate | Standard deviation of rising slope |
| 18 | Spectral LLD | Spectral Slope | Standard deviation of rising slope | Spectral LLD | Spectral Roll Off point 0.50 | Quadratic regression coefficient 3 |
| 19 | Spectral LLD | Spectral Variance (de) | Relative peak mean | Voicing Related LLD | Fundamental frequency | Inter-quartile 2–3 |
| 20 | MFCC | 5th Mel Coefficient (de) | Skewness | Spectral LLD | Spectral Entropy | Absolute peak mean |
| 21 | RASTA coefficients | Coefficient of band 4 (de) | Skewness | MFCC | 3rd Mel Coefficient | 1% percentile |
| 22 | Energy Related LLD | RMS Energy | Mean of falling slope | Spectral LLD | Spectral Variance | Inter-quartile 2–3 |
| 23 | Spectral LLD | Spectral Roll Off point 0.75 | Linear prediction coefficient 3 | RASTA coefficients | Coefficient of band 18 | Position of min |
| 24 | RASTA coefficients | Coefficient of band 5 | Linear prediction coefficient 4 | MFCC | 3rd Mel Coefficient | Relative peak mean |
| 25 | Energy Related LLD | Zero Crossing Rate | Linear prediction coefficient 0 | Spectral LLD | Spectral Kurtosis | Absolute peak range |
| 26 | MFCC | 4th Mel Coefficient (de) | Relative peak range | RASTA coefficients | Coefficient of band 9 (de) | Flatness |
| 27 | Voicing Related LLD | Shimmer (Local) | Position of max | RASTA coefficients | Coefficient of band 4 | Position of min |
| 28 | RASTA coefficients | Coefficient of band 2 | Linear prediction coefficient 3 | Spectral LLD | Spectral Centroid | 1% percentile |
| 29 | RASTA coefficients | Coefficient of band 1 (de) | Standard deviation | Spectral LLD | Spectral Skewness | Mean segment length |
| 30 | Voicing Related LLD | Shimmer (Local) (de) | Quadratic regression coefficient 2 | RASTA coefficients | Coefficient of band 22 | Position of min |
The table refers to selected voice features for the comparison between healthy subjects and patients with Parkinson's disease. Ranking of the first 30 features (functionals applied to low-level descriptors—LLDs) extracted using a dedicated software (OpenSMILE) and selected using Information Gain Attribute Evaluation (IGAE) algorithm for the comparison between healthy subjects and the whole group of patients with PD, during the sustained emission of the vowel and sentence. MFCC, mel-frequency cepstral coefficient; de, first derivative of the LLD.
Figure 1Experimental design. (A) recording of voice samples through a high-definition audio recorder; (B) narrow-band spectrogram of the acoustic voice signal; (C) feature extraction; (D) feature selection; (E) feature classification; (F) ROC curve analysis; (G) LR values calculated through ANN.
Figure 2ROC curves calculated through SVM classifier in Parkinson's disease. (A) HS vs. the whole group of PD patients; (B) HS vs. early-stage patients; (C) HS vs. mid-advanced-stage patients OFF therapy; (D) Early-stage vs. mid-advanced-stage patients OFF therapy. Gray lines refer to the emission of the vowel, whereas black lines refer to the sentence.
Performance of the machine learning algorithm.
|
|
|
|
|
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|---|---|---|---|
| HS vs. PD | Vowel | 98 | 10 folds | −0.03 | 0.60 | 82.7 | 77.1 | 75.0 | 84.3 | 79.6 | 0.870 |
| Sentence | 94 | 10 folds | 0.02 | 0.57 | 72.5 | 84.7 | 88.0 | 66.7 | 77.3 | 0.848 | |
| HS vs. | Vowel | 67 | 10 folds | −0.36 | 0.64 | 87.0 | 77.4 | 74.1 | 88.9 | 81.5 | 0.900 |
| Sentence | 93 | 10 folds | 0.16 | 0.66 | 75.8 | 90.5 | 92.6 | 70.4 | 81.5 | 0.876 | |
| HS vs. mid-a | Vowel | 100 | 10 folds | 0.16 | 0.87 | 92.7 | 94.3 | 94.4 | 92.6 | 93.5 | 0.980 |
| Sentence | 82 | 10 folds | 0.18 | 0.63 | 82.7 | 80.4 | 79.6 | 83.3 | 81.5 | 0.897 | |
| Vowel | 119 | 10 folds | 0.16 | 0.76 | 87.2 | 88.7 | 88.9 | 87.0 | 88.0 | 0.934 | |
| Sentence | 102 | 10 folds | 0.10 | 0.85 | 91.1 | 94.1 | 94.4 | 90.7 | 92.6 | 0.981 | |
| Vowel | 22 | 10 folds | 0.02 | 0.46 | 69.7 | 76.0 | 79.3 | 65.5 | 72.4 | 0.754 | |
| Sentence | 6 | 10 folds | 0.03 | 0.49 | 71.9 | 76.9 | 79.3 | 69.0 | 74.1 | 0.786 | |
| HS vs. | Vowel | 82 | 10 folds | 0.97 | 0.66 | 85.2 | 80.6 | 79.3 | 86.2 | 82.8 | 0.913 |
| Sentence | 69 | 10 folds | −0.01 | 0.93 | 96.6 | 96.6 | 96.6 | 96.6 | 96.6 | 0.985 | |
| Vowel | 71 | 10 folds | −0.18 | 0.94 | 100 | 93.5 | 93.1 | 100 | 96.6 | 0.992 | |
| Sentence | 78 | 10 folds | 0.62 | 0.97 | 100 | 96.7 | 96.6 | 100 | 98.3 | 0.999 |
Performance of SVM linear classifier elaborating the 30 most relevant selected features during the sustained emission of the vowel and the sentence for seven independent conditions: (1) HS vs. the whole group of PD patients; (2) HS vs. early-stage patients; (3) HS vs. mid-advanced-stage patients; (4) Early-stage vs. mid-advanced-stage patients; (5) Mid-advanced-stage patients OFF vs. ON therapy; (6) HS vs. mid-advanced-stage patients ON therapy; (7) Early-stage patients vs. mid-advanced-stage patients ON therapy. Selected features refer to the number of features able to obtain the best results; instances refer to the number of subjects considered in each comparison; cross validation refers to standardized validation procedures (see methods for details). Se, sensitivity; Sp, specificity; PPV, positive predictive value; NPV, negative predictive value; Acc, accuracy; AUC, area under the curve.
Figure 3ROC curves calculated through SVM classifier in Parkinson's disease: the effect of L-Dopa. (A) Mid-advanced-stage patients OFF vs. ON therapy; (B) HS vs. mid-advanced-stage patients ON therapy; (C) Early-stage patients vs. mid-advanced-stage patients ON therapy. Gray lines refer to the emission of the vowel, whereas black lines refer to the sentence.
Figure 4Clinical-instrumental correlations. (A) Disease Duration and VHI; (B) UPDRS-III and VHI; (C) Disease Duration and LRs; (D) UPDRS-III and LRs; (E) VHI and LRs; (F) UPDRS-III ON and LRs. Note that the correlation analysis only refers to the emission of the vowel. Similar results have been achieved when analyzing the emission of a sentence (data not shown). In addition, correlation analysis shown in (A–E) refers to the whole group of PD patients, whereas (F) shows the correlation assessed in the subgroup of mid-advanced stage patients ON therapy.