| Literature DB >> 34422258 |
Federica Amato1, Luigi Borzì1, Gabriella Olmo1, Juan Rafael Orozco-Arroyave2,3.
Abstract
INTRODUCTION: Automatic assessment of speech impairment is a cutting edge topic in Parkinson's disease (PD). Language disorders are known to occur several years earlier than typical motor symptoms, thus speech analysis may contribute to the early diagnosis of the disease. Moreover, the remote monitoring of dysphonia could allow achieving an effective follow-up of PD clinical condition, possibly performed in the home environment.Entities:
Keywords: Artificial Intelligence; Isolated words; Parkinson’s disease; Speech analysis; Speech impairment; Telemedicine; k-Nearest neighbours
Year: 2021 PMID: 34422258 PMCID: PMC8324609 DOI: 10.1007/s13755-021-00162-8
Source DB: PubMed Journal: Health Inf Sci Syst ISSN: 2047-2501
Fig. 1Work flow scheme
Fig. 2UPDRS distribution comparison between PC-GITA and the additional dataset for male and female subjects
Overview of the extracted features, divided according to the domain of analysis
| Region | Study | Feature | Information |
|---|---|---|---|
| Entire signal | [ | IEDCC(1–6) | Vocal tract and vocal folds abnormalities [ |
| [ | Zero crossing rate | Voice activity (Details in [ | |
| [ | DFA | Self-similarity of the voice (Details in [ | |
| Voiced | [ | Bandwidth | Frequency range |
| [ | Harmonic ratio | Ratio of signal over noise [ | |
| [ | F0 | Vocal folds vibration and frequency alteration | |
| [ | Spectral features: flux | Spectrum shape information (Details in [ | |
| [ | LPC(1–3) | Formants and resonances (Details in [ | |
| [ | Short time energy | Energy variation among frames | |
| [ | MFCC(1–13) | Subtle changes in the motion of articulators (Details [ | |
| Transition | Present study | PTS | Ability to promptly interrupt/start vocal fold vibration |
| Present study | ETS | Ability to promptly interrupt/start vocal fold vibration | |
| [ | MFCC(1–12), | Ability to promptly interrupt/start vocal fold vibration | |
| [ | BBE(1–25) | Ability to promptly interrupt/start vocal fold vibration |
The apex letter represents the classification between LLf and HLf subgroups
Fig. 3Comparison between early and late fusion approaches: in the first case, the features derived from each word are joined before performing the supervised learning; in the second one, separate scores are learned for each word, joined, and used as input of a second supervised learning step
Fig. 4Fusion scheme and feature subset analysis. Performance for the first randomly selected subset
Execution time of the three most proficient algorithms
| Model | Computational time (s) |
|---|---|
| Case 1: Late fusion | 3.37 |
| Case 2: Late fusion | 4.19 |
| Case 3: Late fusion | 6.23 |
| Case 3: Early fusion | 0.065 |
Mean values reported between male and female subjects
Performance comparison among 6 classifiers
| Classifier | Male | Female | ||
|---|---|---|---|---|
| Validation set | Test set | Validation set | Test set | |
| SVM | 96% ± 3.22 | 74% ± 18.95 | 98% ± 2.46 | 90% ± 7.12 |
| DT | 95% ± 4.46 | 64% ± 17.34 | 100% ± 0 | 65% ± 19.56 |
| NB | 73% ± 41.10 | 50% ± 28.36 | 92% ± 5.65 | 77% ± 22.36 |
| kNN | 96% ± 2.46 | 74% ± 15.56 | 99% ± 1.61 | 97% ± 3.42 |
| Ensemble bagged trees | 92% ± 5.05 | 60% ± 19.56 | 96% ± 1.31 | 56% ± 0 |
| Ensemble subspace discriminant | 94% ± 5.26 | 71% ± 16.29 | 99% ± 1.31 | 96% ± 3.42 |
The results report the validation (10-fold applied to 70% of PC-GITA) and test set (30%PC-GITA) accuracy averaged over 5 iterations
Set of feature selection parameters employed for the final test
| Parameter | Male value | Female value |
|---|---|---|
| th1 | 0.5 | 0.5 |
| th2 | 0.1 | 0.1 |
| th3 | 10 | 30 |
Performance comparison among validation set (10-fold applied to 70% of PC-GITA), test set (30%PC-GITA) over 5 iterations for male and female groups
| Iter. | Validation set | Test set | kNN optimal parameters | |||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Acc. | Sens. | Spec. | AUC | Acc. | Sens. | Spec. | AUC | |||
| 1 | 100% | 100% | 100% | 1 | 100% | 100% | 100% | 1 | Distance = cityblock K = 3 | |
| 2 | 100% | 100% | 100% | 1 | 94% | 100% | 87% | 0.94 | ||
| 3 | 100% | 100% | 100% | 1 | 100% | 100% | 100% | 1 | ||
| 4 | 97% | 100% | 94% | 1 | 100% | 100% | 100% | 1 | ||
| 5 | 100% | 100% | 100% | 1 | 94% | 87% | 100% | 0.94 | ||
| 99.4% | 100% | 98.8% | 1 | 97.6% | 97.4% | 97.4% | 0.98 | |||
| Male | 1 | 100% | 100% | 100% | 1 | 100% | 100% | 100% | 1 | distance = cityblock K = 6 |
| 2 | 100% | 100% | 100% | 1 | 75% | 63% | 87% | 0.75 | ||
| 3 | 97% | 94% | 100% | 0.97 | 87% | 75% | 100% | 0.88 | ||
| 4 | 100% | 100% | 100% | 1 | 100% | 100% | 100% | 1 | ||
| 5 | 100% | 100% | 100% | 1 | 94% | 88% | 100% | 0.94 | ||
| 99.4% | 98.8% | 100% | 0.99 | 91.2% | 85.2% | 97.4% | 0.91 | |||
The model optimal hyper-parameters are reported
Fig. 5Execution time of the algorithm with different numbers of inputs. The final number of features selected is also reported
Fig. 6Execution time of the algorithm with different numbers of words. The final number of features selected is also reported
Fig. 7Execution time of the algorithm with different numbers of input features. The final number of features selected is also reported
Most significant words and features for male and female subgroups resulted from the post-hoc analysis of the selected models
| Words selected | Feature name | Region | |
|---|---|---|---|
| F | Clavo, Crema, Globo, Name | Roll off point | Voiced |
MFCC, BBE, | Onset | ||
PTS, ETS, MFCC, BBE, | Offset | ||
| M | Bodega, Braso, Globo, Llueve, Name, Presa, Viaje | MFCC, BBE, | Onset |
| PTS, MFCC,BBE | Offset |
F female, M male
Performance comparison with the best results of similar studies employing the PC-GITA database and focusing on the isolated word repetition task
| Author | [ | [ | [ | Present study |
| Year | 2015 | 2020 | 2020 | 2020 |
| Model | SVM | SVM | CNN | kNN |
| Sensibility | 94% | n.r. | n.r. | 99.4% |
| Specificity | 90% | n.r. | n.r | 99.4% |
| Accuracy | 92% | 91% | 77% | 99.4% |
| F1-score | n.r. | 0.83 | n.r. | 0.99 |
n.r. not reported. For the present study mean values between male and female subgroups averaged over 5 repetitions are reported