| Literature DB >> 30577548 |
Lucijano Berus1, Simon Klancnik2, Miran Brezocnik3, Mirko Ficko4.
Abstract
In recent years, neural networks have become very popular in all kinds of prediction problems. In this paper, multiple feed-forward artificial neural networks (ANNs) with various configurations are used in the prediction of Parkinson's disease (PD) of tested individuals, based on extracted features from 26 different voice samples per individual. Results are validated via the leave-one-subject-out (LOSO) scheme. Few feature selection procedures based on Pearson's correlation coefficient, Kendall's correlation coefficient, principal component analysis, and self-organizing maps, have been used for boosting the performance of algorithms and for data reduction. The best test accuracy result has been achieved with Kendall's correlation coefficient-based feature selection, and the most relevant voice samples are recognized. Multiple ANNs have proven to be the best classification technique for diagnosis of PD without usage of the feature selection procedure (on raw data). Finally, a neural network is fine-tuned, and a test accuracy of 86.47% was achieved.Entities:
Keywords: Parkinson’s disease; artificial neural networks; feature selection; voice recognition
Year: 2018 PMID: 30577548 PMCID: PMC6339026 DOI: 10.3390/s19010016
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Extracted time frequency-based features from individual voice samples [37].
| Feature Number | Feature | Mean | Stand. Deviation |
|---|---|---|---|
| 1 | Jitter (local) | 2.67952 | 1.76505 |
| 2 | Jitter (local, absolute) | 0.00017 | 0.00011 |
| 3 | Jitter (rap) | 1.24705 | 0.97946 |
| 4 | Jitter (ppq5) | 1.34832 | 1.13874 |
| 5 | Jitter (ddp) | 3.74116 | 2.93844 |
| 6 | Number of pulses | 12.91839 | 5.45220 |
| 7 | Number of periods | 1.19489 | 0.42007 |
| 8 | Mean period | 5.69960 | 3.01518 |
| 9 | Standard dev. of period | 7.98355 | 4.84089 |
| 10 | Shimmer (local) | 12.21535 | 6.01626 |
| 11 | Shimmer (local, dB) | 17.09844 | 9.04554 |
| 12 | Shimmer (apq3) | 0.84601 | 0.08571 |
| 13 | Shimmer (apq5) | 0.23138 | 0.15128 |
| 14 | Shimmer (apq11) | 9.99954 | 4.29130 |
| 15 | Shimmer (dda) | 163.3683 | 56.02168 |
| 16 | Fraction of locally unvoiced frames | 168.7276 | 55.96991 |
| 17 | Number of voice breaks | 27.54763 | 36.67262 |
| 18 | Degree of voice breaks | 134.5381 | 47.05806 |
| 19 | Median pitch | 234.8760 | 121.5412 |
| 20 | Mean pitch | 109.7442 | 150.0277 |
| 21 | Standard deviation | 105.9692 | 149.4171 |
| 22 | Minimum pitch | 0.00655 | 0.00188 |
| 23 | Maximum pitch | 0.00084 | 0.00072 |
| 24 | Autocorrelation | 27.68286 | 20.97529 |
| 25 | Noise-to-harmonic | 1.13462 | 1.16148 |
| 26 | Harmonic-to-noise | 12.37001 | 15.16192 |
Figure 1Block diagram of the proposed method.
Confusion matrix representation.
| Predicted | ||
|---|---|---|
|
| Positive | Negative |
| Positive | TP | FN |
| Negative | FP | TN |
Selected time frequency-based features using selected Pearson’s correlation factors in the case of testing multiple artificial neural networks (ANNs) on subject no. 1, and training of ANNs is performed on the other 39 subjects.
| ID | Voice Sample | Related Features | Related Features | Related Features | Related Features | Related Features |
|---|---|---|---|---|---|---|
| 1 | Vowel “a” | All | 24 | None | None | None |
| 2 | Vowel “o” | All | 19, 24 | 24, 19 | None | None |
| 3 | Vowel “u” | All | 13, 21 | None | None | None |
| 4 | Number 1 | All | 1, 2, 3, 4, 5, 24 | 1, 2, 3, 4, 5, 24 | 1, 2, 4 | 1, 4 |
| 5 | Number 2 | All | 1, 2, 8, 9, 10, 11 | 2, 8, 9, 10, 11 | 10 | None |
| 6 | Number 3 | All | 12, 13, 14, 17, 19, 23, 25, 26 | 17, 19, 23, 25, 26 | 17, 19, 23, 25, 26 | 17, 25 |
| 7 | Number 4 | All | 1, 2, 3, 4, 5, 10, 20, 21 | 1, 2, 3, 4, 5, 10 | 1, 2, 3, 4, 5 | 1, 2, 3, 4, 5 |
| 8 | Number 5 | All | 24 | 24 | 24 | None |
| 9 | Number 6 | All | 10, 23, 26 | None | None | None |
| 10 | Number 7 | All | 17, 19, 24, 26 | None | None | None |
| 11 | Number 8 | All | 9, 10 | 9 | None | None |
| 12 | Number 9 | All | 26 | 26 | None | None |
| 13 | Number 10 | All | 1, 2, 3, 5, 8, 9, 11, 23 | None | None | None |
| 14 | Short sentence 1 | All | None | None | None | None |
| 15 | Short sentence 2 | All | 3, 4, 5, 24, 25, 26 | 25, 26 | 25 | 25 |
| 16 | Short sentence 3 | All | 3, 4, 5, 10, 25, 26 | 4, 10, 25, 26 | 10, 26 | 26 |
| 17 | Short sentence 4 | All | 1, 2, 3, 4, 5, 10, 24, 25, 26 | 1, 2, 3, 4, 5, 10, 26 | 1, 2, 3, 4, 5, 10, 26 | 3, 4, 5, 10 |
| 18 | Word 1 | All | 1, 2, 4, 7 | 1, 2 | None | None |
| 19 | Word 2 | All | 10 | None | None | None |
| 20 | Word 3 | All | 17, 19, 23, 25 | 17, 19, 23, 25 | 17, 19 | 17, 19 |
| 21 | Word 4 | All | 3, 5 | None | None | None |
| 22 | Word 5 | All | 26 | 26 | None | None |
| 23 | Word 6 | All | 2, 10 | None | None | None |
| 24 | Word 7 | All | 17 | None | None | None |
| 25 | Word 8 | All | 1, 2, 3, 4, 5, 10, 17, 19, 23, 24, 25 | 1, 2, 3, 5, 17, 19, 23, 25 | 4, 17, 19 | 17, 19 |
| 26 | Word 9 | All | 2, 24 | 24 | None | None |
| Number of classifiers | 26 | 25 | 16 | 10 | 8 | |
Selected time frequency-based features using selected Kendall’s correlation factors in the case of testing multiple ANNs on subject no. 1, and training of ANNs is performed on the other 39 subjects.
| ID | Voice Sample | Related Features | Related Features | Related Features | Related Features | Related Features |
|---|---|---|---|---|---|---|
| 1 | Vowel “a” | All | 6, 7, 9, 10, 14 | 10 | None | None |
| 2 | Vowel “o” | All | 17, 24 | 24 | 24 | 24 |
| 3 | Vowel “u” | All | 24 | 24 | None | None |
| 4 | Number 1 | All | 1, 2, 3, 4, 5, 6, 7, 9,10, 24 | 1, 2, 3, 4, 5, 6, 24 | 1, 2, 4, 24 | None |
| 5 | Number 2 | All | 1, 2, 3, 4, 5, 6, 8, 9, 10, 11 | 1, 8, 9, 10, 11 | 9 | None |
| 6 | Number 3 | All | 12, 13, 14, 17, 19, 23, 24, 25, 26 | 12, 13, 17, 19, 23, 25, 26 | 17, 23, 25, 26 | 17,25,26 |
| 7 | Number 4 | All | 1, 2, 3, 4, 5, 10, 20, 21 | 1, 2, 3, 4, 5, 10, | 1, 2, 3, 4, 5, | 1,2,3,4,5 |
| 8 | Number 5 | All | 24 | 24 | 24 | None |
| 9 | Number 6 | All | 10, 24, 26 | 10, 26 | None | None |
| 10 | Number 7 | All | 1, 3, 4, 5, 8, 11, 24 | 4, 5 | 4 | None |
| 11 | Number 8 | All | 9 | 9 | 9 | None |
| 12 | Number 9 | All | 2, 3, 4, 5, 21, 26 | 4, 26 | 4 | None |
| 13 | Number 10 | All | 1, 3, 5, 20, 23 | 23 | None | None |
| 14 | Short sentence 1 | All | 25, 26 | None | None | None |
| 15 | Short sentence 2 | All | 3, 4, 5, 8, 10, 11, 17, 25, 26 | 24, 25, 26 | 25 | 25 |
| 16 | Short sentence 3 | All | 1, 2, 3, 4, 5, 10, 17, 24, 25, 26 | 10, 26 | 26 | None |
| 17 | Short sentence 4 | All | 1, 2, 3, 4, 5, 10 | 1, 2, 3, 4, 5, 10 | 1, 3, 4, 5, 10, 25, 26 | 3,5 |
| 18 | Word 1 | All | 1, 2, 3, 4, 5, 7 | 1, 2, 4, 7 | 1, 4 | None |
| 19 | Word 2 | All | None | None | None | None |
| 20 | Word 3 | All | 17, 19, 23, 25 | 17, 19, 25 | 17, 25 | 17 |
| 21 | Word 4 | All | 3, 5 | None | None | None |
| 22 | Word 5 | All | 17, 19, 26 | None | None | None |
| 23 | Word 6 | All | 10, 17 | 10 | None | None |
| 24 | Word 7 | All | 3, 5, 23 | None | None | None |
| 25 | Word 8 | All | 1, 2, 3, 4, 5, 10, 14, 17, 19, 23, 25 | 2, 17, 19, 25 | 17, 19 | 17 |
| 26 | Word 9 | All | 2, 3 4, 5, 24 | 24 | None | None |
| Number of classifiers | 26 | 25 | 21 | 15 | 7 | |
Figure 2Accuracy (a) and training accuracy (b) measures of ANN 5, ANN 10, ANN 5-5, ANN 10-10, and ANN 5-10-5 configurations with Pearson’s-based feature selection.
Figure 3Sensitivity (a) and specificity (b) measures of ANN 5, ANN 10, ANN 5-5, ANN 10-10, and ANN 5-10-5 configurations with Pearson’s-based feature selection.
Figure 4Test set accuracy of Kendall’s (a), PCA (b), and SOM based feature selection (c) with ANN 5, ANN 10, ANN 5-5, ANN 10-10, and ANN 5-10-5 configurations.
Figure 5Comparison of best test set accuracies of different ANN topologies.
Selected features using a medium correlation factor.
| ID | Voice Sample | Related Features |
|---|---|---|
| 1 | Vowel “a” | None |
| 2 | Vowel “o” | 24 |
| 3 | Vowel “u” | None |
| 4 | Number 1 | 1, 2, 3, 4, 5, 24 |
| 5 | Number 2 | 2, 9, 10 |
| 6 | Number 3 | 17, 19, 23, 25, 26 |
| 7 | Number 4 | 1, 2, 3, 4, 5, 10 |
| 8 | Number 5 | 24 |
| 9 | Number 6 | None |
| 10 | Number 7 | None |
| 11 | Number 8 | 9 |
| 12 | Number 9 | 26 |
| 13 | Number 10 | None |
| 14 | Short sentence 1 | None |
| 15 | Short sentence 2 | 25, 26 |
| 16 | Short sentence 3 | 4, 10, 25, 26 |
| 17 | Short sentence 4 | 1, 2, 3, 4, 5, 10, 26 |
| 18 | Word 1 | 2 |
| 19 | Word 2 | None |
| 20 | Word 3 | 17, 19, 23, 25 |
| 21 | Word 4 | None |
| 22 | Word 5 | None |
| 23 | Word 6 | None |
| 24 | Word 7 | None |
| 25 | Word 8 | 1, 2, 3, 5, 6, 17, 19, 23, 25 |
| 26 | Word 9 | 24 |
| Number of classifiers | 15 | |
Comparison of different classifiers performance on PD dataset.
| Classifier | Feature Selection | Accuracy (%) | Sensitivity (%) | Specificity (%) | MCC |
|---|---|---|---|---|---|
| k-NN (k = 1) | / [ |
|
|
|
|
| A-MCFS [ |
|
|
|
| |
| k-NN (k = 3) | / [ |
|
|
|
|
| A-MCFS [ |
|
|
|
| |
| k-NN (k = 5) | / [ |
|
|
|
|
| A-MCFS [ |
|
|
|
| |
| k-NN (k = 7) | / [ |
|
|
|
|
| A-MCFS [ |
|
|
|
| |
| SVM (linear kernel) | / [ |
|
|
|
|
| A-MCFS [ |
|
|
|
| |
| SVM (RBF kernel) | / [ |
|
|
|
|
| A-MCFS [ |
|
|
|
| |
| ANN 10 | / |
|
|
|
|
| ANN 5-10-5 | Pearson’s |
|
|
|
|
| ANN 10 | Kendall’s |
|
|
|
|
| ANN 10-10 | PCA |
|
|
|
|
| ANN 10-10 | SOM |
|
|
|
|
| ANN (fine-tuned) | A-MCFS |
|
|
|
|