| Literature DB >> 27190506 |
Mahnaz Behroozi1, Ashkan Sami1.
Abstract
Recently, speech pattern analysis applications in building predictive telediagnosis and telemonitoring models for diagnosing Parkinson's disease (PD) have attracted many researchers. For this purpose, several datasets of voice samples exist; the UCI dataset named "Parkinson Speech Dataset with Multiple Types of Sound Recordings" has a variety of vocal tests, which include sustained vowels, words, numbers, and short sentences compiled from a set of speaking exercises for healthy and people with Parkinson's disease (PWP). Some researchers claim that summarizing the multiple recordings of each subject with the central tendency and dispersion metrics is an efficient strategy in building a predictive model for PD. However, they have overlooked the point that a PD patient may show more difficulty in pronouncing certain terms than the other terms. Thus, summarizing the vocal tests may lead into loss of valuable information. In order to address this issue, the classification setting must take what has been said into account. As a solution, we introduced a new framework that applies an independent classifier for each vocal test. The final classification result would be a majority vote from all of the classifiers. When our methodology comes with filter-based feature selection, it enhances classification accuracy up to 15%.Entities:
Year: 2016 PMID: 27190506 PMCID: PMC4844904 DOI: 10.1155/2016/6837498
Source DB: PubMed Journal: Int J Telemed Appl ISSN: 1687-6415
Time-frequency based features presented in Parkinson speech dataset with multiple types of sound recordings [13].
| Feature number | Feature name | Group |
|---|---|---|
| 1 | Jitter (local) |
|
| 2 | Jitter (local, absolute) | |
| 3 | Jitter (rap) | |
| 4 | Jitter (ppq5) | |
| 5 | Jitter (ddp) | |
|
| ||
| 6 | Number of pulses |
|
| 7 | Number of periods | |
| 8 | Mean period | |
| 9 | Standard deviation of period | |
|
| ||
| 10 | Shimmer (local) |
|
| 11 | Shimmer (local, dB) | |
| 12 | Shimmer (apq3) | |
| 13 | Shimmer (apq5) | |
| 14 | Shimmer (apq11) | |
| 15 | Shimmer (dda) | |
|
| ||
| 16 | Fraction of locally unvoiced frames |
|
| 17 | Number of voice breaks | |
| 18 | Degree of voice breaks | |
|
| ||
| 19 | Median pitch |
|
| 20 | Mean pitch | |
| 21 | Standard deviation | |
| 22 | Minimum pitch | |
| 23 | Maximum pitch | |
|
| ||
| 24 | Autocorrelation |
|
| 25 | Noise-to-harmonic | |
| 26 | Harmonic-to-noise | |
Figure 1An illustration of the proposed method.
Each vocal test and its related features after applying filter-based feature selection.
| ID | Vocal test | Related features |
|---|---|---|
| 1 | Vowel “a” | None |
| 2 | Vowel “o” | 24 |
| 3 | Vowel “u” | None |
| 4 | Number 1 | 1, 2, 3, 4, 5, 24 |
| 5 | Number 2 | 2, 9, 10 |
| 6 | Number 3 | 17, 19, 23, 25, 26 |
| 7 | Number 4 | 1, 2, 3, 4, 5, 10 |
| 8 | Number 5 | 24 |
| 9 | Number 6 | None |
| 10 | Number 7 | None |
| 11 | Number 8 | 9 |
| 12 | Number 9 | 26 |
| 13 | Number 10 | None |
| 14 | Short sentence 1 | None |
| 15 | Short sentence 2 | 25, 26 |
| 16 | Short sentence 3 | 4, 10, 25, 26 |
| 17 | Short sentence 4 | 1, 2, 3, 4, 5, 10, 26 |
| 18 | Word 1 | 2 |
| 19 | Word 2 | None |
| 20 | Word 3 | 17, 19, 23, 25 |
| 21 | Word 4 | None |
| 22 | Word 5 | None |
| 23 | Word 6 | None |
| 24 | Word 7 | None |
| 25 | Word 8 | 1, 2, 3, 4, 5, 6, 17, 19, 23, 25 |
| 26 | Word 9 | 24 |
Figure 2Frequency of each selected feature as relevant feature of a vocal test.
Results obtained from applying different methods and classifiers.
| Classifier | Method | Accuracy (%) | Sensitivity (%) | Specificity (%) | MCC |
|---|---|---|---|---|---|
|
| LOSO | 53.37 | 49.62 | 57.12 | 0.0007 |
| s-LOO (1–4) | 42.50 | 30.00 | 55.00 | 0.0015 | |
| s-LOO (2–5) | 52.50 | 45.00 | 60.00 | 0.0005 | |
| s-LOO (3–6) | 50.00 | 55.00 | 45.00 | 0.0000 | |
| s-LOO (all) | 55.00 | 55.00 | 55.00 | 0.1000 | |
| MCFS | 67.50 | 75.00 | 60.00 | 0.3549 | |
| A-MCFS |
|
|
|
| |
|
| |||||
|
| LOSO | 54.04 | 53.27 | 54.81 | 0.0008 |
| s-LOO (1–4) | 55.00 | 45.00 | 65.00 | 0.1021 | |
| s-LOO (2–5) | 60.00 | 55.00 | 65.00 | 0.2010 | |
| s-LOO (3–6) | 42.50 | 55.00 | 30.00 | 0.0015 | |
| s-LOO (all) | 55.00 | 55.00 | 55.00 | 0.1000 | |
| MCFS | 65.00 | 60.00 | 70.00 | 0.3015 | |
| A-MCFS |
|
|
|
| |
|
| |||||
|
| LOSO | 54.42 | 53.65 | 55.19 | 0.0009 |
| s-LOO (1–4) | 55.00 | 45.00 | 65.00 | 0.1201 | |
| s-LOO (2–5) | 57.50 | 65.00 | 50.00 | 0.1517 | |
| s-LOO (3–6) | 50.00 | 70.00 | 30.00 | 0.0000 | |
| s-LOO (all) | 55.00 | 70.00 | 40.00 | 0.1048 | |
| MCFS | 67.5 | 60.00 | 75.00 | 0.3540 | |
| A-MCFS |
|
|
|
| |
|
| |||||
|
| LOSO | 53.94 | 54.04 | 53.85 | 0.0008 |
| s-LOO (1–4) | 65.00 | 55.00 | 75.00 | 0.3062 | |
| s-LOO (2–5) | 62.50 | 60.00 | 65.00 | 0.2503 | |
| s-LOO (3–6) | 42.50 | 65.00 | 20.00 | 0.0017 | |
| s-LOO (all) | 57.50 | 65.00 | 50.00 | 0.1517 | |
| MCFS | 62.5 | 65.00 | 60.00 | 0.2503 | |
| A-MCFS |
|
|
|
| |
|
| |||||
| SVM (linear kernel) | LOSO | 52.50 | 52.50 | 52.50 | 0.0006 |
| s-LOO (1–4) | 77.50 | 80.00 | 75.00 | 0.5507 | |
| s-LOO (2–5) | 70.00 | 80.00 | 60.00 | 0.4082 | |
| s-LOO (3–6) | 60.00 | 65.00 | 45.00 | 0.2000 | |
| s-LOO (all) | 67.50 | 70.00 | 65.00 | 0.3504 | |
| MCFS | 75.00 | 75.00 | 75.00 | 0.5000 | |
| A-MCFS |
|
|
|
| |
|
| |||||
| SVM (RBF kernel) | LOSO | 55.00 | 60.00 | 50.00 | 0.1005 |
| s-LOO (1–4) | 65.00 | 60.00 | 70.00 | 0.3015 | |
| s-LOO (2–5) | 70.00 | 70.00 | 70.00 | 0.4000 | |
| s-LOO (3–6) | 72.50 | 70.00 | 75.00 | 0.4506 | |
| s-LOO (all) | 65.00 | 70.00 | 60.00 | 0.3015 | |
| MCFS | 75.00 | 80.00 | 70.00 | 0.5025 | |
| A-MCFS |
|
|
|
| |
|
| |||||
| Naïve Bayes | MCFS | 75.00 | 90.00 | 60.00 | 0.5241 |
| A-MCFS |
|
|
|
| |
|
| |||||
| Discriminant analysis | MCFS | 72.50 | 75.00 | 70.00 | 0.4506 |
| A-MCFS |
|
|
|
| |
Central tendency metrics used in s-LOO method: 1: mean, 2: median, and 3: trimmed mean (25% removed).
Dispersion metrics used in s-LOO method: 4: standard deviation, 5: mean absolute deviation, and 6: interquartile range.
Figure 3Obtained accuracies based on the reported results in Table 3.
Prediction ability of each vocal test, based on their obtained classification accuracy.
| Vocal test ID | Classification accuracy (%) | ||||||||
|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
| SVM | SVM | Naïve Bayes | DA | Mean accuracy ± standard deviation | |
| 1 | 42.5 | 35 | 27.5 | 27.5 | 47.5 | 25 | 52.5 | 37.5 |
|
| 2 | 57.5 | 67.5 | 70 | 70 | 62.5 | 62.5 | 60 | 70 | 65 ± 5 |
| 3 | 40 | 42.5 | 60 | 55 | 27.5 | 50 | 50 | 47.5 |
|
| 4 | 67.5 | 70 | 75 | 67.5 | 60 | 65 | 65 | 62.5 | 66.6 ± 4.6 |
| 5 | 67.5 | 57.5 | 60 | 65 | 67.5 | 67.5 | 67.5 | 65 | 64.7 ± 3.9 |
| 6 | 62.5 | 67.5 | 67.5 | 72.5 | 62.5 | 72.5 | 72.5 | 65 | 67.8 ± 4.3 |
| 7 | 52.5 | 60 | 57.5 | 55 | 67.5 | 67.5 | 70 | 50 | 60 ± 7.6 |
| 8 | 57.5 | 62.5 | 62.5 | 70 | 65 | 67.5 | 65 | 62.5 | 64 ± 3.8 |
| 9 | 47.5 | 62.5 | 65 | 50 | 60 | 50 | 55 | 57.5 | 55.9 ± 6.4 |
| 10 | 62.5 | 62.5 | 65 | 55 | 55 | 55 | 57.5 | 55 | 58.4 ± 4.2 |
| 11 | 42.5 | 60 | 72.5 | 72.5 | 75 | 72.5 | 70 | 72.5 | 67.2 ± 11 |
| 12 | 50 | 45 | 42.5 | 57.5 | 65 | 65 | 65 | 65 | 56.9 ± 9.7 |
| 13 | 40 | 45 | 57.5 | 57.5 | 52.5 | 60 | 60 | 45 |
|
| 14 | 52.5 | 60 | 55 | 55 | 50 | 60 | 45 | 55 |
|
| 15 | 57.5 | 60 | 62.5 | 65 | 72.5 | 72.5 | 65 | 72.5 | 65.9 ± 6 |
| 16 | 50 | 55 | 62.5 | 65 | 72.5 | 72.5 | 65 | 72.5 | 64.4 ± 8.4 |
| 17 | 60 | 57.5 | 57.5 | 65 | 72.5 | 60 | 72.5 | 67.5 | 64.1 ± 6.3 |
| 18 | 45 | 45 | 55 | 65 | 67.5 | 67.5 | 65 | 67.5 | 59.7 ± 9.9 |
| 19 | 40 | 37.5 | 45 | 35 | 40 | 40 | 22.5 | 42.5 |
|
| 20 | 52.5 | 55 | 55 | 47.5 | 60 | 62.5 | 65 | 62.5 | 57.5 ± 6 |
| 21 | 47.5 | 40 | 40 | 27.5 | 45 | 55 | 52.5 | 52.5 |
|
| 22 | 42.5 | 35 | 52.5 | 52.5 | 67.5 | 55 | 55 | 65 |
|
| 23 | 47.5 | 55 | 55 | 47.5 | 62.5 | 60 | 60 | 50 |
|
| 24 | 35 | 35 | 22.5 | 35 | 42.5 | 42.5 | 37.5 | 50 |
|
| 25 | 62.5 | 62.5 | 67.5 | 67.5 | 62.5 | 67.5 | 65 | 62.5 | 64.7 ± 2.5 |
| 26 | 55 | 62.5 | 57.5 | 62.5 | 57.5 | 57.5 | 55 | 55 | 57.8 ± 3.1 |