| Literature DB >> 30634600 |
Nicolas Khoury1, Ferhat Attal2, Yacine Amirat3, Latifa Oukhellou4, Samer Mohammed5.
Abstract
This article presents a machine learning methodology for diagnosing Parkinson's disease (PD) based on the use of vertical Ground Reaction Forces (vGRFs) data collected from the gait cycle. A classification engine assigns subjects to healthy or Parkinsonian classes. The diagnosis process involves four steps: data pre-processing, feature extraction and selection, data classification and performance evaluation. The selected features are used as inputs of each classifier. Feature selection is achieved through a wrapper approach established using the random forest algorithm. The proposed methodology uses both supervised classification methods including K-nearest neighbour (K-NN), decision tree (DT), random forest (RF), Naïve Bayes (NB), support vector machine (SVM) and unsupervised classification methods such as K-means and the Gaussian mixture model (GMM). To evaluate the effectiveness of the proposed methodology, an online dataset collected within three different studies is used. This data set includes vGRF measurements collected from eight force sensors placed under each foot of the subjects. Ninety-three patients suffering from Parkinson's disease and 72 healthy subjects participated in the experiments. The obtained performances are compared with respect to various metrics including accuracy, precision, recall and F-measure. The classification performance evaluation is performed using the leave-one-out cross validation. The results demonstrate the ability of the proposed methodology to accurately differentiate between PD subjects and healthy subjects. For the purpose of validation, the proposed methodology is also evaluated with an additional dataset including subjects with neurodegenerative diseases (Amyotrophic Lateral Sclerosis (ALS) and Huntington's disease (HD)). The obtained results show the effectiveness of the proposed methodology to discriminate PD subjects from subjects with other neurodegenerative diseases with a relatively high accuracy.Entities:
Keywords: Parkinson diseases; classification; features selection method; gait cycle; vertical ground reaction forces (vGRFs); wearable sensors
Mesh:
Year: 2019 PMID: 30634600 PMCID: PMC6359125 DOI: 10.3390/s19020242
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1Gait-phase representation; swing, stance and double stance phases for both feet [26].
Synthetic review of studies on PD diagnosis.
| References | Sensors | Sensors Type | Method | Validation Methods | Accuracy |
|---|---|---|---|---|---|
| Jean et al., 2008 [ | In-shoe dynamic foot pressure | Wearable | SVM | 15-fold cross validation | 91.73% |
| Cho et al., 2009 [ | CCD camera | Non wearable | MDC | Not specified | 95.49% |
| Muniz et al., 2010 [ | Force platform | Non wearable | LR, PNN, SVM | Bootstrap method | 91–94% |
| Wu and Krishnan 2010 [ | Force sensors | Wearable | LS-SVM | Leave-one-out | 90.32% |
| Sarbaz et al., 2011 [ | Force sensors | Wearable | Nearest mean scaled | 70% (train), 30% (test) | 95.6% |
| Daliri 2012 [ | Force sensors | Wearable | SVM | 50% (train), 50% (test) | 89.33% |
| Lee et al., 2012 [ | Force sensors | Wearable | NEWFM | 50% (train), 50% (test) | 74–77% |
| Daliri 2013 [ | Force sensors | Wearable | SVM | 50% (train), 50% (test) | 84–91% |
| Khorasani et al., 2014 [ | Force sensors | Wearable | HMM with GM | Leave-one-out | 90.3% |
| Dror et al., 2014 [ | Microsoft 3D camera sensor | Non wearable | SVM | leave-one-out | 100% |
| Dyshel et al., 2015 [ | Microsoft Kinect For Windows SDK | Non wearable | SVM | 5-fold cross validation | - |
| Su et al., 2015 [ | Force sensors | Wearable | Threshold-based | 80% (train), 10% | 72% |
| and MLP models | (Valid.), 10% (test) | ||||
| Zeng et al., 2016 [ | Force sensors | Wearable | RBF NN | 5-fold cross-validation | 96.39% |
| Jane et al., 2016 [ | Force sensors | Wearable | Q-BTDNN | Cross-validation | 90–92% |
| Ertugrul et al., 2016 [ | Force sensors | Wearable | BayesNT, NB, LR, MLP, | 10-folds | 87–88% |
| PART, Jrip, RF, and FT | cross-validation | ||||
| Cuzzolin et al., 2017 [ | IMU sensors | Wearable | HMM | Cross-validation | 85.51% |
| Açici et al., 2017 [ | Force sensors | Wearable | RF | 10-fold Cross Validation | 74–98% |
| Joshi et al., 2017 [ | Force sensors | Wearable | SVM | Leave one-out | 90.32% |
| Wu et al., 2017 [ | Force sensors | Wearable | SVM | Leave one-out | 84.48% |
| Alam et al., 2017 [ | Force sensors | Wearable | SVM, RF, K-NN, and DT | Leave one-out | 85–95% |
| Bhoi et al., 2017 [ | Force sensors | Wearable | K-means | - | - |
| Khoury et al., 2018 [ | Force sensors | Wearable | K-NN, CART, RF, SVM, K-means and GMM | 10-fold Cross Validation | 80–97% |
| Aharonson et al., 2018 [ | Force sensors and accelerometer | Non wearable | K-means | - | - |
| Haji Ghassemi et al., 2018 [ | IMU sensors | wearable | GMM | - | - |
Figure 2Placement of the 16 sensors under both feet.
Figure 3vGRFs measured on left foot (blue) and right foot (red); (a) healthy subject, (b) subject with PD.
Figure 4Example of vGRF data pre-processing; (a) raw vGRFs data; (b) processed vGRF data.
List of the nineteen extracted features.
| Features References | Extracted Features | Explanation |
|---|---|---|
| 1 | Coefficients of Variations in percentage (%) | |
| 2 | Coefficient of Variations in duration (s) | |
| 3 | Coefficient of Variations in duration (s) | |
| 4 | Coefficient of Variation in percentage (%) | |
| 5 | Coefficient of Variation in duration (s) | |
| 6 | Coefficient of Variation in duration (s) | |
| 7 | Coefficient of Variation of the | |
| 8 | Coefficient of Variation of the | |
| 9 | Coefficient of Variation of the | |
| 10 | Mean in percentage (%) | |
| 11 | Mean in duration (s) of the Swing Time | |
| 12 | Mean in duration (s) of the Stride Time | |
| 13 | Mean in percentage (%) | |
| 14 | Mean in duration (s) of the Swing Time | |
| 15 | Mean in duration (s) of the Stride Time | |
| 16 | Mean in percentage (%) | |
| 17 | Mean of the | |
| 18 | Mean of the | |
| 19 | Mean of the |
Accuracy obtained with/without the use of the feature selection method, for each sub-dataset.
| Sub-Dataset | Features Selection | Supervised | Unsupervised | ||||||
|---|---|---|---|---|---|---|---|---|---|
| Performances | K-NN | CART | RF | NB | SVM | K-Means | GMM | ||
| Yogev et al. | With | Accuracy | 86.05% | 83.72% | 86.05% | 74.42% | 86.05% | 63.72% | 64.77% |
| Without | Accuracy | 82.56% | 80.23% | 84.88% | 72.09% | 82.56% | 53.72% | 61.39% | |
| Hausdorff et al. | With | Accuracy | 90.91% | 84.30% | 87.60% | 77.69% | 90.08% | 55.12% | 65.12% |
| Without | Accuracy | 88.43% | 82.64% | 85.95% | 72.73% | 87.60% | 51.07% | 58.93% | |
| Frenkel-Toledo et al. | With | Accuracy | 81.25% | 79.69% | 82.81% | 79.69% | 82.81% | 57.19% | 65.31% |
| Without | Accuracy | 79.69% | 76.56% | 78.12% | 75% | 81.25% | 53.75% | 57.34% | |
The five most relevant features from each sub-dataset obtained using the RF feature selection method.
| Sub-Datasets | References of Features | Selected Features |
|---|---|---|
| Yogev et al. | 13 | Mean in percentage (%) of the Swing Time of the right foot |
| 7 | Coefficient of Variation of the Short Swing Time (SSWCV) | |
| 10 | Mean in percentage (%) of the Swing Time of the left foot | |
| 6 | Coefficient of Variation in duration (s) of the Stride Time of the right foot | |
| 11 | Mean in duration (s) of the Swing Time of the left foot | |
| Hausdorff et al. | 7 | Coefficient of Variation of the Short Swing Time (SSWCV) |
| 5 | Coefficient of Variation in duration (s) of the Swing Time of the right foot | |
| 4 | Coefficient of Variation in percentage (%) of the Swing Time of the right foot | |
| 13 | Mean in percentage (%) of the Swing Time of the right foot | |
| 6 | Coefficient of Variation in duration (s) of the Stride Time of the right foot | |
| Frenkel-Toledo et al. | 7 | Coefficient of Variation of the Short Swing Time (SSWCV) |
| 19 | Mean of the Gait Asymmetry | |
| 11 | Mean in duration (s) of the Swing Time of the left foot | |
| 8 | Coefficient of Variation of the Long Swing Time (LSWCV) | |
| 9 | Coefficient of Variation of the Gait Asymmetry |
Accuracy, Precision, Recall and F-measure for each classifier in the case of the Yogev et al. sub-dataset.
| Performances | Supervised | Unsupervised | |||||
|---|---|---|---|---|---|---|---|
| K-NN | CART | RF | NB | SVM | K-Means | GMM | |
| Accuracy | 86.05% | 83.72% | 86.05% | 74.42% | 86.05% | 63.72% | 64.77% |
| Precision | 84.89% | 82.86% | 85.07% | 74.73% | 84.90% | 64.34% | 62.63% |
| Recall | 86.34% | 81.94% | 85.07% | 76.45% | 85.71% | 65.31% | 62.91% |
| F-measure | 85.61% | 82.40% | 85.07% | 75.58% | 85.30% | 64.82% | 62.77% |
Accuracy, Precision, Recall and F-measure for each classifier in the case of the Hausdorff et al. sub-dataset.
| Performances | Supervised | Unsupervised | |||||
|---|---|---|---|---|---|---|---|
| K-NN | CART | RF | NB | SVM | K-Means | GMM | |
| Accuracy | 90.91% | 84.30% | 87.60% | 77.69% | 90.08% | 55.12% | 65.12% |
| Precision | 85.35% | 82.34% | 89.41% | 70.64% | 89.31% | 52.58% | 57.95% |
| Recall | 88.35% | 64.96% | 71.48% | 78.54% | 78.96% | 53.91% | 61.29% |
| F-measure | 86.83% | 72.62% | 79.45% | 74.38% | 83.82% | 53.24% | 59.57% |
Accuracy, Precision, Recall and F-measure for each classifier in the case of the Frenkel-Toledo et al. sub-dataset.
| Performances | Supervised | Unsupervised | |||||
|---|---|---|---|---|---|---|---|
| K-NN | CART | RF | NB | SVM | K-Means | GMM | |
| Accuracy | 81.25% | 79.69% | 82.81% | 79.69% | 82.81% | 57.19% | 65.31% |
| Precision | 81.43% | 79.56% | 83.10% | 79.69% | 82.81% | 61.07% | 64.95% |
| Recall | 81.67% | 79.36% | 82.22% | 79.95% | 83.10% | 59.23% | 64.62% |
| F-measure | 81.55% | 79.46% | 82.65% | 79.82% | 82.96% | 60.14% | 64.78% |
Global confusion matrix obtained using the different classifiers in the case of the Yogev et al. sub-dataset.
| Obtained Classes | |||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Supervised | Unsueprvised | ||||||||||||||
| K-NN | CART | RF | NB | SVM | K-Means | GMM | |||||||||
| Healthy | PD | Healthy | PD | Healthy | PD | Healthy | PD | Healthy | PD | Healthy | PD | Healthy | PD | ||
| True | Healthy | 87.50% | 12.50% | 75.00% | 25.00% | 81.25% | 18.75% | 84.38% | 15.62% | 84.38% | 15.62% | 71.53% | 28.47% | 55.63% | 44.37% |
| Classes | PD | 14.81% | 85.19% | 11.11% | 88.89% | 11.11% | 88.89% | 31.48% | 68.52% | 12.96% | 87.04% | 40.91% | 59.09% | 29.81% | 70.19% |
Global confusion matrix obtained using the different classifiers in the case of the Hausdorff et al. sub-dataset.
| Obtained Classes | |||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Supervised | Unsueprvised | ||||||||||||||
| K-NN | CART | RF | NB | SVM | K-Means | GMM | |||||||||
| Healthy | PD | Healthy | PD | Healthy | PD | Healthy | PD | Healthy | PD | Healthy | PD | Healthy | PD | ||
| True | Healthy | 84.00% | 16.00% | 32.00% | 68.00% | 44.00% | 56.00% | 80.00% | 20.00% | 60.00% | 40.00% | 51.84% | 48.16% | 54.76% | 45.24% |
| Classes | PD | 7.29% | 92.71% | 2.08% | 97.92% | 1.04% | 98.96% | 22.92% | 77.08% | 2.08% | 97.92% | 44.02% | 55.98% | 32.19% | 67.81% |
Global confusion matrix obtained using the different classifiers in the case of the Frenkel-Toledo et al. sub-dataset.
| Obtained Classes | |||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Supervised | Unsueprvised | ||||||||||||||
| K-NN | CART | RF | NB | SVM | K-Means | GMM | |||||||||
| Healthy | PD | Healthy | PD | Healthy | PD | Healthy | PD | Healthy | PD | Healthy | PD | Healthy | PD | ||
| True | Healthy | 86.21% | 13.79% | 75.86% | 24.14% | 75.86% | 24.14% | 82.76% | 17.24% | 86.21% | 13.79% | 80.97% | 19.03% | 57.24% | 42.76% |
| Classes | PD | 22.86% | 77.14% | 17.14% | 82.86% | 11.43% | 88.57% | 22.86% | 77.14% | 20.00% | 80.00% | 62.51% | 37.49% | 28.00% | 72.00% |
Classification accuracy results obtained in recent related studies based on PhysioNet datasets.
| Reference | Gait Parameters Features | Classifiers | Accuracy |
|---|---|---|---|
| Sarbaz et al., 2011 [ | Time-domain features | Nearest mean scaled classifier | 95.6% |
| Daliri 2012 [ | Time domain features | SVM | 89.33% |
| Lee et al., 2012 [ | frequency domain features | NEWFM | 74–77% |
| Daliri 2013 [ | Time domain features | SVM | 84–91% |
| Khorasani et al., 2014 [ | Raw gait data | HMM with GM | 90.3% |
| Ertugrul et al., 2016 [ | Entropy, Energy, Correlation, Coefficient | BayesNT, NB, LR, MLP, | 87–88% |
| Wu et al., 2017 [ | ApEn, NSE, STC | SVM | 84.48% |
| Jane et al., 2016 [ | Left and right vGRF signals | Q-BTDNN | 90–92% |
| Alam et al., 2017 [ | Time and Frequency domain | SVM | 85–95% |
| Açici et al., 2017 [ | Time and Frequency domain | RF | 74–98% |
| Khoury et al., 2018 [ | CDTW-Distance | K-NN, CART, RF, SVM, K-means, and GMM | 82–97% |
| Proposed methodology | Time domain features | K-NN, CART, RF, SVM, K-means, and GMM | 80–91% |
Accuracy, Precision, Recall and F-measure for each classifier, obtained from the binary classification between PD and ALS subjects.
| Performances | Supervised | Unsupervised | |||||
|---|---|---|---|---|---|---|---|
| K-NN | CART | RF | NB | SVM | K-Means | GMM | |
| Accuracy | 92.86% | 82.14% | 85.71% | 78.57% | 89.29% | 73.21% | 66.79% |
| Precision | 92.82% | 82.14% | 85.64% | 78.46% | 89.58% | 77.89% | 68.19% |
| Recall | 92.82% | 82.31% | 85.64% | 78.46% | 88.97% | 71.62% | 65.46% |
| F-measure | 92.82% | 82.83% | 85.64% | 78.46% | 89.28% | 74.62% | 66.88% |
Accuracy, Precision, Recall and F-measure for each classifier, obtained from the binary classification between PD and HD subjects.
| Performances | Supervised | Unsupervised | |||||
|---|---|---|---|---|---|---|---|
| K-NN | CART | RF | NB | SVM | K-Means | GMM | |
| Accuracy | 83.33% | 73.33% | 76.67% | 70% | 80 % | 69.33% | 64.67% |
| Precision | 83.43% | 73.76% | 76.79% | 70.83% | 80.54 % | 69.39% | 64.88% |
| Recall | 83.33% | 73.33% | 76.67% | 70% | 80% | 69.33% | 64.67% |
| F-measure | 83.41% | 73.54% | 76.73% | 70.41% | 80.27% | 69.36% | 64.77% |
Accuracy, Precision, Recall and F-measure for each classifier, obtained from the binary classification between PD and Healthy subjects.
| Performances | Supervised | Unsupervised | |||||
|---|---|---|---|---|---|---|---|
| K-NN | CART | RF | NB | SVM | K-Means | GMM | |
| Accuracy | 87.10% | 80.65% | 87.10% | 83.87% | 90.32% | 57.42% | 65.16% |
| Precision | 87.08% | 80.63% | 87.82% | 85.31% | 90.55% | 58.52% | 65.94% |
| Recall | 87.08% | 80.63% | 86.88% | 83.54% | 90.21% | 57.88% | 64.73% |
| F-measure | 87.08% | 80.62% | 87.35% | 84.42% | 90.38% | 58.20% | 65.33% |
Accuracy, Precision, Recall and F-measure for each classifier, obtained from the binary classification between PD and ALL (ALS, HD, and Healthy) subjects.
| Performances | Supervised | Unsupervised | |||||
|---|---|---|---|---|---|---|---|
| K-NN | CART | RF | NB | SVM | K-Means | GMM | |
| Accuracy | 90% | 80% | 83.33% | 76.67% | 86.67% | 75.33% | 69.33% |
| Precision | 90.18% | 80.54% | 83.48% | 76.79% | 87.33% | 76% | 69.62% |
| Recall | 90% | 80% | 83.33% | 76.67% | 86.67% | 75.33% | 69.33% |
| F-measure | 90.09% | 80.27% | 83.41% | 76.73% | 87.00% | 75.66% | 69.47% |