| Literature DB >> 34025389 |
Jie Mei1, Christian Desrosiers2, Johannes Frasnelli1,3.
Abstract
Diagnosis of Parkinson's disease (PD) is commonly based on medical observations and assessment of clinical signs, including the characterization of a variety of motor symptoms. However, traditional diagnostic approaches may suffer from subjectivity as they rely on the evaluation of movements that are sometimes subtle to human eyes and therefore difficult to classify, leading to possible misclassification. In the meantime, early non-motor symptoms of PD may be mild and can be caused by many other conditions. Therefore, these symptoms are often overlooked, making diagnosis of PD at an early stage challenging. To address these difficulties and to refine the diagnosis and assessment procedures of PD, machine learning methods have been implemented for the classification of PD and healthy controls or patients with similar clinical presentations (e.g., movement disorders or other Parkinsonian syndromes). To provide a comprehensive overview of data modalities and machine learning methods that have been used in the diagnosis and differential diagnosis of PD, in this study, we conducted a literature review of studies published until February 14, 2020, using the PubMed and IEEE Xplore databases. A total of 209 studies were included, extracted for relevant information and presented in this review, with an investigation of their aims, sources of data, types of data, machine learning methods and associated outcomes. These studies demonstrate a high potential for adaptation of machine learning methods and novel biomarkers in clinical decision making, leading to increasingly systematic, informed diagnosis of PD.Entities:
Keywords: Parkinson's disease; deep learning; diagnosis; differential diagnosis; machine learning
Year: 2021 PMID: 34025389 PMCID: PMC8134676 DOI: 10.3389/fnagi.2021.633752
Source DB: PubMed Journal: Front Aging Neurosci ISSN: 1663-4365 Impact factor: 5.750
Boolean search strings used for the retrieval of relevant publications on PubMed and IEEE Xplore databases.
| PubMed | (“Parkinson Disease”[Mesh] OR Parkinson*) AND (“Machine Learning”[Mesh] OR machine learn* OR machine-learn* OR deep learn* OR deep-learn*) AND (human OR patient) AND |
| IEEE | (Parkinson*) AND |
Figure 1PRISMA Flow Diagram of Literature Search and Selection Process showing the number of studies identified, screened, extracted, and included in the review.
Source of data of the included studies.
| independent recruitment of human participants | 93 | 43.06% |
| UCI Machine Learning Repository | 44 | 20.37% |
| PPMI database | 33 | 15.28% |
| PhysioNet | 15 | 6.94% |
| HandPD dataset | 6 | 2.78% |
| mPower database | 4 | 1.85% |
| Other databases | 6 | 2.78% |
| Collected postmortem | 1 | 0.46% |
| Commercially sourced | 1 | 0.46% |
| Acquired at another institution | 1 | 0.46% |
| From another study | 1 | 0.46% |
| From the author's institutional database | 1 | 0.46% |
| Others | 3 | 1.39% |
PACS, Picture Archiving and Communication System; PaHaW, Parkinson's Disease Handwriting Database.
Figure 2Sample size of the included studies. (A) Cumulative relative frequency graph depicting the frequency of the sample sizes studied. (B) Histogram depicting the frequency of a sample size of 0–50, 50–100, 100–200, 200–500, 500–100, and over 1,000 for studies using locally recruited human participants and studies using previously published open databases. Green, studies using locally recruited human participants; gray, studies using data sourced from public databases. (C) Model performance as measured by accuracy in relation to sample size, shown in means (SD).
Figure 3Data modality (A) and number of subjects (B,C) of included studies, summarized by objectives (i.e., methodology or clinical application). Orange, studies with a focus on the development of a novel technical approach to be used in the diagnosis of Parkinson's disease (i.e., methodology); blue, studies that investigate the use of published machine learning models or novel data modalities (i.e., clinical application). (A) Proportion of data modalities in included studies displayed as percentages. (B) Sample size in all included studies. (C) Sample size in studies that collected data from recruited human participants. Data shown are means (SD).
Performance metrics used in the evaluation of machine learning models.
| Accuracy | 174 | |
| Sensitivity (recall) | 110 | |
| Specificity (TNR) | 94 | |
| AUC | The two-dimensional area under the Receiver Operating Characteristic (ROC) curve | 60 |
| MCC | 9 | |
| Precision (PPV) | 31 | |
| NPV | 8 | |
| F1 score | 25 | |
| Others | N/A | 28 |
TNR, true negative rate; AUC, Area under the ROC Curve; MCC, Matthews correlation coefficient; PPV, positive predictive value; NPV, negative predictive value; EER, equal error rate; MSE, mean squared error; LOR, log odds ratio; YI, Youden's Index; FPR, false positive rate; FNR, false negative rate; PE, probability excess.
Figure 4Data type, machine learning models applied, and accuracy. (A) Accuracy achieved in individual studies and average accuracy for each data type. Error bar: standard deviation. (B) Distribution of machine learning models applied per data type. MRI, magnetic resonance imaging; SPECT, single-photon emission computed tomography; PET, positron emission tomography; CSF, cerebrospinal fluid; SVM, support vector machine; NN, neural network; EL, ensemble learning; k-NN, nearest neighbor; regr, regression; DT, decision tree; NB, naïve Bayes; DA, discriminant analysis; other: data/models that do not belong to any of the given categories.
Studies that applied machine learning models to voice recordings to diagnose PD (n = 55).
| Classification of PD from HC | Diagnosis | UCI machine learning repository | 31; 8 HC + 23 PD | Fuzzy neural system with 10-fold cross validation | Testing accuracy = 100% | 2016 | Abiyev and Abizade, |
| Classification of PD from HC | Diagnosis | UCI machine learning repository | 31; 8 HC + 23 PD | RPART, C4.5, PART, Bagging CART, random forest, Boosted C5.0, SVM | SVM: | 2019 | Aich et al., |
| Accuracy = 97.57% | |||||||
| Sensitivity = 0.9756 | |||||||
| Specificity = 0.9987 | |||||||
| NPV = 0.9995 | |||||||
| Classification of PD from HC | Diagnosis | UCI machine learning repository | 31; 8 HC + 23 PD | DBN of 2 RBMs | Testing accuracy = 94% | 2016 | Al-Fatlawi et al., |
| Classification of PD from HC | Diagnosis | UCI machine learning repository | 31; 8 HC + 23 PD | EFMM-OneR with 10-fold cross validation or 5-fold cross validation | Accuracy = 94.21% | 2019 | Sayaydeha and Mohammad, |
| Classification of PD from HC | Diagnosis | UCI machine learning repository | 40; 20 HC + 20 PD | Linear regression, LDA, Gaussian naïve Bayes, decision tree, KNN, SVM-linear, SVM-RBF with leave-one-subject-out cross validation | Logistic regression or SVM-linear accuracy = 70% | 2019 | Ali et al., |
| Classification of PD from HC | Diagnosis | UCI machine learning repository | 40; 20 HC + 20 PD | LDA-NN-GA with leave-one-subject-out cross validation | Training: | 2019 | Ali et al., |
| Accuracy = 95% | |||||||
| Sensitivity = 95% | |||||||
| Test: | |||||||
| Accuracy = 100% | |||||||
| Sensitivity = 100% | |||||||
| Classification of PD from HC | Diagnosis | UCI machine learning repository | 31; 8 HC + 23 PD | NNge with AdaBoost with 10-fold cross validation | Accuracy = 96.30% | 2018 | Alqahtani et al., |
| Classification of PD from HC | Diagnosis | UCI machine learning repository | 31; 8 HC + 23 PD | Logistic regression, KNN, naïve Bayes, SVM, decision tree, random forest, DNN with 10-fold cross validation | KNN accuracy = 95.513% | 2018 | Anand et al., |
| Classification of PD from HC | Diagnosis | UCI machine learning repository | 31; 8 HC + 23 PD | MLP with a train-validation-test ratio of 50:20:30 | Training accuracy = 97.86% | 2012 | Bakar et al., |
| Test accuracy = 92.96% | |||||||
| MSE = 0.03552 | |||||||
| Classification of PD from HC | Diagnosis | UCI machine learning repository | 31 (8 HC + 23 PD) for dataset 1 and 68 (20 HC + 48 PD) for dataset 2 | FKNN, SVM, KELM with 10-fold cross validation | FKNN accuracy = 97.89% | 2018 | Cai et al., |
| Classification of PD from HC | Diagnosis | UCI machine learning repository | 40; 20 HC + 20 PD | SVM, logistic regression, ET, gradient boosting, random forest with train-test split ratio = 80:20 | Logistic regression accuracy = 76.03% | 2019 | Celik and Omurca, |
| Classification of PD from HC | Diagnosis | UCI machine learning repository | 40; 20 HC + 20 PD | MLP, GRNN with a training-test ratio of 50:50 | GRNN: | 2016 | Çimen and Bolat, |
| Error rate = 0.0995 (spread parameter = 195.1189) | |||||||
| Error rate = 0.0958 (spread parameter = 1.2) | |||||||
| Error rate = 0.0928 (spread parameter = 364.8) | |||||||
| Classification of PD from HC | Diagnosis | UCI machine learning repository | 31; 8 HC + 23 PD | ECFA-SVM with 10-fold cross validation | Accuracy = 97.95% | 2017 | Dash et al., |
| Sensitivity = 97.90% | |||||||
| Precision = 97.90% | |||||||
| F-measure = 97.90% | |||||||
| Specificity = 96.50% | |||||||
| AUC = 97.20% | |||||||
| Classification of PD from HC | Diagnosis | UCI machine learning repository | 40; 20 HC + 20 PD | Fuzzy classifier with 10-fold cross validation, leave-one-out cross validation or a train-test ratio of 70:30 | Accuracy = 100% | 2019 | Dastjerd et al., |
| Classification of PD from HC | Diagnosis | UCI machine learning repository | 31; 8 HC + 23 PD | Averaged perceptron, BPM, boosted decision tree, decision forests, decision jungle, locally deep SVM, logistic regression, NN, SVM with 10-fold cross-validation | Boosted decision trees: | 2017 | Dinesh and He, |
| Accuracy = 0.912105 | |||||||
| Precision = 0.935714 | |||||||
| F-score = 0.942368 | |||||||
| AUC = 0.966293 | |||||||
| Classification of PD from HC | Diagnosis | UCI machine learning repository | 50; 8 HC + 42 PD | KNN, SVM, ELM with a train-validation ratio of 70:30 | SVM: | 2017 | Erdogdu Sakar et al., |
| Accuracy = 96.43% | |||||||
| MCC = 0.77 | |||||||
| Classification of PD from HC | Diagnosis | UCI machine learning repository | 252; 64 HC + 188 PD | CNN with leave-one-person-out cross validation | Accuracy = 0.869 | 2019 | Gunduz, |
| F-measure = 0.917 | |||||||
| MCC = 0.632 | |||||||
| Classification of PD from HC | Diagnosis | UCI machine learning repository | 31; 8 HC + 23 PD | SVM, logistic regression, KNN, DNN with a train-test ratio of 70:30 | DNN: | 2018 | Haq et al., |
| Accuracy = 98% | |||||||
| Specificity = 95% | |||||||
| sensitivity = 99% | |||||||
| Classification of PD from HC | Diagnosis | UCI machine learning repository | 31; 8 HC + 23 PD | SVM-RBF, SVM-linear with 10-fold cross validation | Accuracy = 99% | 2019 | Haq et al., |
| Specificity = 99% | |||||||
| Sensitivity = 100% | |||||||
| Classification of PD from HC | Diagnosis | UCI machine learning repository | 31; 8 HC + 23 PD | LS-SVM, PNN, GRNN with conventional (train-test ratio of 50:50) and 10-fold cross validation | LS-SVM or PNN or GRNN: | 2014 | Hariharan et al., |
| Accuracy = 100% | |||||||
| Precision = 100% | |||||||
| Sensitivity = 100% | |||||||
| specificity = 100% | |||||||
| AUC = 100 | |||||||
| Classification of PD from HC | Diagnosis | UCI machine learning repository | 31; 8 HC + 23 PD | Random tree, SVM-linear, FBANN with 10-fold cross validation | FBANN: | 2014 | Islam et al., |
| Accuracy = 97.37% | |||||||
| Sensitivity = 98.60% | |||||||
| Specificity = 93.62% | |||||||
| FPR = 6.38% | |||||||
| Precision = 0.979 | |||||||
| MSE = 0.027 | |||||||
| Classification of PD from HC | Diagnosis | UCI machine learning repository | 31; 8 HC + 23 PD | SVM-linear with 5-fold cross validation | Error rate ~0.13 | 2012 | Ji and Li, |
| Classification of PD from HC | Diagnosis | UCI machine learning repository | 40; 20 HC + 20 PD | Decision tree, random forest, SVM, GBM, XGBoost | SVM-linear: | 2018 | Junior et al., |
| FNR = 10% | |||||||
| Accuracy = 0.725 | |||||||
| Classification of PD from HC | Diagnosis | UCI machine learning repository | 31; 8 HC + 23 PD | CART, SVM, ANN | SVM accuracy = 93.84% | 2020 | Karapinar Senturk, |
| Classification of PD from HC | Diagnosis | UCI machine learning repository | Dataset 1: 31; 8 HC + 23 PD | EWNN with a train-test ratio of 90:10 and cross validation | Dataset 1: | 2018 | Khan et al., |
| Ensemble classification accuracy = 100.0% | |||||||
| Sensitivity = 100.0% | |||||||
| MCC = 100.0% | |||||||
| Dataset 2: | |||||||
| Accuracy = 66.3% | |||||||
| Ensemble classification accuracy = 90.0% | |||||||
| Sensitivity = 93.0% | |||||||
| Specificity = 97.0% | |||||||
| MCC = 87.0% | |||||||
| Classification of PD from HC | Diagnosis | UCI machine learning repository | 40; 20 HC + 20 PD | Stacked generalization with CMTNN with 10-fold cross validation | Accuracy = ~70% | 2015 | Kraipeerapun and Amornsamankul, |
| Classification of PD from HC | Diagnosis | UCI machine learning repository | 40; 20 HC + 20 PD | HMM, SVM | HMM: | 2019 | Kuresan et al., |
| Accuracy = 95.16% | |||||||
| Sensitivity = 93.55% | |||||||
| Specificity = 91.67% | |||||||
| Classification of PD from HC | Diagnosis | UCI machine learning repository | 31; 8 HC + 23 PD | IGWO-KELM with 10-fold cross validation | Iteration number = 100 | 2017 | Li et al., |
| Accuracy = 97.45% | |||||||
| Sensitivity = 99.38% | |||||||
| Specificity = 93.48% | |||||||
| Precision = 97.33% | |||||||
| G-mean = 96.38% | |||||||
| F-measure = 98.34% | |||||||
| Classification of PD from HC | Diagnosis | UCI machine learning repository | 31; 8 HC + 23 PD | SCFW-KELM with 10-fold cross validation | Accuracy = 99.49% | 2014 | Ma et al., |
| Sensitivity = 100% | |||||||
| Specificity = 99.39% | |||||||
| AUC = 99.69% | |||||||
| F-measure = 0.9966 | |||||||
| Kappa = 0.9863 | |||||||
| Classification of PD from HC | Diagnosis | UCI machine learning repository | 31; 8 HC + 23 PD | SVM-RBF with 10-fold cross validation | Accuracy = 96.29% | 2016 | Ma et al., |
| Sensitivity = 95.00% | |||||||
| Specificity = 97.50% | |||||||
| Classification of PD from HC | Diagnosis | UCI machine learning repository | 31; 8 HC + 23 PD | Logistic regression, NN, SVM, SMO, Pegasos, AdaBoost, ensemble selection, FURIA, rotation forest Bayesian network with 10-fold cross-validation | Average accuracy across all models = 97.06% | 2013 | Mandal and Sairam, |
| Classification of PD from HC | Diagnosis | UCI machine learning repository | 31; 8 HC + 23 PD | Logistic regression, KNN, SVM, naïve Bayes, decision tree, random forest, ANN | ANN: | 2018 | Marar et al., |
| Accuracy = 94.87% | |||||||
| Specificity = 96.55% | |||||||
| Sensitivity = 90% | |||||||
| Classification of PD from HC | Diagnosis | UCI machine learning repository | Dataset 1: 31; 8 HC + 23 PD | KNN | Dataset 1 accuracy = 90% | 2017 | Moharkan et al., |
| Dataset 2: 40; 20 HC + 20 PD | Dataset 2 accuracy = 65% | ||||||
| Classification of PD from HC | Diagnosis | UCI machine learning repository | 31; 8 HC + 23 PD | Rotation forest ensemble with 10-fold cross validation | Accuracy = 87.1% | 2011 | Ozcift and Gulten, |
| Kappa error = 0.63 | |||||||
| AUC = 0.860 | |||||||
| Classification of PD from HC | Diagnosis | UCI machine learning repository | 31; 8 HC + 23 PD | Rotation forest ensemble | Accuracy = 96.93% | 2012 | Ozcift, |
| Kappa = 0.92 | |||||||
| AUC = 0.97 | |||||||
| Classification of PD from HC | Diagnosis | UCI machine learning repository | 31; 8 HC + 23 PD | SVM-RBF with 10-fold cross validation or a train-test ratio of 50:50 | 10-fold cross validation: | 2016 | Peker, |
| Accuracy = 98.95% | |||||||
| Sensitivity = 96.12% | |||||||
| Specificity = 100% | |||||||
| F-measure = 0.9795 | |||||||
| Kappa = 0.9735 | |||||||
| AUC = 0.9808 | |||||||
| Classification of PD from HC | Diagnosis | UCI machine learning repository | 31; 8 HC + 23 PD | ELM with 10-fold cross validation | Accuracy = 88.72% | 2016 | Shahsavari et al., |
| Recall = 94.33% | |||||||
| Precision = 90.48% | |||||||
| F-score = 92.36% | |||||||
| Classification of PD from HC | Diagnosis | UCI machine learning repository | 31; 8 HC + 23 PD | Ensemble learning with 10-fold cross validation | Accuracy = 90.6% | 2019 | Sheibani et al., |
| Sensitivity = 95.8% | |||||||
| Specificity = 75% | |||||||
| Classification of PD from HC | Diagnosis | UCI machine learning repository | 31; 8 HC + 23 PD | GLRA, SVM, bagging ensemble with 5-fold cross validation | Bagging: | 2017 | Wu et al., |
| Sensitivity = 0.9796 | |||||||
| Specificity = 0.6875 | |||||||
| MCC = 0.6977 | |||||||
| AUC = 0.9558 | |||||||
| SVM: | |||||||
| Sensitivity = 0.9252 | |||||||
| specificity = 0.8542 | |||||||
| MCC = 0.7592 | |||||||
| AUC = 0.9349 | |||||||
| Classification of PD from HC | Diagnosis | UCI machine learning repository | 31; 8 HC + 23 PD | Decision tree classifier, logistic regression, SVM with 10-fold cross validation | SVM: | 2011 | Yadav et al., |
| Accuracy = 0.76 | |||||||
| Sensitivity = 0.9745 | |||||||
| Specificity = 0.13 | |||||||
| Classification of PD from HC | Diagnosis | UCI machine learning repository | 80; 40 HC + 40 PD | KNN, SVM with 10-fold cross validation | SVM: | 2019 | Yaman et al., |
| Accuracy = 91.25% | |||||||
| Precision = 0.9125 | |||||||
| Recall = 0.9125 | |||||||
| F-Measure = 0.9125 | |||||||
| Classification of PD from HC | Diagnosis | UCI machine learning repository | 31; 8 HC + 23 PD | MAP, SVM-RBF, FLDA with 5-fold cross validation | MAP: | 2014 | Yang et al., |
| Accuracy = 91.8% | |||||||
| Sensitivity = 0.986 | |||||||
| Specificity = 0.708 | |||||||
| AUC = 0.94 | |||||||
| Classification of PD from other disorders | Differential diagnosis | Collected from participants | 50; 30 PD + 9 MSA + 5 FND + 1 somatization + 1 dystonia + 2 CD + 1 ET + 1 GPD | SVM, KNN, DA, naïve Bayes, classification tree with LOSO | SVM-linear: | 2016 | Benba et al., |
| Accuracy = 90% | |||||||
| Sensitivity = 90% | |||||||
| Specificity = 90% | |||||||
| MCC = 0.794067 | |||||||
| PE = 0.788177 | |||||||
| Classification of PD from other disorders | Differential diagnosis | Collected from participants | 40; 20 PD + 9 MSA + 5 FND + 1 somatization + 1 dystonia + 2 CD + 1ET + 1 GPD | SVM (RBF, linear, polynomial, and MLP kernels) with LOSO | SVM-linear accuracy = 85% | 2016 | Benba et al., |
| Classification of PD from HC and assess the severity of PD | Diagnosis | Collected from participants | 52; 9 HC + 43 PD | SVM-RBF with cross validation | Accuracy = 81.8% | 2014 | Frid et al., |
| Classification of PD from HC | Diagnosis | Collected from participants | 54; 27 HC + 27 PD | SVM with stratified 10-fold cross validation or leave-one-out cross validation | Accuracy = 94.4% | 2018 | Montaña et al., |
| Specificity = 100% | |||||||
| Sensitivity = 88.9% | |||||||
| Classification of PD from HC | Diagnosis | Collected from participants | 40; 20 HC + 20 PD | KNN, SVM-linear, SVM-RBF with leave-one-subject-out or summarized leave-one-out | SVM-linear: | 2013 | Sakar et al., |
| Accuracy = 77.50% | |||||||
| MCC = 0.5507 | |||||||
| Sensitivity = 80.00% | |||||||
| Specificity = 75.00% | |||||||
| Classification of PD from HC | Diagnosis | Collected from participants | 78; 27 HC + 51 PD | KNN, SVM-linear, SVM-RBF, ANN, DNN with leave-one-out cross validation | SVM-RBF: | 2017 | Sztahó et al., |
| Accuracy = 84.62% | |||||||
| Precision = 88.04% | |||||||
| Recall = 78.65% | |||||||
| Classification of PD from HC and assess the severity of PD | Diagnosis | Collected from participants | 88; 33 HC + 55 PD | KNN, SVM-linear, SVM-RBF, ANN, DNN with leave-one-subject-out cross validation | SVM-RBF: | 2019 | Sztahó et al., |
| Accuracy = 89.3% | |||||||
| Sensitivity = 90.2% | |||||||
| Specificity = 87.9% | |||||||
| Classification of PD from HC | Diagnosis | Collected from participants | 43; 10 HC + 33 PD | Random forests, SVM with 10-fold cross validation and a train-test ratio of 90:10 | SVM accuracy = 98.6% | 2012 | Tsanas et al., |
| Classification of PD from HC | Diagnosis | Collected from participants | 99; 35 HC + 64 PD | Random forest with internal out-of-bag (OOB) validation | EER = 19.27% | 2017 | Vaiciukynas et al., |
| Classification of PD from HC | Diagnosis | UCI machine learning repository and participants | 40 and 28; 20 HC + 20 PD and 28 PD, respectively | ELM | Training data: | 2016 | Agarwal et al., |
| Accuracy = 90.76% | |||||||
| MCC = 0.815 | |||||||
| Test data: | |||||||
| Accuracy = 81.55% | |||||||
| Classification of PD from HC | Diagnosis | The Neurovoz corpus | 108; 56 HC + 52 PD | Siamese LSTM-based NN with 10-fold cross- validation | EER = 1.9% | 2019 | Bhati et al., |
| Classification of PD from HC | Diagnosis | mPower database | 2,289; 2,023 HC + 246 PD | L2-regularized logistic regression, random forest, gradient boosted decision trees with 5-fold cross validation | Gradient boosted decision trees: | 2019 | Tracy et al., |
| Recall = 0.797 | |||||||
| Precision = 0.901 | |||||||
| F1-score = 0.836 | |||||||
| Classification of PD from HC | Diagnosis | PC-GITA database | 100; 50 HC + 50 PD | ResNet with train-validation ratio of 90:10 | Precision = 0.92 | 2019 | Wodzinski et al., |
| Recall = 0.92 | |||||||
| F1-score = 0.92 | |||||||
| Accuracy = 91.7% |
ANN, artificial neural network; AUC, area under the receiver operating characteristic (ROC) curve; CART, classification and regression trees; CD, cervical dystonia; CMTNN, complementary neural network; CNN, convolutional neural network; DA, discriminant analysis; DBN, deep belief network; DNN, deep neural network; ECFA, enhanced chaos-based firefly algorithm; EFMM-OneR, enhanced fuzzy min-max neural network with the OneR attribute evaluator; ELM, extreme Learning machine; ET, extra trees or essential tremor; EWNN, evolutionary wavelet neural network; FBANN, feedforward back-propagation based artificial neural network; FKNN, fuzzy k-nearest neighbor; FLDA, Fisher's linear discriminant analysis; FND, functional neurological disorder; FNR, false negative rate; FPR, false positive rate; FURIA, fuzzy unordered rule induction algorithm; GA, genetic algorithm; GBM, gradient boosting machine; GLRA, generalized logistic regression analysis; GPD, generalized paroxysmal dystonia; GRNN, general(ized) regression neural network; HC, healthy control; HMM, hidden Markov model; IGWO-KELM, improved gray wolf optimization and kernel(-based) extreme learning machine; KELM, kernel-based extreme learning machine; KNN, k-nearest neighbors; LDA, linear discriminant analysis; LOSO, leave-one-subject-out; LS-SVM, least-square support vector machine; LSTM, long short-term memory; MAP, maximum a posteriori decision rule; MCC, Matthews correlation coefficient; MLP, multilayer perceptron; MSA, multiple system atrophy; MSE, mean squared error; NN, neural network; NNge, non-nested generalized exemplars; NPV, negative predictive value; PD, Parkinson's disease; PNN, probabilistic neural network; RBM, restricted Boltzmann machine; ResNet, residual neural network; RPART, recursive partitioning and regression trees; SCFW-KELM, subtractive clustering features weighting and kernel-based extreme learning machine; SMO, sequential minimal optimization; SVM, support vector machine; SVM-linear, support vector machine with linear kernel; SVM-RBF, support vector machine with radial basis function kernel; XGBoost, extreme gradient boosting.
Studies that applied machine learning models to movement data to diagnose PD (n = 51).
| Classification of PD from HC | Diagnosis | Collected from participants | 103; 71 HC + 32 PD | Ensemble method of 8 models (SVM, MLP, logistic regression, random forest, NSVC, decision tree, KNN, QDA) | Sensitivity = 96% | 2017 | Adams, |
| Classification of PD, HC and other neurological stance disorders | Diagnosis and differential diagnosis | Collected from participants | 293; 57 HC + 27 PD + 49 AVS + 12 PNP + 48 CA + 16 DN + 25 OT + 59 PPV | Ensemble method of 7 models (logistic regression, KNN, shallow and deep ANNs, SVM, random forest, extra-randomized trees) with 90% training and 10% testing data in stratified k-fold cross-validation | 8-class classification accuracy = 82.7% | 2019 | Ahmadi et al., |
| Classification of PD from HC | Diagnosis | Collected from participants | 137; 38 HC + 99 PD | SVM with leave-one-out-cross validation | PD vs. HC accuracy = 92.3% | 2016 | Bernad-Elazari et al., |
| Mild vs. severe accuracy = 89.8% | |||||||
| Mild vs. HC accuracy = 85.9% | |||||||
| Classification of PD from HC | Diagnosis | Collected from participants | 30; 14 HC + 16 PD | SVM (linear, quadratic, cubic, Gaussian kernels), ANN, with 5-fold cross-validation | Classification with ANN: | 2019 | Buongiorno et al., |
| Accuracy = 89.4% | |||||||
| Sensitivity = 87.0% | |||||||
| Specificity = 91.8% | |||||||
| Severity assessment with ANN: | |||||||
| Accuracy = 95.0% | |||||||
| sensitivity = 90.0% | |||||||
| Specificity = 99.0% | |||||||
| Classification of PD from HC | Diagnosis | Collected from participants | 28; 12 HC + 16 PD | NN with a train-validation-test ratio of 70:15:15, SVM with leave-one-out cross-validation, logistic regression with 10-fold cross validation | SVM: | 2017 | Butt et al., |
| Classification of PD from HC | Diagnosis | Collected from participants | 28; 12 HC + 16 PD | Logistic regression, naïve Bayes, SVM with 10-fold cross validation | Naïve Bayes: | 2018 | Butt et al., |
| Accuracy = 81.45% | |||||||
| Sensitivity = 76% | |||||||
| Specificity = 86.5% | |||||||
| AUC = 0.811 | |||||||
| Classification of PD from HC | Diagnosis | Collected from participants | 54; 27 HC + 27 PD | Naïve Bayes, LDA, KNN, decision tree, SVM-linear, SVM-RBF, majority of votes with 5-fold cross validation | Majority of votes (weighted) accuracy = 96% | 2018 | Caramia et al., |
| Classification of PD, HC and PD, HC, IH | Diagnosis | Collected from participants | 90; 30 PD + 30 HC + 30 IH | SVM, random forest, naïve Bayes with 10-fold cross validation | Random forest: | 2019 | Cavallo et al., |
| HC vs. PD: | |||||||
| Accuracy = 0.950 | |||||||
| F-measure = 0.947 | |||||||
| HC + IH vs. PD: | |||||||
| Accuracy = 0.917 | |||||||
| F-measure = 0.912 | |||||||
| HC vs. IH vs. PD: | |||||||
| Accuracy = 0.789 | |||||||
| F-measure = 0.796 | |||||||
| Classification of PD from HC and classification of HC, MCI, PDNOMCI, and PDMCI | Diagnosis, differential diagnosis and subtyping | Collected from participants | PD vs. HC: | Decision tree, naïve Bayes, random forest, SVM, adaptive boosting (with decision tree or random forest) with 10-fold cross validation | Adaptive boosting with decision tree: | 2015 | Cook et al., |
| 75; 50 HC + 25 PD | PD vs. HC: | ||||||
| Accuracy = 0.79 | |||||||
| Subtyping: | AUC = 0.82 | ||||||
| 52; 18 HC + 16 PDNOMCI + 9 PDMCI + 9 MCI | Subtyping (HOA vs. MCI vs. PDNOMCI vs. PDMCI): | ||||||
| Accuracy = 0.85 | |||||||
| AUC = 0.96 | |||||||
| Classification of PD from HC | Diagnosis | Collected from participants | 580; 424 HC + 156 PD | Hidden Markov models with nearest neighbor classifier with cross validation and train-test ratio of 66.6:33.3 | Accuracy = 85.51% | 2017 | Cuzzolin et al., |
| Classification of PD from HC | Diagnosis | Collected from participants | 80; 40 HC + 40 PD | Random forest, SVM with 10-fold cross validation | SVM-RBF: | 2017 | Djurić-Jovičić et al., |
| Accuracy = 85% | |||||||
| Sensitivity = 85% | |||||||
| Specificity = 82% | |||||||
| PPV = 86% | |||||||
| NPV = 83% | |||||||
| Classification of PD from HC | Diagnosis | Collected from participants | 13; 5 HC + 8 PD | SVM-RBF with leave-one-out cross validation | 100% HC and PD classified correctly (confusion matrix) | 2014 | Dror et al., |
| Classification of PD from HC | Diagnosis | Collected from participants | 75; 38 HC + 37 PD | SVM with leave-one-out cross validation | Accuracy = 85.61% | 2014 | Drotár et al., |
| Sensitivity = 85.95% | |||||||
| Specificity = 85.26% | |||||||
| Classification of PD from ET | Differential diagnosis | Collected from participants | 24; 13 PD + 11 ET | SVM-linear, SVM-RBF with leave-one-out cross validation | Accuracy = 83% | 2016 | Ghassemi et al., |
| Classification of PD from HC | Diagnosis | Collected from participants | 41; 22 HC + 19 PD | SVM, decision tree, random forest, linear regression with 10-fold and leave-one-individual out (L1O) cross validation | SVM accuracy = 0.89 | 2018 | Klein et al., |
| Classification of PD from HC | Diagnosis | Collected from participants | 74; 33 young HC + 14 elderly HC + 27 PD | SVM with 10-fold cross validation | Sensitivity = ~90% | 2017 | Javed et al., |
| Classification of PD from HC and assess the severity of PD | Diagnosis | Collected from participants | 55; 20 HC + 35 PD | SVM with leave-one-out cross validation | PD diagnosis: | 2016 | Koçer and Oktay, |
| Accuracy = 89% | |||||||
| Precision = 0.91 | |||||||
| Recall = 0.94 | |||||||
| Severity assessment: | |||||||
| HYS 1 accuracy = 72% | |||||||
| HYS 2 accuracy = 77% | |||||||
| HYS 3 accuracy = 75% | |||||||
| HYS 4 accuracy = 33% | |||||||
| Classification of PD from HC | Diagnosis | Collected from participants | 45; 20 HC + 25 PD | Naïve Bayes, logistic regression, SVM, AdaBoost, C4.5, BagDT with 10-fold stratified cross-validation apart from BagDT | BagDT: | 2015 | Kostikis et al., |
| Classification of PD from HC | Diagnosis | Collected from participants | 40; 26 HC + 14 PD | Random forest with leave-one-subject-out cross-validation | Accuracy = 94.6% | 2017 | Kuhner et al., |
| Classification of PD from HC | Diagnosis | Collected from participants | 177; 70 HC + 107 PD | ESN with 10-fold cross validation | AUC = 0.852 | 2018 | Lacy et al., |
| Classification of PD from HC | Diagnosis | Collected from participants | 39; 16 young HC + 12 elderly HC + 11 PD | LDA with leave-one-out cross validation | Multiclass classification (young HC vs. age-matched HC vs. PD): | 2018 | Martínez et al., |
| Accuracy = 64.1% | |||||||
| Sensitivity = 47.1% | |||||||
| Specificity = 77.3% | |||||||
| Classification of PD from HC | Diagnosis | Collected from participants | 38; 10 HC + 28 PD | SVM-Gaussian with leave-one-out cross validation | Training accuracy = 96.9% | 2018 | Oliveira H. M. et al., |
| Test accuracy = 76.6% | |||||||
| Classification of PD from HC | Diagnosis | Collected from participants | 30; 15 HC + 15 PD | SVM-RBF, PNN with 10-fold cross validation | SVM-RBF: | 2015 | Oung et al., |
| Accuracy = 88.80% | |||||||
| Sensitivity = 88.70% | |||||||
| Specificity = 88.15% | |||||||
| AUC = 88.48 | |||||||
| Classification of PD from HC | Diagnosis | Collected from participants | 45; 14 HC + 31 PD | Deep-MIL-CNN with LOSO or RkF | With LOSO: | 2019 | Papadopoulos et al., |
| Precision = 0.987 | |||||||
| Sensitivity = 0.9 | |||||||
| specificity = 0.993 | |||||||
| F1-score = 0.943 | |||||||
| With RkF: | |||||||
| Precision = 0.955 | |||||||
| Sensitivity = 0.828 | |||||||
| Specificity = 0.979 | |||||||
| F1-score = 0.897 | |||||||
| Classification of PD, HC and post-stroke | Diagnosis and differential diagnosis | Collected from participants | 11; 3 HC + 5 PD + 3 post-stroke | MTFL with 10-fold cross validation | PD vs. HC AUC = 0.983 | 2017 | Papavasileiou et al., |
| Classification of PD from HC | Diagnosis | Collected from participants | 182; 94 HC + 88 PD | LSTM, CNN-1D, CNN-LSTM with 5-fold cross-validation and a training-test ratio of 90:10 | CNN-LSTM: | 2019 | Reyes et al., |
| Accuracy = 83.1% | |||||||
| Precision = 83.5% | |||||||
| Recall = 83.4% | |||||||
| F1-score = 81% | |||||||
| Kappa = 64% | |||||||
| Classification of PD from HC | Diagnosis | Collected from participants | 60; 30 HC + 30 PD | Naïve Bayes, KNN, SVM with leave-one-out cross validation | SVM: | 2019 | Ricci et al., |
| Accuracy = 95% | |||||||
| Precision = 0.951 | |||||||
| AUC = 0.950 | |||||||
| Classification of PD, HC and IH | Diagnosis and differential diagnosis | Collected from participants | 90; 30 HC + 30 PD + 30 IH | SVM-polynomial, random forest, naïve Bayes with 10-fold cross validation | HC vs. PD, naïve Bayes or random forest: | 2018 | Rovini et al., |
| Precision = 0.967 | |||||||
| Recall = 0.967 | |||||||
| Specificity = 0.967 | |||||||
| Accuracy = 0.967 | |||||||
| F-measure = 0.967 | |||||||
| HC + IH vs. PD, random forest: | |||||||
| Precision = 1.000 | |||||||
| Recall = 0.933 | |||||||
| Specificity = 1.000 | |||||||
| Accuracy = 0.978 | |||||||
| F-measure = 0.966 | |||||||
| Multiclass classification, random forest: | |||||||
| Precision = 0.784 | |||||||
| Recall = 0.778 | |||||||
| Specificity = 0.889 | |||||||
| Accuracy = 0.778 | |||||||
| F-measure = 0.781 | |||||||
| Classification of PD, HC and IH | Diagnosis and differential diagnosis | Collected from participants | 45; 15 HC + 15 PD + 15 IH | SVM-polynomial, random forest with 5-fold cross validation | HC vs. PD, random forest: | 2019 | Rovini et al., |
| Precision = 1.000 | |||||||
| Recall = 1.000 | |||||||
| Specificity = 1.000 | |||||||
| Accuracy = 1.000 | |||||||
| F-measure = 1.000 | |||||||
| Multiclass classification (HC vs. IH vs. PD), random forest: | |||||||
| Precision = 0.930 | |||||||
| Recall = 0.911 | |||||||
| Specificity = 0.956 | |||||||
| Accuracy = 0.911 | |||||||
| F-measure = 0.920 | |||||||
| Classification of PD from ET | Differential diagnosis | Collected from participants | 52; 32 PD + 20 ET | SVM-linear with 10-fold cross validation | Accuracy = 1 | 2016 | Surangsrirat et al., |
| Sensitivity = 1 | |||||||
| Specificity = 1 | |||||||
| Classification of PD from HC | Diagnosis | Collected from participants | 12; 10 HC + 2 PD | Naive Bayes, LogitBoost, random forest, SVM with 10-fold cross-validation | Random forest: | 2017 | Tahavori et al., |
| Accuracy = 92.29% | |||||||
| Precision = 0.99 | |||||||
| Recall = 0.99 | |||||||
| Classification of PD from HC | Diagnosis | Collected from participants | 39; 16 HC + 23 PD | SVM-RBF with 10-fold stratified cross validation | Sensitivity = 88.9% | 2010 | Tien et al., |
| Specificity = 100% | |||||||
| Precision = 100% | |||||||
| FPR = 0.0% | |||||||
| Classification of PD from HC | Diagnosis | Collected from participants | 60; 30 HC + 30 PD | Logistic regression, naïve Bayes, random forest, decision tree with 10-fold cross validation | Random forest: | 2018 | Urcuqui et al., |
| Accuracy = 82% | |||||||
| False negative rate = 23% | |||||||
| False positive rate = 12% | |||||||
| Classification of PD from HC | Diagnosis | PhysioNet | 47; 18 HC + 29 PD | SVM, KNN, random forest, decision tree | SVM with cubic kernel: | 2017 | Alam et al., |
| Accuracy = 93.6% | |||||||
| Sensitivity = 93.1% | |||||||
| Specificity = 94.1% | |||||||
| Classification of PD from HC | Diagnosis | PhysioNet | 34; 17 HC + 17 PD | MLP, SVM, decision tree | MLP: | 2018 | Alaskar and Hussain, |
| Accuracy = 91.18% | |||||||
| Sensitivity = 1 | |||||||
| Specificity = 0.83 | |||||||
| Error = 0.09 | |||||||
| AUC = 0.92 | |||||||
| Classification of PD from HC and assess the severity of PD | Diagnosis | PhysioNet | 166; 73 HC + 93 PD | 1D-CNN, 2D-CNN, LSTM, decision tree, logistic regression, SVM, MLP | 2D-CNN and LSTM accuracy = 96.0% | 2019 | Alharthi and Ozanyan, |
| Classification of PD from HC | Diagnosis | PhysioNet | 146; 60 HC + 86 PD | SVM-Gaussian with 3- or 5-fold cross validation | Accuracy = 100%, 88.88%, and 100% in three test groups | 2019 | Andrei et al., |
| Classification of PD from HC | Diagnosis | PhysioNet | 166; 73 HC + 93 PD | ANN, SVM, naïve Bayes with cross validation | ANN accuracy = 86.75% | 2017 | Baby et al., |
| Classification of PD from HC | Diagnosis | PhysioNet | 31; 16 HC + 15 PD | SVM-linear, KNN, naïve Bayes, LDA, decision tree with leave-one-out cross validation | SVM, KNN and decision tree accuracy = 96.8% | 2019 | Félix et al., |
| Classification of PD from HC | Diagnosis | PhysioNet | 31; 16 HC + 15 PD | SVM-linear with leave-one-out cross validation | Accuracy = 100% | 2017 | Joshi et al., |
| Classification of PD from HC | Diagnosis | PhysioNet | 165; 72 HC + 93 PD | KNN, CART, decision tree, random forest, naïve Bayes, SVM-polynomial, SVM-linear, K-means, GMM with leave-one-out cross validation | SVM: | 2019 | Khoury et al., |
| Classification of ALS, HD, PD from HC | Diagnosis | PhysioNet | 64; 16 HC + 15 PD + 13 ALS + 20 HD | String grammar unsupervised possibilistic fuzzy C-medians with FKNN, with 4-fold cross validation | PD vs. HC accuracy = 96.43% | 2018 | Klomsae et al., |
| Classification of PD from HC | Diagnosis | PhysioNet | 166; 73 HC + 93 PD | Logistic regression, decision trees, random forest, SVM-Linear, SVM-RBF, SVM-Poly, KNN with cross validation | KNN: | 2018 | Mittra and Rustagi, |
| Accuracy = 93.08% | |||||||
| Precision = 89.58% | |||||||
| Recall = 84.31% | |||||||
| F1-score = 86.86% | |||||||
| Classification of PD from HC | Diagnosis | PhysioNet | 85; 43 HC + 42 PD | LS-SVM with leave-one-out, 2- or 10-fold cross validation | Leave-one-out cross validation: | 2018 | Pham, |
| AUC = 1 | |||||||
| Sensitivity = 100% | |||||||
| Specificity = 100% | |||||||
| Accuracy = 100% | |||||||
| 10-fold cross validation: | |||||||
| AUC = 0.89 | |||||||
| Sensitivity = 85.00% | |||||||
| Specificity = 73.21% | |||||||
| Accuracy = 79.31% | |||||||
| Classification of PD from HC | Diagnosis | PhysioNet | 165; 72 HC + 93 PD | LS-SVM with leave-one-out, 2- or 5- or 10-fold cross validation | Accuracy = 100% | 2018 | Pham and Yan, |
| Sensitivity = 100% | |||||||
| Specificity = 100% | |||||||
| AUC = 1 | |||||||
| Classification of PD from HC | Diagnosis | PhysioNet | 166; 73 HC + 93 PD | DCALSTM with stratified 5-fold cross validation | Sensitivity = 99.10% | 2019 | Xia et al., |
| Specificity = 99.01% | |||||||
| Accuracy = 99.07% | |||||||
| Classification of HC, PD, ALS and HD | Diagnosis and differential diagnosis | PhysioNet | 64; 16 HC + 15 PD + 13 ALS + 20 HD | SVM-RBF with 10-fold cross validation | PD vs. HC: | 2009 | Yang et al., |
| Accuracy = 86.43% | |||||||
| AUC = 0.92 | |||||||
| Classification of PD, HD, ALS and ND from HC | Diagnosis | PhysioNet | 64; 16 HC + 15 PD + 13 ALS + 20 HD | Adaptive neuro-fuzzy inference system with leave-one-out cross validation | PD vs. HC: | 2018 | Ye et al., |
| Accuracy = 90.32% | |||||||
| Sensitivity = 86.67% | |||||||
| Specificity = 93.75% | |||||||
| Classification of PD from HC and assess the severity of PD | Diagnosis | mPower database | 50; 22 HC + 28 PD | Random forest, bagged trees, SVM, KNN with 10-fold cross validation | Random forest: | 2017 | Abujrida et al., |
| PD vs. HC accuracy = 87.03% | |||||||
| PD severity assessment accuracy = 85.8% | |||||||
| Classification of PD from HC | Diagnosis | mPower database | 1,815; 866 HC + 949 PD | CNN with 10-fold cross validation | Accuracy = 62.1% | 2018 | Prince and de Vos, |
| F1 score = 63.4% | |||||||
| AUC = 63.5% | |||||||
| Classification of PD from HC | Diagnosis | Dataset from Fernandez et al., | 49; 26 HC + 23 PD | KFD-RBF, naïve Bayes, KNN, SVM-RBF, random forest with 10-fold cross validation | Random forest accuracy = 92.6% | 2015 | Wahid et al., |
ALS, amyotrophic lateral sclerosis; ANN, artificial neural network; AUC, area under the receiver operating characteristic (ROC) curve; AVS, acute unilateral vestibulopathy; BagDT, bootstrap aggregation for a random forest of decision trees; CA, anterior lobe cerebella atrophy; CART, classification and regression trees; DCALSTM, dual-modal with each branch has a convolutional network followed by an attention-enhanced bi-directional LSTM; DN, downbeat nystagmus syndrome; ESN, echo state network; FKNN, fuzzy k-nearest neighbor; GMM, Gaussian mixture model; HC, healthy control; HD, Huntington's disease; IH, idiopathic hyposmia; KFD, kernel Fisher discriminant; KNN, k-nearest neighbors; LDA, linear discriminant analysis; LOSO, leave-one-subject-out; LS-SVM, least-squares support vector machine; LSTM, long short-term memory; MCI, mild cognitive impairment; MIL, multiple-instance learning; MLP, multilayer perceptron; MTFL, multi-task feature learning; NN, neural network; NSVC, nu-support vector classification; OT, primary orthostatic tremor; PD, Parkinson's disease; PDMCI, PD participants who met criteria for mild cognitive impairment; PDNOMCI, PD participants with no indication of mild cognitive impairment; PNN, probabilistic neural network; PNP, sensory polyneuropathy; PPV, phobic postural vertigo; QDA, quadratic discriminant analysis; RkF, repeated k-fold; SVM, support vector machine; SVM-Poly, support vector machine with polynomial kernel; SVM-RBF, support vector machine with radial basis function kernel.
Studies that applied machine learning models to MRI data to diagnose PD (n = 36).
| Classification of PD from MSA | Differential diagnosis | Collected from participants | 150; 54 HC + 65 PD + 31 MSA | SVM with leave-one-out-cross validation | MSA vs. PD: | 2019 | Abos et al., |
| Accuracy = 0.79 | |||||||
| Sensitivity = 0.71 | |||||||
| Specificity = 0.86 | |||||||
| MSA vs. HC: | |||||||
| Accuracy = 0.79 | |||||||
| Sensitivity = 0.84 | |||||||
| Specificity = 0.74 | |||||||
| MSA vs. subsample of PD: | |||||||
| Accuracy = 0.84 | |||||||
| Sensitivity = 0.77 | |||||||
| Specificity = 0.90 | |||||||
| Classification of PD from MSA | Differential diagnosis | Collected from participants | 151; 59 HC + 62 PD + 30 MSA | SVM with leave-one-out-cross validation | Accuracy = 77.17% | 2019 | Baggio et al., |
| Sensitivity = 83.33% | |||||||
| Specificity = 74.19% | |||||||
| Classification of PD from HC | Diagnosis | Collected from participants | 94; 50 HC + 44 PD | CNN with 85 subjects for training and 9 for testing | Training accuracy = 95.24% | 2019 | Banerjee et al., |
| Testing accuracy = 88.88% | |||||||
| Classification of PD from HC | Diagnosis | Collected from participants | 47; 26 HC + 21 PD | SVM-linear with leave-one-out cross validation | Accuracy = 93.62% | 2015 | Chen et al., |
| Sensitivity = 90.47% | |||||||
| Specificity = 96.15% | |||||||
| Classification of PD from PSP | Differential diagnosis | Collected from participants | 78; 57 PD + 21 PSP | SVM with leave-one-out cross validation | Accuracy = 100% | 2013 | Cherubini et al., |
| Sensitivity = 1 | |||||||
| Specificity = 1 | |||||||
| Classification of PD, MSA, PSP and HC | Diagnosis and differential diagnosis | Collected from participants | 106; 36 HC + 35 PD + 16 MSA + 19 PSP | Elastic Net regularized logistic regression with nested 10-fold cross validation | HC vs. PD/MSA-P/PSP: | 2017 | Du et al., |
| AUC = 0.88 | |||||||
| Sensitivity = 0.80 | |||||||
| Specificity = 0.83 | |||||||
| PPV = 0.82 | |||||||
| NPV = 0.81 | |||||||
| HC vs. PD: | |||||||
| AUC = 0.91 | |||||||
| Sensitivity = 0.86 | |||||||
| Specificity = 0.80 | |||||||
| PPV = 0.82 | |||||||
| NPV = 0.89 | |||||||
| PD vs. MSA/PSP: | |||||||
| AUC = 0.94 | |||||||
| Sensitivity = 0.86 | |||||||
| Specificity = 0.87 | |||||||
| PPV = 0.88 | |||||||
| NPV = 0.84 | |||||||
| PD vs. MSA: | |||||||
| AUC = 0.99 | |||||||
| Sensitivity = 0.97 | |||||||
| Specificity = 1.00 | |||||||
| PPV = 1.00 | |||||||
| NPV = 0.93 | |||||||
| PD vs. PSP: | |||||||
| AUC = 0.99 | |||||||
| Sensitivity = 0.97 | |||||||
| Specificity = 1.00 | |||||||
| PPV = 1.00 | |||||||
| NPV = 0.94 | |||||||
| MSA vs. PSP: | |||||||
| AUC = 0.98 | |||||||
| Sensitivity = 0.94 | |||||||
| Specificity = 1.00 | |||||||
| PPV = 1.00 | |||||||
| NPV = 0.93 | |||||||
| Classification of HC, PD, MSA and PSP | Diagnosis and differential diagnosis | Collected from participants | 64; 22 HC + 21 PD + 11 MSA + 10 PSP | SVM-linear with leave-one-out cross validation | PD vs. HC: | 2011 | Focke et al., |
| Accuracy = 41.86% | |||||||
| Sensitivity = 38.10% | |||||||
| Specificity = 45.45% | |||||||
| PD vs. MSA: | |||||||
| Accuracy = 71.87% | |||||||
| Sensitivity = 36.36% | |||||||
| Specificity = 90.48% | |||||||
| PD vs. PSP: | |||||||
| Accuracy = 96.77% | |||||||
| Sensitivity = 90% | |||||||
| Specificity = 100% | |||||||
| MSA vs. PSP: | |||||||
| Accuracy = 76.19% | |||||||
| MSA vs. HC: | |||||||
| Accuracy = 78.78% | |||||||
| Sensitivity = 54.55% | |||||||
| Specificity = 90.91% | |||||||
| PSP vs. HC: | |||||||
| Accuracy = 93.75% | |||||||
| Sensitivity = 90.00% | |||||||
| Specificity = 95.45% | |||||||
| Classification of PD and atypical PD | Differential diagnosis | Collected from participants | 40; 17 PD + 23 atypical PD | SVM-RBF with 10-fold cross-validation | Accuracy = 97.50% | 2012 | Haller et al., |
| TPR = 0.94 | |||||||
| FPR = 0.00 | |||||||
| TNR = 1.00 | |||||||
| FNR = 0.06 | |||||||
| Classification of PD and other forms of Parkinsonism | Differential diagnosis | Collected from participants | 36; 16 PD + 20 other Parkinsonism | SVM-RBF with 10-fold cross validation | Accuracy = 86.92% | 2012 | Haller et al., |
| TP = 0.87 | |||||||
| FP = 0.14 | |||||||
| TN = 0.87 | |||||||
| FN = 0.13 | |||||||
| Classification of HC, PD, PSP, MSA-C and MSA-P | Diagnosis and differential diagnosis | Collected from participants | 464; 73 HC + 204 PD + 106 PSP + 21 MSA-C + 60 MSA-P | SVM-RBF with 10-fold cross validation | PD vs. HC: | 2016 | Huppertz et al., |
| Sensitivity = 65.2% | |||||||
| Specificity = 67.1% | |||||||
| Accuracy = 65.7% | |||||||
| PD vs. PSP: | |||||||
| Sensitivity = 82.5% | |||||||
| Specificity = 86.8% | |||||||
| Accuracy = 85.3% | |||||||
| PD vs. MSA-C: | |||||||
| Sensitivity = 76.2% | |||||||
| Specificity = 96.1% | |||||||
| Accuracy = 94.2% | |||||||
| PD vs. MSA-P: | |||||||
| Sensitivity = 86.7% | |||||||
| Specificity = 92.2% | |||||||
| Accuracy = 90.5% | |||||||
| Classification of PD from HC | Diagnosis | Collected from participants | 42; 21 HC + 21 PD | SVM-linear with stratified 10-fold cross validation | Accuracy = 78.33% | 2017 | Kamagata et al., |
| Precision = 85.00% | |||||||
| Recall = 81.67% | |||||||
| AUC = 85.28% | |||||||
| Classification of PD, PSP, MSA-P and HC | Diagnosis and differential diagnosis | Collected from participants | 419; 142 HC + 125 PD + 98 PSP + 54 MSA-P | CNN with train-validation ratio of 85:15 | PD: | 2019 | Kiryu et al., |
| Sensitivity = 94.4% | |||||||
| Specificity = 97.8% | |||||||
| Accuracy = 96.8% | |||||||
| AUC = 0.995 | |||||||
| PSP: | |||||||
| Sensitivity = 84.6% | |||||||
| Specificity = 96.0% | |||||||
| Accuracy = 93.7% | |||||||
| AUC = 0.982 | |||||||
| MSA-P: | |||||||
| Sensitivity = 77.8% | |||||||
| Specificity = 98.1% | |||||||
| Accuracy = 95.2% | |||||||
| AUC = 0.990 | |||||||
| HC: | |||||||
| Sensitivity = 100.0% | |||||||
| Specificity = 97.5% | |||||||
| Accuracy = 98.4% | |||||||
| AUC = 1.000 | |||||||
| Classification of PD from HC | Diagnosis | Collected from participants | 65; 31 HC + 34 PD | FCP with 36 out of the 65 subjects as the training set | AUC = 0.997 | 2016 | Liu H. et al., |
| Classification of PD, PSP, MSA-C and MSA-P | Differential diagnosis | Collected from participants | 85; 47 PD + 22 PSP + 9 MSA-C + 7 MSA-P | SVM-linear with leave-one-out cross validation | 4-class classification (MSA-C vs. MSA-P vs. PSP vs. PD) accuracy = 88% | 2017 | Morisi et al., |
| Classification of PD from HC | Diagnosis | Collected from participants | 89; 47 HC + 42 PD | Boosted logistic regression with nested cross-validation | Accuracy = 76.2% | 2019 | Rubbert et al., |
| Sensitivity = 81% | |||||||
| Specificity = 72.7% | |||||||
| Classification of PD, PSP and HC | Diagnosis and differential diagnosis | Collected from participants | 84; 28 HC + 28 PSP + 28 PD | SVM-linear with leave-one-out cross validation | PD vs. HC: | 2014 | Salvatore et al., |
| Accuracy = 85.8% | |||||||
| Specificity = 86.0% | |||||||
| Sensitivity = 86.0% | |||||||
| PSP vs. HC: | |||||||
| Accuracy = 89.1% | |||||||
| Specificity = 89.1% | |||||||
| Sensitivity = 89.5% | |||||||
| PSP vs. PD: | |||||||
| Accuracy = 88.9% | |||||||
| Specificity = 88.5% | |||||||
| Sensitivity = 89.5% | |||||||
| Classification of PD, APS (MSA, PSP) and HC | Diagnosis and differential diagnosis | Collected from participants | 100; 35 HC + 45 PD + 20 APS | CNN-DL, CR-ML, RA-ML with 5-fold cross-validation | PD vs. HC with CNN-DL: | 2019 | Shinde et al., |
| Test accuracy = 80.0% | |||||||
| Test sensitivity = 0.86 | |||||||
| Test specificity = 0.70 | |||||||
| Test AUC = 0.913 | |||||||
| PD vs. APS with CNN-DL: | |||||||
| Test accuracy = 85.7% | |||||||
| Test sensitivity = 1.00 | |||||||
| Test specificity = 0.50 | |||||||
| Test AUC = 0.911 | |||||||
| Classification of PD from HC | Diagnosis | Collected from participants | 101; 50 HC + 51 PD | SVM-RBF with leave-one-out cross validation | Sensitivity = 92% | 2017 | Tang et al., |
| Classification of PD from HC | Diagnosis | Collected from participants | 85; 40 HC + 45 PD | SVM-linear with leave-one-out, 5-fold, 0.632-fold (1-1/e), 2-fold cross validation | Accuracy = 97.7% | 2016 | Zeng et al., |
| Classification of PD from HC | Diagnosis | PPMI database | 543; 169 HC + 374 PD | RLDA with JFSS with 10-fold cross validation | Accuracy = 81.9% | 2016 | Adeli et al., |
| Classification of PD from HC | Diagnosis | PPMI database | 543; 169 HC + 374 PD | RFS-LDA with 10-fold cross validation | Accuracy = 79.8% | 2019 | Adeli et al., |
| Classification of PD from HC | Diagnosis | PPMI database | 543; 169 HC + 374 PD | Random forest (for feature selection and clinical score); SVM with 10-fold stratified cross validation | Accuracy = 0.93 | 2018 | Amoroso et al., |
| AUC = 0.97 | |||||||
| Sensitivity = 0.93 | |||||||
| Specificity = 0.92 | |||||||
| Classification of PD, HC and prodromal | Diagnosis | PPMI database | 906; 203 HC + 66 prodromal + 637 PD | MLP, XgBoost, random forest, SVM with 5-fold cross validation | MLP: | 2020 | Chakraborty et al., |
| Accuracy = 95.3% | |||||||
| Recall = 95.41% | |||||||
| Precision = 97.28% | |||||||
| F1-score = 94% | |||||||
| Classification of PD from HC | Diagnosis | PPMI database | Dataset 1: 15; 6 HC + 9 PD | SVM with leave-one-out cross validation | Dataset 1: | 2014 | Chen et al., |
| EER = 87% | |||||||
| Dataset 2: 39; 21 HC + 18 PD | Accuracy = 80% | ||||||
| AUC = 0.907 | |||||||
| Dataset 2: | |||||||
| EER = 73% | |||||||
| Accuracy = 68% | |||||||
| AUC = 0.780 | |||||||
| Classification of PD from HC | Diagnosis | PPMI database | 80; 40 HC + 40 PD | Naïve Bayes, SVM-RBF with 10-fold cross validation | SVM: | 2019 | Cigdem et al., |
| Accuracy = 87.50% | |||||||
| Sensitivity = 85.00% | |||||||
| Specificity = 90.00% | |||||||
| AUC = 90.00% | |||||||
| Classification of PD from HC | Diagnosis | PPMI database | 37; 18 HC + 19 PD | SVM-linear with leave-one-out cross validation | Accuracy = 94.59% | 2017 | Kazeminejad et al., |
| Classification of PD, HC and SWEDD | Diagnosis and subtyping | PPMI database | 238; 62 HC + 142 PD + 34 SWEDD | Joint learning with 10-fold cross validation | HC vs. PD: | 2018 | Lei et al., |
| Accuracy = 91.12% | |||||||
| AUC = 94.88% | |||||||
| HC vs. SWEDD: | |||||||
| Accuracy = 94.89% | |||||||
| AUC = 97.80% | |||||||
| PD vs. SWEDD: | |||||||
| accuracy = 92.12% | |||||||
| AUC = 93.82% | |||||||
| Classification of PD and SWEDD from HC | Diagnosis | PPMI database | Baseline: 238; 62 HC + 142 PD + 34 SWEDD12 months: 186; 54 HC + 123 PD + 9 SWEDD | SSAE with 10-fold cross validation | HC vs. PD: | 2019 | Li et al., |
| Classification of PD from HC | Diagnosis | PPMI database | 112; 56 HC + 56 PD | RLDA with 8-fold cross validation | Accuracy = 70.5% | 2016 | Liu L. et al., |
| AUC = 71.1 | |||||||
| Classification of PD from HC | Diagnosis | PPMI database | 60; 30 HC + 30 PD | SVM, ELM with train-test ratio of 80:20 | ELM: | 2016 | Pahuja and Nagabhushan, |
| Training accuracy = 94.87% | |||||||
| Testing accuracy = 90.97% | |||||||
| Sensitivity = 0.9245 | |||||||
| Specificity = 0.9730 | |||||||
| Classification of PD from HC | Diagnosis | PPMI database | 172; 103 HC + 69 PD | Multi-kernel SVM with 10-fold cross validation | 2017 | Peng et al., | |
| Accuracy = 85.78% | |||||||
| Specificity = 87.79% | |||||||
| Sensitivity = 87.64% | |||||||
| AUC = 0.8363 | |||||||
| Classification of PD from HC | Diagnosis and subtyping | PPMI database | 109; 32 HC + 77 PD (55 PD-NC + 22 PD-MCI) | SVM with 2-fold cross validation | PD vs. HC: | 2016 | Peng et al., |
| Accuracy = 92.35% | |||||||
| Sensitivity = 0.9035 | |||||||
| Specificity = 0.9431 | |||||||
| AUC = 0.9744 | |||||||
| PD-MCI vs. HC: | |||||||
| Accuracy = 83.91% | |||||||
| Sensitivity = 0.8355 | |||||||
| Specificity = 0.8587 | |||||||
| AUC = 0.9184 | |||||||
| PD-MCI vs. PD-NC: | |||||||
| Accuracy = 80.84% | |||||||
| Sensitivity = 0.7705 | |||||||
| Specificity = 0.8457 | |||||||
| AUC = 0.8677 | |||||||
| Classification of PD, HC and SWEDD | Diagnosis and subtyping | PPMI database | 831; 245 HC + 518 PD + 68 SWEDD | LSSVM-RBF with cross validation | Accuracy = 99.9% | 2015 | Singh and Samavedham, |
| Classification of PD, HC and SWEDD | Diagnosis and differential diagnosis | PPMI database | 741; 262 HC + 408 PD + 71 SWEDD | LSSVM-RBF with 10-fold cross validation | PD vs. HC accuracy = 95.37% | 2018 | Singh et al., |
| PD vs. SWEDD accuracy = 96.04% | |||||||
| SWEDD vs. HC accuracy = 93.03% | |||||||
| Classification of PD from HC | Diagnosis | PPMI database | 408; 204 HC + 204 PD | CNN (VGG and ResNet) | ResNet50 accuracy = 88.6% | 2019 | Yagis et al., |
| Classification of PD from HC | Diagnosis | PPMI database | 754; 158 HC + 596 PD | FCN, GCN with 5-fold cross validation | AUC = 95.37% | 2018 | Zhang et al., |
APS, atypical parkinsonian syndromes; AUC, area under the receiver operating characteristic (ROC) curve; CNN, convolutional neural network; CNN-DL, convolutional neural network with discriminative localization; CR-ML, contrast ratio classifier; EER, equal error rate; ELM, extreme learning machine; FCN, fully connected network; FCP, folded concave penalized (learning); FN, false negative; FNR, false negative rate; FP, false positive; FPR, false positive rate; GCN, graph convolutional network; HC, healthy control; JFSS, joint feature-sample selection; LSSVM, least-squares support vector machine; MLP, multilayer perceptron; MSA, multiple system atrophy; MSA-C, multiple system atrophy with a cerebellar syndrome; MSA-P, multiple system atrophy with a parkinsonian type; PD, Parkinson's disease; PD-MCI, PD participants who met criteria for mild cognitive impairment; PD-NC, PD participants with no indication of mild cognitive impairment; PSP, progressive supranuclear palsy; RA-ML, radiomics based classifier; ResNet, residual neural network; RFS-LDA, robust feature-sample linear discriminant analysis; RLDA, robust linear discriminant analysis; SSAE, stacked sparse auto-encoder; SVM, support vector machine; SVM-RBF, support vector machine with radial basis function kernel; SWEDD, PD with scans without evidence of dopaminergic deficit; TN, true negative; TNR, true negative rate; TP, true positive; TPR, true positive rate; XgBoost, extreme gradient boosting.
Studies that applied machine learning models to handwritten patterns, SPECT, PET, CSF, other data types and combinations of data to diagnose PD (n = 67).
| Classification of PD from HC | Diagnosis | HandPD | Handwritten patterns | 92; 18 HC + 74 PD | LDA, KNN, Gaussian naïve Bayes, decision tree, Chi2 with Adaboost with 5- or 4-fold stratified cross validation | Chi-2 with Adaboost: | 2019 | Ali et al., |
| Classification of PD (PD + SWEDD) from HC | Diagnosis | PPMI database | More than one | 388; 194 HC + 168 PD + 26 SWEDD | Ensemble method of several SVM with linear kernel with leave-one-out cross validation | Accuracy = 94.38% | 2018 | Castillo-Barnes et al., |
| Classification of PD from HC | Diagnosis | PPMI database | More than one | 586; 184 HC + 402 PD | MLP, BayesNet, random forest, boosted logistic regression with a train-test ratio of 70:30 | Boosted logistic regression: | 2016 | Challa et al., |
| Classification of tPD from rET | Differential diagnosis | Collected from participants | More than one | 30; 15 tPD + 15rET | Multi-kernel SVM with leave-one-out cross validation | Accuracy = 100% | 2014 | Cherubini et al., |
| Classfication of PD, HC and atypical PD | Diagnosis, differential diagnosis and subtyping | PPMI database and SNUH cohort | SPECT imaging data | PPMI: 701; 193 HC + 431 PD + 77 SWEDD | CNN with train-validation ratio of 90:10 | PPMI: | 2017 | Choi et al., |
| Classification of PD from HC | Diagnosis | Collected from participants | Other | 270; 120 HC + 150 PD | Random forest | Classification error = 49.6% (rs11240569) | 2019 | Cibulka et al., |
| Classification of PD from HC | Diagnosis | HandPD | Handwritten patterns | 92; 18 HC + 74 PD | Naïve Bayes, OPF, SVM with cross-validation | SVM-RBF accuracy = 85.54% | 2018 | de Souza et al., |
| Classification of PD from HC | Diagnosis | PPMI database | More than one | 1194; 816 HC + 378 PD | BoostPark | Accuracy = 0.901 | 2017 | Dhami et al., |
| Classification of PD and HC, and PD + SWEDD and HC | Diagnosis | PPMI database | More than one | 430; 127 HC + 263 PD + 40 SWEDD | AdaBoost, SVM, naïve Bayes, decision tree, KNN, K-Means with 5-fold cross validation | PD vs. HC (adaboost): | 2016 | Dinov et al., |
| Classification of PD from HC | Diagnosis | Collected from participants | CSF | Cohort 1: 160; 80 HC + 80 PD | Elastic Net and gradient boosted regression with 10-fold cross validation | Ensemble of 60 decision trees identified with gradient boosted model: | 2018 | Dos Santos et al., |
| Classification of PD from HC | Diagnosis | Collected from participants | Handwritten patterns | 75; 38 HC + 37 PD | SVM-RBF with stratified 10-fold cross-validation | Accuracy = 88.13% | 2015 | Drotár et al., |
| Classification of PD from HC | Diagnosis | Collected from participants | Handwritten patterns | 75; 38 HC + 37 PD | KNN, ensemble AdaBoost, SVM | SVM: | 2016 | Drotár et al., |
| Classification of IPD, VaP and HC | Differential diagnosis | Collected from participants | More than one | 45; 15 HC + 15 IPD + 15 VaP | MLP, DBN with 10-fold cross validation | IPD + VaP vs HC with MLP: | 2018 | Fernandes et al., |
| Classification of PD from HC | Diagnosis | Collected from participants | More than one | 75; 15 HC + 60 PD | SVM-linear, random forest with leave-one-out cross validation | SVM AUC for FDOPA + metabolomics: 0.98 | 2019 | Glaab et al., |
| Classification of PD, HC and SWEDD | Diagnosis and subtyping | PPMI database | More than one | 666; 415 HC + 189 PD + 62 SWEDD | EPNN, PNN, SVM, KNN, classification tree with train-test ratio of 90:10 | EPNN: PD vs SWEDD vs HC accuracy = 92.5% | 2015 | Hirschauer et al., |
| Classification of PD from HC and assess the severity of PD | Diagnosis | Picture Archiving and Communication System (PACS) | SPECT imaging data | 202; 6 HC + 102 mild PD + 94 severe PD | Linear regression, SVM-RBF with a train-test ratio of 50:50 | SVM-RBF: | 2019 | Hsu et al., |
| Classification of PD from VP | Differential diagnosis | Collected from participants | SPECT imaging data | 244; 164 PD + 80 VP | Logistic regression, LDA, SVM with 10-fold cross-validation | SVM: | 2014 | Huertas-Fernández et al., |
| Classification of PD from HC | Diagnosis | Collected from participants | SPECT imaging data | 208; 108 HC + 100 PD | SVM, KNN, NM with 3-fold cross validation | SVM: | 2012 | Illan et al., |
| Classification of PD from HC | Diagnosis | Collected from participants | Handwritten patterns | 72; 15 HC + 57 PD | CNN with 10-fold cross validation or leave-one-out cross validation | Accuracy = 88.89% | 2018 | Khatamino et al., |
| Classification of PD from HC | Diagnosis | Collected from participants | Other | 10; 5 HC + 5 PD | SVM with leave-one-subject-out cross validation | Sensitivity = 0.90 | 2013 | Kugler et al., |
| Classification of PD from HC | Diagnosis | UCI machine learning repository | Handwritten patterns | 72; 15 HC + 57 PD | SVM-linear, SVM-RBF, KNN with leave-one-subject-out cross validation | SVM-linear: | 2019 | İ et al., |
| Classification of PD from HC | Diagnosis | Collected postmortem | CSF | 105; 57 HC + 48 PD | SVM with 10-fold cross validation | Sensitivity = 65% | 2013 | Lewitt et al., |
| Classification of PD from HC | Diagnosis | Collected from participants | CSF | 78; 42 HC + 36 PD | Random forest and extreme gradient tree boosting with 10-fold cross validation | Extreme gradient tree boosting: | 2018 | Maass et al., |
| Classification of PD from HC or NPH | Diagnosis and differential diagnosis | Collected from participants | CSF | 157; 68 HC + 82 PD + 7 NPH | SVM with 10-fold cross validation or leave-one-out cross validation | Cohort 1, PD vs HC: | 2020 | Maass et al., |
| Classification of PD from HC | Diagnosis | PPMI database | More than one | 550; 157 HC + 342 PD + 51 SWEDD | SVM, random forest, MLP, logistic regression, KNN with nested cross-validation | Motor features, SVM: | 2018 | Mabrouk et al., |
| Classification of PD from HC | Diagnosis | PPMI database | SPECT imaging data | 642; 194 HC + 448 PD | CNN (LENET53D, ALEXNET3D) with 10-fold stratified cross-validation | ALEXNET3D: | 2018 | Martinez-Murcia et al., |
| Classification of PD from HC | Diagnosis | Collected from participants | Handwritten patterns | 75; 10 HC + 65 PD | MLP, non-linear SVM, random forest, logistic regression with stratified 10-fold cross-validation | MLP: | 2015 | Memedi et al., |
| Classification of PD from HC | Diagnosis | Parkinson's Disease Handwriting Database (PaHaW) | Handwritten patterns | 69; 36 HC + 33 PD | Random forest with stratified 7-fold cross-validation | Accuracy = 89.81% | 2018 | Mucha et al., |
| Classification of PD, MSA, PSP, CBS and HC | Differential diagnosis | Collected from participants | SPECT imaging data | 578; 208 HC + 280 PD + 21 MSA + 41 PSP + 28 CBS | SVM with 5-fold cross-validation | Accuracy = 58.4–92.9% | 2019 | Nicastro et al., |
| Classification of PD from HC | Diagnosis | Collected from participants | Handwritten patterns | 30; 15 HC + 15 PD | KNN, decision tree, random forest, SVM, AdaBoost with 3-fold cross validation | Random forest accuracy = 0.91 | 2018 | Nõmm et al., |
| Classification of HC, AD and PD | Diagnosis and differential diagnosis | The authors' institutional oct database | Other | 75; 27 HC + 28 PD + 20 AD | SVM-RBF with 2-, 5- and 10-fold cross validation | Accuracy = 87.7% | 2019 | Nunes et al., |
| Classification of idiopathic PD, atypical Parkinsonian and ET | Differential diagnosis | Collected from participants | Other | 85; 50 idiopathic PD + 26 atypical PD + 9 ET | SVM, random forest with leave-one-out cross validation | SVM accuracy = 100% | 2019 | Nuvoli et al., |
| Classification of PD from HC | Diagnosis | PPMI database | SPECT imaging data | 654; 209 HC + 445 PD | SVM-linear with leave-one-out cross validation | Accuracy = 97.86% | 2015 | Oliveira and Castelo-Branco, |
| Classification of PD from HC | Diagnosis | PPMI database | SPECT imaging data | 652; 209 HC + 443 PD | SVM-linear, KNN, logistic regression with leave-one-out cross validation | SVM-linear: | 2017 | Oliveira F. et al., |
| Classification of PD and non-PD (ET, drug-induced Parkinsonism) | Differential diagnosis | Collected from participants | SPECT imaging data | 90; 56 PD + 34 non-PD | SVM-RBF with leave-one-out or 5-fold cross validation | Accuracy = 95.6% | 2014 | Palumbo et al., |
| Classification of PD from HC | Diagnosis | Collected from participants | Handwritten patterns | 55; 18 HC + 37 PD | Naïve Bayes, OPF, SVM-RBF with 10-fold cross validation | Naïve Bayes accuracy = 78.9% | 2015 | Pereira et al., |
| Classification of PD from HC | Diagnosis | HandPD | Handwritten patterns | 92; 18 HC + 74 PD | Naïve Bayes, OPF, SVM-RBF with cross-validation | SVM-RBF recognition rate (sensitivity) = 66.72% | 2016 | Pereira et al., |
| Classification of PD from HC | Diagnosis | Extended handpd dataset with signals extracted from a smart pen | Handwritten patterns | 35; 21 HC + 14 PD | CNN with cross validation with a train:test ratio of 75:25 or 50:50 | Accuracy = 87.14% | 2016 | Pereira et al., |
| Classification of PD from HC | Diagnosis | HandPD | Handwritten patterns | 92; 18 HC + 74 PD | CNN, OPF, SVM, naïve Bayes with train-test split = 50:50 | CNN-Cifar10 accuracy = 99.30% | 2018 | Pereira et al., |
| Classification of PD from HC | Diagnosis | UCI machine learning repository | More than one | Dataset 1: 40; 20 HC + 20 PD | Random forest, KNN, SVM-RBF, ensemble method with 5-fold cross validation | Ensemble method: | 2019 | Pham et al., |
| Classification of PD from HC | Diagnosis | PPMI database | More than one | 618; 195 HC + 423 PD | SVM-linear, SVM-RBF, classification tree with a train-test ratio of 70:30 | SVM-RBF, test set: | 2014 | Prashanth et al., |
| Classification of PD from HC | Diagnosis and subtyping | PPMI database | SPECT imaging data | 715; 208 HC + 427 PD + 80 SWEDD | SVM, naïve Bayes, boosted trees, random forest with 10-fold cross validation | SVM: | 2016 | Prashanth et al., |
| Classification of PD from HC | Diagnosis | PPMI database | More than one | 584; 183 HC + 401 PD | Naïve Bayes, SVM-RBF, boosted trees, random forest with 10-fold cross validation | SVM: | 2016 | Prashanth et al., |
| Classification of PD from HC | Diagnosis | PPMI database | Other | 626; 180 HC + 446 PD | Logistic regression, random forests, boosted trees, SVM with cross validation | Accuracy > 95% | 2018 | Prashanth and Dutta Roy, |
| Classification of PD from HC | Diagnosis | mPower database | More than one | 133 out of 1,513 with complete source data; 46 HC + 87 PD | Logistic regression, random forests, DNN, CNN, Classifier Ensemble, Multi-Source Ensemble learning with stratified 10-fold cross validation | Ensemble learning: | 2019 | Prince et al., |
| Classification of PD from HC | Diagnosis | HandPD | Handwritten patterns | 35; 21 HC + 14 PD | Bidirectional Gated Recurrent Units with a train-validation-test ratio of 40:10:50 or 65:10:25 | The Spiral dataset: | 2019 | Ribeiro et al., |
| Classification of PD from HC | Diagnosis | Collected from participants | Handwritten patterns | 130; 39 elderly HC + 40 young HC + 39 PD + 6 PD (validation set) + 6 HC (validation set) | KNN, SVM-Gaussian, random forest with leave-one-out cross validation | SVM for PD vs young HC: | 2019 | Rios-Urrego et al., |
| Classification of IPD from non-IPD | Differential diagnosis | Collected from participants | PET imaging | 87; 39 IPD + 48 non-IPD (24 MSA + 24 PSP) | SVM with leave-one-out cross validation | Accuracy = 78.16% | 2015 | Segovia et al., |
| Classification of PD from HC | Diagnosis | Dataset from “Virgen de la Victoria” hospital | SPECT imaging data | 189; 94 HC + 95 PD | SVM with 10-fold cross validation | Accuracy = 94.25% | 2019 | Segovia et al., |
| Classification of PD from HC | Diagnosis | Collected from participants | Other | 486; 233 HC + 205 PD + 48 NDD | SVM-linear with leave-batch-out cross validation | Validation AUC = 0.79 | 2017 | Shamir et al., |
| Classification of PD from HC | Diagnosis | Collected from participants | PET imaging | 350; 225 HC + 125 PD | GLS-DBN with a train-validation ratio of 80:20 | Test dataset 1: | 2019 | Shen et al., |
| Classification of PD from HC | Diagnosis | Collected from participants | Other | 33; 18 HC + 15 PD | SMMKL-linear with leave-one-out cross validation | Accuracy = 84.85% | 2018 | Shi et al., |
| Classification of PD from HC | Diagnosis | Collected from participants | More than one | Plasma samples: 156; 76 HC + 80 PD; | PLS, random forest with 10-fold cross validation with train-test ratio of 70:30 | PLS: | 2018 | Stoessel et al., |
| Classification of PD from HC | Diagnosis | PPMI database | SPECT imaging data | 658; 210 HC + 448 PD | Logistic Lasso with 10-fold cross validation | Test errors: | 2017 | Tagare et al., |
| Classification of PD from HC | Diagnosis | PDMultiMC | handwritten patterns | 42; 21 HC + 21 PD | CNN, CNN-BLSTM with stratified 3-fold cross validation | CNN: | 2019 | Taleb et al., |
| Classification of PD from HC | Diagnosis | PPMI database and local database | SPECT imaging data | Local: 304; 113 Non-PDD + 191 PD | SVM with stratified, nested 10-fold cross-validation | Local data: | 2017 | Taylor and Fenner, |
| Classification of PD from HC | Diagnosis | Collected from participants | CSF | 87; 43 HC + 44 PD | Logistic regression | Sensitivity = 0.797 | 2017 | Trezzi et al., |
| Classification of PD from HC | Diagnosis | Collected from participants | Other | 38; 24 HC + 14 PD | SVM-RFE with repeated leave-one-out bootstrap validation | Accuracy = 89.6% | 2013 | Tseng et al., |
| Classification of MSA and PD | Differential diagnosis | Collected from participants | More than one | 85; 25 HC + 30 PD + 30 MSA-P | NN | AUC = 0.775 | 2019 | Tsuda et al., |
| Classification of PD from HC | Diagnosis | Collected from participants | Other | 59; 30 HC + 29 PD | Logistic regression, decision tree, extra tree | Extra tree AUC = 0.99422 | 2018 | Vanegas et al., |
| Classification of PD from HC | Diagnosis | Commercially sourced | Other | 30; 15 HC + 15 PD | Decision tree | Cross validation score = 0.86 (male) | 2019 | Váradi et al., |
| Classification of PD from HC | Diagnosis | Collected from participants | More than one | 84; 40 HC + 44 PD | CNN with train-validation-test ratio of 80:10:10 | Accuracy = 97.6% | 2018 | Vásquez-Correa et al., |
| Classification of PD and Parkinsonism | Differential diagnosis | The NTUA Parkinson Dataset | More than one | 78; 55 PD + 23 Parkinsonism | MTL with DNN | Accuracy = 0.91 | 2018 | Vlachostergiou et al., |
| Classification of PD from HC | Diagnosis | PPMI database | More than one | 534; 165 HC + 369 PD | pGTL with 10-fold cross validation | Accuracy = 97.4% | 2017 | Wang et al., |
| Classification of PD from HC | Diagnosis | PPMI database | SPECT imaging data | 645; 207 HC + 438 PD | CNN with train-validation-test ratio of 60:20:20 | Accuracy = 0.972 | 2019 | Wenzel et al., |
| Classification of PD from HC | Diagnosis | Collected from participants | PET imaging | Cohort 1: 182; 91 HC + 91 PD | SVM-linear, SVM-sigmoid, SVM-RBF with 5-fold cross validation | Cohort 1: | 2019 | Wu et al., |
| Classification of PD, MSA and PSP | Differential diagnosis | Collected from participants | PET imaging | 920; 502 PD + 239 MSA + 179 PSP | 3D residual CNN with 6-fold cross validation | Classification of PD: | 2019 | Zhao et al., |
AD, Alzheimer's disease; AUC or AUC-ROC, area under the receiver operating characteristic (ROC) curve; AUC-PR, area under the precision-recall (PR) curve; BLSTM, bidirectional long short-term memory; CBS, corticobasal syndrome; CNN, convolutional neural network; CSF, cerebrospinal fluid; DBN, deep belief network; DNN, deep neural network; EPNN, enhanced probabilistic neural network; ET, essential tremor; FN, false negative; FP, false positive; GLS-DBN, group Lasso sparse deep belief network; HC, healthy control; IPD, idiopathic Parkinson's disease; KNN, k-nearest neighbors; LDA, linear discriminant analysis; LOR, log odds ratio; MCC, Matthews correlation coefficient; MLP, multilayer perceptron; MSA, multiple system atrophy; MSA-P, Parkinson's variant of multiple system atrophy; MTL, multi-task learning; NDD, neurodegenerative disease; NM, nearest mean; non-PDD, patients without pre-synaptic dopaminergic deficit; NPH, normal pressure hydrocephalus; NPV, negative predictive value; OPF, optimum-path forest; PD, Parkinson's disease; PET, positron emission tomography; pGTL, progressive graph-based transductive learning; PLS, partial least square; PNN, probabilistic neural network; PPV, positive predictive value; PSP, progressive supranuclear palsy; rET, essential tremor with rest tremor; SMMKL, soft margin multiple kernel learning; SPECT, single-photon emission computed tomography; SVM, support vector machine; SVM-RBF, support vector machine with radial basis function kernel; SVM-RFE, support vector machine-recursive feature elimination; SWEDD, PD with scans without evidence of dopaminergic deficit; tPD, tremor-dominant Parkinson's disease; VaP or VP, vascular Parkinsonism; YI, Youden's Index.