| Literature DB >> 36010353 |
Arti Rana1, Ankur Dumka2, Rajesh Singh3,4, Manoj Kumar Panda5, Neeraj Priyadarshi6, Bhekisipho Twala7.
Abstract
Parkinson's disease (PD) is a neurodegenerative disease that affects the neural, behavioral, and physiological systems of the brain. This disease is also known as tremor. The common symptoms of this disease are a slowness of movement known as 'bradykinesia', loss of automatic movements, speech/writing changes, and difficulty with walking at early stages. To solve these issues and to enhance the diagnostic process of PD, machine learning (ML) algorithms have been implemented for the categorization of subjective disease and healthy controls (HC) with comparable medical appearances. To provide a far-reaching outline of data modalities and artificial intelligence techniques that have been utilized in the analysis and diagnosis of PD, we conducted a literature analysis of research papers published up until 2022. A total of 112 research papers were included in this study, with an examination of their targets, data sources and different types of datasets, ML algorithms, and associated outcomes. The results showed that ML approaches and new biomarkers have a lot of promise for being used in clinical decision-making, resulting in a more systematic and informed diagnosis of PD. In this study, some major challenges were addressed along with a future recommendation.Entities:
Keywords: Parkinson’s disease; artificial neural network; classification; logistic regression; machine learning; support vector machine
Year: 2022 PMID: 36010353 PMCID: PMC9407112 DOI: 10.3390/diagnostics12082003
Source DB: PubMed Journal: Diagnostics (Basel) ISSN: 2075-4418
Figure 1A number of articles cited between 1996 and 2022.
Figure 2Comparative analysis of machine learning algorithms used to diagnose Parkinson’s disease w.r.t. accuracy rate. (a) Accuracy rate of detecting Parkinson’s disease based on speech feature (2015–2020). (b) Accuracy rate of detecting Parkinson’s disease based on handwritten pattern features (2015–2020).
Figure 3Symptoms of Parkinson’s disease.
Figure 4Machine learning algorithm used to diagnose Parkinson’s disease.
Comparative studies of machine learning approaches in speech recording to diagnose PD.
| Reference | Machine | Objective | Tools Used | Source of Data | No. of | Outcomes |
|---|---|---|---|---|---|---|
| Benba, A. et al., 2015 [ | Linear kernel SVM | Classification of PD from HC | Not mentioned | Department of Neurology Cerrahpas‚ a Faculty of Medicine, Istanbul University | 34, 17 PD + 17 HC | Classification |
| Mathur, R. et al., 2019 [ | ANN, KNN with K-fold cross validation; K = 10 | Classification of PD from HC | Weka | UCI machine learning repository | 195 instances, 24 attributes | Accuracy of: KNN with Adaboosta.M1—91.28% |
| Sakar et al., 2019 [ | Naïve Bayes, Logistic regression, SVM (RBF and Linear), KNN, random Forest, MLP | Classification of PD from HC | JupyterLab with python programming language | Collected from participants | 252, 188 PD + 64 HC | Highest accuracy obtained from SVM (RBF)—86% |
| Yasar, A. et al., 2019 [ | Artificial Neural Network | Classification of PD from HC | MATLAB | Collected from participants | 80, 40 PD + 40 HC | Accuracy of ANN—94.93% |
| Almeida, J.S. et al., 2019 [ | KNN, MLP, Optimum Path Forest (OPF), SVM with RBF, Linear and Polynomial kernel | Classification of PD from HC | OpenCV-2.49 | UCI machine learning | 98, 63 PD + 35 HC | acoustic cardioid (AC) accuracy—94.55% |
| Alqahtani, E.J. et al., 2018 [ | NNge and ensemble algorithm, AdaBoostM1 with 10- fold cross validation | Classification of PD from HC | Weka | Collected from participants | 31, 23 PD + 8 HC | Accuracy—96.30% |
| Avuçlu, E., Elen, A., 2020 [ | KNN, random forest, naïve Bayes, SVM | Classification of PD from HC | JupyterLab with python programming language | UCI machine learning | 31, 23 PD + 8 HC | Highest accuracy achieved from SVM—88.72% and lowest accuracy from naïve Bayes—70.26% |
| Zehra Karapinar, 2020 [ | CART, ANN, SVM | Classification of PD from HC | Weka | Collected from participants | 31, 23 PD + 8 HC | Highest accuracy from SVM—93.84% |
| Yaman, O. et al., 2019 [ | SVM, KNN | Classification of PD from HC | MATLAB | Collected from participants | 31, 23 PD + 8 HC | Accuracy rate of SVM—91.25% and KNN—91.23% |
| Aich, S. et al., 2019 [ | Random forest, Bagging CART, SVM, Boosted C5.0 | Classification of PD from HC | Not mentioned | Collected from participants | 31, 23 PD + 8 HC | Highest accuracy obtained from SVM with RBF kernel—97.57% |
| Haq, A.U. et al., 2019 [ | L1-Norm SVM with K- fold cross validation; K = 10 | Classification of PD from HC | Python | University of Oxford (UO) | 31, 23 PD + 8 HC | Accuracy rate—99% |
| Wu et al., 2017 [ | Generalized Logistic Regression Analysis (GLRA), SVM, Bagging ensemble | Classification of PD from HC | Not mentioned | Collected from participants | 31, 23 PD + 8 Healthy control (HC) | Optimal result obtained from bagging ensemble; sensitivity—97.96%, |
| Peker, 2016 [ | SVM with RBF kernel | Classification of PD from HC | Weka | University of Oxford (UO) | 31, 23 PD + 8 HC | Accuracy—98.95% |
| Montaña et al., 2018 [ | SVM with k-fold cross validation; k = 10 | Classification of PD from HC | Weka | UCI machine learning | 54, 27 PD + 27 HC | Accuracy—94.4% |
| Kuresan et al., 2019 [ | Hidden Markov Models (HMM), SVM | Classification of PD from HC | MATLAB | Collected from participants | 40, 20 PD + 20 HC | Highest accuracy obtained from HMM with accuracy—95.16%, |
| Marar et al., 2018 [ | Naïve Bayes, ANN, KNN, random forest, SVM, logistic regression, decision tree (DT) | Classification of PD from HC | R programming | Collected from participants | 31, 23 PD + 8 HC | Highest accuracy obtained from ANN—94.87% |
| Sheibani, R. et al., 2019 [ | Ensemble-based method | Classification of PD from HC | JupyterLab with python programming language | UCI machine learning | 31, 23 PD + 8 HC | Accuracy obtained from ensemble learning—90.6%, |
| Moharkan et al., 2017 [ | KNN | Classification of PD from HC | Python | Collected from participants | 31, 23 PD + 8 HC | Accuracy obtained from KNN—90%, |
| Sztahó, D. et al., 2019 [ | ANN, KNN, SVM with RBF and linear kernel, DNN | Classification of PD from HC | Not mentioned | UCI machine learning | 88, 55 PD + 33 HC | Highest accuracy obtained from SVM with RBF kernel—89.3%, sensitivity—90.2%, specificity—87.9% |
| Tracy, J.M. et al., 2020 [ | Logistic regression (L2- Regularized), random forest, Gradient Boosted trees | Classification of PD from HC | Python | mPower | 2289, 246 PD + 2023 HC | Highest accuracy obtained from gradient boosted trees recall—79.7%, precision—90.1%, F1-score—83.6% |
Comparative studies of machine learning approaches in handwritten patterns to diagnose PD.
| Reference | Machine | Objective | Tools Used | Source of Data | No. of | Outcomes |
|---|---|---|---|---|---|---|
| Taylor, J.C. and Fenner, 2017 [ | SVM with 10-fold cross-validation | Classification of PD from HC | MATLAB | PPMI and local database | PPMI: 657, 448 PD + 209 HC and local: 304,191 PD + 113 HC | Local data: Accuracy for local data range between 88 to 92% and for PPMI range from 95 to 97% |
| Oliveira et al., 2017 [ | SVM with linear kernel, logistic regression with LOOCV, KNN | Classification of PD from HC | C++ Programming language and MATLAB R2014a | PPMI database | 652, 443 PD + 209 HC | SVM (linear kernel) with highest accuracy rate—97.9% |
| de Souza et al., 2018 [ | OPF, naïve Bayes, SVM (RBF) with cross validation | Classification of PD from HC | Python | HandPD | 92, 74 PD + 18 HC | Highest accuracy obtained from SVM with RBF kernel—85.54% |
| Drotár et al., 2016 [ | SVM, KNN, | Classification of PD from HC | MATLAB | PaHaW | 75, 37 PD + 38 HC | Highest Accuracy obtained from SVM—81.3% with specificity—80.9% and sensitivity—87.4% |
| Hsu, S.-Y. et al., 2019 [ | SVM with RBF kernel, logistic regression | Classification of PD from HC | Weka | PACS | 202, 94 Severe PD + 102 mild PD + 6 HC | Highest accuracy obtained from SVM-RBF 83.2%, having sensitivity 82.8%, |
| Khatamino et al., 2018 [ | Convolutional Neural Network (CNN) | Classification of PD from HC | Python | Collected from participants | 72, 57 PD + 15 HC | Accuracy—88.89% |
| Kurt, İ.et al., 2019 [ | SVM (linear and RBF kernel), KNN | Classification of PD from HC | Not mentioned | UCI machine learning | 72, 57 PD + 15 HC | Highest accuracy obtained from SVM (linear)—97.52%. |
| Mabrouk et al., 2019 [ | Random forest, SVM, MLP, KNN | Classification of PD from HC | Not mentioned | PPMI Database | 550, 342 PD + 157 HC + 51 Scan without evidence of dopaminergic deficit (SWEDD) | For motor features, highest accuracy obtained from SVM—78.4%, and for non-motor features, highest accuracy obtained |
| Fabian Maass et al., 2020 [ | SVM | Classification of PD from HC | Weka | UCI machine learning | 157, 82 PD + 68 HC +7 Normal Pressure Hydrocephalus (NPH) | Sensitivity—80%, and specificity—83% |
| Mucha, J. et al., 2018 [ | Random forest classifier | Classification of PD from HC | Python | PaHaW | 69, 33 PD + 36 HC | Obtained classification accuracy—90% with sensitivity 89%, and |
| Cibulka et al., 2019 [ | Random forest | Classification of PD from HC | Not mentioned | Collected from participants | 270, 150 PD + 120 HC | Classification error for rs11240569, rs708727, rs823156 is 49.6%, 44.8%, 49.3%, respectively. |
| Pereira, C.R. et al., 2016 [ | CNN with cross validation | Classification of PD from HC | Not mentioned | Collected from participants | 35, 14 PD + 21 HC | Accuracy rate of CNN—87.14% |
| Prashanth, R. et al., 2016 [ | Naïve Bayes, random forest SVM, boosted trees | Classification of PD from HC | MATLAB | PPMI database | 584, 401 PD + 183 HC | Highest accuracy obtained from SVM with RBF kernel—96.40% having sensitivity 97.03% and |
| Shi, et al., 2018 [ | Soft margin multiple kernel learning (SMMKL) with LOOCV | Classification of PD from HC | Not mentioned | PPMI database | 33, 15 PD + 18 HC | Accuracy rate—84.85% with sensitivity 80% and specificity 88.89% |
| Trezzi, J. P et al., 2017 [ | Logistic | Classification of PD from HC | Not mentioned | UCI machine learning | 87, 44 PD + 43 HC | Sensitivity 79.7% and specificity 80% |
| Wenzel et al., 2019 [ | CNN | Classification of PD from HC | MATLAB | PPMI database | 645, 438 PD + 207 HC | Accuracy—97.2% |
| Segovia, F. et al., 2019 [ | SVM with 10 cross validation | Classification of PD from HC | Python | Virgen De La Victoria | 189, 95 PD + 94 HC | Accuracy—94.25% |
| Memedi, M. et al., 2015 [ | Random forest, logistic regression, MLP and non-linear SVM | Classification of PD from HC | Weka | PPMI database | 75, 65 PD + 10 HC | Highest accuracy obtained from MLP—84% having sensitivity—75.7% and specificity—88.9% |
| Nõmm, S. et al., 2018 [ | Random forest, decision tree, KNN, AdaBoost, SVM | Classification of PD from HC | Python programming (Scikit–Learn Library) | Collected from participants | 30, 15 PD + 15 HC | Highest accuracy obtained from Random forest—91% |
| Challa et al., 2016 [ | MLP, BayesNet, boosted logistic regression, | Classification of PD from HC | Weka | Parkinson’s Progression Markers Initiative (PPMI) | 586, 402 PD + 184 HC | Optimal result obtained from boosted logistic |
Figure 5Proposed methodology to diagnose Parkinson’s disease by [87].
Figure 6Proposed methodology to diagnose Parkinson’s disease using handwriting in a spiral format by [68].
Comparative studies of machine learning approaches in gait dataset to diagnose PD.
| Reference | Machine | Objective | Tools Used | Source of Data | No. of | Outcomes |
|---|---|---|---|---|---|---|
| Ye, Q. et al., 2018 [ | Least square (LS)—SVM, particle swarm | Classification of PD, ALS, HD from HC | Not mentioned | Neurology Outpatient Clinic at Massachusetts | 64, 15 PD + 16 HC + 13 (Amyotrophic lateral | Accuracy to diagnose PD from |
| Wahid, F. et al., 2015 [ | Random forest, SVM, kernel Fisher | Classification of PD from HC | MATLAB R2013b | Collected from participants | 49, 23 PD + 26 HC | The accuracy obtained from random forest, SVM, and KFD was 92.6%, 80.4% and 86.2%, respectively. |
| Pham, T.D.and Yan, H., 2018 [ | LS-SVM | Classification of PD from HC | MATLAB | Laboratory for Gait and Neurodynamics | 166, 93 PD + 73 HC | Sensitivity—100% and |
| Y. Mittra and V. Rustagi, 2018 [ | Logistic regression, decision tree, SVM | Classification of PD from HC | Not mentioned | Collected from | 49, 23 PD + 26 HC | Highest accuracy obtained from SVM (RBF) and random |
| Klomsae, A. et al., 2018 [ | Fuzzy KNN | Classification of PD, ALS, HD from HC | Not mentioned | Neurology Outpatient Clinic at Massachusetts General Hospital, Boston, MA, USA [ | 64, 15 PD + 20 HD + 13 ALS + 16 HC | Accuracy to diagnose PD from HC—96.43%, accuracy to diagnose HD from HC—97.22%, accuracy to diagnose ALS from HC—96.88% |
| Milica et al., 2017 [ | SVM-RBF | Classification of PD from HC | Python | Collected from participants from Institute of Neurology CCS, School of Medicine, University of Belgrade | 80, 40 PD + 40 HC | Overall accuracy from SVM-RBF—85% |
| Cuzzolin, F. et al., 2017 [ | HMM | Classification of PD from HC | Not mentioned | Collected from participants | 424, 156 PD + 268 HC | Accuracy—85.51% |
| Félix, J.P. et al., 2019 [ | SVM, KNN, | Classification of PD from HC | MATLAB R2017a | Neurology Outpatient Clinic at | 31, 15 PD + 16 HC | Highest accuracy obtained from SVM, KNN, and decision tree—96.8% |
| Baby, M.S. et al., 2017 [ | ANN | Classification of PD from HC | MATLAB | Laboratory for Gait and Neurodynamics | 166, 93 PD + 73 HC | Accuracy—86.75% |
| Andrei et al., 2019 [ | SVM | Classification of PD from HC | Not mentioned | Laboratory for Gait and Neurodynamics | 166, 93 PD + 73 HC | Accuracy—100% |
| Priya, S.J. et al., 2021 [ | ANN | Classification of PD from HC | MATLAB R2018b | Laboratory for Gait and Neurodynamics | 166, 93 PD + 73 HC | Accuracy—96.28% |
| Perumal, S.V. & Sankar, R., 2016 [ | SVM, ANN | Classification of PD from HC | MATLAB | Laboratory for Gait and Neurodynamics | 166, 93 PD + 73 HC | Average Accuracy—86.9% |
| Nancy, Y. et al., 2016 [ | Q-Backpropagated time delay neural network (Q-BTDNN) | Classification of PD from HC | MATLAB 2013 | Laboratory for Gait and Neurodynamics | 166, 93 PD + 73 HC | Accuracy—91.49% |
| Oğul, et al., 2020 [ | ANN | Classification of PD from HC | MATLAB | Laboratory for Gait and Neurodynamics | 166, 93 PD + 73 HC | Classification |
| Li, B. et al., 2020 [ | Deep CNN | Classification of PD from HC | Not mentioned | Collected from participants | 20, 10 PD + 10 HC | Accuracy—91.9% |
| Gao, C. et al., 2018 [ | Logistic | Classification of PD from HC | Not mentioned | University of Michigan | 80, 40 PD + 40 HC | Highest accuracy obtained from random forests—79.6% |
| Rehman et al., 2019 [ | SVM, logistic regression | Classification of PD from HC | Python programming | Not mentioned | 303, 119 PD + 184 HC | Average accuracy—97% |
| Natasa et al., 2020 [ | Random forest, XGBoosting, | Classification of PD from HC | Not mentioned | Collected from the participants | 10 PD | Best performance obtained from SVM(RBF) with the sensitivity value 72.34%, 91.49%, 75.00% and specificity value 87.36%, 88.51% and 93.62%, for the FoG, transition and normal activity classes, respectively. |
Figure 7Proposed architecture of cloud and machine learning-based framework for the diagnosis of Parkinson’s disease.