| Literature DB >> 32644870 |
Daehyuk Yim1, Tae Young Yeo1, Moon Ho Park1.
Abstract
OBJECTIVE: To develop a machine learning algorithm to identify cognitive dysfunction based on neuropsychological screening test results.Entities:
Keywords: Machine learning; cognitive dysfunction; dementia; diagnostic tool; mild cognitive impairment; screening
Mesh:
Year: 2020 PMID: 32644870 PMCID: PMC7350047 DOI: 10.1177/0300060520936881
Source DB: PubMed Journal: J Int Med Res ISSN: 0300-0605 Impact factor: 1.671
Demographic and clinical characteristics.
Cognitive dysfunction | |||||
|---|---|---|---|---|---|
| MCI( | Dementia | Control( | Statistics | ||
| Age (years) | 70.43 (9.36) | 70.45 (11.32) | 69.41 (10.12) | 0.270 | |
| Sex (female) | 191 (57.4%) | 197 (57.8%) | 166 (59.1%) | χ2 = 0.20 ( | 0.906 |
| Education (years) | 7.56 (4.68) | 7.55 (5.04) | 8.33 (4.80) | 0.118 | |
Values are presented as the mean (standard deviation) or number (percentage).
The sub-scores of neuropsychological screening tests and their comparison between groups.
Cognitive dysfunction | |||||
|---|---|---|---|---|---|
| MCI( | Dementia( | Control( | Cohen’s | ||
| MMSE | |||||
| Total | 24.84 (3.75) | 18.20 (12.11) | 26.94 (2.44) | 0.192 | <0.001 |
| Time | 4.38 (0.94) | 2.47 (1.47) | 4.76 (0.53) | 0.468 | <0.001 |
| Place | 4.83 (0.44) | 3.95 (1.18) | 4.88 (0.32) | 0.240 | <0.001 |
| Memory | 2.99 (0.09) | 2.80 (0.57) | 3.00 (0.00) | 0.071 | <0.001 |
| Attention | 3.21 (1.53) | 1.51 (1.45) | 3.90 (1.18) | 0.340 | <0.001 |
| Recall | 1.65 (0.95) | 0.59 (0.80) | 2.11 (0.83) | 0.354 | <0.001 |
| Others | 7.80 (1.36) | 6.31 (1.99) | 8.28 (0.95) | 0.235 | <0.001 |
| MoCA | |||||
| Total | 19.41 (5.67) | 11.17 (5.80) | 23.93 (4.13) | 0.496 | <0.001 |
| Visuospatial | 2.92 (1.42) | 1.55 (1.04) | 3.90 (1.24) | 0.375 | <0.001 |
| Naming | 2.48 (0.86) | 1.78 (1.19) | 2.94 (0.24) | 0.225 | <0.001 |
| Attention | 4.33 (1.68) | 2.42 (1.96) | 5.20 (1.11) | 0.333 | <0.001 |
| Language | 1.72 (0.93) | 1.11 (0.90) | 2.29 (0.83) | 0.220 | <0.001 |
| Abstract | 1.18 (0.73) | 0.68 (0.73) | 1.42 (0.70) | 0.153 | <0.001 |
| Memory | 1.47 (1.45) | 0.43 (0.89) | 2.45 (1.54) | 0.278 | <0.001 |
| Orientation | 5.33 (1.02) | 3.21 (1.66) | 5.74 (0.60) | 0.462 | <0.001 |
| KDSQ | 5.00 (3.64) | 15.77 (7.52) | 2.58 (2.58) | 0.555 | <0.001 |
MMSE, Mini-Mental State Examination; MoCA, Montreal Cognitive Assessment; KDSQ, Korea Dementia Screening Questionnaire.
The performance of screening models created using different machine learning algorithms.
| Machine learning algorithms | MCI vs. control | Dementia vs. control | Cog dys vs. control |
|---|---|---|---|
| Overall accuracy (kappa) | Overall accuracy (kappa) | Overall accuracy (kappa) | |
| Screening model | 55.5% (0.051) | 55.6% (0.030) | 70.6% (0.000) |
| Logistic regression | 68.2% (0.359) | 97.2% (0.943) | 82.2% (0.551) |
| Penalized logistic regression | 72.7% (0.463) | 96.8% (0.935) | 81.9% (0.590) |
| Linear SVM | 67.8% (0.351) | 97.2% (0.943) | 80.8% (0.522) |
| Linear discriminant analysis | 70.6% (0.412) | 95.2% (0.903) | 82.7% (0.576) |
| Decision tree | 75.1% (0.499) | 97.2% (0.943) | 75.8% (0.637) |
| Radial basis function kernel SVM | 72.2% (0.436) | 97.6% (0.951) | 82.4% (0.552) |
| Random forest | 80.8% (0.618) | 97.2% (0.943) | 89.2% (0.747) |
| Gradient boosting | 93.5% (0.869) | 99.9% (1.000) | 95.5% (0.891) |
| Neural network | 76.7% (0.530) | 97.6% (0.951) | 85.6% (0.640) |
Accuracy is presented as a percentage with Cohen’s kappa in parentheses. MCI, mild cognitive impairment; Cog dys, cognitive dysfunction; SVM, support vector machine.
Figure 1.ROC curves for the screening of each cognitive dysfunction according to different machine learning methods. Comparison of the power of the ROC curve of different machine learning models in predicting (a) MCI versus control, (b) dementia versus control, and (c) cognitive dysfunction versus control. Using different line styles, the AUCs of the different machine learning models are presented as values. Superscript letters indicating the first letter of each machine learning method’s name (or second letter, in the case of LDA [D] and DT [T]) show that the AUCs of RF and GB are significantly higher than those of other machine learning methods (P<0.001). ROC, receiver operating characteristic; MCI, mild cognitive impairment; AUC, area under the ROC curve; LR, binary logistic regression; PLR, penalized binary logistic regression; lSVM, linear support vector machine; LDA, linear discriminant analysis; rSVM, radial basis function kernel support vector machine; RF, random forest; GB, gradient boosting; DT, decision tree; NN, neural network.