| Literature DB >> 30285646 |
Hyun-Soo Choi1, Jin Yeong Choe2, Hanjoo Kim1, Ji Won Han3, Yeon Kyung Chi3, Kayoung Kim3, Jongwoo Hong3, Taehyun Kim3, Tae Hui Kim4, Sungroh Yoon5, Ki Woong Kim6,7,8.
Abstract
BACKGROUND: The conventional scores of the neuropsychological batteries are not fully optimized for diagnosing dementia despite their variety and abundance of information. To achieve low-cost high-accuracy diagnose performance for dementia using a neuropsychological battery, a novel framework is proposed using the response profiles of 2666 cognitively normal elderly individuals and 435 dementia patients who have participated in the Korean Longitudinal Study on Cognitive Aging and Dementia (KLOSCAD).Entities:
Keywords: Alzheimer disease; Data mining; Deep learning; Dementia; Neuropsychological tests
Mesh:
Year: 2018 PMID: 30285646 PMCID: PMC6171238 DOI: 10.1186/s12877-018-0915-z
Source DB: PubMed Journal: BMC Geriatr ISSN: 1471-2318 Impact factor: 3.921
Fig. 1Overall scheme. The proposed diagnostic framework includes five steps. a Data Acquisition, b Missing Data Imputation, c Design and Validation of Classifier, d Input Variable Selection, e Two-stage Classification
Characteristics of the subjects
| Controls | Dementia | Statistics | |||
|---|---|---|---|---|---|
| CDR=0.5 | CDR=1 | For | post | ||
| Number | 2666 | 189 | 246 | ||
| Age (years) | 69.54± | 75.01± | 76.61± | 174.927∗∗∗ | |
| 6.52a | 7.23b | 7.43b | |||
| Sex | 53.2 | 56.6 | 65.4 | 20.138∗∗ | |
| (female, %) | |||||
| Education | 9.57± | 8.40± | 6.61± | 30.520∗∗ | |
| (years) | 5.33a | 5.75b | 5.75c |
∗∗∗p<.001, ∗∗p<.01, Games-Howell post hoc comparisons
a, b, c: the same letters indicate homogeneous groups
Fig. 2Architecture of proposed deep neural networks for KLOSCAD-N assessment and demographic information
Classification performances on the imputed dataset indicated by the area under the receiver operator curve (AUC)
| Proposed DNNs | XGBoost | Logistic Regression | Random Forest | Adaboost | Bagging | Support Vector Machine | |
|---|---|---|---|---|---|---|---|
| MinMax | 0.9489 | 0.9506 | 0.9083 | 0.9405 | 0.9149 | 0.9334 | 0.8898 |
| kNN | 0.9603 | 0.9541 | 0.9356 | 0.9466 | 0.9444 | 0.9559 | 0.9321 |
| MI | 0.9586 | 0.9524 | 0.9312 | 0.9211 | 0.9184 | 0.9418 | 0.9347 |
| LLS | 0.9594 | 0.9471 | 0.9295 | 0.9343 | 0.9109 | 0.9339 | 0.9383 |
MinMax: minimum-maximum imputation, kNN: k nearest neighbor imputation, MI: multiple imputation, LLS: local least square imputation
Classification performances of various deep neural network architectures on Mini Mental Status Exam (MMSE) and Korean Longitudinal Study on Cognitive Aging and Dementia Neuropsychological Battery (KLOSCAD-N) indicated by the area under the receiver operator curve (AUC) via five-cross validation on train dataset
| 2D-CNN | 2D-CNN Naïve | 2D-CNN w/o SC | 1D-CNN | 1D-CNN w/o SC | FCN | FCN w/o SC | NasNet | ||
|---|---|---|---|---|---|---|---|---|---|
| MMSEa | mean | - | - | - | - | - | 0.9702 | 0.9583 | - |
| std | - | - | - | - | - | 0.0144 | 0.0139 | - | |
| KLOSCAD-N | mean | 0.9863 | 0.9850 | 0.9782 | 0.9848 | 0.9805 | 0.9830 | 0.9771 | 0.9813 |
| std | 0.0048 | 0.0058 | 0.0057 | 0.0053 | 0.0042 | 0.0060 | 0.0070 | 0.0046 |
aSince MMSE is composed with only five dimension (four demographic variables and one MMSE total-score, the other architecture are not applicable except FCN
Comparative analysis with other conventional classifiers indicated by the area under the receiver operator curve (AUC) via five-cross validation on train dataset
| Proposed DNNs | XGBoost | AdaBoost | Random Forest | Bagging | Support Vector Machine | Logistic Regression | ||
|---|---|---|---|---|---|---|---|---|
| MMSE | mean | 0.9702 | 0.9605 | 0.9573 | 0.9581 | 0.9631 | 0.9627 | 0.9642 |
| std | 0.0144 | 0.0144 | 0.0171 | 0.0192 | 0.0169 | 0.0196 | 0.0171 | |
| KLOSCAD-N | mean | 0.9863 | 0.9850 | 0.9774 | 0.9762 | 0.9724 | 0.9744 | 0.9807 |
| std | 0.0048 | 0.0065 | 0.0107 | 0.0079 | 0.0069 | 0.0093 | 0.0080 |
Comparative results of two-stage classification on test dataset
| KLOSCAD-N w/ DNNs | Proposed Two-stage Classification | MMSE w/ DNNs | KLOSCAD-N w/o DNNs | MMSE w/o DNNs | |
|---|---|---|---|---|---|
| Accuracy (%) | 92.74 | 92.90 | 87.74 | 86.13 | 84.84 |
| AUC | 0.9790 | -a | 0.9383 | 0.9349 | 0.9143 |
| F1 Score | 0.7805 | 0.7800 | 0.6667 | 0.6356 | 0.6179 |
| Sensitivity | 0.9287 | 0.9343 | 0.8780 | 0.8621 | 0.8736 |
| Specificity | 0.9195 | 0.8966 | 0.8736 | 0.8612 | 0.8443 |
| Likelihood Ratio Plus | 11.5425 | 9.0319 | 6.9446 | 6.2092 | 5.6097 |
| Likelihood Ratio Minus | 0.0775 | 0.0732 | 0.1396 | 0.1602 | 0.1498 |
| Positive Predictive Value | 0.5673 | 0.5064 | 0.4410 | 0.4136 | 0.3892 |
| Negative Predictive Value | 0.9913 | 0.9917 | 0.9844 | 0.9821 | 0.9833 |
| Pre Test Odd | 0.1136 | 0.1136 | 0.1136 | 0.1136 | 0.1136 |
| Post Test Odd | 1.3111 | 1.0259 | 0.7888 | 0.7053 | 0.6372 |
| Post Test Probability | 0.5673 | 0.5064 | 0.4410 | 0.4136 | 0.3892 |
| Costb | $111,600 | $40,030 | $6,200 | $111,600 | $6,200 |
aSince each stage provides their own probability, single AUC value can not be calculated
bTotal cost for test dataset including 620 subjects
Fig. 3Dependency on the variables. Trends of the area under the receiver operator curve (AUC) as a function of the number of variables included in order from the highest ranging variable
Top 43 variables selected for classifying dementia from normal controls
| Ranking | Variable description |
|---|---|
| 1 | Time to complete the Trail Making Test A |
| 2 | Retention index of Constructional Recall Testa |
| 3 | Age |
| 4 | Response bias index of the Word List Recognition Testb |
| 5 | Recency index of the Word List Memory Testc |
| 6 | Executive Clock Drawing Test (CLOX) 1 score |
| 7 | Consistency index of the Word List Memory Testd |
| 8 | Correct responses at the second quarter (15–30 s) in the Verbal Fluency Test |
| 9 | The number of repetitive recalls in trial 3 of the Word List Memory Test |
| 10 | Geriatric Depression Scale score |
| 11 | Cube recall score of the Constructional Recall Test |
| 12 | Clustering index of Verbal Fluency Test |
| 13 | Correct responses in the middle-frequency objects of the 15-item Boston Naming Test without cues |
| 14 | The number of correct recall in trial 2 of the Word List Memory Test |
| 15 | Digit Span Test Forward score |
| 16 | Years of education |
| 17 | Perceptual error index in the low-frequency objects of the 15-item Boston Naming Test |
| 18 | Ineffective switch index of the Verbal Fluency Test |
| 19 | Retention index of the Word List Recall Teste |
| 20 | Consistency index of the Word List Recall Testf |
| 21 | Primacy index of the Word List Memory Testg |
| 22 | Word List Recall Test score |
| 23 | Switch index of the Verbal Fluency Testh |
| 24 | The number of correct recall in trial 1 of the Word List Memory Test |
| 25 | Forward span of the Digit Span Test |
| 26 | Word List Recognition Test total score |
| 27 | Correct responses in the low-frequency objects of the 15-item Boston Naming Test with phonemic cues |
| 28 | Learning curve of the Word List Memory Testi |
| 29 | Digit Span Test Backward score |
| 30 | Correct responses at the last quarter (45–60 s) in the Verbal Fluency Test |
| 31 | Constructional Recognition Test score |
| 32 | Go-No-Go score of the Frontal Assessment Battery |
| 33 | The umber of correct recall in trial 3 of the Word List Memory Test |
| 34 | Correct responses in the high-frequency objects of the 15-item Boston Naming Test without cues |
| 35 | Correct responses at the first quarter (0–15 s) in the Verbal Fluency Test |
| 36 | ’Do not know’ responses in the low-frequency objects of the 15-item Boston Naming Test |
| 37 | The number of intrusion errors in the Word List Recall Test |
| 38 | Intersecting rectangles recall score of the Constructional Recall Test |
| 39 | Recency index in trial 1 of the Word List Memory Test |
| 40 | Correct responses at the third quarter (30–45 s) in the Verbal Fluency Test |
| 41 | Backward span of the Digit Span Test |
| 42 | Diamond recall score of the Constructional Recall Test |
| 43 | Cube score of the Constructional Praxis Test |
a(Constructional recall test score /constructional praxis test) ×100
b(False positive score −false negative score) /(false positive score+false negative score)
c(The number of recalled words among the last 3 words of the Word List Memory Test /Word List Memory Test score) ×100
dThe sum of the numbers of words consistently recalled in between trial 1, trial 2 and trial 3 of the Word List Memory Test
e(Word List Recall Test total score/trial 3 score of Word List Memory Test) ×100
f(The number of words consistently recalled in the World List Recall Test among the recalled words in the Word List Memory Test) × 100
g(The number of recalled words among the first 3 words of the Word List Memory Test /Word List Memory Test score) ×100
hThe number of switches between clusters during Verbal Fluency Test
iThe number of recalled words in trial 3 of the Word List Memory Test - the number of recalled words in trial 1 of the Word List Memory Test
Fig. 4Dependency on the sweeping first classification threshold. Two-stage classification performance trends as function of a sweeping threshold of deep neural networks (DNNs) with MMSE for the second-stage diagnosis with Korean Longitudinal Study on Cognitive Aging and Dementia Neuropsychological Battery. a Equal error rate (EER) curve on DNNs for MMSE. b Empirically estimated performance and cost on test dataset. When first-stage classification threshold values is 0.0362, cost is minimized without any loss on performance (f1 score)
Fig. 5Histogram of MMSE scores. The distribution of the MMSE scores of the test set subjects requiring only first-stage and those requiring two-stages. The two distributions are roughly divided around 25 points, but can not be clearly distinguished only by the MMSE score