| Literature DB >> 32290134 |
Abstract
Because it is possible to delay the progression of dementia if it is detected and treated in an early stage, identifying mild cognitive impairment (MCI) is an important primary goal of dementia treatment. The objectives of this study were to develop a random forest-based Parkinson's disease with mild cognitive impairment (PD-MCI) prediction model considering health behaviors, environmental factors, medical history, physical functions, depression, and cognitive functions using the Parkinson's Dementia Clinical Epidemiology Data (a national survey conducted by the Korea Centers for Disease Control and Prevention) and to compare the prediction accuracy of our model with those of decision tree and multiple logistic regression models. We analyzed 96 subjects (PD-MCI = 45; Parkinson's disease with normal cognition (PD-NC) = 51 subjects). The prediction accuracy of the model was calculated using the overall accuracy, sensitivity, and specificity. Based on the random forest analysis, the major risk factors of PD-MCI were, in descending order of magnitude, Clinical Dementia Rating (CDR) sum of boxes, Untitled Parkinson's Disease Rating (UPDRS) motor score, the Korean Mini Mental State Examination (K-MMSE) total score, and the K- Korean Montreal Cognitive Assessment (K-MoCA) total score. The random forest method achieved a higher sensitivity than the decision tree model. Thus, it is advisable to develop a protocol to easily identify early stage PDD based on the PD-MCI prediction model developed in this study, in order to establish individualized monitoring to track high-risk groups.Entities:
Keywords: Parkinson’s disease with mild cognitive impairment; cognitive function; data mining; neuropsychological test; random forest
Year: 2020 PMID: 32290134 PMCID: PMC7178031 DOI: 10.3390/ijerph17072594
Source DB: PubMed Journal: Int J Environ Res Public Health ISSN: 1660-4601 Impact factor: 3.390
Figure 1Framework of study.
Measurement and definition of variables.
| Variable. | Measurement | Characteristics |
|---|---|---|
| Sociodemographic factors | Gender | Male or female |
| Education | Middle school graduate and below or high school graduate and above | |
| Mainly used hand | Left hand, right hand, or both hands | |
| Family dementia history | Yes or no | |
| Family PD history | Yes or no | |
| Pack-years | Non-smoking, 1–20, 21–40, or ≥41 pack-years | |
| Health behaviors | Coffee-drinking | Yes or no |
| Mean coffee intake per day (cups/day) | No, ≤1, 2–3, or ≥4 cups | |
| Coffee drinking period (year) | No, ≤5, 6–9, or ≥10 years | |
| Exposure to pesticide | Never, currently not exposed but exposed previously, or currently exposed to pesticide | |
| Environmental factors | Carbon monoxide poisoning | Yes or no |
| Disease history | Manganese poisoning | Yes or no |
| Traumatic brain injury | Yes or no | |
| Stroke | Yes or no | |
| Diabetes | Yes or no | |
| Hypertension | Yes or no | |
| Hyperlipidemia | Yes or no | |
| Atrial fibrillation | Yes or no | |
| Tremor | Yes or no | |
| Exercise characteristics related to PD (PD related motor signs) | Rigidity | Yes or no |
| Bradykinesia | Yes or no | |
| Postural instability | Yes or no | |
| Rapid eye movement (REM) and sleep behavior disorders (RBD) | Yes or no | |
| Sleep behavior disorders | Total score of K-MMSE | Continuous variable |
| Neuropsychological characteristics | Total score of K-MoCA | Continuous variable |
| CDR global score | ||
| CDR sum of boxes | ||
| K-IADL | ||
| Total score of UPDRS | ||
| Motor score of UPDRS | ||
| H&Y staging (Hoehn and Yahr staging) | ||
| Schwab and England ADL |
Pack-years: Cumulative amount of smoking, based on one pack of smoking per day. For example, 30 pack-years means smoking one pack of cigarettes per day for 30 years or two packs of cigarettes per day for 15 years. CDR—Clinical Dementia Rating; K-IADL—Korean Instrumental Activities of Daily Living; UDPRS—Untitled Parkinson’s Disease Rating; ADL—Schwab and England Activities of Daily Living scale.
Figure 2Ensemble classifiers that combines many single decision trees.
General characteristics of the subjects, n (%).
| Characteristics | After Match | ||
|---|---|---|---|
| PD-MCI ( | PD-NC ( | Total ( | |
| Gender | |||
| Male | 24 (53.3) | 22 (43.1) | 46 (47.9) |
| Female | 21 (46.7) | 29 (56.9) | 50 (52.1) |
| Education | |||
| Middle school graduate and below | 27 (60.0) | 32 (62.7) | 59 (61.5) |
| High school graduate and above | 18 (40.0) | 19 (37.3) | 37 (38.5) |
| Mainly used hand | |||
| Right hand | 44 (97.8) | 47 (92.2) | 91 (94.8) |
| Left hand | 1 (2.2) | 1 (2.0) | 2 (2.1) |
| Both hands | 0 | 3 (5.9) | 3 (3.1) |
| Family PD history | |||
| No | 36 (92.3) | 33 (91.7) | 69 (92.0) |
| Yes | 3 (7.7) | 3 (8.3) | 6 (8.0) |
| Family dementia history | |||
| No | 36 (94.7) | 32 (91.4) | 68 (93.2) |
| Yes | 2 (5.3) | 3 (8.6) | 5 (6.8) |
| Pack year (Smoking) | |||
| 1–20 | 6 (13.3) | 3 (5.9) | 9 (9.4) |
| 21–40 | 3 (6.7) | 2 (3.9) | 5 (5.2) |
| 41+ | 36 (80.0) | 46 (90.2) | 82 (85.4) |
| Coffee-drinking | |||
| No | 15 (33.3) | 19 (37.3) | 34 (35.4) |
| Yes | 30 (66.7) | 32 (62.7) | 57 (64.6) |
| Carbon monoxide poisoning | |||
| No | 42 (97.7) | 38 (86.4) | 80 (92.0) |
| Yes | 1 (2.3) | 6 (13.6) | 7 (8.0) |
| Traumatic brain injury | |||
| No | 40 (93.0) | 42 (95.5) | 82 (94.3) |
| Yes | 3 (7.0) | 2 (4.5) | 5 (5.7) |
| Stroke | |||
| No | 41 (95.3) | 44 (100) | 85 (97.7) |
| Yes | 2 (4.7) | 0 | 2 (2.3) |
| Diabetes | |||
| No | 36 (80.0) | 37 (74.4) | 73 (76.8) |
| Yes | 9 (20.0) | 13 (26.0) | 22 (23.2) |
| Hypertension | |||
| No | 32 (71.1) | 25 (50.0) | 57 (60.0) |
| Yes | 13 (28.9) | 25 (50.0) | 38 (40.0) |
| Hyperlipidemia | |||
| No | 41 (91.1) | 43 (86.0) | 84 (88.4) |
| Yes | 4 (8.9) | 7 (14.0) | 11 (11.6) |
| Atrial fibrillation | |||
| No | 44 (97.8) | 47 (94.0) | 91 (95.8) |
| Yes | 1 (2.2) | 3 (6.0) | 4 (4.2) |
| Tremor | |||
| No | 14 (33.3) | 8 (17.4) | 22 (25.0) |
| Yes | 28 (66.7) | 38 (82.6) | 66 (75.0) |
| Rigidity | |||
| No | 3 (7.0) | 8 (17.0) | 11 (12.2) |
| Yes | 40 (93.0) | 39 (83.0) | 79 (87.8) |
| Bradykinesia | |||
| No | 2 (4.7) | 6 (12.8) | 8 (8.9) |
| Yes | 41 (95.3) | 41 (87.2) | 82 (91.1) |
| Postural instability | |||
| No | 22 (55.0) | 28 (60.9) | 50 (58.1) |
| Yes | 18 (45.0) | 18 (39.1) | 36 (41.9) |
| REM sleep behavior disorders | |||
| No | 29 (67.4) | 27 (56.3) | 56 (61.5) |
| Yes | 14 (32.6) | 21 (43.7) | 35 (38.5) |
| Depression (GDS) | |||
| No | 22 (62.9) | 22 (75.9) | 44 (68.8) |
| Yes | 13 (37.1) | 7 (24.1) | 20 (31.3) |
| K-MMSE, mean ± SD | 25.8 ± 2.7 | 25.4 ± 4.7 | 25.6 ± 3.9 |
| K-MoCA, mean ± SD | 20.6 ± 4.0 | 20.5 ± 6.2 | 20.5 ± 5.3 |
| Global CDR score, mean ± SD | 0.5 ± 0.2 | 0.5 ± 0.6 | 0.5 ± 0.4 |
| Sum of boxes in CDR, mean ± SD | 1.4 ± 1.4 | 0.8 ± 1.3 | 1.2 ± 1.4 |
| K-IADL, mean ± SD | 1.0 ± 2.6 | 0.7 ± 1.0 | 0.8 ± 2.0 |
| Total UPDRS, mean ± SD | 34.9 ± 18.9 | 29.9 ± 13.1 | 33.0 ± 16.9 |
| Motor UPDRS, mean ± SD | 22.6 ± 11.6 | 17.9 ± 8.6 | 20.0 ± 10.3 |
| H&Y staging score, mean ± SD | 2.1 ± 0.8 | 1.8 ± 0.6 | 2.0 ± 0.7 |
| Schwab and England ADL, mean ± SD | 80.0 ± 16.0 | 87.7 ± 8.1 | 83.6 ± 13.3 |
REM sleep behavior disorders—rapid eye movement sleep behavior disorders; PD-MCI—Parkinson’s Disease with Mild Cognitive Impairment; PD-NC—Parkinson’s Disease with Normal Cognition; K-MMSE—Korean Mini Mental State Examination; K-MoCA—Korean Montreal Cognitive Assessment; CDR—Clinical Dementia Rating; K-IADL—Korean Instrumental Activities of Daily Living; UPDRS—Untitled Parkinson’s Disease Rating; H&Y staging—Hoehn and Yahr staging; Schwab and England ADL—Schwab and England Activities of Daily Living scale.
Figure 3Variable importance in a random forest model (showing only the top 12 factors).
Figure 4Partial dependence plot (CDR sum of boxes).
Error of out-of-bag.
| Numbers of mtry | Error of Out-of-Bag |
|---|---|
| 5 | 0.344 |
| 6 | 0.375 |
| 7 | 0.396 |
| 8 | 0.375 |
| 9 | 0.396 |
| 10 | 0.365 |
| 11 | 0.385 |
| 12 | 0.375 |
| 13 | 0.375 |
| 14 | 0.375 |
| 15 | 0.375 |
Comparison of accuracies developed prediction models, %.
| Model | Overall Accuracy | Sensitivity | Specificity |
|---|---|---|---|
| Multiple logistic regression | NA | NA | NA |
| Decision tree | 67.7 | 51.1 | 82.4 |
| Random Forest | 65.6 | 70.6 | 60.0 |
NA—not available.
Figure 5Out-of-bag error rate curve (random forest model). Black line—overall accuracy; red line—sensitivity; Green line—specificity.