| Literature DB >> 36212530 |
Masataka Kikuchi1,2, Kaori Kobayashi1,3, Sakiko Itoh1, Kensaku Kasuga4, Akinori Miyashita4, Takeshi Ikeuchi4, Eiji Yumoto5, Yuki Kosaka5, Yasuto Fushimi3, Toshihiro Takeda6, Shirou Manabe6, Satoshi Hattori7,8, Akihiro Nakaya9, Kenichi Kamijo3, Yasushi Matsumura6.
Abstract
Mild cognitive impairment (MCI) is a high-risk condition for conversion to Alzheimer's disease (AD) dementia. However, individuals with MCI show heterogeneous patterns of pathology and conversion to AD dementia. Thus, detailed subtyping of MCI subjects and accurate prediction of the patients in whom MCI will convert to AD dementia is critical for identifying at-risk populations and the underlying biological features. To this end, we developed a model that simultaneously subtypes MCI subjects and predicts conversion to AD and performed an analysis of the underlying biological characteristics of each subtype. In particular, a heterogeneous mixture learning (HML) method was used to build a decision tree-based model based on multimodal data, including cerebrospinal fluid (CSF) biomarker data, structural magnetic resonance imaging (MRI) data, APOE genotype data, and age at examination. The HML model showed an average F1 score of 0.721, which was comparable to the random forest method and had significantly more predictive accuracy than the CART method. The HML-generated decision tree was also used to classify-five subtypes of MCI. Each MCI subtype was characterized in terms of the degree of abnormality in CSF biomarkers, brain atrophy, and cognitive decline. The five subtypes of MCI were further categorized into three groups: one subtype with low conversion rates (similar to cognitively normal subjects); three subtypes with moderate conversion rates; and one subtype with high conversion rates (similar to AD dementia patients). The subtypes with moderate conversion rates were subsequently separated into a group with CSF biomarker abnormalities and a group with brain atrophy. The subtypes identified in this study exhibited varying MCI-to-AD conversion rates and differing biological profiles.Entities:
Keywords: Alzheimer’s disease; Decision trees; Mild cognitive impairment
Year: 2022 PMID: 36212530 PMCID: PMC9513733 DOI: 10.1016/j.csbj.2022.08.007
Source DB: PubMed Journal: Comput Struct Biotechnol J ISSN: 2001-0370 Impact factor: 6.155
Summary of sample characteristics at baseline.
| CN | MCI | AD dementia | |
|---|---|---|---|
| N | 305 | 480 | 156 |
| Age in years, mean ± SE | 73.7 ± 0.328 | 71.8 ± 0.339 | 74 ± 0.666 |
| Sex (Female:Male) | 162:143 | 200:280 | 69:87 |
| Education year, mean ± SE | 16.3 ± 0.151 | 16 ± 0.127 | 15.5 ± 0.214 |
| CSF Aβ(1–42) (pg/mL), mean ± SE | 1226 ± 25.27 | 964.2 ± 19.94 | 644.6 ± 22.97 |
| CSF tTau (pg/mL), mean ± SE | 238.4 ± 5.128 | 287.4 ± 6.264 | 373.8 ± 11.1 |
| CSF pTau (pg/mL), mean ± SE | 21.9 ± 0.529 | 27.9 ± 0.693 | 37.3 ± 1.18 |
| tTau / Aβ(1–42), mean ± SE | 2.33e-01 ± 9.01e-03 | 3.86e-01 ± 1.32e-02 | 6.51e-01 ± 2.45e-02 |
| pTau / Aβ(1–42), mean ± SE | 2.19e-02 ± 9.78e-04 | 3.82e-02 ± 1.43e-03 | 6.52e-02 ± 2.55e-03 |
| Whole-brain volume / ICV, mean ± SE | 6.94e-01 ± 2.52e-03 | 6.83e-01 ± 2.28e-03 | 6.45e-01 ± 3.37e-03 |
| Hippocampus volume / ICV, mean ± SE | 5.01e-03 ± 3.37e-05 | 4.5e-03 ± 3.74e-05 | 3.84e-03 ± 5.27e-05 |
| Brain-ventricular volume / ICV, mean ± SE | 2.14e-02 ± 5.9e-04 | 2.45e-02 ± 5.87e-04 | 3.05e-02 ± 9.25e-04 |
| Entorhinal cortex volume / ICV, mean ± SE | 2.58e-03 ± 2.16e-05 | 2.33e-03 ± 2.19e-05 | 1.86e-03 ± 3.48e-05 |
| WMH volume, mean ± SE | 4.03 ± 0.429 | 5.02 ± 0.385 | 4.75 ± 0.539 |
| 81 (26.6%) | 239 (49.8%) | 108 (69.2%) |
Abbreviations are as follows: CN, Cognitively normal; MCI, Mild cognitive impairment; AD, Alzheimer's disease; CSF, Cerebrospinal fluid; Aβ, Amyloid-beta; tTau, Total tau; pTau, Phosphorylated tau; ICV, intracranial volume; WMH, White matter hyperintensity; APOE, Apolipoprotein E; SE, Standard error.
Fig. 1Procedures for 5-fold cross-validation (CV) to estimate model parameters based on heterogeneous mixture learning (HML).
Test performance of each method.
| HML (Approach 1) | HML (Approach 2) | CART | Random forest | |
|---|---|---|---|---|
| Sensitivity | 0.574 ± 0.079 | 0.765 ± 0.011 | 0.682 ± 0.024 | 0.725 ± 0.012 |
| Specificity | 0.848 ± 0.036 | 0.825 ± 0.008 | 0.806 ± 0.011 | 0.834 ± 0.005 |
| Precision | 0.694 ± 0.055 | 0.695 ± 0.010 | 0.658 ± 0.011 | 0.705 ± 0.005 |
| Accuracy | 0.760 ± 0.037 | 0.792 ± 0.006 | 0.760 ± 0.008 | 0.795 ± 0.003 |
| F1 | 0.610 ± 0.039 | 0.721 ± 0.008 | 0.663 ± 0.015 | 0.713 ± 0.006 |
| # of leaf nodes | 3.8 ± 1.095 | 4.2 ± 0.236 | 26.2 ± 2.145 | – |
Each value shows mean ± SE of performances obtained from 5-fold CV.
1Tukey's HSD tests were performed on three groups: HML (Approach 2), CART, and random forest, and the p-value between HML and the corresponding algorithm is shown.
2The p-value between HML (Approach 2) and CART by Student's t-test is shown.
Fig. 2The HML-based decision tree classifies the MCI subjects into five subtypes. (A) Decision tree model generated by HML. The dotted lines in each gating node represent each threshold. The numbers in yellow and green areas of each pie chart indicate the number of subjects with CN and AD, respectively. (B) Conversion rates over time for each MCI subtype (p = 4.62e-15 in the log-rank test). (C) Conversion rates within three years for each MCI subtype. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)
Summary of each subtype.
| Subtype 1 | Subtype 2 | Subtype 3 | Subtype 4 | Subtype 5 | |
|---|---|---|---|---|---|
| N | 68 | 173 | 188 | 14 | 37 |
| Age in years, mean ± SE | 73.9 ± 0.99 | 71.8 ± 0.568 | 71.4 ± 0.529 | 66 ± 1.62 | 72 ± 0.948 |
| Sex (female:male) | 25:43 | 72:101 | 82:106 | 9:5 | 12:25 |
| Years of education, mean ± SE | 15.8 ± 0.355 | 16.3 ± 0.202 | 15.8 ± 0.206 | 17 ± 0.756 | 16.1 ± 0.452 |
| CSF Aβ(1–42) (pg/mL), mean ± SE | 923.3 ± 46.37 | 1211 ± 32.32 | 852.1 ± 28.32 | 713.9 ± 68.01 | 549.7 ± 30.4 |
| CSF tTau (pg/mL), mean ± SE | 273.2 ± 17.62 | 235.8 ± 7.491 | 323.7 ± 10.54 | 416 ± 56.81 | 321.7 ± 18.28 |
| CSF pTau (pg/mL), mean ± SE | 25.9 ± 1.95 | 22.1 ± 0.843 | 32 ± 1.15 | 42.6 ± 6.5 | 31.6 ± 1.96 |
| tTau / Aβ(1–42), mean ± SE | 3.53e-01 ± 2.85e-02 | 2.37e-01 ± 1.36e-02 | 4.66e-01 ± 2.2e-02 | 6.95e-01 ± 1.47e-01 | 6.25e-01 ± 3.56e-02 |
| pTau / Aβ(1–42), mean ± SE | 3.4e-02 ± 3.07e-03 | 2.29e-02 ± 1.52e-03 | 4.68e-02 ± 2.38e-03 | 7.21e-02 ± 1.66e-02 | 6.13e-02 ± 3.59e-03 |
| Whole-brain volume / ICV, mean ± SE | 6.49e-01 ± 4.88e-03 | 6.99e-01 ± 3.63e-03 | 6.81e-01 ± 3.65e-03 | 7.24e-01 ± 8.36e-03 | 6.72e-01 ± 6.86e-03 |
| Hippocampus volume / ICV, mean ± SE | 3.83e-03 ± 7.68e-05 | 4.87e-03 ± 5.18e-05 | 4.49e-03 ± 6e-05 | 4.91e-03 ± 2.16e-04 | 3.87e-03 ± 8.39e-05 |
| Brain-ventricular volume / ICV, mean ± SE | 3.26e-02 ± 1.69e-03 | 2.21e-02 ± 8.44e-04 | 2.44e-02 ± 9.93e-04 | 1.18e-02 ± 6.66e-04 | 2.61e-02 ± 1.36e-03 |
| Entorhinal cortex volume / ICV, mean ± SE | 1.83e-03 ± 3.46e-05 | 2.59e-03 ± 2.24e-05 | 2.3e-03 ± 3.65e-05 | 2.63e-03 ± 1.42e-04 | 2.05e-03 ± 6.6e-05 |
| WMH volume, mean ± SE | 5.78 ± 1.17 | 5.48 ± 0.624 | 4.29 ± 0.618 | 4.71 ± 1.71 | 5.22 ± 1.25 |
| 0 (0%) | 0 (0%) | 188 (100%) | 14 (100%) | 37 (100%) |
Abbreviations are as follows: CN, Cognitively normal; MCI, Mild cognitive impairment; AD, Alzheimer's disease; CSF, Cerebrospinal fluid; Aβ, Amyloid-beta; tTau, Total tau; pTau, Phosphorylated tau; ICV, intracranial volume; WMH, White matter hyperintensity; APOE, Apolipoprotein E; SE, Standard error.
Fig. 3Features of each MCI subtype. (A) Cerebrospinal fluid (CSF) Aβ(1–42) level, (B) CSF tTau level, (C) CSF pTau level, (D) tTau/Aβ(1–42) ratio, (E) pTau/Aβ(1–42) ratio, (F) normalized brain-ventricular volume, (G) normalized hippocampal volume, (H) normalized whole-brain volume, (I) normalized entorhinal cortex volume, (J) normalized white matter hyperintensity (WMH) volume, (K) age, and (L) dosage of APOE ε4 alleles. The blue and red lines indicate the cut-off values based on the CN subjects and the AD dementia patients. The adopted cut-off value was that which distinguished the CN subjects and the AD dementia patients with the highest accuracy in the ROC analysis. Points below the blue line or above the red line represent levels similar to those observed in AD dementia patients. (M) The spot matrix shows the proportions of individuals with features similar to those of AD dementia patients. The spot size represents the proportion of MCI subjects with values exceeding the cut-off value. Yellow indicates a value greater than 50% of the proportion. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)
Fig. 4Cognitive function in each subtype. (A, D, G, J) Comparison of the cognitive function at baseline among subtypes. Higher composite scores in each cognitive domain were indicative of higher cognitive function. Multiple pairwise comparisons were performed with Tukey’s HSD tests to verify the differences in scores between subtypes. (B, E, H, K) Longitudinal changes in cognitive function by subtype. Each trajectory is indicated by a linear regression line. The error bars represent the 95 % confidence intervals. (C, F, I, L) Cognitive decline over time compared with cognitive function in subtype 2. The bar plots represent the interactions between subtypes and follow-up time, as estimated by the linear mixed model (LMM) with subtype 2 as the reference. *p < 0.05, **p < 0.01, ***p < 0.001.
Fig. 5Levels of the CSF neuronal and synaptic injury markers VILIP-1 (A), SNAP-25 (B), and NGRN (C) and the inflammatory response marker YKL-40 (D). Multiple pairwise comparisons were performed with Tukey’s HSD tests to verify the differences in CSF levels between subtypes. *p < 0.05, **p < 0.01, ***p < 0.001.