| Literature DB >> 34490422 |
James Howlett1, Steven M Hill1, Craig W Ritchie2, Brian D M Tom1.
Abstract
A key challenge for the secondary prevention of Alzheimer's dementia is the need to identify individuals early on in the disease process through sensitive cognitive tests and biomarkers. The European Prevention of Alzheimer's Dementia (EPAD) consortium recruited participants into a longitudinal cohort study with the aim of building a readiness cohort for a proof-of-concept clinical trial and also to generate a rich longitudinal data-set for disease modelling. Data have been collected on a wide range of measurements including cognitive outcomes, neuroimaging, cerebrospinal fluid biomarkers, genetics and other clinical and environmental risk factors, and are available for 1,828 eligible participants at baseline, 1,567 at 6 months, 1,188 at one-year follow-up, 383 at 2 years, and 89 participants at three-year follow-up visit. We novelly apply state-of-the-art longitudinal modelling and risk stratification approaches to these data in order to characterise disease progression and biological heterogeneity within the cohort. Specifically, we use longitudinal class-specific mixed effects models to characterise the different clinical disease trajectories and a semi-supervised Bayesian clustering approach to explore whether participants can be stratified into homogeneous subgroups that have different patterns of cognitive functioning evolution, while also having subgroup-specific profiles in terms of baseline biomarkers and longitudinal rate of change in biomarkers.Entities:
Keywords: Alzheimer’s disease; Bayesian profile regression; European prevention of Alzheimer’s dementia; biomarkers; cognitive functioning; disease modelling; latent class mixed models; precision medicine
Year: 2021 PMID: 34490422 PMCID: PMC8417903 DOI: 10.3389/fdata.2021.676168
Source DB: PubMed Journal: Front Big Data ISSN: 2624-909X
FIGURE 1Graphical representation of the proposed two-stage approach.
Baseline characteristics of the 1,574 participants with more than one visit.
| Variable | Mean (SD) | Frequency (%) | No. Unknown | ||
|---|---|---|---|---|---|
| Risk Factors | Age, years | 65.4 (7.4) | 0 | ||
| Sex | Female | 888 (56.4) | 0 | ||
| Male | 686 (43.6) | ||||
| Education | Level 1 | 587 (37.3) | 0 | ||
| Level 2 | 393 (25.0) | ||||
| Level 3 | 594 (37.7) | ||||
| Family history of AD | No | 543 (34.5) | 0 | ||
| Yes | 1,031 (65.5) | ||||
| APOE | No | 965 (62.5) | 31 | ||
| Yes | 578 (37.5) | ||||
| Outcomes | CDRSB | 0 | 1,162 (73.9) | 2 | |
| 0.5 | 214 (13.6) | ||||
| ≥1 | 196 (12.5) | ||||
| MMSE | 29–30 | 999 (63.5) | 1 | ||
| 27–28 | 417 (26.5) | ||||
| ≤26 | 157 (10.0) | ||||
| Transformed CDRSB, tCDRSB | 4.60 (1.04) | 2 | |||
| Normalised MMSE, nMMSE | 83.6 (14.6) | 1 | |||
| Biomarkers | pTau/A | ≤0.024 | 1,240 (80.5) | 33 | |
| >0.024 | 301 (19.5) | ||||
| MTA average | 0 | 800 (51.2) | 13 | ||
| 0.5 | 375 (24.0) | ||||
| ≥1 | 386 (24.7) | ||||
| Fazekas scale deep | <2 | 1,317 (84.4) | 13 | ||
| ≥2 | 244 (15.6) | ||||
| Fazekas scale periventricular | <1 | 947 (60.7) | 13 | ||
| ≥1 | 614 (39.3) | ||||
| ARWMC basal ganglia | <1 | 1,379 (88.3) | 13 | ||
| ≥1 | 182 (11.7) | ||||
| ARWMC frontal | <1 | 506 (32.4) | 13 | ||
| ≥1 | 1,055 (67.6) | ||||
| ARWMC infratentorial | <1 | 1,465 (93.9) | 13 | ||
| ≥1 | 96 (6.1) | ||||
| ARWMC parieto-occipital | <1 | 786 (50.4) | 13 | ||
| ≥1 | 775 (49.6) | ||||
| ARWMC temporal | <1 | 1,268 (81.2) | 13 | ||
| ≥1 | 293 (18.8) | ||||
| ARWMC combined | <3 | 1,194 (76.5) | 13 | ||
| ≥3 | 367 (23.5) | ||||
| Total hippocampal volume (adj), | 5,793 (703) | 62 | |||
| Total ventricular volume (adj), | 32,991 (17,669) | 168 |
FIGURE 2(A) Predicted trajectories with 95% confidence bands for each class on the latent process scale given mean values for each of the covariates. (B) and (C) Predicted trajectories with 95% confidence bands for each class on the CDRSB and MMSE scale given mean values for the each of the covariates with observed outcomes for each participant.
Results of the 4-class MLCMM on the 1,543 participants.
| Coefficient (SE) | |||
|---|---|---|---|
| Class membership model | Intercept class 0 | 1.30 (0.07) | <0.0001 |
| Intercept class 1 | −1.04 (0.12) | <0.0001 | |
| Intercept class 2 | −0.87 (0.13) | <0.0001 | |
| Fixed effects model | Intercept class 0 | 0 (not estimated) | — |
| Intercept class 1 | −2.33 (0.15) | <0.0001 | |
| Intercept class 2 | −0.65 (0.14) | <0.0001 | |
| Intercept class 3 | −3.14 (0.22) | <0.0001 | |
| Time in study class 0 | −0.0040 (0.014) | 0.773 | |
| Time in study class 1 | 2.43 (0.16) | <0.0001 | |
| Time in study class 2 | −1.58 (0.11) | <0.0001 | |
| Time in study class 3 | 0.16 (0.04) | 0.0001 | |
| Age | −0.0033 (0.0014) | 0.022 | |
| Sex male | −0.015 (0.019) | 0.443 | |
| Education level 2 | 0.022 (0.024) | 0.367 | |
| Education level 3 | 0.043 (0.022) | 0.049 | |
| Family history of AD | 0.016 (0.021) | 0.453 | |
| APOE | −0.021 (0.019) | 0.290 | |
| Link function parameters | tCDRSB | 5.31 (0.07) | <0.0001 |
| tCDRSB | 0.73 (0.04) | <0.0001 | |
| nMMSE | 87.96 (0.48) | <0.0001 | |
| nMMSE | 3.80 (0.30) | <0.0001 |
Characterisation of the baseline and change variables by latent phenotype classes.
| Mean (SD) | ||||||
|---|---|---|---|---|---|---|
| Variable | Class 0 | Class 1 | Class 2 | Class 3 | ANOVA | |
| Age, years | 63.9 (7.0) | 65.6 (6.6) | 68.1 (7.0) | 69.5 (7.0) | <0.0001 | |
| Total hippocampal | ||||||
| volume (adj), | 5,911 (644) | 5,814 (725) | 5,609 (715) | 5,429 (768) | <0.0001 | |
| Total ventricular | ||||||
| volume (adj), | 30,715 (16,348) | 35,404 (19,823) | 37,962 (18,838) | 38,396 (19,405) | <0.0001 | |
| Annual (adj) hippocampal | ||||||
| volume change, | −9.4 (83.5) | −30.4 (61.9) | −40.2 (85.3) | −55.3 (99.1) | <0.0001 | |
| Annual (adj) ventricular | ||||||
| volume change, | 988 (910) | 1,430 (1,354) | 1,958 (1,651) | 1,688 (1,586) | <0.0001 | |
FIGURE 3Posterior similarity matrix for the consensus across the six MCMC chains from the Bayesian profile regression analysis on the 1,543 EPAD participants. Each entry (i, j) of this 1,543 × 1,543-matrix represents the proportion of times participants i and j are assigned to the same cluster over the 250,000 × 6 MCMC iterations. Color bars indicate the seven final PAM consensus representative clusters of participants identified. See Figure 4 for more information regarding these clusters.
FIGURE 4Results from Bayesian Profile Regression analysis. (A) Cluster sizes for the final PAM representative consensus clusters. (B–D) Posterior distributions for mean mixture component parameter values for each of the “representative” clusters (see Section 2.2.2). (B) Outcome variable (parameters are the probability of belonging to each MLCMM latent class). (C) Binary covariates (parameters are the probability of the covariate having value of one). (D) Continuous covariates (parameters are the mean covariate value). For (A–D), colors indicate clusters (see also Figure 3). For (B–D), black horizontal lines indicate the mean parameter values across all subjects and the coloured circles indicate the upper and lower limit of the 90% credible interval.
Results from the Bayesian profile regression analysis on the 1,543 participants.
| Posterior means | ||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Probability of abnormal pathology | SD distance from overall mean | Class membership probability | ||||||||||||
| Clusters | N (%) | pTau/A | MTA | FSD | FSPV | ARWMC combined | Mean HV | Mean HV rate | Mean VV | Mean VV rate | Class 0 | Class 1 | Class 2 | Class 3 |
| 1 | 575 (37.3) | 0.113 | 0.096 | 0.079 | 0.269 | 0.143 | 0.549 | 0.362 | −0.674 | −0.463 | 0.917 | 0.040 | 0.038 | 0.005 |
| 2 | 110 (7.1) | 0.166 | 0.575 | 0.278 | 0.567 | 0.357 | −0.706 | 0.047 | 1.590 | 0.321 | 0.791 | 0.133 | 0.038 | 0.039 |
| 3 | 409 (26.5) | 0.145 | 0.200 | 0.195 | 0.436 | 0.287 | −0.111 | −0.066 | 0.101 | −0.042 | 0.976 | 0.015 | 0.006 | 0.003 |
| 4 | 227 (14.7) | 0.353 | 0.337 | 0.135 | 0.405 | 0.202 | −0.300 | −0.166 | −0.063 | 0.020 | 0.010 | 0.040 | 0.009 | 0.941 |
| 5 | 82 (5.3) | 0.553 | 0.810 | 0.380 | 0.675 | 0.524 | −1.229 | −1.093 | 1.583 | 1.558 | 0.101 | 0.024 | 0.154 | 0.721 |
| 6 | 72 (4.7) | 0.308 | 0.309 | 0.276 | 0.529 | 0.319 | −0.354 | 0.000 | 0.287 | 0.407 | 0.064 | 0.177 | 0.731 | 0.028 |
| 7 | 68 (4.4) | 0.228 | 0.247 | 0.157 | 0.376 | 0.240 | 0.025 | 0.026 | −0.109 | −0.004 | 0.248 | 0.464 | 0.173 | 0.115 |
| Overall empirical mean | 0.194 | 0.247 | 0.158 | 0.393 | 0.236 | 5,793 | -23.8 | 32,997 | 1,274 | 0.680 | 0.063 | 0.069 | 0.188 | |
| SD | 705 | 88.7 | 17,687 | 1,264 | ||||||||||