| Literature DB >> 36002550 |
Agnès Pérez-Millan1,2, José Contador1, Raúl Tudela3, Aida Niñerola-Baizán3,4, Xavier Setoain3,4, Albert Lladó1,5, Raquel Sánchez-Valle1, Roser Sala-Llonch6,7.
Abstract
Linear mixed effects (LME) modelling under both frequentist and Bayesian frameworks can be used to study longitudinal trajectories. We studied the performance of both frameworks on different dataset configurations using hippocampal volumes from longitudinal MRI data across groups-healthy controls (HC), mild cognitive impairment (MCI) and Alzheimer's disease (AD) patients, including subjects that converted from MCI to AD. We started from a big database of 1250 subjects from the Alzheimer's disease neuroimaging initiative (ADNI), and we created different reduced datasets simulating real-life situations using a random-removal permutation-based approach. The number of subjects needed to differentiate groups and to detect conversion to AD was 147 and 115 respectively. The Bayesian approach allowed estimating the LME model even with very sparse databases, with high number of missing points, which was not possible with the frequentist approach. Our results indicate that the frequentist approach is computationally simpler, but it fails in modelling data with high number of missing values.Entities:
Mesh:
Year: 2022 PMID: 36002550 PMCID: PMC9402558 DOI: 10.1038/s41598-022-18129-4
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.996
Characteristics of the longitudinal ADNI sample used.
| Variable | sHC | cHC | sMCI | cMCI | AD | p-value |
|---|---|---|---|---|---|---|
| N | 273 | 78 | 361 | 319 | 219 | |
| Baseline age (years) | 74.3 ± 5.7 | 76.2 ± 5.1 | 72.9 ± 7.4 | 72.4 ± 7.5 | 74.7 ± 7.9 | 0.19 |
| Sex (M/F) | 142/131 | 40/38 | 212/149 | 184/135 | 123/96 | 0.41 |
| APOE-e4 (nc/c) | 207/66 | 52/26 | 203/158 | 121/198 | 64/155 | < 0.0005 |
Baseline age values are in mean ± standard deviation. M = male, F = female, nc = non-carriers, c = carriers.
p-values indicate differences between group. We used ANOVA for baseline age, and Fisher’s exact test for the other data.
Number of scans per time point by clinical group and time between scans.
| Time point | sHC (N) | cHC (N) | sMCI (N) | cMCI (N) | AD (N) | Time from baseline (years) |
|---|---|---|---|---|---|---|
| Baseline | 273 | 78 | 361 | 319 | 219 | 0.00 |
| Year 0.5 | 243 | 71 | 326 | 274 | 187 | 0.51 ± 0.05 |
| Year 1 | 234 | 70 | 294 | 275 | 173 | 1.01 ± 0.06 |
| Year 2 | 206 | 61 | 233 | 240 | 97 | 2.02 ± 0.08 |
Time from baseline values are in mean ± standard deviation.
Characteristics of the balanced longitudinal ADNI sample used.
| Variable | sHC | cHC | sMCI | cMCI | AD | p-value |
|---|---|---|---|---|---|---|
| N | 172 | 53 | 187 | 186 | 72 | |
| Baseline age (years) | 74.2 ± 5.6 | 76.1 ± 5.4 | 72.0 ± 6.9 | 71.6 ± 7.6 | 74.2 ± 7.9 | 0.004 |
| Sex (M/F) | 96/76 | 24/29 | 104/83 | 107/79 | 40/32 | 0.62 |
| APOE-ɛ4 (nc/c) | 133/39 | 33/20 | 116/71 | 72/114 | 21/51 | < 0.0005 |
Baseline age values are in mean ± standard deviation. M = male, F = female, nc = non-carriers, c=carriers.
p-values indicate differences between group. We used ANOVA for baseline age, and Fisher’s exact test for the other data.
Figure 1Simulations’ scheme (a) Strategy for minimum N simulations. The initial data is dataset 1 or dataset 2. (b) Strategy of the simulation of missing time points. RE= random effects.
Summary of the null hypotheses tested and results of the statistical inference.
| Contrast | Dataset 1 | Dataset 2 | Dataset 3 | Dataset 4 |
|---|---|---|---|---|
| sMCI vs cMCI | 39.2 5.6 × 10–10 * | 36.1 3.2 × 10–9* | 31.8 2.5 × 10–8* | 24.2 1.3 × 10–6* |
| All groups | 22.8 4.1 × 10–18 * | – | 15.1 8.5 × 10–12* | – |
| AD vs cMCI | 2.0 0.2 | – | 0.2 0.6 | – |
| sHC vs sMCI | 2.3 0.1 | – | 1.0 0.3 | – |
| sHC vs AD | 53.8 4.1 × 10–13* | – | 27.7 1.9 × 10–7* | – |
| sHC vs cMCI | 53.4 5.7 × 10–13* | – | 40.4 3.8 × 10–10* | – |
| sHC vs cHC | 2.3 0.1 | – | 2.7 0.1 | – |
*Indicates p-value < 0.05 (Bonferroni corrected).
Estimation and 95% Credible Intervals (CrI) of the ßs of interest LME model fitted with a Bayesian approach.
| Parameter | Interpretation | Estimate | 95% CrI | |
|---|---|---|---|---|
| ß11 | cHC × time | − 0.03 | − 0.06 | 0.01 |
| ß12 | sMCI × time | − 0.02 | − 0.04 | 0.01 |
| ß13 | cMCI × time | − 0.08 | − 0.11 | − 0.06* |
| ß14 | AD × time | − 0.11 | − 0.13 | − 0.08* |
CrI borders are expressed as the 2.5% and 97.5% percentiles.
*Indicates that the effect is significant (i.e., CrI does not contain zero).
Figure 2Distribution of subjects within each group for all the obtained databases (a) minimum N simulation across five clinical groups (b) minimum N simulation across MCI group.
Figure 3Hippocampus volume versus age. Top plots (a,b) represent dataset 1 (a) and 2 (b) with initial N. Bottom plots (a,b) represent four different databases obtained after the simulation of minimum N, resulting significant for frequentist and Bayesian approaches and only for frequentist approach. Top plots (c,d) represent dataset 3 (c) and 4 (d). Bottom plots (c,d) represent different databases obtained after the simulation of missing time points, being only estimable for Bayesian approach.