| Literature DB >> 26397878 |
Jennifer Weuve1, Cécile Proust-Lima2, Melinda C Power3, Alden L Gross4, Scott M Hofer5, Rodolphe Thiébaut6, Geneviève Chêne6, M Maria Glymour7, Carole Dufouil8.
Abstract
Clinical and population research on dementia and related neurologic conditions, including Alzheimer's disease, faces several unique methodological challenges. Progress to identify preventive and therapeutic strategies rests on valid and rigorous analytic approaches, but the research literature reflects little consensus on "best practices." We present findings from a large scientific working group on research methods for clinical and population studies of dementia, which identified five categories of methodological challenges as follows: (1) attrition/sample selection, including selective survival; (2) measurement, including uncertainty in diagnostic criteria, measurement error in neuropsychological assessments, and practice or retest effects; (3) specification of longitudinal models when participants are followed for months, years, or even decades; (4) time-varying measurements; and (5) high-dimensional data. We explain why each challenge is important in dementia research and how it could compromise the translation of research findings into effective prevention or care strategies. We advance a checklist of potential sources of bias that should be routinely addressed when reporting dementia research.Entities:
Keywords: Alzheimer disease; Big data; Brain imaging; Dementia; Epidemiologic factors; Genomics; Longitudinal studies; Neuropsychological tests; Selection bias; Statistical models; Survival bias
Mesh:
Year: 2015 PMID: 26397878 PMCID: PMC4655106 DOI: 10.1016/j.jalz.2015.06.1885
Source DB: PubMed Journal: Alzheimers Dement ISSN: 1552-5260 Impact factor: 21.566
Fig. 1Guidelines for reporting methodological challenges and evaluating bias in cognitive decline and dementia research.
Selection processes: problems and commonly adopted analytic approaches*
| Differential attrition of enrolled participants | Differential survival of enrolled participants | Differential study enrollment or “muting” |
|---|---|---|
| Ignore | Ignore | Ignore |
| Sensitivity analyses for magnitude of bias under plausible set of selection processes | Sensitivity analyses for magnitude of bias under plausible set of selection processes | Sensitivity analyses for magnitude of bias under plausible set of selection processes |
| Assess bounds based on best or worst case assumptions | Assess bounds based on best or worst case assumptions | Assess bounds based on best or worst case assumptions |
| Model determinants of selection to evaluate whether ignoring selection is appropriate | Model determinants of selection to evaluate whether ignoring selection is appropriate | Model determinants of selection to evaluate whether ignoring selection is appropriate |
| Adjust for determinants of selection | Adjust for determinants of selection | Adjust for determinants of selection |
| Weight on the inverse of the probability of selection | Weight on the inverse of the probability of selection | Weight on the inverse of the probability of selection |
| Instrumental variable methods, if an instrument for selection is available | Instrumental variable methods, if an instrument for selection is available | Instrumental variable methods, if an instrument for selection is available |
| Joint modeling of selection process (dropout, death, enrollment) and outcome | Joint modeling of selection process (dropout, death, enrollment) and outcome | Joint modeling of selection process (dropout, death, enrollment) and outcome |
| Multiple imputation and likelihood-based estimation including covariates related to missingness mechanism | Competing risks analysis (only when dementia is the outcome) | Principal stratification |
For many of these approaches, there is currently limited empirical or theoretical evidence comparing the performance (i.e., providing a precise estimate of the effect of interest) in dementia research.
Fig. 2Hypothetical illustration of selection processes before and after study enrollment. At age 20, smoking and APOE status are unrelated, but these risk factors synergistically affect mortality, with more than multiplicative effects on survival up to age 70 [72]. By the time of study initiation at age 70, smokers are very unlikely to be APOE ε4 carriers. Analyses that did not control for APOE ε4 would conflate APOE status and smoking and spuriously underestimate effects of smoking.
Measurement challenges: problems and commonly adopted analytic approaches*
| Validity of measurement | Reliability/random measurement error | Practice or retest effects | Unequal-interval scaling (including ceilings/floors on measures) |
|---|---|---|---|
| Ignore | Ignore | Ignore | Ignore |
| Multivariate latent variable methods or measurement error models | Multivariate latent variable methods or measurement error models | ||
| Compare to a gold standard/criterion validity | Instrumental variable analyses | Drop the first assessment or average first two assessments | Drop observations at the ceiling/ floor or otherwise condition on the baseline score |
| Compare to measures of theoretically correlated variables | Use composite scores from multiple neuropsychological assessments (e.g., summed Z-scores | Choose tests with limited retest effects | Item response theory or factor analysis based models. Factor analyses, imposing distributional assumptions |
| Evaluate Differential Item Functioning (DIF) and implement statistical corrections or adjustment for source of DIF | Randomize time of first assessment | Rescale by Z-scoring | |
| Indicator for first assessment | Transform the measure with a monotonic transformation intended to reduce non-interval scales (e.g. logarithm, box-cox, specifically designed normalizing transformation) | ||
| Other models of practice (linear or non-linear increases in practice effects) | Categorize the outcome (impaired vs. not impaired) | ||
| Mixed models identifying practice effects based on time-varying interview delays | Tobit regression models (for ceilings/floors) or quantile regressions | ||
| Joint estimation of a normalizing transformation of the outcome and the coefficients | |||
For many of these approaches, there is currently limited empirical or theoretical evidence comparing the performance (i.e., providing a precise estimate of the effect of interest) in dementia research.
Z-scoring rescales each individual’s raw score with respect to the distribution of scores for other individuals in the sample. From each individual’s raw score, the Z-score is calculated by subtracting the sample mean (usually at baseline) and dividing by the sample standard deviation (also at baseline).
Defining the time scale for longitudinal analyses: problems and commonly adopted analytic approaches*
| Divergence of within-person change and between-person age differences | Analysis of terminal decline preceding death, dementia, or other “milestone” events | Nonlinear cognitive trajectories |
|---|---|---|
| Ignore | Ignore | Ignore |
| Age as the time-scale with adjustment for age at entry or time-from-entry as the time-scale adjusting for age at entry | Analysis among the participants who had the event | Polynomial trajectory (quadratic, cubic) |
| Use of age at assessment as the time scale, without adjustment for age at entry. | Time to event as time scale in the group with event versus time to last measure for the healthy participants matched by or adjusted for the age at the last measure among others | Trajectories with random, pre-specified, or empirically selected change-points |
| Other time scale of interest adjusting for a cross-sectional age (possibly other than age at entry) | Joint model of the longitudinal outcome and the time to the event of interest (death, dementia or others) | Flexible parametric (splines, fractional polynomials) or non-parametric trajectories |
For many of these approaches, there is currently limited empirical or theoretical evidence comparing the performance (i.e., providing a precise estimate of the effect of interest) in dementia research.
Handling time-varying exposures and time-varying confounding: problems and commonly adopted analytic approaches*
| Time-varying exposures | Time-varying confounding |
|---|---|
| Ignore | Ignore |
| Marginal structural models and inverse probability weighting Structural nested models | Marginal structural models and inverse probability weighting Structural nested models |
| Time-to-event models, allowing exposure to update or lag | Compare effect estimates with or without adjustment for time-varying confounders |
| Summaries of time-varying exposure (e.g. average, duration, age at initiation) | Longitudinal propensity score models |
| Compare estimates from several models using exposure status at a single point in time or moving time windows; formally test alternative lifecourse models. | Instrumental variables models |
For many of these approaches, there is currently limited empirical or theoretical evidence comparing the performance (i.e., providing a precise estimate of the effect of interest) in dementia research.
Handling high dimensional data: challenges and commonly adopted analytic approaches
| Multiple comparisons/false discovery | Summarizing multiple highly correlated variables | Regression with high dimensional data |
|---|---|---|
| Family wise error correction (e.g. Bonferroni) | Theoretically motivated summaries or selected indicators based on prior knowledge, e.g., candidate gene approaches | Preselection of the variables of interest for adjustment |
| False discovery rate (e.g. BH correction) | Combination of variables (e.g. principal components analysis, partial least square) | Regularization methods (e.g. Lasso, ridge regression, elastic net) |