| Literature DB >> 31598577 |
Dena Zeraatkar1, Kevin Cheung2, Kirolos Milio2, Max Zworth2, Arnav Gupta3, Arrti Bhasin1, Jessica J Bartoszko1, Michel Kiflen1,4, Rita E Morassut5, Salmi T Noor1, Daeria O Lawson1, Bradley C Johnston6, Shrikant I Bangdiwala1,4, Russell J de Souza1,4.
Abstract
BACKGROUND: Observational studies provide important information about the effects of exposures that cannot be easily studied in clinical trials, such as nutritional exposures, but are subject to confounding. Investigators adjust for confounders by entering them as covariates in analytic models.Entities:
Keywords: confounding; covariate; model building; nutritional epidemiology; research methods
Year: 2019 PMID: 31598577 PMCID: PMC6778415 DOI: 10.1093/cdn/nzz104
Source DB: PubMed Journal: Curr Dev Nutr ISSN: 2475-2991
Study characteristics
| 2007/2008 | 2017/2018 | All articles | |
|---|---|---|---|
| ( | ( | ( | |
| Journal | |||
| | 2 (3.8) | 3 (3.1) | 5 (3.3) |
| | 4 (7.5) | 11 (11.3) | 15 (10.0) |
| | 3 (5.7) | 5 (5.2) | 8 (5.3) |
| | 1 (1.9) | 6 (6.2) | 7 (4.7) |
| | 0 (0) | 1 (1.0) | 1 (0.7) |
| | 14 (26.4) | 10 (10.3) | 24 (16.0) |
| | 14 (26.4) | 10 (10.3) | 24 (16.0) |
| | 0 (0) | 24 (24.7) | 24 (16.0) |
| | 14 (26.4) | 10 (10.3) | 24 (16.0) |
| | 1 (1.9) | 17 (17.5) | 18 (12.0) |
| Study design | |||
| Cohort | 30 (56.6) | 67 (69.1) | 97 (64.7) |
| Case-control | 8 (15.1) | 5 (5.2) | 13 (8.7) |
| Nested case-control | 0 (0) | 2 (2.1) | 2 (1.3) |
| Case-cohort | 1 (1.9) | 0 (0) | 1 (0.7) |
| Cross-sectional | 14 (26.4) | 23 (23.7) | 37 (24.7) |
| Participants, | 5823 [1864–26,238] | 11,879 [2121–88,184] | 8072 [2035–62,461] |
| Primary exposures investigated | |||
| Micronutrient | 10 (18.9) | 10 (10.3) | 20 (13.3) |
| Macronutrient | 6 (11.3) | 13 (13.4) | 19 (12.7) |
| Food | 9 (17.0) | 23 (23.7) | 32 (21.3) |
| Food group | 10 (18.9) | 13 (13.4) | 23 (15.3) |
| Dietary pattern | 18 (34.0) | 38 (39.2) | 56 (37.3) |
| Primary outcomes investigated | |||
| All-cause mortality | 4 (7.5) | 16 (16.5) | 20 (13.3) |
| Cardiovascular mortality | 1 (1.9) | 2 (2.1) | 3 (2.0) |
| Cardiovascular disease | 1 (1.9) | 3 (3.1) | 4 (2.7) |
| Stroke | 0 (0) | 3 (3.1) | 3 (2.0) |
| Myocardial infarction | 3 (5.7) | 3 (3.1) | 6 (4.0) |
| Brain cancer and tumors of the spinal cord | 1 (1.9) | 1 (1.0) | 2 (1.3) |
| Digestive cancers | 7 (13.2) | 11 (11.3) | 18 (12.0) |
| Endocrine-related cancers | 0 (0) | 1 (1.0) | 1 (0.7) |
| Female cancers | 3 (5.7) | 1 (1.0) | 4 (2.7) |
| Prostate cancer | 4 (7.5) | 1 (1.0) | 5 (3.3) |
| Overall cancer mortality | 1 (1.9) | 0 (0) | 1 (0.7) |
| Type 2 diabetes | 4 (7.5) | 9 (9.3) | 13 (8.7) |
| Other | 24 (45.3) | 46 (47.4) | 70 (46.7) |
| Methods for controlling for effects of covariates | |||
| Regression methods | 31 (58.5) | 52 (53.6) | 83 (55.3) |
| Combination of regression methods and stratification | 14 (26.4) | 39 (40.2) | 53 (35.3) |
| Combination of regression methods and individual matching | 4 (7.5) | 3 (3.1) | 7 (4.7) |
| Combination of regression methods and frequency matching | 4 (7.5) | 2 (2.1) | 6 (4.0) |
| None | 0 (0) | 1 (1.0) | 1 (0.7) |
| Analytic model | |||
| Multivariable linear regression | 5 (9.4) | 9 (9.3) | 14 (9.3) |
| Logistic regression | 21 (39.6) | 30 (30.9) | 51 (34.0) |
| Cox proportional hazards model | 22 (41.5) | 51 (52.6) | 73 (48.7) |
| ANOVA methods | 2 (3.8) | 4 (4.1) | 6 (4.0) |
| Poisson regression | 2 (3.8) | 0 (0) | 2 (1.3) |
| Other | 1 (1.9) | 3 (3.1) | 4 (2.7) |
| Reported statistically significant association between the primary exposure and outcome of interest? | |||
| Yes | 35 (66.0) | 82 (84.5) | 117 (78.0) |
| No | 18 (34.0) | 15 (15.5) | 33 (22.0) |
1Values are n (%) or median [IQR].
Reporting of methods for selection of covariates
| 2007/2008 | 2017/2018 | All articles | |
|---|---|---|---|
| ( | ( | ( | |
| Reported whether covariates were selected a priori? | |||
| Some (but not all) covariates were selected a priori | 1 (1.9) | 1 (1.0) | 2 (1.3) |
| All covariates were selected a priori | 0 (0) | 7 (7.2) | 7 (4.7) |
| Not reported | 52 (98.1) | 89 (91.8) | 141 (94.0) |
| Reported methods for selection of covariates for analysis? | |||
| Reported criteria for selection of all covariates | 9 (17.0) | 21 (21.7) | 30 (20.0) |
| Reported criteria for selection of some covariates | 10 (18.9) | 15 (15.5) | 25 (16.7) |
| Not reported | 34 (64.2) | 61 (62.9) | 95 (63.3) |
| Among studies that reported methods for selection of covariates, covariates were selected from: | |||
| Factors known or suspected to be associated with the exposure | 2 (3.8) | 3 (3.0) | 5 (3.3) |
| Known or established risk factors for the outcome | 13 (24.5) | 26 (26.8) | 39 (26.0) |
| Factors known or suspected to be associated with both the exposure and outcome | 1 (1.9) | 4 (4.1) | 5 (3.3) |
| Factors known or suspected to be associated with either the exposure or the outcome | 2 (3.8) | 0 (0) | 2 (1.3) |
| Confounders (factors associated with the exposure that also act on the outcome) as identified by Directed Acyclic Graphs | 0 (0) | 4 (4.1) | 4 (2.7) |
| Other | 4 (7.5) | 2 (2.1) | 6 (4.0) |
| Not reported | 34 (64.2) | 61 (62.9) | 95 (63.3) |
| Sources cited to support choice of covariates? | |||
| Systematic review | 1 (1.9) | 5 (5.2) | 6 (4.0) |
| Authoritative document (e.g., World Cancer Research Fund report) | 0 (0) | 4 (4.1) | 4 (2.7) |
| Narrative review | 0 (0) | 1 (1.0) | 1 (0.7) |
| Epidemiological study | 9 (17.0) | 11 (11.3) | 20 (13.3) |
| De novo literature search conducted by authors | 1 (1.9) | 9 (9.3) | 10 (6.7) |
| Methodology article | 0 (0) | 1 (1.0) | 1 (0.7) |
| No source cited | 44 (83.0) | 76 (78.4) | 120 (80.0) |
| Reported use of data-driven methods for selection of covariates for inclusion in final analytic model? | |||
| Reported use of data-driven methods for selection of all covariates for inclusion in final analytic model | 6 (11.3) | 8 (8.3) | 14 (9.3) |
| Reported use of a combination of data-driven and hypothesis-driven methods to select covariates for inclusion in final analytic model | 11 (20.8) | 15 (15.4) | 26 (17.3) |
| Did not report using any data-driven methods to select covariates | 36 (67.9) | 74 (76.3) | 110 (73.3) |
| Among studies that reported use of data-driven methods for selection of covariates, covariates were selected based on: | |||
| If their inclusion appreciably changed the effect estimate of the primary exposure (change-in-estimate criterion) | 11 (20.8) | 11 (11.3) | 22 (14.7) |
| | 2 (3.8) | 1 (1.0) | 3 (2.0) |
| | 1 (1.9) | 3 (3.1) | 4 (2.7) |
| | 3 (5.7) | 6 (6.1) | 9 (6.0) |
| Backward elimination | 1 (1.9) | 0 (0) | 1 (0.7) |
| Stepwise selection | 0 (0) | 2 (2.1) | 2 (1.3) |
| Magnitude of correlation with exposure | 0 (0) | 1 (1.0) | 1 (0.7) |
| Whether inclusion reduced the SE of the effect estimate of the primary exposure | 1 (1.9) | 1 (1.0) | 2 (1.3) |
| Model fit | 0 (0) | 1 (1.0) | 1 (0.7) |
| Some description provided but unclear which specific method was used | 4 (7.5) | 3 (3.1) | 7 (4.7) |
| Did not report using any data-driven methods to select covariates | 36 (67.9) | 74 (76.3) | 110 (73.3) |
| Conducted quantitative bias analysis to evaluate impact of potential unadjusted/unmeasured confounders on results? | |||
| Yes, according to methods described by Lin et al. ( | 0 (0) | 1 (1.0) | 1 (0.7) |
| Yes, according to methods described by Ding and VanderWeele ( | 0 (0) | 1 (1.0) | 1 (0.7) |
| Yes, by constructing a hypothetical confounder | 1 (1.9) | 0 (0) | 1 (0.7) |
| No | 52 (98.1) | 95 (97.9) | 147 (98.0) |
1Values are n (%).
2Categories are not mutually exclusive.
3Specific measure of model fit used not reported.
Interpretation of results in nutritional epidemiology studies
| 2007/2008 | 2017/2018 | All articles | |
|---|---|---|---|
| ( | ( | ( | |
| Acknowledge potential for residual confounding? | |||
| Yes | 31 (58.5) | 67 (69.1) | 98 (65.3) |
| No | 22 (41.5) | 30 (30.9) | 52 (34.7) |
| Acknowledge existence of unadjusted confounders? | |||
| Yes, because the confounder was not measured | 4 (7.5) | 11 (11.3) | 15 (10.0) |
| Yes, for other reasons | 0 (0) | 4 (4.1) | 4 (2.7) |
| Yes, no reason is provided | 6 (11.3) | 9 (9.3) | 15 (10.0) |
| No, the authors do not acknowledge the existence of confounders not included in the analysis | 43 (81.1) | 74 (76.3) | 117 (78.0) |
| Acknowledge measurement error as a potential source of confounding? | |||
| Yes | 4 (7.5) | 11 (11.3) | 15 (10.0) |
| No | 49 (92.5) | 86 (88.7) | 135 (90.0) |
| Described likelihood of residual confounding affecting results? | |||
| Likely | 1 (1.9) | 2 (2.1) | 3 (2.0) |
| Possible | 23 (43.4) | 62 (63.9) | 85 (56.7) |
| Unlikely | 6 (11.3) | 1 (1.0) | 7 (4.7) |
| Not possible | 0 (0) | 0 (0) | 0 (0) |
| Not discussed | 23 (43.4) | 32 (33.0) | 55 (36.7) |
| Report a causal link between exposure and outcome? | |||
| Yes | 6 (11.3) | 6 (6.2) | 12 (8.0) |
| No | 47 (88.7) | 91 (93.8) | 138 (92.0) |
1Values are n (%).
2Categories are not mutually exclusive.