| Literature DB >> 35027007 |
Rachel Visontay1,2, Matthew Sunderland3, Tim Slade3, Jack Wilson3, Louise Mewton3,4.
Abstract
BACKGROUND: Research has long found 'J-shaped' relationships between alcohol consumption and certain health outcomes, indicating a protective effect of moderate consumption. However, methodological limitations in most studies hinder causal inference. This review aimed to identify all observational studies employing improved approaches to mitigate confounding in characterizing alcohol-long-term health relationships, and to qualitatively synthesize their findings.Entities:
Keywords: Alcohol abstinence; Alcohol drinking; Causality; Protective factors; Risk factors; Systematic review
Mesh:
Year: 2022 PMID: 35027007 PMCID: PMC8759175 DOI: 10.1186/s12874-021-01486-5
Source DB: PubMed Journal: BMC Med Res Methodol ISSN: 1471-2288 Impact factor: 4.615
Fig. 1Exemplar functional forms between alcohol consumption and health outcomes. Legend: A = J-shaped; B=U-shaped; C = reverse J-shaped; D = positive linear; E = negative linear; F = monotonically increasing; G = monotonically decreasing
Methods to enhance causal inference in observational research
| Method | Relevant sub-methods | Description | Can address: | Advantages | Limitations | |||
|---|---|---|---|---|---|---|---|---|
| Confounding | Reverse causality | Selection bias | Measurement error | |||||
Propensity scores (PS) [ | Covariate balancing propensity scores (CBPS) | -The PS is a single value reflecting the probability of exposure for an individual given their values on all relevant covariates -PS generation occurs as a data ‘pre-processing’ step prior to main analysis -Usually generated via logistic regression -Once generated, the PS can be used for matching, stratification, weighting, or as a covariate for adjustment in regression | ✓ | -Unlike standard methods, can handle large numbers of covariates -Not reliant on correctly modelling covariate- outcome relationships -Covariate balance after matching/weighting can be assessed | -Still relies on appropriate choice of covariates and accurate measurement -PS matching may not find matches for some treatment cases, leading to reduced sample size and limiting generalizability -Effect estimation with PS doesn’t always perform better than regular adjustment -Most PS methods rely on manual covariate balance checking and refitting | |||
G-methods [ | -A family of methods intended for use with time-dependent variables -Developed as a solution to the problem of time-varying covariates affected by past exposure, including those that act as both confounders and mediators over time -The three G-methods are the G-formula, marginal structural models, and G-estimation, each relying on its own modelling assumptions | -Unlike standard methods to control for confounding, G-methods do not fix values of covariates, thus do not block mediation via the covariate, and avoid introducing collider bias -Accounting for changes in variables over time mitigates misclassification | -Still relies on appropriate choice of covariates and accurate measurement | |||||
G-formula (aka G-computation or G-standardization) [ | -First models relationship given observed data (using actual exposure for each individual), and then predicts outcomes under counterfactual exposures, with the difference taken as the causal effect -Is a generalization of standardization (conditioning on covariates and then marginalizing) that accounts for dynamic variables by considering covariate distribution over follow-up time | ✓ | ✓ (specifically, bias due to censoring) | ✓ | -Can be used to calculate risk ratios and risk differences -Well-suited to assess time-varying exposures | -Vulnerable to the ‘g-null paradox’- null hypotheses tend to be rejected in large studies even when true -Usually requires specifying a statistical model, hence also being known as the ‘parametric’ G-formula | ||
Marginal structural models (MSMs) [ | -Use weights based on inverse probability of exposure at each time point to create a pseudo-population where each combination of covariates is equally present in each exposure condition -Using these weights, MSMs then estimate the causal effect -The most popular of the G-methods | ✓ | ✓ (specifically, bias due to censoring) | ✓ | -Simplest of the G-methods to understand and implement -Can also integrate censoring weights to account for differential attrition | -Not ideal for assessing exposure-confounder interactions, with standard MSMs unable to estimate interactions involving dynamic variables -Requires checking weight distribution, may require refitting (as with PS methods) -Cannot be used if all participants are exposed/unexposed on a particular level of a confounder | ||
G-estimation of structural nested models [ | -At each wave assesses the relationship between exposure and likelihood of outcome given covariates, adjusting for exposure and covariate values from past waves, thus accounting for dynamic confounders affected by past exposure -Considered semi-parametric in that mean counterfactual outcomes under no exposure are unspecified | ✓ | ✓ | -Can be used even if all participants are exposed/ unexposed on a particular level of a confounder -Can be used to assess exposure-confounder interactions | -Unlike the other G-methods, cannot account for selection bias arising from censoring, so data requires preliminary weighting to account for bias from censoring | |||
Doubly robust methods [ | Targeted maximum likelihood estimation; Augmented inverse probability weighting | -Incorporates both an estimation of the outcome mechanism (as in regression adjustment) and the exposure mechanism (as in propensity scores) | ✓ | -Only the outcome mechanism or the exposure mechanism need be consistently estimated to generate an unbiased estimate of exposure effect | -Still relies on appropriate choice of covariates and accurate measurement | |||
| Fixed effects regression [ | -A technique developed in the econometrics literature for use with longitudinal data with repeat outcome measurements, only using information on within-subject variation, thus controlling for all time-invariant sources of confounding -Treats time-invariant characteristics that differ between individuals as fixed parameters (unlike in mixed models), allowing estimation of parameters of interest net of stable confounders -Each participant serves as own control | ✓ | -Removes the threat of all observed and unobserved time-invariant confounding -Models can be extended to include time-varying covariates | -Cannot overcome time-varying confounding without extending the model, and these variables must be observed/measured -Individuals with stable exposure values do not contribute to estimates; also leads to imprecision when exposures change little over time -Reducing confounding comes at the cost of more sampling variability -Cannot generate parameter estimates for stable characteristics like race | ||||
| Causal mediation analysis [ | -Integrates traditional mediation analysis (which separately estimates total effect of exposure on outcome, indirect effect via mediators, and direct effect unexplained by mediators) with the potential outcomes framework to allow for exposure-mediator interaction and non-linear relationships (i.e., is a non-parametric method) -Uses the concepts of ‘controlled direct effect’, ‘natural direct effect’, and ‘natural indirect effect’ -Makes explicit underlying assumptions related to unmeasured confounding, and encourages sensitivity analyses to test robustness to assumption violations | ✓ (see advantages and limitations*) | -Effect decomposition is still possible given exposure-mediator interaction, nonlinearity, and categorical variables -Makes underlying assumptions explicit -Helps identify the mechanism/s of an exposure’s effect; especially useful when heterogenous causal mechanisms at play -Can be extended to situations with multiple mediators -*If a variable completely mediates the exposure-outcome relationship and is shielded from confounders, confounder measurement is not needed | -*In practice, likely there will always be exposure-mediator or mediator-outcome confounding, so still need to observe and accurately measure covariates -Analyses make strong assumptions, necessitating sensitivity analyses | ||||
Natural experiments [ | -Mimic RCTs by exploiting exogenous events that are truly randomized/approximate random assignment -Differ from true experiments in that exposure is not assigned by the researcher -Assignment may be as a result of naturally occurring phenomena (e.g., a weather event), or of human intervention implemented for reasons other than the research question (e.g., army draft lottery) | -In approximating randomization, obviates the need for accurate measurement of confounders -Potential to overcome measurement error, reverse causation, and selection bias | -Rare to find truly random or as if-random exposure assignment | |||||
| Standard natural experiments | -Natural experiments where individuals are as-if/randomly assigned to exposure and control groups | ✓ | ✓ | ✓ | ✓ | -May be difficult to find a standard natural experiment that maps on well to the actual research question of interest | ||
| Instrumental variable analysis | -Assesses the relationship between an as-if/ randomly assigned -A valid instrumental variable must be associated with the exposure of interest, be independent of confounders of the exposure-outcome relationship, and should affect the outcome only via the exposure | ✓ | ✓ | ✓ | ✓ | -Useful when the exposure itself is difficult to manipulate or measure | -Difficult to find valid instrumental variables -Potential for weak instrument bias i.e., when the instrument explains a small amount of variance in exposure -Relies on assumption that the instrumental variable is not associated with exposure-outcome confounding | |
| Genetic instrumental variables | -Kind of instrumental variable analysis using genetic variants as proxies for exposure -The most prominent technique is Mendelian Randomisation | ✓ | ✓ | ✓ | ✓ | -Genes cannot be confounded by environment, cannot be subject to reverse causality, and are stable over time -Multiple variants can be combined to explain more variance in exposure, mitigating weak instrument bias | -Genetic instrumental variables are proxies for lifelong exposure - this period may be longer than what the research question is interested in -Potential for weak instrument bias, pleiotropy (gene affecting more than one phenotype) and linkage disequilibrium (genes more likely to be inherited together) -Possible population stratification (there may be population subgroups with different distributions of genes) | |
| Quasi-experiments [ | -Like natural experiments, exploit exogenous events to assess relationships between exposures and outcomes, but lack random or as-if random assignment | ✓ | ✓ | ✓ | -Same as for natural experiments | -Without random assignment, confounding is still possible i.e., cause of exposure may also contribute to outcome | ||
Family-based designs [ | Twin studies; Sibling comparison | -By comparing genetically related participants discordant for the exposure of interest, accounts for confounding from genetic or shared environmental sources | ✓ | -Controls for unmeasured/unobservable confounding (for shared covariates) -Comparing monozygotic and dizygotic twins enhances understanding of genetic vs. environmental confounding | -Still need to observe and accurately measure non-shared environmental covariates to control for this kind of confounding -May be difficult to find family members discordant for exposure of interest | |||
| Negative controls [ | Negative control exposures; Negative control outcomes | -Have the same confounding structures as the exposure-outcome relationship of interest, but lack a plausible causal mechanism -If association is greater for the relationship of interest than for the negative control, a causal relationship is likely; if not, suggests confounding/other shared biases responsible -May take the form of a negative control exposure or a negative control outcome | ✓ | ✓ (specifically, immortal time bias) | -Can identify when confounding (or assumed shared bias) is responsible for apparent causal effects -Does not require observation/measurement of covariates | -Relies on assumption of no plausible causal mechanism in the negative control relationship -Relies on assumption that same confounding structure shared by relationship of interest and negative control relationship | ||
Fig. 2PRISMA flow chart
Characteristics of included papers (n = 16)
| First author (year) | Health outcome/s and interval to follow-up | Study design | Participant characteristics | Cohort/s name | Sample size | Statistical methodology | Explicit testing for non-linearity? | Main results (statistically significant findings bolded) | Conclusion |
|---|---|---|---|---|---|---|---|---|---|
| Dickerman (2016) [ | Prostate cancer (PC) and prostate cancer mortality -median 30 yrs. of follow-up | Twin | Male twins; mean age 40.1 at baseline | Older Finnish Twin Cohort | 11,372; 225 and 43 discordant twin pairs for PC and PC-mortality respectively | -Co-twin (discordant for both alcohol consumption level and either time to diagnosis among outcome-concordant pairs concordant or time to event vs death/end of follow-up among outcome-discordant pairs) and pooled cohort Cox analyses to examine risk for PC and PC-mortality -Alcohol consumption measured twice (6 years apart) and averaged; categories: abstainers, | ϰ | -PC-risk: HR (95% CI) Cohort MZ twins DZ twins All twins Abstainers 1.27 (.94,1.71) 2.85 (.67,12.1) Moderate 1.2 (.99,1.46) 1.28 (.6,2.74) 1.54 (0.92,2.57) 1.36 (.91,2.04) Heavy -PC-mortality: HR (95% CI) Cohort MZ twins DZ twins All twins Abstainers Moderate 1.22 (.76,1.97) 9.13 (.70,119) 1.43 (.37,5.62) 2.44 (.79, 7.52) Heavy 1.32 (.66,2.62) - 2.39 (.33,17.3) 7.31 (1.3, 41) | |
| Carlsson (2003) [ | Type 2 diabetes (T2D) −20 yrs. of follow-up | Twin | Same-sex twins without diabetes at baseline; mean age 34.3 (M) and 35.4 (F) at baseline | Finnish Twin Cohort | 22,788; 27 discordant twin pairs analysed | -Co-twin (discordant for alcohol category) and pooled cohort Cox analyses to examine risk for T2D -Categories for pooled cohort (based on 3 questionnaires over 15 years): abstainers, | ✓ | RR (95% CI) Cohort Men Cohort Women OR (95% CI) Twin Abstainers 1.1 (.7,1.5) 1.1 (.9,1.5) Moderate .5 (.2,1.3) 5–29.9 g/day (m); 5–19.9 g/day (f) .8 (.6,1.1) .7 (.4,1.1) High 1.2 (.4,3.9) ≥ 30 g/day (m); ≥ 20 g/day (f) .9 (.6,1.4) 1.6 (.8,3.5) | |
Peng (2019) [ *See also cardiovascular outcomes | Diabetes-related biomarkers (FBG, P2hBG, HbA1c, HOMA-IR, HOMA-beta) -cross-sectional | MR | Chinese adults living in the Yi-Ling district of Yichang; mean age 55 (SD 0.1) at baseline | One community from the Risk Evaluation of cAncers in Chinese diabeTic Individuals: a LONgitudinal (REACTION) study | 4536 | −1-sample MR using a single variant and standard IV analysis (2SLS); local average treatment effects (LATEs) computed for subgroups of observed alcohol consumption (non-zero LATE slopes indicate non-linearity) -Only standard, linear IV analysis was performed for categorical diabetes risk, not allowing for detection of non-linearity -Analysis performed in women as a negative control due to their lack of alcohol consumption -No conventional analyses conducted for comparison | ✓ | -Non-linear analyses: the LATE slopes for all diabetes-related biomarkers were not sig. Different to zero, indicating no non-linear relationships Unstandardizedβ (95% CI) Using log-transformed alcohol intake FBG (log-transformed) -.01 (−.04,.01) P2hBG (log-transformed) .01 (−.04,.06) HbA1c (log-transformed) -.01 (−.03,.00) HOMA-IR (log-transformed) -.00 (−.09,.08) HOMA-beta (log-transformed) .03 (−.04, .11) -Standard IV analyses: Unstandardizedβ (95% CI)Per 1-unit increase in log-transformed genetically-predicted alcohol consumption FBG (log-transformed) P2hBG(log-transformed) HbA1c (log-transformed).00 (−.01,.01) HOMA-IR(log-transformed) HOMA-beta (log-transformed).00 (−.05,.06) | |
| Handing (2015) [ | Dementia −43 yrs. of follow-up | Twin | Twins without dementia prior to baseline; <=65 at baseline and > =60 at study end; mean age 54.2 (SD 5.9) at baseline; reported drinking <=100 g/day of alcohol | Swedish Twin Registry | 12,362; 576 dementia discordant pairs (177 MZ); 396 concordant pairs (160 MZ) | -Co-twin (discordant for dementia) logistic regressions and pooled cohort Cox models used to examine risk for dementia; co-twin (concordant for dementia) mixed-effects analyses used to examine age at onset -Categories for pooled cohort + twin age of onset analyses: abstainers, | ✓ | -Dementia risk: HR (95% CI) Cohort OR (95% CI) MZ twins All twins Abstainers 1.05 (1.00,1.11) Abstainers 1.37 (.6,3.16) 1.39 (.89,2.16) Moderate .98 (.92–1.04) Moderate-to-very heavy Heavy Very heavy -Dementia age at onset: Mean difference in years between twin diagnosed first vs twin diagnosed later ( Abstainers − 5.37 (.83) -5.49 (.68) Light −6.28 (NA – reference group) -6.79 (NA – reference group) Moderate − 5.41 (.13) -7.00 (.98) Heavy − 6.33 (.99) -6.73 (.32) Very heavy − 12.67 (.09) | |
| Gemes (2019) [ | Depression − 7-9 yrs. of follow-up | MSM | General population aged 20–64 at recruitment; mean age 43.3 (SD 12.2) at baseline | Psykisk Hälsa–Arbete–Relationer (PART) cohort (Sweden) | 5087 | -MSM (weighted logistic regression) + standard logistic regression for comparison -Alcohol consumption measured pre-baseline (for use as time-variant confounder) and baseline; categories: no consumption, | ✓ | -Adjusting for baseline MDI score: RR (95% CI) Non-MSM MSM Abstainers 1.10 (.69,1.74) Moderate .54 (.28,1.04) 1.05 (.83,1.32) Excessive .61 (.21,3.07) -Excluding those with baseline depression (MDI > 26): RR (95% CI) Non-MSM MSM Abstainers 1.24 (.72,2.14) Moderate .63 (.29,1.41) .75 (.55,1.03) Excessive 2.72 (.61,12.19) | |
| Samuelsson 2013 [ | Disability pension (DP) due to mental health diagnoses (MHD) -median 10 yrs. of follow-up from prior study (baseline) | Twin | Twins with data from a prior study, and at time of that study were living in Sweden, < 65, and without DP/old age pension; mean age 52.9 (SD 5.6) | Swedish Twin Study of Disability Pension and Sickness Absence (STODS), drawn from the Swedish Twin Registry | 28,613; 229 DP-discordant twin pairs (95 MZ pairs) | -Co-twin (discordant for dementia) Cox regressions + pooled cohort Cox regressions performed -Differentiated between frequent and infrequent drinkers (have/not consumed alcohol in previous two months); categories for pooled cohort + twin analyses: abstainers, | ✓ | HR (95% CI) Cohort MZ twins DZ twins All twins Abstainers Moderate frequent 1.07 (.78,1.49) .46 (.11,1.87) 2.70 (.81–9.02) 1.24 (.53,2.91) Heavy frequent .98 (.61,1.54) - .46 (.09,2.36) .48 (.12,1.90) Light infrequent 1.09 (.69,1.73) 2.80 (.24,32.6) 3.98 (.71,22.4) 3.67 (.91,14.8) Moderate infrequent 1.18 (.91,1.54) .52 (.18,1.49) 1.71 (.75,3.91) 1.03 (.55,1.93) Heavy infrequent 1.20 (.92,1.57) 1.09 (.33,3.53) | |
| Ilomaki (2011) [ | Myocardial infarction (MI) − 12-14 yrs. of follow-up | MSM | General population males; mean age 52 (SD 6.7) at initial exam (4 yrs. before ‘baseline’) | Kuopio Ischaemic Heart Disease Risk Factor Study (KIHD); Finland | 1030 | −5 discrete-time hazard models run and compared: M1 (baseline alcohol consumption with no covariate adjustment or inclusion of t-4 consumption), M2 (baseline alcohol consumption adjusted for covariates and t-4 consumption), M3 (baseline and t + 7 alcohol consumption with no covariate adjustment or inclusion of t-4 consumption), M4 (baseline and t + 7 alcohol consumption adjusted for covariates and t-4 consumption), M5 (MSM with baseline and t + 7 alcohol consumption with stabilized IP weights at t and t + 7) -Baseline alcohol categories: < 12 g/wk., | ✓ | RR (95% CI) M1 M2 M3 M4 M5 (MSM) < 12 g/wk. 1.20 (.86,1.67) 1.01 (.70,1.45) 84-167 g/wk. 1.05 (.71,1.56) 1.13 (.75,1.72) 1.20 (.78,1.84) 1.27 (.81,2.00) 1.18 (.75,1.87) ≥168 g/wk. .98 (.60,1.58) 1.20 (.68,2.12) 1.40 (.91,2.18) | |
| Kadlecova (2015) [ | Stroke and transient ischaemic attack (TIA) events − 43 yrs. of follow-up | Twin | Same-sex twins ≤60 with no history of stroke at baseline and with ≥5 yrs. of follow-up; mean age 50.5 (SD 5.29) at baseline | Swedish Twin Registry | 11,644; 370 stroke/TIA discordant pairs (all MZ); 167 stroke/TIA concordant pairs (all MZ) | -Co-twin (discordant for stroke) logistic regressions and pooled cohort Cox models used to examine risk for stroke/TIA; co-twin (concordant for stroke) mixed-effects analyses used to examine time to stroke/TIA -Categories for pooled cohort + twin analyses: abstainers, | ✓ | -Stroke/TIA risk: HR (95% CI) Cohort OR ( Abstainers 1.11 (.98,1.23) Abstainers 2.22 (.058) Light .98 (.85,1.15) Light 1.56 (.17) Moderate .99 (.85,1.15) Moderate 1.59 (.26) Heavy -Time to stroke/TIA: Mean difference in yrs. to stroke/TIA ( Abstainers 2.07 (.11) Light .77 (.54) Moderate 1.15 (.47) Heavy | |
| Millwood 2019 [ | Ischaemic stroke, intracerebral haemorrhage (ICH), total stroke, acute myocardial infarction (AMI), total coronary heart disease (CHD) -roughly 10 yrs. of follow-up | MR | Permanent residents from 10 Chinese regions aged roughly 35–74 and without major disabilities, (those with a history of CVD were excluded from analyses of disease incidence); mean age 52 (SD 11) at baseline | China Kadoorie Biobank | 512,715; 161,498 of which had genotype data (male and female combined) | -1-sample MR using Cox models -Instrument composed of 9 combinations of 2 SNPs (ALDH2 rs671 and ADH1B rs1229984) within each of 10 geographic areas, producing 90 combinations overall; then, based on mean ‘usual’ alcohol consumption within each of those combinations (incorporating repeat measurements to account for measurement error), six final categories were produced, aligned with increasing genetically-predicted alcohol consumption: -Conventional analyses using observed alcohol consumption conducted for comparison using Cox models, with consumption categories (for men): ex-drinker, non-drinker, occasional drinker (less than weekly), | ✓ | -Ischaemic stroke: Log RR (95% CI) MR Conventional C2 1.00 (.91,1.10) Ex-drinker C3 1.03 (.96,1.11) Non-drinker C4 C5 C6 ≥420 g/wk. Per 280 g/wk. (assuming linearity) -ICH: Log RR (95% CI) MR Conventional C2 1.01 (.88,1.14) Ex-drinker C3 1.02 (.88,1.14) Non-drinker C4 1.08 (.96,1.22) Occasional 1.02 (.96,1.09) C5 C6 ≥420 g/wk. Per 280 g/wk. (assuming linearity) -Total stroke: Log RR (95% CI) MR Conventional C2 1.02 (.95,1.10) Ex-drinker C3 1.05 (.95,1.10) Non-drinker C4 C5 C6 ≥420 g/wk. Per 280 g/wk. (assuming linearity) -AMI: Log RR (95% CI) MR Conventional C2 1.02 (.87,1.19) Ex-drinker C3 1.05 (.93,1.19) Non-drinker C4 .93 (.81,1.07) Occasional C5 .94 9.81,1.09) 140-279 g/wk 1.11 (.97,1.26) C6 .97 (.83,1.15) 280-419 g/wk ≥420 g/wk 1.14 (.95,1.37) -Total CHD: Log RR (95% CI) MR Conventional C2 1.03 (.93,1.13) Ex-drinker C3 C4 .94 (.87,1.02) Occasional C5 1.04 (.97,1.11) 140-279 g/wk 1.03 (.97,1.09) C6 1.06 (.98,1.15) 280-419 g/wk ≥420 g/wk | |
Ropponen 2014 [ *See also musculoskeletal health | DP due to circulatory system diagnoses −5 − 10 years of follow-up | Twin | Twins with data from a prior study, and at time of that study were living in Sweden, < 65, working and without DP/old age pension; mean age at baseline 53.7 (SD 5.7) a | Swedish Twin Registry | 31,206; 216 DP due to circulatory system diagnoses-discordant pairs (of which 95 are MZ) | -Co-twin (discordant for DP due to MSD) Cox models and pooled cohort Cox models used to examine risk (stratified for sex in pooled model) -Categories for pooled cohort + twin analyses: abstainers, | ✓ | HR (95% CI) Cohort MZ twins DZ twins All twins Abstainers 1.22 (.91,1.64) .97 (.31,3.01) 1.55 (.70,3.44) 1.3 (.69,2.45) Moderate .85 (.63,1.15) - .86 (.39,1.91) 1.54 (.78,3.04) Heavy | |
Peng (2019) 28 *See also diabetic outcomes | Lipids: HDL-C, non-HDL-C, triglycerides (TG), total cholesterol (TC); blood pressure: systolic blood pressure (SBP), diastolic blood pressure (DBP); obesity anthropometric measures: BMI, waist circumference (WC), hip circumference (HC), waist-to-hip ratio (WHR) -cross-sectional | MR | Chinese adults living in the Yi-Ling district of Yichang; mean age 55 (SD 0.1) at baseline | One community selected from the Risk Evaluation of cAncers in Chinese diabeTic Individuals: a LONgitudinal (REACTION) study | 4536 | −1-sample MR using ALDH2 to instrument for alcohol consumption with standard IV analysis (2SLS); local average treatment effects (LATEs) computed for subgroups of observed alcohol consumption (non-zero LATE slopes indicate non-linearity) -Analysis performed in women as a negative control due to their lack of alcohol consumption -No conventional analyses conducted for comparison | ✓ | -Non-linear analyses: the LATE slopes for all lipids, blood pressure and obesity parameters in both sexes were not sig. Different to zero, indicating no presence of non-linear relationships Unstandardizedβ (95% CI) Using log-transformed alcohol intake HDL-C −.01 (−.06,.04) Non-HDL-C −.01 (−.13,.10) TG (log-transformed) .03 (−.05,.10) TC −.03 (−.16,.08) SBP −.85 (− 3.15,1.3) DBP −.70 (− 2.32,.71) BMI −.15 (−.51,.21) WC −.02 (− 1.09,1.09) HC .34 (−.37,1.07) WHR −.00 (−.01,.00) -Standard IV analyses: Unstandardizedβ (95% CI) Per 1-unit increase in log-transformed genetically-predicted alcohol consumption HDL-C .04 (−.00,.08) Non-HDL-C TG (log-transformed) TC SBP DBP BMI WC HC WHR .02 (.00,.02) | |
| Silverwood (2014) [ | Lipids: non-HDL-C, HDL-C, TG; blood pressure: SBP; obesity anthropometric measures: BMI, WC; inflammatory markers: CRP, interleukin 6 (IL-6) -cross-sectional | MR | Individuals of European descent from Europe and North America; mean age 56.75 (calculated from Holmes et al. [cite]) | 22 individual studies (18 cohorts, 2 nested case-control, 1 RCT, 1 case-control) from the Alcohol-ADH1B consortium | 80,057 individuals total; 78,172 for SBP, 60,140 for non-HDL-C, 60,227 for HDL-C, 79,454 for BMI, 57,172 for WC, 63,367 for CRP, 23,535 for IL-6, 63,667 for TG | -1-sample MR using the rs1229984 polymorphism in ADH1B to instrument for alcohol consumption and standard IV analysis (2SLS); local average treatment effects (LATEs) computed for subgroups of observed alcohol consumption (non-zero LATE slopes indicate non-linearity) -Where non-linearity is present, the difference in outcome between no alcohol and median observed consumption in low (> 0–7 units/wk), moderate (7–21 units/wk), heavy (21–70 units/wk) and very heavy (> 70 units/wk) groups is predicted, as well as curve nadir, difference in outcome between nadir and abstinence, and level of consumption matching outcome for abstinence -No conventional analyses conducted for comparison | ✓ | -Non-linear analyses: the LATE slopes for SBP, non-HDL-C, BMI, WC and CRP were all sig. Different to zero, indicating non-linearity Unstandardizedβ (95% CI) Using log-transformed alcohol intake HDL-C .00 (−.06,.06) Non-HDL-C TG (log-transformed) -.02 (−.10,.06) SBP BMI WC CRP (log-transformed) IL-6 (log-transformed) -.13 (−.34,.29) -Predicted differences in outcome between category medians and abstinence (unstandardized): 3.04 units/wk. 12.15 units/wk. 31.90 units/wk. 84.52 units/wk. Non-HDL-C −.39 (−.79,.06) -.15 (−.72,.47) .40 (−.28,1.10) SBP .1 (− 5.5,6.1) 5.2 (− 2.6,13.9) BMI −.6 (− 2.2,.8) .2 (− 2.0,2.1) 1.6 (−.8,3.8) WC −.6 (− 4.7,3.5) 1.9 (− 3.9,7.8) 5.7(−.6,12.5) CRP (log-transformed) -.29 (−.68,.15) -.15 (−.68,.5) .22 (−.37,.95) -Predicted curve features for those outcomes with evidence of non-linearity (unstandardized): Nadir (units/wk.; 95% CI) Difference in outcome at nadir Units/wk. (95% CI) with outcome equivalent to abstinence Non-HDL-C SBP 1.00 (.0,3.6) −.7(− 5.4,.0) 2.8 (.0,19.6) BMI 2.3 (.0,6.0) −.6 (− 2.3,.0) 10.1 (.0,48.4) WC 1.5 (.0,5.4) −.8 (− 4.9,.0) 5.3 (.0,37.4) CRP 3.5 (.0,7.2) −.30 (−.75,.00) 19.4 (.0,66.0) -Standard IV analyses for those outcomes with no evidence of non-linearity: β (95% CI) Per 1-unit increase in log-transformed genetically-predicted alcohol consumption HDL-C −.02 (−.07,.03) TG (log-transformed) −.01 (−.06,.07) IL-6 (log-transformed) | |
| Vu (2016) [ | Lipids: TG, total cholesterol, HDL-C, HDL2-C, HDL3-C, LDL-C, sdLDL-C, apoB, Lp(a) -cross-sectional | MR | European Americans; mean age 54.3 (SD 5.7) at baseline | Atherosclerosis Risk in Communities (ARIC); USA | 10,893 individuals total; 9911 for TG, 9751 for total cholesterol and LDL-C, 10,132 for HDL-C, 10,120 for HDL2-C and HDL3-C, 8102 for sdLDL-C, 7663 for apoB, 9924 for Lp(a) | − 1-sample MR using a genetic risk score composed of 5 SNPs (rs2066702, rs1693457, rs1789891, rs698, and rs1126671) and standard IV analysis (2SLS); model fitted separately for quartiles of genetically-predicted alcohol consumption ( -Conventional analyses regressing outcomes on observed alcohol consumption were also conducted, using consumption categories: | ✓ | -TG (log-transformed): Unstandardized β (95% CI) MR Using log-transformed alcohol intake Conventional Q2– Q3– Q4– -TC: Unstandardized β (95% CI) MR Using log-transformed alcohol intake Conventional Q2– Q3– Q4–4.56 (− 11.36,2.25) Heavy -HDL-C (log-transformed): Unstandardized β (95% CI) MR Using log-transformed alcohol intake Conventional Q2 .01 (−.01,.03) Former/ infrequent Q3 .04 (.00,.07) Low-to-moderate Q4 .03 (−.02,.07) Heavy -HDL2-C (log-transformed): Unstandardized β (95% CI) MR Using log-transformed alcohol intake Conventional Q2 Q3 Q4 .06 (−.03,.15) Heavy -HDL3-C: Unstandardized β (95% CI) MR Conventional Q2–.19(−.87,.49) Former/ infrequent .40 (−.14,.94) Q3 .08 (− 1.23,1.38) Low-to-moderate Q4 .11 (− 1.51,1.74) Heavy -LDL-C: Unstandardized β (95% CI) MR Using log-transformed alcohol intake Conventional Q2 Q3 Q4 − 4.57 (− 11.11,1.96) Heavy -sdLDL-C (log-transformed): Unstandardized β (95% CI) MR Using log-transformed alcohol intake Conventional Q2 Q3 Q4 −.08 (−.17,.01) Heavy .01 (−.04,.06) -apoB (log-transformed): Unstandardized β (95% CI) MR Using log-transformed alcohol intake Conventional Q2 Q3 −.04 (−.07,.00) Low-to-moderate −.01 (−.03,.01) Q4 −.04 (−.08,.01) Heavy −.02 (−.04,.01) -Lp(a) (log-transformed): Unstandardized β (95% CI) MR Using log-transformed alcohol intake Conventional Q2 −.02 (−.09,.06) Former/ infrequent −.03 (−.10,.03) Q3 −.05 (−.20,.10) Low-to-moderate .01 (−.06,.08) Q4 −.01 (−.20,.18) Heavy −.06 (−.16,.03) | |
| Sipila 2016 [ | All-cause mortality -median 30.2 yrs. of follow-up | Twin | Same-sex twins aged 18–54 at baseline and free of chronic disease 6 years post-baseline; mean age 35.9 at baseline | Older Finnish Twin Cohort | 14,787; 3389 drinking-discordant pairs (of which 926 pairs are MZ) a | -Co-twin (discordant for alcohol consumption) Cox models and pooled cohort Cox models used to examine risk for mortality -Categories for pooled cohort + twin analyses based on average of two measurements 6 yrs. apart: abstainers, | ϰ | HR (95% CI) Cohort MZ twins All twins 0 g/mnth 1.02 (.85,1.22) .43 (.17,1.11) .96 (.63,1.45) 70–139 g/mnth .95 (.81,1.10) .64 (.29,1.40) .96 (.68,1.36) 140–209 g/mnth 1.08 (.91,1.29) 1.12 (.47,2.65) .85 (.57,1.26) 210–419 g/mth 420–839 g/mnth 840–1199 g/mnth ≥1200 g/mnth | |
| Sander (2013) [ | HIV seroconversion -median 10.5 yrs. of follow-up | MSM | Men who have sex with men and who were sexually active and HIV-seronegative at baseline; median age 33.4 at baseline | Multicenter AIDS Cohort Study (MACS); USA | 3752 | -MSM (Cox models) + standard Cox models for comparison -Categories for analyses based on average of two measurements 1 yr apart: | ϰ | RR (95% CI) Non-MSM MSM Moderate .91 (.65,1.27) 1.10 (.78,1.54) Heavy 1.19 (.83,1.70) | |
| Pietikainen 2011 [ | DP due to low back disorders (LBD) − 29 years of follow-up | Twin | Same-sex twins aged 18–64 and not receiving pension at baseline; mean age 33.2 (SD 12) at baseline | Finnish Twin Cohort | 24,043; 504 (284 M) pairs discordant for DP due to LBD | -Co-twin (discordant for DP due to LBD) Cox models and pooled cohort Cox models used to examine risk for mortality -Categories for pooled cohort + twin analyses: abstainers, l | ϰ | HR (95% CI) Cohort All twins Abstainer .85 (.61,1.20) .79 (.49,1.27) Moderate 1.07 (.79,1.43) .94 (.62,1.42) Heavy 1.08 (.80,1.46) 1.07 (.69,1.66) | |
Ropponen 2014 [ *see also CVD section | DP due to musculoskeletal diagnoses (MSD) − 5-10 years of follow-up | Twin | Twins with data from a prior study, and at time of that study were living in Sweden, < 65, working and without DP/old age pension; mean age at baseline 53.7 (SD 5.7) b | Swedish Twin Registry | 31,206; 922 DP due to MSD-discordant pairs (of which 357 are MZ) | -Co-twin (discordant for DP due to MSD) Cox models and pooled cohort Cox models used to examine risk (stratified for sex in pooled model) -Categories for pooled cohort + twin analyses: abstainers, | ✓ | HR (95% CI) Cohort MZ twins DZ twins All twins Abstainers .93 (.80,1.07) 1.49 (.98,2.26) .8 (.54,1.19) 1.07 (.81,1.42) Moderate Heavy | |
| Ropponen 2011 [ | DP due to musculoskeletal disorder (MSD) and osteoarthritis specifically (OA) −29 years of follow-up | Twin | MZ and same-sex DZ twins aged ≥18 and working at baseline; mean age 33.2 (SD 12) at baseline | Finnish Twin Cohort | 24,043; 1317 pairs discordant for DP due to MSD, 461 pairs discordant for DP due to OA | -Co-twin (discordant for DP due to MSD/OA) Cox models and pooled cohort Cox models used to examine risk for MSD and OA in men and women separately -Categories for pooled cohort + twin analyses: | ϰ | -For DP due to MSD: HR (95% CI) Cohort (M) Cohort (F) All twins (M) All twins (F) Light .91 (.59,1.40) 1.15 (.93,1.42) Moderate .95 (.70,1.30) 1.23 (1.00,1.51) 1.50 (.94,2.38) .97 (.74,1.27) Heavy 1.01 (.73,1.39) 1.08 (.84,1.39) 1.58 (.99,2.53) 1.37 (.98,1.91) -For DP due to OA specifically: HR (95% CI) Cohort (M) Cohort (F) All twins (M) All twins (F) Light .99 (.50,1.96) 1.19 (.86,1.63) Moderate .78 (.48,1.28) .96 (.69, 1.32) 2.01 (.82,4.93) 1.12 (.70,1.82) Heavy 1.04 (.64,1.71) .86 (.60,1.25) 2.39 (.97,5.89) | |
a Ropponen et al. (2014) also used this cohort to examine the relationship between alcohol and disability pension due to mental health diagnoses – not reported on here as Samuelsson et al. used the more informative exposure categorization
b From correspondence with authors
Note: Where multiple models are reported in the original paper, the most adjusted analyses are reflected in the above table if (excluding for MR studies)
Triangulation of included studies with reviews of the broader observational literature
| Outcome | Study/ies first author (year) | Simplified summary of findings | Consistent with broader literature? | Details |
|---|---|---|---|---|
| Dickerman (2018) | -Reverse J-shaped relationship | No | A 2016 meta-analysis found a monotonically increasing relationship [ | |
| Sipila (2016) | -Monotonically increasing | Yes | A 2016 meta-analysis found a monotonically increasing relationship [ | |
| Peng (2016) | -Positive linear relationship | Yes | A 2017 review found results consistent with a positive linear relationship [ | |
| Peng (2016) | -Positive linear relationships for P2hBG (glucose tolerance) and HOMA-IR (insulin sensitivity) -Lack of relationship for HbA1c (glycated haemoglobin) and HOMA-beta (insulin sensitivity) | N/A | A 2017 review found that, for the other diabetes biomarkers included in this review, there was not enough/consistent evidence from which to draw conclusions [ | |
Silverwood (2014) Peng (2019) | -Non-linear relationship (small protective effect of light drinking) for SBP -Positive linear relationships for both SBP and DBP | No Yes | A 2018 meta-analysis found no protective effect for hypeternsion [ | |
Silverwood (2014) Peng (2019) | -Non-linear relationship (small protective effect of light drinking) for BMI and waist circumference -Positive linear relationships for BMI, waist circumference, and hip circumference, with no relationship for waist-to-hip ratio | Mixed Mixed | A 2011 review found mixed evidence on the relationship between alcohol and weight/measures of abdominal adiposity, with heavy drinking generally associated with increased values, but moderate consumption either associated with lower values or not associated at all (suggested that discrepancies may depend on type of alcohol consumed) [ | |
| Silverwood (2014) | -Non-linear relationship (small protective effect of light drinking) for CRP but a positive linear relationship for IL-6 | N/A | A 2021 review of studies from the previous decade found no relevant longitudinal observational studies that were capable of detecting non-linear relationships [ | |
Silverwood (2014) Peng (2019) Vu (2016) | -No relationship for HDL-C -No relationship for HDL-C -No relationship for HDL-C, or HDL-3C, but an inverted reverse J-shaped relationship for HDL-2C (a beneficial effect) | No No Mixed | A 2021 review of studies from the previous decade found just one longitudinal observational study, which reported a reverse J-shaped relationship in decreases in HDL-C from baseline [ | |
Silverwood (2014) Peng (2019) Vu (2016 | -No relationship for TG, but a non-linear relationship (small protective effect of light drinking) for non-HDL-C -Positive linear relationships for TG, non-HDL-C, and TC -Reverse J-shaped relationships for LDL-C, TG and TC, no relationship for Lp(a), and monotonically decreasing relationships for sdLDL-C and apoB | N/A | A 2021 review of studies from the previous decade found no relevant longitudinal observational studies [ | |
Ilomaki (2011) Millwood (2019) | -J-shaped relationship -No relationship | No | A 2017 review found alcohol was detrimental for myocarditis [ were found | |
Kadlecova (2015) Millwood (2019) | -Reverse J-shaped relationship for stroke and TIA -Monotonically increasing relationships for ischaemic stroke, intracerebral haemorrhage and total stroke | Mixed | A 2016 meta-analysis found J-shaped relationships for ischaemic stroke, but monotonically increasing relationships for intracerebral haemorrhage and subarachnoid haemorrhage [ | |
| Millwood (2019) | -No relationship | No | A 2016 meta-analysis found a U-shaped relationship [ | |
| Ropponen (2014) | -No clear functional form | N/A | Most reviews evaluate CVD sub-conditions, finding divergent functional forms | |
| Handing (2015) | -J-shaped relationship | Yes | A 2021 review found most recent evidence is consistent with a J-shaped relationship [ | |
Gemes (2019) Samuelsson (2013) | -U−/J-shaped relationship -Non-linear relationship with abstainers at increased risk over light frequent drinkers | Yes | A 2020 meta-analysis found a J-shaped relationship with depressive symptoms [ | |
| Sander (2013) | -Monotonically increasing relationship | N/A | There is a lack of dose-response information on alcohol’s relationship with HIV [ | |
Pietikainen (2011) Ropponen (2014) Ropponen (2011) | -Monotonically increasing relationship -No clear functional form -No clear functional form, but abstainers have lowest risk | N/A | No reviews on the relationship between alcohol and musculoskeletal disorders were found | |
Note: Risk for type 2 diabetes (Carlsson et al.) is excluded from this table as the analyses were critically underpowered, and thus the results are not appropriate for inclusion in the triangulation process. Findings from RCTs were not integrated into triangulation as they are almost exclusively conducted in short-term time frames, which is not of interest to this review