| Literature DB >> 31506063 |
John Ferguson1, Neil O'Leary2, Fabrizio Maturo2, Salim Yusuf3, Martin O'Donnell2.
Abstract
BACKGROUND: Population attributable fractions (PAF) measure the proportion of disease prevalence that would be avoided in a hypothetical population, similar to the population of interest, but where a particular risk factor is eliminated. They are extensively used in epidemiology to quantify and compare disease burden due to various risk factors, and directly influence public policy regarding possible health interventions. In contrast to individual specific metrics such as relative risks and odds ratios, attributable fractions depend jointly on both risk factor prevalence and relative risk. The relative contributions of these two components is important, and usually needs to be presented in summary tables that are presented together with the attributable fraction calculation. However, representing PAF in an accessible graphical format, that captures both prevalence and relative risk, may assist interpretation. <br> METHODS: Taylor-series approximations to PAF in terms of risk factor prevalence and log-odds ratio are derived that facilitate simultaneous representation of PAF, risk factor prevalence and risk-factor/disease log-odds ratios on a single co-ordinate axis. Methods are developed for binary, multi-category and continuous exposure variables. <br> RESULTS: The methods are demonstrated using INTERSTROKE, a large international case control dataset focused on risk factors for stroke. <br> CONCLUSIONS: The described methods could be used as a complement to tables summarizing prevalence, odds ratios and PAF, and may convey the same information in a more intuitive and visually appealing manner. The suggested nomogram can also be used to visually estimate the effects of health interventions which only partially reduce risk factor prevalence. Finally, in the binary risk factor case, the approximations can also be used to quickly convert logistic regression coefficients for a risk factor into approximate PAFs.Entities:
Mesh:
Substances:
Year: 2019 PMID: 31506063 PMCID: PMC6737608 DOI: 10.1186/s12874-019-0827-4
Source DB: PubMed Journal: BMC Med Res Methodol ISSN: 1471-2288 Impact factor: 4.615
Definitions, assumptions and approximations for PAF when the exposure is binary, multi-category and logistic
| Binary | Multicategory | Continuous | |
|---|---|---|---|
| Counterfactual definition of PAF |
|
|
|
| Assumptions: | 1. Standard causal inference assumptions • Conditional exchangeability (counterfactual outcome • Consistency of counterfactuals: • Positivity 0 < 2. No interactions ( 3. Rare disease assumption (P(Y = 1) small) | ||
| Re-expression of PAF (given assumptions 1. and 2.) | |||
aCorresponding logistic model (Given assumption 3.) | = | ||
Logistic Approximation for PAF (Given assumptions 1,2 and 3) |
|
| |
| Graphical Approximation |
|
| |
| “Average” estimated log-odds ratio: |
|
|
|
*Here β0 = 0 by definition for the Binary and Multicategory exposures and β(j0) = 0 for continuous exposures. Estimates could be found via generalized additive models with a logistic link, where the confounders and possibly the exposure are modelled non-parametrically
**Note that RR = P(Y = 1| A = j, C = c)/P(Y = 1| A = 0, C = c) and RR(j) = P(Y = 1| A = j, C = c)/P(Y = 1| A = j0, C = c)
***f(j| 1) is the conditional density of A when Y = 1; similarly f(j| 0) is the conditional density of A when Y = 0
****Note that when A is continuous, the probability of a non-reference level of the exposure: is 1
Illustration of the approximations on the INTERSTROKE dataset. For binary risk factors, , for multicategory risk factors is a kind of weighted average log odds ratio summarizing the increase in risk of non-reference levels of the risk factor compared to the reference level. Confidence intervals for exact PAF are given at 99% level and calculated using Bootstrap
| Risk factor |
|
| prevalence exposure in controls | Approximate | Exact calculation |
|---|---|---|---|---|---|
| High blood pressure (Y/N) | 1.093 | 2.98 | 47.4% | 51.8% | 47.9% (45.1–50.6) |
| Lack of physical activity | 0.501 | 1.65 | 83.7% | 41.9% | 35.5% (27.7–44.7) |
| ApoA, ApoB ratio (in tertiles) | 0.428 | 1.53 | 66.9% | 28.6% | 26.9% (22.2–31.9) |
| Diet score (in tertiles) | 0.378 | 1.46 | 67.0% | 25.3% | 23.0% (18.2–28.9) |
| Waist hip ratio (in tertiles) | 0.294 | 1.34 | 67.0% | 19.7% | 18.8% (13.3–25.3) |
| Smoking (Y/N) | 0.513 | 1.67 | 22.4% | 11.5% | 12.4% (10.2–14.9) |
| Cardiac causes (Y/N) | 1.156 | 3.18 | 4.9% | 5.7% | 9.1% (8.0–10.2) |
| Frequency of alcohol consumption (3 levels) | 0.186 | 1.20 | 27.7% | 5.2% | 5.9% (3.4–9.7) |
| Global stress (Y/N) | 0.301 | 1.35 | 14.4% | 4.3% | 5.0% (2.6–7.3) |
| Diabetes (Y/N) | 0.148 | 1.16 | 12.9% | 1.9% | 2.4% (0.1–4.9) |
Fig. 1Graphical representation of estimates of approximate PAF (left-hand y-axis), prevalence (x-axis) and odds ratios (right-hand y-axis) for 10 risk factors from the INTERSTROKE dataset. Approximate PAF is represented both by the slope of the black dashed line, and also the left y-axis intercept of the same line. Prevalence and Odds Ratio information is displayed as on a usual scatterplot (although prevalence decreases from left to right). Risk factors are ranked 1–10 according to approximate PAF
Fig. 2Attributable fraction nomogram displaying estimates of risk factor prevalence, average Odds Ratio and approximate PAF for the 10 INTERSTROKE risk factors. The prevalence, OR and approximate PAF for a particular risk factor are identified on the same line. Again risk factors are ranked according to approximate PAF, recorded on the right-most axis
Fig. 3a/b Alternative formatting of attributable fraction nomogram displaying estimates of risk factor prevalence, average Odds Ratio and approximate PAF for the 10 INTERSTROKE risk factors. The prevalence, OR and approximate PAF for a particular risk factor are identified on the same line. Here the left hand axis records estimated average odds ratios and the middle axis records estimated prevalence. Differing interventions that might reduce risk factor prevalence might be compared by rotating the line for a given risk factor using the left axis intercept as a pivot. For example, in the bottom pane, the difference in approximate PAF between the red dashed line and the solid line estimates the % reduction in the prevalence of stroke from an intervention that halved the prevalence of hypertension. The blue dashed line estimates the PAF for smoking in China (where the prevalence of smoking is higher than the global average)
Fig. 4Absolute and relative bias from approximations as functions of the estimated prevalence and estimated odds ratios of the risk factor. Babs is defined as PAFa -PAF, with PAF being the usual estimate of PAF and PAFa the estimated approximate PAF defined in this manuscript. Br is defined as PAFa/PAF