| Literature DB >> 31807416 |
John A Gallis1,2, Elizabeth L Turner1,2.
Abstract
Background: Binary outcomes-which have two distinct levels (e.g., disease yes/no)-are commonly collected in global health research. The relative association of an exposure (e.g., a treatment) and such an outcome can be quantified using a ratio measure such as a risk ratio or an odds ratio. Although the odds ratio is more frequently reported than the risk ratio, many researchers, policymakers, and the general public frequently interpret it as a risk ratio. This is particularly problematic when the outcome is common because the magnitude of association is larger on the odds ratio scale than the risk ratio scale. Some recently published global health studies included misinterpretation of the odds ratio, which we hypothesize is because statistical methods for risk ratio estimation are not well known in the global health research community.Entities:
Year: 2019 PMID: 31807416 PMCID: PMC6873895 DOI: 10.5334/aogh.2581
Source DB: PubMed Journal: Ann Glob Health ISSN: 2214-9996 Impact factor: 2.462
Example of two relative measures of association, adapted from results of a randomized controlled trial in febrile individuals published in BMJ Global Health.a
| Exposure group | Outcomeb | Relative Measure of Associationc | |||
|---|---|---|---|---|---|
| Tested for malaria | Did not test for malaria | “Risk” of malaria testing | Risk Ratio | Odds Ratio | |
| Intervention | 76 | 27 | 76/103 = 73.8% | 73.8/51.0 = 1.45 | |
| Control | 51 | 49 | 51/100 = 51.0% | ||
a RCT by O’Meara et al. (2016) [5] was a 2 × 2 factorial design of two interventions for febrile individuals. Here we have adapted the example to focus on one of those interventions, namely a subsidy for a rapid diagnostic test, where “intervention” denotes the group that received the subsidy and “control” denotes the group that did not receive the subsidy. Specifically, we have extracted outcome data from Table 2 of O’Meara et al. (2016) [5] for the two groups which did not receive the second intervention.
b Row counts correspond to the number of participants with each level of the outcome within each exposure group.
c For intervention group vs. control group, where we note that O’Meara et al. (2016) [5] reported neither of these results in their Table 2 because they instead reported absolute measures of effect.
Unadjusted measures of relative association from three articles in the global health literature.
| Exposure | Outcome | Unexposed group outcome proportion | Risk Ratioa | Odds Ratiob | Magnitude of odds ratio relative to risk ratioc | |
|---|---|---|---|---|---|---|
| 1 | Surviving Ebola virus [ | Safe sexual behavior | 14% | 2.71 | 3.67 | 35% |
| 2 | Point-of-care testing [ | Antibiotic use | 78% | 0.82 | 0.50 | 178% |
| 3 | Drinking [ | Feelings of aggression | 20% | 3.1 | 6.7 | 116% |
a Risk ratio (for “exposed” vs. “unexposed”) computed directly from outcome proportions reported in the article as none of the three articles used the risk ratio as a measure of relative association.
b Odds ratio is obtained from unadjusted logistic regression [32] or directly from outcome proportions reported [3334].
c In these examples where the outcome is relatively common (i.e., >10%), if the odds ratio were to be incorrectly interpreted as a risk ratio, this is the magnitude of overstatement of relative association.
Figure 1Relationship between the odds ratio and risk ratio at various levels of the reference risk.
Brief summary of four methods of obtaining risk ratios for binary outcomes.
| Name of method | Type of method | Background literature | Some advantages | Some disadvantages | Example of use in the global health literature | Exposure | Binary Outcome |
|---|---|---|---|---|---|---|---|
| Log-binomial | Direct | Wacholder (1986) [ | Easy to implement. | May not converge; may estimate individual-level probabilities (and/or the upper bound of their 95% confidence intervals) above 1. | Gibson et al. (2017) [ | Mobile phone based intervention to improve immunization rates, in a cluster-randomized trial | Full immunization by 12 months of age. |
| Modified log-Poisson | Direct | Zou (2004) [ | Easy to implement; almost always converges. | May estimate individual-level probabilities (and/or the upper bound of their 95% confidence intervals) above 1. | Chan et al. (2017) [ | AIDS-related stigma | Probable depression (PHQ-9 score ≥10 or recent suicidal thoughts). |
| Substitution | Indirect | Zhang and Yu (1998) [ | Easy to implement. Uses output from logistic regression. | Generally produces biased estimates and 95% confidence intervals are expected to be too narrow, on average [ | Agweyu et al. (2018) [ | Various demographics and health-related exposures | Mortality. |
| Marginal or Conditional Standardization | Indirect | Localio et al. (2007) [ | Uses output from logistic regression. | May be more difficult to implement and interpret than other methods, especially in certain software packages. | Weobong et al. (2017) [ | Psychological intervention for depression, in a randomized trial | Remission from depression as measured by the PHQ-9. |
Abbreviation: PHQ-9 – Patient Health Questionnaire 9-item [36], a screening tool for depression.
Code to fit the log-binomial and modified log-Poisson models in four commonly used statistical software packages, and to use the marginal standardization method in two of the packages.
| Software Program | Data Structure | Log-binomial codea | Modified log-Poisson codeb | Marginal standardization codec |
|---|---|---|---|---|
| Statae | Ind | |||
| Clustd | ||||
| SAS | Ind | |||
| Clust | ||||
| Rf | Ind | |||
| Clust | ||||
| SPSS | Ind | |||
| Clust | ||||
Abbreviations: Ind = Independent (i.e., non-clustered); Clust = Clustered.
Variables: binaryoutcome = the binary outcome; exposure = exposure (e.g., treatment group indicator), assumed to be categorical; participantID = participant identifier; cluster = cluster identifier.
a The log-binomial code for direct estimation of the risk ratio in the clustered setting is only shown in the generalized estimating equations (GEE) framework. A generalized linear mixed model (GLMM) could also be used.
b For the log-Poisson approach, a robust standard error is needed to account for misspecification of the outcome distribution (i.e., Poisson instead of binomial); GEE is the natural approach to obtain this robust standard error, in both the non-clustered and clustered setting.
c To our knowledge, the marginal standardization method is not as straightforward to implement in SAS or SPSS, so no code is provided. In addition, we are unaware of an easy-to-implement function in R to perform marginal standardization in a clustered setting.
d In the context of GEE to analyze clustered outcome data, we have used an exchangeable working correlation matrix as an example. It is natural to use such a working correlation matrix when the outcome data are measured at a single point in time and the clustering arises through some natural grouping of individuals (e.g., in schools or hospitals). But, if the clustering arises from longitudinal data, other working correlation structures may be preferred.
e The standard errors from Stata may be slightly larger than that obtained from the other programs. This is because Stata multiplies the robust standard errors by K/(K–1), where K is the number of clusters, whereas other programs do not do this.
f The cbind R code illustrated here works only for a single binary exposure variable. It will need to be modified for more complex scenarios. Additionally, the gee function requires that the outcome be set up as a numeric variable, rather than a factor variable, when specifying the modified log-Poisson model.