| Literature DB >> 35637431 |
Yilin Ning1,2, Anastasia Lam3,4, Marie Reilly5.
Abstract
BACKGROUND: Despite the ease of interpretation and communication of a risk ratio (RR), and several other advantages in specific settings, the odds ratio (OR) is more commonly reported in epidemiological and clinical research. This is due to the familiarity of the logistic regression model for estimating adjusted ORs from data gathered in a cross-sectional, cohort or case-control design. The preservation of the OR (but not RR) in case-control samples has contributed to the perception that it is the only valid measure of relative risk from case-control samples. For cohort or cross-sectional data, a method known as 'doubling-the-cases' provides valid estimates of RR and an expression for a robust standard error has been derived, but is not available in statistical software packages.Entities:
Keywords: Doubling-of-cases; Expanded data logistic regression; Log-binomial regression; Poisson regression; Relative risk; Weighted analysis
Mesh:
Year: 2022 PMID: 35637431 PMCID: PMC9150348 DOI: 10.1186/s12874-022-01636-3
Source DB: PubMed Journal: BMC Med Res Methodol ISSN: 1471-2288 Impact factor: 4.612
Fig. 1Doubling the cases in a cohort of N subjects, where N.1 subjects are cases. The first subscript indicates the exposure status (e for exposed and for unexposed) and the second subscript indicates the outcome (1 for cases and 0 for non-cases). A dot (“.”) for either suffix denotes the total (i.e., no stratification on that variable)
Equivalence of the crude RR computed from a cohort of N subjects and the crude OR computed from the expanded cohort with N+N.1 records, where is the total number of cases in the original cohort
| Total | Prevalence | Crude RR | |||
| Odds | Crude OR | ||||
Equivalence of the adjusted RR assessed in a log-binomial regression model of the original cohort with N subjects and the adjusted OR assessed in a logistic regression model of the expanded cohort with N+N.1 records, where is the total number of cases in the original cohort
| Expected | Expected | ||
| Expected | Expected | Odds | |
| exp{ | |||
| exp{ |
Fig. 2Estimated RR across 2000 simulations using different levels of prevalence and true RR values. Estimates were computed using the Mantel-Haenszel (M-H) RR (clear boxes) and expanded data M-H OR methods (shaded boxes)
Fig. 3Estimated RR from expanded data logistic regression across 2000 simulations scenarios with different levels of prevalence and true RR values, compared to estimates from a naive logistic regression and the (true) log-binomial model. Estimates are presented for the full cohort data (panel A) and for matched case-control samples (panel B) with 1:1 (clear boxes) and 1:2 (shaded boxes) case:control ratio
Fig. 4Ratio of the estimated and true RR (panel A) and the coverage (panel B), type I error (panel C) and power (panel D) of the RR estimated by naive and expanded data logistic regression analysis of case-control samples in simulation studies. 2000 simulation iterations were repeated in each simulation scenario. The results from log-binomial regression of the simulated cohort data are displayed for comparison: for a prevalence of 40%, this model failed to converge for 2 simulation cycles for RR=1.5 and 1432 cycles for RR=2, and these were excluded. Panel B excludes scenarios with RR > 1 where the naive/weighted logistic regression model had coverage lower than 50%
Adjusted ORs estimated using naive logistic regression, and adjusted RRs from log-binomial and expanded data logistic regression analysis, using data from a cohort study of neonatal jaundice. In addition to covariates in the table, estimates are adjusted for sex of infant, maternal age and smoking status
| Variables | Naive logistic | Log-binomial | Expanded data |
|---|---|---|---|
| logistic | |||
| OR (95% CI) | RR (95% CI) | RR (95% CI) | |
| Preterm: nulliparous | 23.5 (22.4, 24.5) | 12.9 (12.5, 13.3) | 13.0 (12.6, 13.4) |
| Preterm: multiparous | 32.5 (30.8, 34.2) | 20.1 (19.4, 20.9) | 20.4 (19.6, 21.2) |
| Overweight: BMI ≥ 25 | 1.30 (1.26, 1.34) | 1.20 (1.17, 1.23) | 1.26 (1.23, 1.30) |
| Multiparous | 0.50 (0.48, 0.52) | 0.51 (0.50, 0.53) | 0.51 (0.49, 0.53) |
Adjusted ORs estimated from weighted logistic regression and adjusted RRs estimated from expanded data weighted logistic regression models, using data sampled in a 1:2 case-control design from the infant cohort, matched on infant sex and maternal age. In addition to covariates in the table, estimates are adjusted for smoking status
| Variables | Weighted logistic | Expanded data |
|---|---|---|
| weighted logistic | ||
| OR (95% CI) | RR (95% CI) | |
| Preterm: nulliparous | 23.8 (22.7, 24.9) | 13.1 (12.3, 13.9) |
| Preterm: multiparous | 32.5 (30.9, 34.3) | 20.5 (19.1, 21.9) |
| Overweight: BMI ≥ 25 | 1.32 (1.26, 1.39) | 1.28 (1.23, 1.33) |
| Multiparous | 0.50 (0.48, 0.52) | 0.51 (0.49, 0.53) |
Bias, empirical SE (Emp. SE), mean SE, coverage, type I error and power of lnRR from log-binomial and expanded data logistic regression analysis of simulated cohort, with 2000 simulation iterations in each scenario
| Prevalence | True ln | Method | Bias | Emp. | Mean | Coverage | Type I/ |
|---|---|---|---|---|---|---|---|
| SE | SE | Power1 | |||||
| 0.1 | 0 | Log-binomial | -0.001 | 0.213 | 0.211 | 95.4 | 4.6 |
| Expanded data logistic | -0.002 | 0.214 | 0.211 | 95.4 | 4.6 | ||
| 0.223 | Log-binomial | -0.001 | 0.202 | 0.201 | 95.2 | 21.4 | |
| Expanded data logistic | -0.001 | 0.202 | 0.201 | 95.2 | 21.7 | ||
| 0.405 | Log-binomial | 0.002 | 0.194 | 0.194 | 95.0 | 56.5 | |
| Expanded data logistic | 0.002 | 0.194 | 0.194 | 95.2 | 56.4 | ||
| 0.693 | Log-binomial | 0.000 | 0.185 | 0.185 | 95.8 | 96.0 | |
| Expanded data logistic | 0.000 | 0.185 | 0.185 | 95.6 | 96.0 | ||
| 0.2 | 0 | Log-binomial | -0.005 | 0.136 | 0.139 | 95.8 | 4.2 |
| Expanded data logistic | -0.006 | 0.137 | 0.139 | 95.6 | 4.4 | ||
| 0.223 | Log-binomial | -0.002 | 0.131 | 0.131 | 95.6 | 39.1 | |
| Expanded data logistic | -0.002 | 0.132 | 0.132 | 95.3 | 39.1 | ||
| 0.405 | Log-binomial | 0.004 | 0.126 | 0.126 | 95.3 | 89.3 | |
| Expanded data logistic | 0.004 | 0.126 | 0.126 | 95.3 | 89.2 | ||
| 0.693 | Log-binomial | 0.002 | 0.122 | 0.120 | 94.9 | 100 | |
| Expanded data logistic | 0.002 | 0.122 | 0.120 | 94.6 | 100 | ||
| 0.3 | 0 | Log-binomial | -0.004 | 0.102 | 0.104 | 95.3 | 4.7 |
| Expanded data logistic | -0.004 | 0.103 | 0.105 | 95.3 | 4.6 | ||
| 0.223 | Log-binomial | -0.001 | 0.096 | 0.098 | 94.8 | 63.1 | |
| Expanded data logistic | -0.001 | 0.098 | 0.099 | 95.0 | 61.3 | ||
| 0.405 | Log-binomial | -0.003 | 0.094 | 0.094 | 95.2 | 98.9 | |
| Expanded data logistic | -0.003 | 0.095 | 0.095 | 94.7 | 98.7 | ||
| 0.693 | Log-binomial | -0.002 | 0.092 | 0.088 | 93.4 | 100 | |
| Expanded data logistic | -0.002 | 0.093 | 0.089 | 93.7 | 100 | ||
| 0.4 | 0 | Log-binomial | -0.001 | 0.081 | 0.082 | 95.6 | 4.4 |
| Expanded data logistic | -0.002 | 0.082 | 0.084 | 95.3 | 4.7 | ||
| 0.223 | Log-binomial | -0.005 | 0.074 | 0.076 | 95.8 | 81.6 | |
| Expanded data logistic | -0.005 | 0.077 | 0.078 | 95.2 | 79.4 | ||
| 0.405 | Log-binomial2 | -0.002 | 0.074 | 0.072 | 93.7 | 100 | |
| Expanded data logistic | -0.002 | 0.076 | 0.073 | 93.8 | 100 | ||
| 0.693 | Log-binomial3 | -0.017 | 0.067 | 0.067 | 95.2 | 100 | |
| Expanded data logistic | 0.002 | 0.068 | 0.068 | 95.3 | 100 |
1Values reported in this column are the type I error when true lnRR=0 and power otherwise.
2Based on 1998 simulation cycles where the log-binomial regression converged.
3Based on 568 simulation cycles where the log-binomial regression converged
Bias, empirical SE (Emp. SE), mean SE, coverage, type I error and power of lnRR from expanded data weighted logistic regression analysis of simulated case-control data, with 2000 simulation iterations in each scenario
| Design | Prevalence | True ln | Bias | Emp. | Mean | Coverage | Type I/ |
|---|---|---|---|---|---|---|---|
| SE | SE | Power1 | |||||
| Simple | 0.1 | 0.000 | 0.015 | 0.299 | 0.289 | 94.7 | 5.2 |
| case- | 0.223 | 0.006 | 0.284 | 0.280 | 95.0 | 12.7 | |
| control | 0.405 | 0.011 | 0.282 | 0.273 | 93.7 | 34.5 | |
| 0.693 | 0.010 | 0.269 | 0.26 | 94.1 | 76.4 | ||
| 0.2 | 0.000 | -0.004 | 0.174 | 0.178 | 95.8 | 4.3 | |
| 0.223 | 0.001 | 0.171 | 0.170 | 95.0 | 26.8 | ||
| 0.405 | 0.004 | 0.165 | 0.164 | 94.8 | 70.9 | ||
| 0.693 | 0.002 | 0.156 | 0.155 | 95.0 | 99.0 | ||
| 0.3 | 0.000 | -0.003 | 0.123 | 0.125 | 95.5 | 4.5 | |
| 0.223 | -0.002 | 0.117 | 0.118 | 95.0 | 46.7 | ||
| 0.405 | -0.004 | 0.115 | 0.113 | 94.6 | 93.9 | ||
| 0.693 | -0.001 | 0.108 | 0.104 | 93.8 | 100 | ||
| 0.4 | 0.000 | -0.002 | 0.089 | 0.091 | 95.2 | 4.8 | |
| 0.223 | -0.004 | 0.084 | 0.085 | 95.6 | 72.9 | ||
| 0.405 | -0.003 | 0.082 | 0.080 | 94.4 | 99.8 | ||
| 0.693 | 0.002 | 0.072 | 0.072 | 94.8 | 100 | ||
| Matched | 0.1 | 0.000 | 0.007 | 0.287 | 0.283 | 95.2 | 4.7 |
| case- | 0.223 | 0.000 | 0.280 | 0.274 | 95.6 | 13.9 | |
| control | 0.405 | 0.011 | 0.262 | 0.268 | 95.7 | 33.9 | |
| 0.693 | 0.008 | 0.256 | 0.257 | 95.5 | 78.4 | ||
| 0.2 | 0.000 | -0.004 | 0.173 | 0.175 | 95.3 | 4.7 | |
| 0.223 | -0.004 | 0.167 | 0.168 | 95.2 | 26.3 | ||
| 0.405 | 0.007 | 0.163 | 0.162 | 94.6 | 73.2 | ||
| 0.693 | 0.008 | 0.158 | 0.154 | 94.8 | 99.5 | ||
| 0.3 | 0.000 | -0.003 | 0.121 | 0.123 | 95.3 | 4.7 | |
| 0.223 | 0.001 | 0.115 | 0.117 | 95.0 | 48.7 | ||
| 0.405 | 0.001 | 0.113 | 0.112 | 94.6 | 94.2 | ||
| 0.693 | 0.001 | 0.107 | 0.105 | 95.0 | 100 | ||
| 0.4 | 0.000 | -0.001 | 0.090 | 0.091 | 95.0 | 4.9 | |
| 0.223 | -0.005 | 0.084 | 0.085 | 95.8 | 72.4 | ||
| 0.405 | -0.002 | 0.083 | 0.081 | 93.5 | 99.8 | ||
| 0.693 | 0.003 | 0.074 | 0.075 | 96.0 | 100 |
1Values reported in this column are the type I error when true lnRR=0 and power otherwise