| Literature DB >> 32341699 |
Abstract
This study empirically illustrates the mechanism by which epidemiological effect measures and statistical evidence can be misleading in the presence of Simpson's paradox and identify possible alternative methods of analysis to manage the paradox. Three scenarios of observational study designs, including cross-sectional, cohort, and case-control approaches, are simulated. In each scenario, data are generated, and various methods of epidemiological and statistical analyses are undertaken to obtain empirical results that illustrate Simpson's paradox and mislead conclusions. Rational methods of analysis are also performed to illustrate how to avoid pitfalls and obtain valid results. In the presence of Simpson's paradox, results from analyses in overall data contradict the findings from all subgroups of the same data. This paradox occurs when distributions of confounding characteristics are unequal in the groups being compared. Data analysis methods which do not take confounding factor into account, including epidemiological 2×2 table analysis, independent samples t-test, Wilcoxon rank-sum test, chi-square test, and univariable regression analysis, cannot manage the problem of Simpson's paradox and mislead research conclusions. Mantel-Haenszel procedure and multivariable regression methods are examples of rational analysis methods leading to valid results. Therefore, Simpson's paradox arises as a consequence of extreme unequal distributions of a specific inherent characteristic in groups being compared. Analytical methods which take control of confounding effect must be applied to manage the paradox and obtain valid research evidence regarding the causal association. ©Carol Davila University Press.Entities:
Keywords: Simpson's Paradox; bias; confounding variable; epidemiology; regression analysis
Mesh:
Substances:
Year: 2020 PMID: 32341699 PMCID: PMC7175433 DOI: 10.25122/jml-2019-0120
Source DB: PubMed Journal: J Med Life ISSN: 1844-122X
Figure 1:Numerical example of Simpson’s paradox.
Average monthly incomes of dentists by gender and linear regression analyses of difference in average monthly incomes by gender (N=240).
| n = 120 | n = 120 | ||
| Mean ± SD | 19,111.3 ± 8,780.2 | 21,722.3 ± 8,466.9 | 0.020† |
| Median (IQR) | 13,516.7 (18,208.3) | 26,166.7 (16,510.0) | 0.451* |
| Min. – Max. | 9,033.3 – 33,266.7 | 9,250 – 32,666.7 | |
| n = 75 (62.5%) | n = 45 (37.5%) | ||
| Mean ± SD | 12,648.7 ± 2,368.2 | 11,143.8 ± 1,555.7 | <0.001‡ |
| Median (IQR) | 12,700.0 (1,966.7) | 11,333.3 (3,006.7) | <0.001* |
| Min. – Max. | 9,033.3 – 20,000 | 9,250 – 13,333.3 | |
| n = 45 (37.5%) | n = 75 (62.5%) | ||
| Mean ± SD | 29,882.2 ± 3,037.3 | 28,069.3 ± 2,228.1 | <0.001‡ |
| Median (IQR) | 31,000.0 (5,400.0) | 28,186.7 (3,150.0) | <0.001* |
| Min. – Max. | 23,366.7 – 33,266.7 | 24,666.7 – 32,666.7 | |
SD, standard deviation; IQR, interquartile range; Min., minimum; Max., maximum;
CI, confidence interval; %, percentage by column
†Independent samples t-test with equal variances
‡Independent samples t-test with unequal variances
*Two-sample Wilcoxon rank-sum test.
Influenza-related acute respiratory infection in the overall groups and COPD severity subgroups in patients with and without influenza vaccination (N=320).
| Acute respiratory infection | p-value‡ | Incidence | RD | RR | Effectiveness [1-RR]×100 (%) | ||
|---|---|---|---|---|---|---|---|
| Yes n (%)† | No n (%)† | ||||||
| Overall | |||||||
| Vaccine | 76 (47.5) | 84 (52.5) | 0.176 | 0.48 | 0.08 | 1.19 | N/A |
| No vaccine | 64 (40.0) | 96 (60.0) | 0.40 | ||||
| Subgroups: | |||||||
| Low severity* | |||||||
| Vaccine | 6 (15.0) | 34 (85.0) | 0.062 | 0.15 | - 0.15 | 0.50 | 50 |
| No vaccine | 36 (30.0) | 84 (70.0) | 0.30 | ||||
| High severity* | |||||||
| Vaccine | 70 (58.3) | 50 (41.7) | 0.190 | 0.58 | - 0.12 | 0.83 | 17 |
| No vaccine | 28 (70.0) | 12 (30.0) | 0.70 | ||||
| 0.73 | 27 | ||||||
| 0.169 | |||||||
| 63.0 | |||||||
RD, risk difference; RR, risk ratio; N/A, not applicable
M-H adjusted RR, Mantel-Haenszel adjusted RR
M-H test of homogeneity, Mantel-Haenszel test of homogeneity of stratum-specific RRs
†Percentage by row
‡Chi-square test
*Severity of COPD
**Calculated by [(RRcrude – RRadjusted) / RRadjusted]×100%.
Univariable and multivariable analyses of influenza-related acute respiratory infection; risk difference and risk ratio in COPD patients with and without influenza vaccination (N=320).
| No | Reference | Reference | ||||
| Yes | 0.08 | - 0.03, 0.18 | 0.175 | - 0.14 | - 0.24, - 0.03 | 0.012 |
| Low | - | Reference | ||||
| High | - | 0.42 | 0.31, 0.53 | <0.001 | ||
| No | Reference | Reference | ||||
| Yes | 1.19 | 0.92, 1.53 | 0.179 | 0.74‡ | 0.59, 0.94 | 0.015 |
| Low | - | Reference | ||||
| High | - | 2.70 | 2.01, 3.64 | <0.001 | ||
CI, confidence interval
†Severity of COPD
‡Vaccine effectiveness adjusted for severity of COPD is 26% [from (1- RRadjusted)×100%].
Coffee consumption and lung cancer in the overall groups and smoking status subgroups, as well as logistic regression analyses of the association (N=500).
| Lung cancer | Odds ratio (OR) | p-value‡ | ||
|---|---|---|---|---|
| Yes [n=250] n (%)† | No [n=250] n (%)† | |||
| Coffee (+) | 200 (80.0) | 164 (65.6) | 2.10 | <0.001 |
| Coffee (–) | 50 (20.0) | 86 (34.4) | ||
| Coffee (+) | 24 (64.9) | 125 (61.9) | 1.14 | 0.731 |
| Coffee (–) | 13 (35.1) | 77 (38.1) | ||
| Coffee (+) | 176 (82.6) | 39 (81.2) | 1.10 | 0.821 |
| Coffee (–) | 37 (17.4) | 9 (18.8) | ||
| 1.12 | ||||
| 0.949 | ||||
| 87.5 | ||||
Coffee (+), ≥ 1 cup a day; Coffee (–), <1 cup a day
CI, confidence interval; M-H adjusted OR, Mantel-Haenszel adjusted OR;
M-H test of homogeneity, Mantel-Haenszel test of homogeneity of stratum-specific ORs;
cOR, crude odds ratio; aOR, adjusted odds ratio
† Percentage by column
‡ Chi-square test
* Calculated by [(ORcrude – ORadjusted) / ORadjusted]×100%