| Literature DB >> 35202396 |
Abstract
The analysis of contingency tables is a powerful statistical tool used in experiments with categorical variables. This study improves parts of the theory underlying the use of contingency tables. Specifically, the linkage disequilibrium parameter as a measure of two-way interactions applied to three-way tables makes it possible to quantify Simpson's paradox by a simple formula. With tests on three-way interactions, there is only one that determines whether the partial interactions of all variables agree or whether there is at least one variable whose partial interactions disagree. To date, there has been no test available that determines whether the partial interactions of a certain variable agree or disagree, and the presented work closes this gap. This work reveals the relation of the multiplicative and the additive measure of a three-way interaction. Another contribution addresses the question of which cells in a contingency table are fixed when the first- and second-order marginal totals are given. The proposed procedure not only detects fixed zero counts but also fixed positive counts. This impacts the determination of the degrees of freedom. Furthermore, limitations of methods that simulate contingency tables with given pairwise associations are addressed.Entities:
Mesh:
Year: 2022 PMID: 35202396 PMCID: PMC8870532 DOI: 10.1371/journal.pone.0262502
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Success × treatment sub-tables for the sexes with given one- and two-way marginals.
| Treatment 1 | Treatment 2 | ||
| Male | Success |
|
|
| No Success |
|
| |
| Female | Success |
|
|
| No Success |
|
|
Cell counts for given zero-, one-, and two-way marginal totals for Table 6 of Fienberg and Rinaldo [2].
(The original table is obtained by inserting and ).
| Cell counts | |||||
|---|---|---|---|---|---|
| 0 | 0 | 0 | 4 | ||
|
| 0 | 0 |
| ||
|
|
| 0 |
| ||
|
|
| 3 | 2 | ||
| 0 | 0 |
|
| ||
|
| 0 |
|
| ||
|
| 3 | 4 |
| ||
| 0 | 0 | 2 | 0 | ||
| 0 |
|
|
| ||
| 5 | 6 |
|
| ||
| 0 | 2 | 0 | 0 | ||
| 0 |
|
| 0 | ||
| 5 |
|
| 3 | ||
| 1 | 0 | 0 | 0 | ||
|
|
| 0 | 0 | ||
|
|
|
| 0 | ||
A: Cell probabilities for given one- and two-way marginal totals.
B: Table for association parameters Γ1,2 = −1, Γ1,3 = 1, and Γ2,3 = 0.714.
| A) Bona fide table satisfying the restraints | B) Table for extreme associations | ||||||
|---|---|---|---|---|---|---|---|
|
| 0.2− | 0 | 0.2 | ||||
| − |
| 0 | 0 | ||||
| 0.4− | 0.3 | 0 | |||||
|
| 0.5− | 0.1 | 0.4 | ||||
3×3 sub-tables for association parameters Γ1,2 = −1, Γ1,3 = 1, and Γ2,3 = 1.
| Γ1,2 = −1 | Γ1,3 = 1 | Γ2,3 = 1 | ||||||
| 0 | 0 | 0.1 | 0.1 | 0 | 0 | 0.2 | 0 | 0 |
| 0 | 0 | 0.3 | 0.2 | 0.1 | 0 | 0.1 | 0.3 | 0 |
| 0.2 | 0.4 | 0 | 0 | 0.2 | 0.4 | 0 | 0 | 0.4 |
3×3 sub-tables for independent variables.
| Γ1,2 = 0 | Γ1,3 = 0 | Γ2,3 = 0 | ||||||
|---|---|---|---|---|---|---|---|---|
| 0.02 | 0.04 | 0.04 | 0.03 | 0.03 | 0.04 | 0.06 | 0.06 | 0.08 |
| 0.06 | 0.12 | 0.12 | 0.09 | 0.09 | 0.12 | 0.12 | 0.12 | 0.16 |
| 0.12 | 0.24 | 0.24 | 0.18 | 0.18 | 0.24 | 0.12 | 0.12 | 0.16 |
3×3 sub-tables for association parameters −Γ1,2 = Γ1,3 = Γ2,3 = 0.6023.
| Γ1,2 = −0.6023 | Γ1,3 = 0.6023 | Γ2,3 = 0.6023 | ||||||
|---|---|---|---|---|---|---|---|---|
| 0.0107 | 0.0213 | 0.0680 | 0.0657 | 0.0147 | 0.0196 | 0.1282 | 0.0308 | 0.0410 |
| 0.0320 | 0.0640 | 0.2041 | 0.1461 | 0.0951 | 0.0588 | 0.1103 | 0.2077 | 0.0821 |
| 0.1574 | 0.3147 | 0.1279 | 0.0882 | 0.1902 | 0.3216 | 0.0615 | 0.0616 | 0.2769 |
×3×3 table satisfying 0.6023 = −Γ1,2 = Γ1,3 = Γ2,3 and one-way marginal totals p1 = (0.1, 0.3, 0.6)′, p2 = (0.2, 0.4, 0.4)′, and p3 = (0.3, 0.3, 0.4)′.
| 0 | 0.0262 | 0.0417 | ||
| 0 | 0 | 0.0321 | ||
| 0 | 0 | 0 | ||
| 0.0072 | 0.1193 | 0 | ||
| 0 | 0.0181 | 0.0820 | ||
| 0 | 0 | 0.0738 | ||
| 0.1056 | 0 | 0 | ||
| 0.0872 | 0.0806 | 0 | ||
| 0 | 0.1558 | 0.1708 |
Two-way sub-tables from Table 7.
| Γ1,2 = −0.6023 | Γ1,3 = 0.6023 | Γ2,3 = 0.6023 | ||||||
|---|---|---|---|---|---|---|---|---|
| 0 | 0.0262 | 0.0738 | 0.0679 | 0.0321 | 0 | 0.1128 | 0.0872 | 0 |
| 0.0072 | 0.1374 | 0.1554 | 0.1265 | 0.1001 | 0.0734 | 0.1454 | 0.0987 | 0.1558 |
| 0.1928 | 0.2364 | 0.1708 | 0.1056 | 0.1678 | 0.3266 | 0.0417 | 0.1141 | 0.2442 |
Numbers of denied and admitted applications at six departments as part of the study [41].
Variable 1 is sex (men—women), variable 2 is admittance (denied–admitted), and variable 3 is the department (1 to 6). is the LD between variable 1 and variable 3 (which is now dichotomous: Department i versus the rest). is the LD between variable 2 and variable 3 (which is again dichotomous: Department i versus the rest). Parameter stands for the frequency of applications to department i. is the LD between the first category of variable 1 and the first category of variable 2 within Department i, and is the corresponding correlation coefficient.
| Dept. | Men | Women |
|
|
|
|
| ||
|---|---|---|---|---|---|---|---|---|---|
|
| Denied | Admitted (%) | Denied | Admitted (%) | |||||
| 1 | 313 | 512 (62) | 19 | 89 (82) | 0.060 | −0.053 | 0.206 | 0.021 | 0.14 |
| 2 | 207 | 353 (63) | 8 | 17 (68) | 0.047 | −0.032 | 0.129 | 0.002 | 0.02 |
| 3 | 205 | 120 (37) | 391 | 202 (34) | −0.049 | 0.008 | 0.203 | −0.007 | −0.03 |
| 4 | 279 | 138 (33) | 244 | 131 (35) | −0.012 | 0.008 | 0.175 | 0.005 | 0.02 |
| 5 | 138 | 53 (28) | 299 | 94 (24) | −0.035 | 0.018 | 0.129 | −0.008 | −0.04 |
| 6 | 351 | (22 (6) | 317 | 24 (7) | −0.011 | 0.051 | 0.158 | 0.003 | 0.02 |
| Total | 1493 | 1198 (44.5) | 1278 | 557 (30.4) | 0 | 0 | 1 | 0.0034 | 0.03 |
Numbers of men and women with application to department 1 and numbers of denied and admitted applicants at department 1.
“Rest” means departments two to six.
| Men | Women | Denied | Admitted | |
|---|---|---|---|---|
| Department 1 | 825 | 108 | 332 | 601 |
| Rest | 1866 | 1727 | 2439 | 1154 |
| Total | 2691 | 1835 | 2771 | 1755 |
Fitted counts of the Berkeley data.
Five free variables were fitted in three ways. A: First variable n1,1,1 taken from Table 9, four from maximizing entropy. B: Five from D1,1, = 0, i = 2,3,…,6. C: Four from agreeing , with the fifth the log-likelihood estimate.
| Dep. | Men | Women | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Denied | Admitted | Denied | Admitted | |||||||||
|
| A | B | C | A | B | C | A | B | C | A | B | C |
| 1 | 313.0 | 308.9 | 312.7 | 512.0 | 516.1 | 512.3 | 19.0 | 23.1 | 19.3 | 89.0 | 84.9 | 88.7 |
| 2 | 205.6 | 205.8 | 205.5 | 354.4 | 354.2 | 354.5 | 9.4 | 9.2 | 9.5 | 15.6 | 15.8 | 15.5 |
| 3 | 209.5 | 211.0 | 209.8 | 115.5 | 114.0 | 115.2 | 386.5 | 385.0 | 386.2 | 206.5 | 208.0 | 206.8 |
| 4 | 274.0 | 275.4 | 274.3 | 143.0 | 141.6 | 142.7 | 249.0 | 247.6 | 248.7 | 126.0 | 127.4 | 126.3 |
| 5 | 142.2 | 142.9 | 142.2 | 48.8 | 48.1 | 48.8 | 294.8 | 294.1 | 294.8 | 98.2 | 98.9 | 98.2 |
| 6 | 348.6 | 349.0 | 348.5 | 24.4 | 24.0 | 24.5 | 319.4 | 319.0 | 319.5 | 21.6 | 22.0 | 21.5 |
The χ2 values for certain interaction hypotheses concerning the data of Mood (1950).
| Method | Null hypothesis | ||
|---|---|---|---|
| Mutual independence | Zero three-way interaction | ||
| Mood [ | 110.1 | 86.7 | - |
| Lancaster [ | 132.0 | 107.9 | 7.80 |
| Snedecor [ | 132.0 | 93.7 | 19.57 |
| Cheng [ | 120.6 | 96.4 | 6.82 |
| Maximal entropy | 132.0 | 93.7 | 6.80 |
Marked cells contain (nearly) correct results.
Fig 1The log-likelihood function ln L and the entropy H for the free parameter x of two 2×2 tables.
The broken lines correspond to the asymptotic expression (57).
| Γ1,2 = −1 | Γ1,3 = 1 | Γ2,3 = 1 | ||||
| 0 | 0.2 | 0.2 | 0 | 0.4 | 0 | (41) |
| 0.4 | 0.4 | 0.3 | 0.5 | 0.1 | 0.5 | |
| Γ2,3 = 0.714 | |
| 0.3 | 0.1 |
| 0.2 | 0.4 |
| 0.1 | 0.7 | 0.08 | 0.72 | 0 | 0.8 |
| 0 | 0.2 | 0.02 | 0.18 | 0.1 | 0.1 |