| Literature DB >> 27322305 |
Ionut Bebu1, George Luta2, Thomas Mathew3, Brian K Agan4.
Abstract
For binary outcome data from epidemiological studies, this article investigates the interval estimation of several measures of interest in the absence or presence of categorical covariates. When covariates are present, the logistic regression model as well as the log-binomial model are investigated. The measures considered include the common odds ratio (OR) from several studies, the number needed to treat (NNT), and the prevalence ratio. For each parameter, confidence intervals are constructed using the concepts of generalized pivotal quantities and fiducial quantities. Numerical results show that the confidence intervals so obtained exhibit satisfactory performance in terms of maintaining the coverage probabilities even when the sample sizes are not large. An appealing feature of the proposed solutions is that they are not based on maximization of the likelihood, and hence are free from convergence issues associated with the numerical calculation of the maximum likelihood estimators, especially in the context of the log-binomial model. The results are illustrated with a number of examples. The overall conclusion is that the proposed methodologies based on generalized pivotal quantities and fiducial quantities provide an accurate and unified approach for the interval estimation of the various epidemiological measures in the context of binary outcome data with or without covariates.Entities:
Keywords: common odds ratio; fiducial quantity; generalized pivotal quantity; log-binomial model; logistic regression
Mesh:
Year: 2016 PMID: 27322305 PMCID: PMC4924062 DOI: 10.3390/ijerph13060605
Source DB: PubMed Journal: Int J Environ Res Public Health ISSN: 1660-4601 Impact factor: 3.390
Empirical coverage probability and (mean,median) length of different confidence intervals for the common odds ratio for five studies, for a 95% nominal level.
| n1 | n2 | OR | MH | SMH | GPQ | F1 | F2 | |||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 15 | 15 | 1.0 | 0.9520 | (1.70,1.56) | 0.9437 | (1.69,1.55) | 0.9489 | (1.68,1.53) | 0.9446 | (1.58,1.45) | 0.9467 | (1.53,1.41) |
| 15 | 15 | 3.5 | 0.9530 | (7.24,6.12) | 0.9533 | (7.07,6.06) | 0.9469 | (7.79,6.34) | 0.9507 | (6.50,5.49) | 0.9382 | (5.94,5.14) |
| 15 | 15 | 6.5 | 0.9580 | (16.72,12.89) | 0.9503 | (16.65,12.83) | 0.9526 | (19.53,14.42) | 0.9485 | (15.22,11.84) | 0.9352 | (12.51,10.36) |
| 20 | 10 | 1.0 | 0.9552 | (1.85,1.69) | 0.9490 | (1.82,1.65) | 0.9480 | (1.83,1.64) | 0.9464 | (1.71,1.55) | 0.9453 | (1.63,1.50) |
| 20 | 10 | 3.5 | 0.9556 | (8.42,6.82) | 0.9513 | (7.96,6.50) | 0.9511 | (9.35,7.19) | 0.9497 | (7.29,5.99) | 0.9401 | (6.43 ,5.45) |
| 20 | 10 | 6.5 | 0.9570 | (22.04,14.80) | 0.9521 | (20.82,14.14) | 0.9534 | (25.00,17.23) | 0.9500 | (17.78,13.17) | 0.9377 | (13.79,11.06) |
| 20 | 20 | 1.0 | 0.9527 | (1.41,1.33) | 0.9502 | (1.39,1.31) | 0.9510 | (1.36,1.27) | 0.9501 | (1.31,1.23) | 0.9502 | (1.29,1.21) |
| 20 | 20 | 3.5 | 0.9544 | (5.73,5.14) | 0.9516 | (5.67 ,5.07) | 0.9466 | (5.78,5.03) | 0.9474 | (5.10,4.56) | 0.9381 | (4.87,4.40) |
| 20 | 20 | 6.5 | 0.9541 | (12.89,10.84) | 0.9524 | (12.59,10.66) | 0.9469 | (14.12,11.35) | 0.9465 | (11.34,9.59) | 0.9363 | (10.37,8.97) |
| 30 | 30 | 1.0 | 0.9492 | (1.10,1.06) | 0.9511 | (1.10,1.05) | 0.9473 | (1.06,1.02) | 0.9512 | (1.04,1.00) | 0.9460 | (1.03,0.99) |
| 30 | 30 | 3.5 | 0.9506 | (4.35,4.07) | 0.9505 | (4.32,4.01) | 0.9480 | (4.18,3.87) | 0.9471 | (3.89,3.63) | 0.9425 | (3.82,3.59) |
| 30 | 30 | 6.5 | 0.9552 | (9.38,8.49) | 0.9506 | (9.30,8.36) | 0.9457 | (9.39,8.28) | 0.9419 | (8.27,7.44) | 0.9357 | (7.94,7.22) |
OR = odds ratio, MH = Mantel-Haentzel, SMH = Sato-Mantel-Haentzel, GPQ = generalized pivotal quantity, F1 = Equation (2), F2 = Equation (3).
Empirical coverage probability and (mean,median) length of different confidence intervals for the prevalence ratio in the log-binomial model mentioned in Section 4.2, for a 95% nominal level.
| GPQ | F1 | F2 | ML | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| –1.4 | 0.7 | –0.2 | 28 | 28 | 26 | 26 | 0.9540 | (1.34,1.27) | 0.9438 | (1.28,1.23) | 0.9412 | (1.26,1.21) | 0.9526 | (1.19,1.16) |
| –1.4 | 0.9 | –0.2 | 28 | 28 | 26 | 26 | 0.9552 | (1.28,1.22) | 0.9445 | (1.23,1.18) | 0.9417 | (1.21,1.16) | 0.9534 | (1.14,1.11) |
| –2.0 | 0.7 | –0.2 | 28 | 28 | 26 | 26 | 0.9607 | (2.22,2.01) | 0.9419 | (2.09,1.88) | 0.9365 | (1.97,1.83) | 0.9652 | (5.90,1.74) |
| –2.0 | 0.9 | –0.2 | 28 | 28 | 26 | 26 | 0.9590 | (2.15,1.94) | 0.9361 | (2.03,1.81) | 0.9311 | (1.91,1.77) | 0.9653 | (21.94,1.67) |
| –2.0 | 0.5 | –0.2 | 28 | 28 | 26 | 26 | 0.9557 | (2.35,2.12) | 0.9357 | (2.21,1.97) | 0.9308 | (2.07,1.93) | 0.9640 | (11.14,1.82) |
| –1.4 | 0.7 | –0.2 | 56 | 56 | 52 | 52 | 0.9514 | (0.87,0.86) | 0.9452 | (0.86,0.85) | 0.9453 | (0.85,0.84) | 0.9496 | (0.83,0.82) |
| –1.4 | 0.9 | –0.2 | 56 | 56 | 52 | 52 | 0.9529 | (0.84,0.82) | 0.9475 | (0.82,0.81) | 0.9462 | (0.82,0.81) | 0.9516 | (0.79,0.79) |
| –1.4 | 0.5 | –0.2 | 56 | 56 | 52 | 52 | 0.9515 | (0.91,0.90) | 0.9464 | (0.90,0.89) | 0.9447 | (0.90,0.88) | 0.9506 | (0.87,0.86) |
| –2.0 | 0.7 | –0.2 | 56 | 56 | 52 | 52 | 0.9539 | (1.36,1.30) | 0.9447 | (1.32,1.27) | 0.9426 | (1.30,1.26) | 0.9568 | (1.24,1.21) |
| –2.0 | 0.9 | –0.2 | 56 | 56 | 52 | 52 | 0.9509 | (1.31,1.25) | 0.9429 | (1.27,1.22) | 0.9407 | (1.25,1.21) | 0.9540 | (1.20,1.17) |
| –2.0 | 0.5 | –0.2 | 56 | 56 | 52 | 52 | 0.9538 | (1.42,1.37) | 0.9454 | (1.38,1.33) | 0.9429 | (1.36,1.32) | 0.9583 | (1.94,1.27) |
GPQ = generalized pivotal quantity, F1 = Equation (2), F2 = Equation (3), ML = maximum likelihood.
Cross-classification of depression and insomnia cases.
| Insomnia | |||
|---|---|---|---|
| No | Yes | ||
| 97 | 56 | ||
| 7 | 33 | ||
95% confidence intervals for the number needed to expose for the depression and insomnia example.
| Wald-Yates | Agresti-Caffo | GPQ | F1 | F2 | |
|---|---|---|---|---|---|
| Lower limit | 1.63 | 1.71 | 1.71 | 1.73 | 1.73 |
| Upper limit | 3.30 | 3.35 | 3.26 | 3.33 | 3.32 |
GPQ = generalized pivotal quantity, F1 = Equation (2), F2 = Equation (3).
Number of subjects virally suppressed (Y) and not suppressed (N) stratified by race (AA = African-American, C = Caucasian) and site.
| Site1 | Site2 | Site3 | ||||
|---|---|---|---|---|---|---|
| Y | N | Y | N | Y | N | |
| AA | 212 | 90 | 11 | 2 | 352 | 272 |
| C | 271 | 108 | 17 | 3 | 293 | 165 |
95% confidence intervals for the common odds ratio for the viral suppression example.
| MH | SMH | GPQ | F1 | F2 | |
|---|---|---|---|---|---|
| Lower limit | 0.656 | 0.653 | 0.655 | 0.657 | 0.655 |
| Upper limit | 0.973 | 0.926 | 0.974 | 0.970 | 0.974 |
MH = Mantel-Haentzel, SMH = Sato-Mantel-Haentzel, GPQ = generalized pivotal quantity, F1 = Equation (2), F2 = Equation (3).
Counts on the presence or absence of symptoms among AIDS patients who are on the antiretroviral drug AZT, categorized by race.
| Symptoms | |||
|---|---|---|---|
| Race | AZT | Yes | No |
| White (W) | Yes (Y) | 14 | 93 |
| No (N) | 32 | 81 | |
| Black (B) | Yes (Y) | 11 | 52 |
| No (N) | 12 | 43 | |
95% confidence intervals for the prevalence ratio in the migraine headaches example.
| ML | GPQ | F1 | F2 | |
|---|---|---|---|---|
| Lower limit | 0.223 | 0.221 | 0.233 | 0.245 |
| Upper limit | 1.344 | 1.412 | 1.395 | 1.379 |
ML = maximum likelihood, GPQ = generalized pivotal quantity, F1 = Equation (2), F2 = Equation (3).