| Literature DB >> 17302984 |
Nicholas J Horton1, Eugenia Kim, Richard Saitz.
Abstract
BACKGROUND: Alcohol consumption is commonly used as a primary outcome in randomized alcohol treatment studies. The distribution of alcohol consumption is highly skewed, particularly in subjects with alcohol dependence.Entities:
Mesh:
Year: 2007 PMID: 17302984 PMCID: PMC1810542 DOI: 10.1186/1471-2288-7-9
Source DB: PubMed Journal: BMC Med Res Methodol ISSN: 1471-2288 Impact factor: 4.615
Figure 1Observed value of drinks per day for the control group of the ASAP study at 3 months, plus the estimated Poisson fit to these data (1 = 4.98).
0 where I(k = 0) is equal to 1 when k = 0, and equal to 0 otherwise. By distinguishing Always-0 (with probability p) and Not Always-0 group (with probability (1 - p) * exp(-λ)) for abstainers and drinkers who didn't drink during the reporting period, respectively, it can incorporate an overabundance of zeros [8]. Conditional on being a Not Always-0, counts are given by the Poisson distribution. This approach has been generalized to a regression framework, and implemented in general purpose statistical software (e.g. zip in Stata).
Figure 2Graphical display of the five distributions, all with rate parameter 5, used in the simulations (Poisson [Var = 5], negative binomial [NB13, Var = 13], negative binomial [NB40, Var = 40], negative binomial [NB 70, Var = 70] and zero-inflated Poisson [ZIP, p = 0.2, Var = 8]).
Estimated probability (and 99% CI) of rejecting the null hypothesis when there is no true difference between groups for a variety of statistical models and underlying distributions (results that do not include the alpha level of 0.05 are bolded)
| Analysis model fit | ||||
| True Distribution: | Poisson | ODP | NB | ZIP |
| Poisson (Var = 5) | .053 (.041,.064) | .054 (.042,.066) | .047 (.036,.058) | .055* (.043,.067) |
| NB (Var = 13) | .049 (.038,.060) | .049 (.038,.060) | .050 (.039,.061) | |
| NB (Var = 40) | .047 (.036,.058) | .044 (.033,.055) | .046 (.036,.057) | |
| NB (Var = 70) | .052 (.041,.063) | .048 (.037,.059) | .062 (.049,.074) | |
| ZIP (Var = 8) | .058 (.046,.070) | .051 (.040,.063) | ||
all distributions except ZIP have E[Y] = λ = 5, for ZIP E[Y] = 0.8 * 5 = 4.
ODP (over-dispersed Poisson); NB (negative binomial); ZIP (zero-inflated Poisson)
* For the true distribution under the Poisson, the ZIP model failed to converge for n = 672 of the simulations.
Distribution of drinking outcome by timepoint and randomization group
| Base line | 3 Months | |||
| C (n = 169) | T (n = 72) | C (n = 141) | T (n = 130) | |
| MIN | 0.17 | 0 | 0 | 0 |
| 25th percentile | 1.14 | 1.32 | 0.17 | 0.13 |
| MEDIAN | 3.47 | 3.85 | 1.8 | 1.6 |
| 75th percentile | 8.23 | 9.12 | 6.1 | 5.7 |
| MAX | 61.77 | 60 | 48.6 | 38.43 |
| mean (SD) | 6.95 (9.58) | 6.68 (8.44) | 4.98 (8.47) | 4.36 (6.47) |
p-values for the ASAP randomization group effect at 3 months for a variety of count models
| MODEL | p-value |
| Poisson | .018 |
| over-dispersed Poisson | .489 |
| Negative binomial | .458 |
| zero-inflated Poisson | .542 |
| zero-inflated negative binomial | .489 |
| t-test | .495 |
| Wilcoxon | .805 |
| Permutation | .746 |
Figure 3Observed and predicted values from the ASAP study at 3 months for control and treatment groups for each of four models: Wilcoxon, Poisson, negative binomial and zero-inflated Poisson.
Figure 4Observed minus expected values from the ASAP study at 3 months as a function of count for the Poisson, negative binomial and zero-inflated Poisson.