Literature DB >> 24957791

Marginalized multilevel hurdle and zero-inflated models for overdispersed and correlated count data with excess zeros.

Wondwosen Kassahun1, Thomas Neyens, Geert Molenberghs, Christel Faes, Geert Verbeke.   

Abstract

Count data are collected repeatedly over time in many applications, such as biology, epidemiology, and public health. Such data are often characterized by the following three features. First, correlation due to the repeated measures is usually accounted for using subject-specific random effects, which are assumed to be normally distributed. Second, the sample variance may exceed the mean, and hence, the theoretical mean-variance relationship is violated, leading to overdispersion. This is usually allowed for based on a hierarchical approach, combining a Poisson model with gamma distributed random effects. Third, an excess of zeros beyond what standard count distributions can predict is often handled by either the hurdle or the zero-inflated model. A zero-inflated model assumes two processes as sources of zeros and combines a count distribution with a discrete point mass as a mixture, while the hurdle model separately handles zero observations and positive counts, where then a truncated-at-zero count distribution is used for the non-zero state. In practice, however, all these three features can appear simultaneously. Hence, a modeling framework that incorporates all three is necessary, and this presents challenges for the data analysis. Such models, when conditionally specified, will naturally have a subject-specific interpretation. However, adopting their purposefully modified marginalized versions leads to a direct marginal or population-averaged interpretation for parameter estimates of covariate effects, which is the primary interest in many applications. In this paper, we present a marginalized hurdle model and a marginalized zero-inflated model for correlated and overdispersed count data with excess zero observations and then illustrate these further with two case studies. The first dataset focuses on the Anopheles mosquito density around a hydroelectric dam, while adolescents' involvement in work, to earn money and support their families or themselves, is studied in the second example. Sub-models, which result from omitting zero-inflation and/or overdispersion features, are also considered for comparison's purpose. Analysis of the two datasets showed that accounting for the correlation, overdispersion, and excess zeros simultaneously resulted in a better fit to the data and, more importantly, that omission of any of them leads to incorrect marginal inference and erroneous conclusions about covariate effects.
Copyright © 2014 John Wiley & Sons, Ltd.

Keywords:  clustering; hurdle model; marginal model; multilevel model; overdispersion; zero-inflated model

Mesh:

Year:  2014        PMID: 24957791     DOI: 10.1002/sim.6237

Source DB:  PubMed          Journal:  Stat Med        ISSN: 0277-6715            Impact factor:   2.373


  9 in total

1.  Features of the bronchial bacterial microbiome associated with atopy, asthma, and responsiveness to inhaled corticosteroid treatment.

Authors:  Juliana Durack; Susan V Lynch; Snehal Nariya; Nirav R Bhakta; Avraham Beigelman; Mario Castro; Anne-Marie Dyer; Elliot Israel; Monica Kraft; Richard J Martin; David T Mauger; Sharon R Rosenberg; Tonya Sharp-King; Steven R White; Prescott G Woodruff; Pedro C Avila; Loren C Denlinger; Fernando Holguin; Stephen C Lazarus; Njira Lugogo; Wendy C Moore; Stephen P Peters; Loretta Que; Lewis J Smith; Christine A Sorkness; Michael E Wechsler; Sally E Wenzel; Homer A Boushey; Yvonne J Huang
Journal:  J Allergy Clin Immunol       Date:  2016-11-10       Impact factor: 10.793

2.  Marginalized zero-altered models for longitudinal count data.

Authors:  Loni Philip Tabb; Eric J Tchetgen Tchetgen; Greg A Wellenius; Brent A Coull
Journal:  Stat Biosci       Date:  2015-09-22

3.  A semiparametric marginalized zero-inflated model for analyzing healthcare utilization panel data with missingness.

Authors:  Tian Chen; Hui Zhang; Bo Zhang
Journal:  J Appl Stat       Date:  2019-05-22       Impact factor: 1.404

4.  A joint model for multivariate hierarchical semicontinuous data with replications.

Authors:  Wondwosen Kassahun-Yimer; Paul S Albert; Leah M Lipsky; Tonja R Nansel; Aiyi Liu
Journal:  Stat Methods Med Res       Date:  2017-11-08       Impact factor: 3.021

5.  Randomized Trial of Motivational Interviewing to Prevent Early Childhood Caries in Public Housing.

Authors:  M M Henshaw; B Borrelli; S E Gregorich; B Heaton; E M Tooley; W Santo; N F Cheng; M Rasmussen; S Helman; S Shain; R I Garcia
Journal:  JDR Clin Trans Res       Date:  2018-08-22

6.  Randomized Trial of Motivational Interviewing to Prevent Early Childhood Caries in American Indian Children.

Authors:  T S Batliner; T Tiwari; W G Henderson; A R Wilson; S E Gregorich; K A Fehringer; A G Brega; E Swyers; T Zacher; M M Harper; K Plunkett; W Santo; N F Cheng; S Shain; M Rasmussen; S M Manson; J Albino
Journal:  JDR Clin Trans Res       Date:  2018-07-12

7.  Influence of Community-Led Total Sanitation and Water Coverages in the Control of Cholera in Madarounfa, Niger (2018).

Authors:  Julien Graveleau; Maria Eleanor Reserva; Alama Keita; Roberto Molinari; Guillaume Constantin De Magny
Journal:  Front Public Health       Date:  2021-04-29

8.  Straightening Beta: Overdispersion of Lethal Chromosome Aberrations following Radiotherapeutic Doses Leads to Terminal Linearity in the Alpha-Beta Model.

Authors:  Igor Shuryak; Bradford D Loucas; Michael N Cornforth
Journal:  Front Oncol       Date:  2017-12-21       Impact factor: 6.244

9.  Determining the Efficacy of a Hybridizing Agent in Wheat (Triticum aestivum L.).

Authors:  Amanda C Easterly; Walter W Stroup; Nicholas Garst; Vikas Belamkar; Jean-Benoit Sarazin; Thierry Moittié; Amir M H Ibrahim; Jackie C Rudd; Edward Souza; P Stephen Baenziger
Journal:  Sci Rep       Date:  2019-12-27       Impact factor: 4.379

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.