Literature DB >> 31080303

Analyzing clustered count data with a cluster specific random effect zero-inflated Conway-Maxwell-Poisson distribution.

Hyoyoung Choo-Wosoba1, Somnath Datta2.   

Abstract

In recent years, data analysis techniques have been developed in biological and medical research areas with different types of count distributions. In particular, zero-inflated versions of parametric count distributions have been used to model excessive zeros that are often present in these assays. Perhaps, the most common count distribution which has been used for analyzing such data is the Poisson distribution. However, a Poisson distribution, having a single underlying parameter, cannot cope with any other data dispersion pattern besides equidispersion. A negative binomial distribution is capable of modeling overdispersed, but not underdispersed data. However, a Conway-Maxwell-Poisson (CMP) distribution (Conway, R. W., and Maxwell, W. L., 1962) can handle not only overdispersion but also underdispersion. We show with an illustrative data set on next generation sequencing of maize hybrids that both underdispersion and overdispersion can be present in genetic data. Furthermore, if count data consists of clustered observations, one of the most efficient statistical technique is to introduce a cluster specific random effect term. Once again, the maize hybrids data presents such a situation. We develop inference procedures for a zero-inflated CMP regression that incorporates a cluster specific random effect term. Unlike, the Gaussian models, the underlying likelihood is computationally challenging. We use a numerical approximation via a Gaussian quadrature to circumvent this issues. A test for checking zero-inflation has also been developed in our setting. Finite sample properties of our estimators and test have been investigated by extensive simulations. Finally, the statistical methodology has been applied to analyze the maize data mentioned before.

Entities:  

Keywords:  Gaussian-Hermite (G-H) quadrature; Mixed effects model; Next- generation sequencing (NGS) data; Poisson distribution; Under- and over-dispersions

Year:  2017        PMID: 31080303      PMCID: PMC6510495          DOI: 10.1080/02664763.2017.1312299

Source DB:  PubMed          Journal:  J Appl Stat        ISSN: 0266-4763            Impact factor:   1.404


  3 in total

1.  On some aspects of a zero-inflated overdispersed model and its applications.

Authors:  C Satheesh Kumar; Rakhi Ramachandran
Journal:  J Appl Stat       Date:  2019-07-24       Impact factor: 1.416

2.  Analyzing longitudinal clustered count data with zero inflation: Marginal modeling using the Conway-Maxwell-Poisson distribution.

Authors:  Tong Kang; Steven M Levy; Somnath Datta
Journal:  Biom J       Date:  2021-01-04       Impact factor: 1.715

3.  A longitudinal Bayesian mixed effects model with hurdle Conway-Maxwell-Poisson distribution.

Authors:  Tong Kang; Jeremy Gaskins; Steven Levy; Somnath Datta
Journal:  Stat Med       Date:  2020-12-23       Impact factor: 2.497

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.