Literature DB >> 28983398

A Review of Multivariate Distributions for Count Data Derived from the Poisson Distribution.

David Inouye1, Eunho Yang2, Genevera Allen3, Pradeep Ravikumar4.   

Abstract

The Poisson distribution has been widely studied and used for modeling univariate count-valued data. Multivariate generalizations of the Poisson distribution that permit dependencies, however, have been far less popular. Yet, real-world high-dimensional count-valued data found in word counts, genomics, and crime statistics, for example, exhibit rich dependencies, and motivate the need for multivariate distributions that can appropriately model this data. We review multivariate distributions derived from the univariate Poisson, categorizing these models into three main classes: 1) where the marginal distributions are Poisson, 2) where the joint distribution is a mixture of independent multivariate Poisson distributions, and 3) where the node-conditional distributions are derived from the Poisson. We discuss the development of multiple instances of these classes and compare the models in terms of interpretability and theory. Then, we empirically compare multiple models from each class on three real-world datasets that have varying data characteristics from different domains, namely traffic accident data, biological next generation sequencing data, and text data. These empirical experiments develop intuition about the comparative advantages and disadvantages of each class of multivariate distribution that was derived from the Poisson. Finally, we suggest new research directions as explored in the subsequent discussion section.

Entities:  

Keywords:  Copulas; Graphical Models; High Dimensional; Multivariate; Poisson

Year:  2017        PMID: 28983398      PMCID: PMC5624559          DOI: 10.1002/wics.1398

Source DB:  PubMed          Journal:  Wiley Interdiscip Rev Comput Stat        ISSN: 1939-0068


  11 in total

1.  Highway accident severities and the mixed logit model: an exploratory empirical analysis.

Authors:  John C Milton; Venky N Shankar; Fred L Mannering
Journal:  Accid Anal Prev       Date:  2007-07-16

2.  A Local Poisson Graphical Model for inferring networks from sequencing data.

Authors:  Genevera I Allen; Zhandong Liu
Journal:  IEEE Trans Nanobioscience       Date:  2013-08-15       Impact factor: 2.935

3.  Collision prediction models using multivariate Poisson-lognormal regression.

Authors:  Karim El-Basyouny; Tarek Sayed
Journal:  Accid Anal Prev       Date:  2009-05-03

4.  A multivariate Poisson-lognormal regression model for prediction of crash counts by severity, using Bayesian methods.

Authors:  Jianming Ma; Kara M Kockelman; Paul Damien
Journal:  Accid Anal Prev       Date:  2007-12-18

5.  FastMMD: Ensemble of Circular Discrepancy for Efficient Two-Sample Test.

Authors:  Ji Zhao; Deyu Meng
Journal:  Neural Comput       Date:  2015-03-16       Impact factor: 2.026

6.  Square Root Graphical Models: Multivariate Generalizations of Univariate Exponential Families that Permit Positive Dependencies.

Authors:  David I Inouye; Pradeep Ravikumar; Inderjit S Dhillon
Journal:  JMLR Workshop Conf Proc       Date:  2016-06

7.  A copula-based mixed Poisson model for bivariate recurrent events under event-dependent censoring.

Authors:  Richard J Cook; Jerald F Lawless; Ker-Ai Lee
Journal:  Stat Med       Date:  2010-03-15       Impact factor: 2.373

8.  Graphical Models via Univariate Exponential Family Distributions.

Authors:  Eunho Yang; Pradeep Ravikumar; Genevera I Allen; Zhandong Liu
Journal:  J Mach Learn Res       Date:  2015-12       Impact factor: 3.654

9.  TCGA2STAT: simple TCGA data access for integrated statistical analysis in R.

Authors:  Ying-Wooi Wan; Genevera I Allen; Zhandong Liu
Journal:  Bioinformatics       Date:  2015-11-14       Impact factor: 6.937

10.  Estimation of sparse directed acyclic graphs for multivariate counts data.

Authors:  Sung Won Han; Hua Zhong
Journal:  Biometrics       Date:  2016-02-05       Impact factor: 2.571

View more
  10 in total

1.  ADJUSTED REGULARIZATION IN LATENT GRAPHICAL MODELS: APPLICATION TO MULTIPLE-NEURON SPIKE COUNT DATA.

Authors:  Giuseppe Vinci; Valérie Ventura; Matthew A Smith; Robert E Kass
Journal:  Ann Appl Stat       Date:  2018-07-28       Impact factor: 2.083

2.  MODEL-BASED FEATURE SELECTION AND CLUSTERING OF RNA-SEQ DATA FOR UNSUPERVISED SUBTYPE DISCOVERY.

Authors:  David K Lim; Naim U Rashid; Joseph G Ibrahim
Journal:  Ann Appl Stat       Date:  2021-03-18       Impact factor: 2.083

3.  A new bivariate Poisson distribution via conditional specification: properties and applications.

Authors:  Indranil Ghosh; Filipe Marques; Subrata Chakraborty
Journal:  J Appl Stat       Date:  2020-07-14       Impact factor: 1.416

4.  Prioritizing Autism Risk Genes using Personalized Graphical Models Estimated from Single Cell RNA-seq Data.

Authors:  Jianyu Liu; Haodong Wang; Wei Sun; Yufeng Liu
Journal:  J Am Stat Assoc       Date:  2021-07-21       Impact factor: 4.369

5.  ESCO: single cell expression simulation incorporating gene co-expression.

Authors:  Jinjin Tian; Jiebiao Wang; Kathryn Roeder
Journal:  Bioinformatics       Date:  2021-02-24       Impact factor: 6.937

6.  scDesign2: a transparent simulator that generates high-fidelity single-cell gene expression count data with gene correlations captured.

Authors:  Tianyi Sun; Dongyuan Song; Wei Vivian Li; Jingyi Jessica Li
Journal:  Genome Biol       Date:  2021-05-25       Impact factor: 13.583

7.  Modelling the neural code in large populations of correlated neurons.

Authors:  Sacha Sokoloski; Amir Aschner; Ruben Coen-Cagli
Journal:  Elife       Date:  2021-10-05       Impact factor: 8.140

8.  Nonparametric graphical model for counts.

Authors:  Arkaprava Roy; David B Dunson
Journal:  J Mach Learn Res       Date:  2020-12       Impact factor: 5.177

9.  Sufficient dimension reduction for compositional data.

Authors:  Diego Tomassi; Liliana Forzani; Sabrina Duarte; Ruth M Pfeiffer
Journal:  Biostatistics       Date:  2021-10-13       Impact factor: 5.899

10.  Approximate inference of gene regulatory network models from RNA-Seq time series data.

Authors:  Thomas Thorne
Journal:  BMC Bioinformatics       Date:  2018-04-11       Impact factor: 3.169

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.