Literature DB >> 23104841

Empirical bayesian selection of hypothesis testing procedures for analysis of sequence count expression data.

Stanley B Pounds1, Cuilan L Gao, Hui Zhang.   

Abstract

Differential expression analysis of sequence-count expression data involves performing a large number of hypothesis tests that compare the expression count data of each gene or transcript across two or more biological conditions. The assumptions of any specific hypothesis-testing method will probably not be valid for each of a very large number of genes. Thus, computational evaluation of assumptions should be incorporated into the analysis to select an appropriate hypothesis-testing method for each gene. Here, we generalize earlier work to introduce two novel procedures that use estimates of the empirical Bayesian probability (EBP) of overdispersion to select or combine results of a standard Poisson likelihood ratio test and a quasi-likelihood test for each gene. These EBP-based procedures simultaneously evaluate the Poisson-distribution assumption and account for multiple testing. With adequate power to detect overdispersion, the new procedures select the standard likelihood test for each gene with Poisson-distributed counts and the quasi-likelihood test for each gene with overdispersed counts. The new procedures outperformed previously published methods in many simulation studies. We also present a real-data analysis example and discuss how the framework used to develop the new procedures may be generalized to further enhance performance. An R code library that implements the methods is freely available at www.stjuderesearch.org/depts/biostats/software.

Mesh:

Year:  2012        PMID: 23104841     DOI: 10.1515/1544-6115.1773

Source DB:  PubMed          Journal:  Stat Appl Genet Mol Biol        ISSN: 1544-6115


  3 in total

1.  The most informative spacing test effectively discovers biologically relevant outliers or multiple modes in expression.

Authors:  Iwona Pawlikowska; Gang Wu; Michael Edmonson; Zhifa Liu; Tanja Gruber; Jinghui Zhang; Stan Pounds
Journal:  Bioinformatics       Date:  2014-01-22       Impact factor: 6.937

Review 2.  A comparison study on modeling of clustered and overdispersed count data for multiple comparisons.

Authors:  Jochen Kruppa; Ludwig Hothorn
Journal:  J Appl Stat       Date:  2020-07-03       Impact factor: 1.416

3.  De novo transcript sequence reconstruction from RNA-seq using the Trinity platform for reference generation and analysis.

Authors:  Brian J Haas; Alexie Papanicolaou; Moran Yassour; Manfred Grabherr; Philip D Blood; Joshua Bowden; Matthew Brian Couger; David Eccles; Bo Li; Matthias Lieber; Matthew D MacManes; Michael Ott; Joshua Orvis; Nathalie Pochet; Francesco Strozzi; Nathan Weeks; Rick Westerman; Thomas William; Colin N Dewey; Robert Henschel; Richard D LeDuc; Nir Friedman; Aviv Regev
Journal:  Nat Protoc       Date:  2013-07-11       Impact factor: 13.491

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.