Literature DB >> 23502340

Higher order asymptotics for negative binomial regression inferences from RNA-sequencing data.

Yanming Di1, Sarah C Emerson, Daniel W Schafer, Jeffrey A Kimbrel, Jeff H Chang.   

Abstract

RNA sequencing (RNA-Seq) is the current method of choice for characterizing transcriptomes and quantifying gene expression changes. This next generation sequencing-based method provides unprecedented depth and resolution. The negative binomial (NB) probability distribution has been shown to be a useful model for frequencies of mapped RNA-Seq reads and consequently provides a basis for statistical analysis of gene expression. Negative binomial exact tests are available for two-group comparisons but do not extend to negative binomial regression analysis, which is important for examining gene expression as a function of explanatory variables and for adjusted group comparisons accounting for other factors. We address the adequacy of available large-sample tests for the small sample sizes typically available from RNA-Seq studies and consider a higher-order asymptotic (HOA) adjustment to likelihood ratio tests. We demonstrate that 1) the HOA-adjusted likelihood ratio test is practically indistinguishable from the exact test in situations where the exact test is available, 2) the type I error of the HOA test matches the nominal specification in regression settings we examined via simulation, and 3) the power of the likelihood ratio test does not appear to be affected by the HOA adjustment. This work helps clarify the accuracy of the unadjusted likelihood ratio test and the degree of improvement available with the HOA adjustment. Furthermore, the HOA test may be preferable even when the exact test is available because it does not require ad hoc library size adjustments.

Entities:  

Mesh:

Substances:

Year:  2013        PMID: 23502340      PMCID: PMC3683603          DOI: 10.1515/sagmb-2012-0071

Source DB:  PubMed          Journal:  Stat Appl Genet Mol Biol        ISSN: 1544-6115


  18 in total

1.  Statistical significance for genomewide studies.

Authors:  John D Storey; Robert Tibshirani
Journal:  Proc Natl Acad Sci U S A       Date:  2003-07-25       Impact factor: 11.205

2.  Moderated statistical tests for assessing differences in tag abundance.

Authors:  Mark D Robinson; Gordon K Smyth
Journal:  Bioinformatics       Date:  2007-09-19       Impact factor: 6.937

3.  Detecting differential expression in RNA-sequence data using quasi-likelihood with shrunken dispersion estimates.

Authors:  Steven P Lund; Dan Nettleton; Davis J McCarthy; Gordon K Smyth
Journal:  Stat Appl Genet Mol Biol       Date:  2012-10-22

4.  Bacterial blight of soybean: regulation of a pathogen gene determining host cultivar specificity.

Authors:  T V Huynh; D Dahlbeck; B J Staskawicz
Journal:  Science       Date:  1989-09-22       Impact factor: 47.728

Review 5.  Closing the circle on the discovery of genes encoding Hrp regulon members and type III secretion system effectors in the genomes of three model Pseudomonas syringae strains.

Authors:  Magdalen Lindeberg; Samuel Cartinhour; Christopher R Myers; Lisa M Schechter; David J Schneider; Alan Collmer
Journal:  Mol Plant Microbe Interact       Date:  2006-11       Impact factor: 4.171

6.  The complete genome sequence of the Arabidopsis and tomato pathogen Pseudomonas syringae pv. tomato DC3000.

Authors:  C Robin Buell; Vinita Joardar; Magdalen Lindeberg; Jeremy Selengut; Ian T Paulsen; Michelle L Gwinn; Robert J Dodson; Robert T Deboy; A Scott Durkin; James F Kolonay; Ramana Madupu; Sean Daugherty; Lauren Brinkac; Maureen J Beanan; Daniel H Haft; William C Nelson; Tanja Davidsen; Nikhat Zafar; Liwei Zhou; Jia Liu; Qiaoping Yuan; Hoda Khouri; Nadia Fedorova; Bao Tran; Daniel Russell; Kristi Berry; Teresa Utterback; Susan E Van Aken; Tamara V Feldblyum; Mark D'Ascenzo; Wen-Ling Deng; Adela R Ramos; James R Alfano; Samuel Cartinhour; Arun K Chatterjee; Terrence P Delaney; Sondra G Lazarowitz; Gregory B Martin; David J Schneider; Xiaoyan Tang; Carol L Bender; Owen White; Claire M Fraser; Alan Collmer
Journal:  Proc Natl Acad Sci U S A       Date:  2003-08-19       Impact factor: 11.205

7.  GENE-counter: a computational pipeline for the analysis of RNA-Seq data for gene expression differences.

Authors:  Jason S Cumbie; Jeffrey A Kimbrel; Yanming Di; Daniel W Schafer; Larry J Wilhelm; Samuel E Fox; Christopher M Sullivan; Aron D Curzon; James C Carrington; Todd C Mockler; Jeff H Chang
Journal:  PLoS One       Date:  2011-10-06       Impact factor: 3.240

8.  Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation.

Authors:  Cole Trapnell; Brian A Williams; Geo Pertea; Ali Mortazavi; Gordon Kwan; Marijke J van Baren; Steven L Salzberg; Barbara J Wold; Lior Pachter
Journal:  Nat Biotechnol       Date:  2010-05-02       Impact factor: 54.908

9.  Differential expression analysis for sequence count data.

Authors:  Simon Anders; Wolfgang Huber
Journal:  Genome Biol       Date:  2010-10-27       Impact factor: 13.583

10.  Differential expression analysis of multifactor RNA-Seq experiments with respect to biological variation.

Authors:  Davis J McCarthy; Yunshun Chen; Gordon K Smyth
Journal:  Nucleic Acids Res       Date:  2012-01-28       Impact factor: 16.971

View more
  7 in total

1.  Robustly detecting differential expression in RNA sequencing data using observation weights.

Authors:  Xiaobei Zhou; Helen Lindsay; Mark D Robinson
Journal:  Nucleic Acids Res       Date:  2014-04-20       Impact factor: 16.971

2.  Single-gene negative binomial regression models for RNA-Seq data with higher-order asymptotic inference.

Authors:  Yanming Di
Journal:  Stat Interface       Date:  2015       Impact factor: 0.582

Review 3.  On the study of microbial transcriptomes using second- and third-generation sequencing technologies.

Authors:  Sang Chul Choi
Journal:  J Microbiol       Date:  2016-08-02       Impact factor: 3.422

4.  Testing for differentially expressed genetic pathways with single-subject N-of-1 data in the presence of inter-gene correlation.

Authors:  A Grant Schissler; Walter W Piegorsch; Yves A Lussier
Journal:  Stat Methods Med Res       Date:  2017-05-29       Impact factor: 3.021

5.  The level of residual dispersion variation and the power of differential expression tests for RNA-Seq data.

Authors:  Gu Mi; Yanming Di
Journal:  PLoS One       Date:  2015-04-07       Impact factor: 3.240

6.  Identifying stably expressed genes from multiple RNA-Seq data sets.

Authors:  Bin Zhuo; Sarah Emerson; Jeff H Chang; Yanming Di
Journal:  PeerJ       Date:  2016-12-20       Impact factor: 2.984

7.  Differential Expression of Genes Involved in Host Recognition, Attachment, and Degradation in the Mycoparasite Tolypocladium ophioglossoides.

Authors:  C Alisha Quandt; Yanming Di; Justin Elser; Pankaj Jaiswal; Joseph W Spatafora
Journal:  G3 (Bethesda)       Date:  2016-01-22       Impact factor: 3.154

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.