Literature DB >> 22127579

Finding consistent patterns: a nonparametric approach for identifying differential expression in RNA-Seq data.

Jun Li1, Robert Tibshirani.   

Abstract

We discuss the identification of features that are associated with an outcome in RNA-Sequencing (RNA-Seq) and other sequencing-based comparative genomic experiments. RNA-Seq data takes the form of counts, so models based on the normal distribution are generally unsuitable. The problem is especially challenging because different sequencing experiments may generate quite different total numbers of reads, or 'sequencing depths'. Existing methods for this problem are based on Poisson or negative binomial models: they are useful but can be heavily influenced by 'outliers' in the data. We introduce a simple, non-parametric method with resampling to account for the different sequencing depths. The new method is more robust than parametric methods. It can be applied to data with quantitative, survival, two-class or multiple-class outcomes. We compare our proposed method to Poisson and negative binomial-based methods in simulated and real data sets, and find that our method discovers more consistent patterns than competing methods.

Entities:  

Keywords:  FDR; RNA-Seq; differential expression; nonparametric; resampling

Mesh:

Year:  2011        PMID: 22127579      PMCID: PMC4605138          DOI: 10.1177/0962280211428386

Source DB:  PubMed          Journal:  Stat Methods Med Res        ISSN: 0962-2802            Impact factor:   3.021


  28 in total

1.  Analysis of variance for gene expression microarray data.

Authors:  M K Kerr; M Martin; G A Churchill
Journal:  J Comput Biol       Date:  2000       Impact factor: 1.479

2.  DNA arrays for analysis of gene expression.

Authors:  M B Eisen; P O Brown
Journal:  Methods Enzymol       Date:  1999       Impact factor: 1.600

3.  Normalization, testing, and false discovery rate estimation for RNA-sequencing data.

Authors:  Jun Li; Daniela M Witten; Iain M Johnstone; Robert Tibshirani
Journal:  Biostatistics       Date:  2011-10-14       Impact factor: 5.899

4.  RNA-seq: an assessment of technical reproducibility and comparison with gene expression arrays.

Authors:  John C Marioni; Christopher E Mason; Shrikant M Mane; Matthew Stephens; Yoav Gilad
Journal:  Genome Res       Date:  2008-06-11       Impact factor: 9.043

5.  Mapping and quantifying mammalian transcriptomes by RNA-Seq.

Authors:  Ali Mortazavi; Brian A Williams; Kenneth McCue; Lorian Schaeffer; Barbara Wold
Journal:  Nat Methods       Date:  2008-05-30       Impact factor: 28.547

6.  baySeq: empirical Bayesian methods for identifying differential expression in sequence count data.

Authors:  Thomas J Hardcastle; Krystyna A Kelly
Journal:  BMC Bioinformatics       Date:  2010-08-10       Impact factor: 3.169

7.  Comprehensive identification of cell cycle-regulated genes of the yeast Saccharomyces cerevisiae by microarray hybridization.

Authors:  P T Spellman; G Sherlock; M Q Zhang; V R Iyer; K Anders; M B Eisen; P O Brown; D Botstein; B Futcher
Journal:  Mol Biol Cell       Date:  1998-12       Impact factor: 4.138

Review 8.  RNA-Seq: a revolutionary tool for transcriptomics.

Authors:  Zhong Wang; Mark Gerstein; Michael Snyder
Journal:  Nat Rev Genet       Date:  2009-01       Impact factor: 53.242

9.  Differential expression analysis for sequence count data.

Authors:  Simon Anders; Wolfgang Huber
Journal:  Genome Biol       Date:  2010-10-27       Impact factor: 13.583

10.  edgeR: a Bioconductor package for differential expression analysis of digital gene expression data.

Authors:  Mark D Robinson; Davis J McCarthy; Gordon K Smyth
Journal:  Bioinformatics       Date:  2009-11-11       Impact factor: 6.937

View more
  177 in total

1.  Robustly detecting differential expression in RNA sequencing data using observation weights.

Authors:  Xiaobei Zhou; Helen Lindsay; Mark D Robinson
Journal:  Nucleic Acids Res       Date:  2014-04-20       Impact factor: 16.971

2.  SimSeq: a nonparametric approach to simulation of RNA-sequence datasets.

Authors:  Sam Benidt; Dan Nettleton
Journal:  Bioinformatics       Date:  2015-02-26       Impact factor: 6.937

3.  rSeqNP: a non-parametric approach for detecting differential expression and splicing from RNA-Seq data.

Authors:  Yang Shi; Arul M Chinnaiyan; Hui Jiang
Journal:  Bioinformatics       Date:  2015-02-24       Impact factor: 6.937

4.  Statistical Modeling of High Dimensional Counts.

Authors:  Michael I Love
Journal:  Methods Mol Biol       Date:  2021

5.  Interpretation of differential gene expression results of RNA-seq data: review and integration.

Authors:  Adam McDermaid; Brandon Monier; Jing Zhao; Bingqiang Liu; Qin Ma
Journal:  Brief Bioinform       Date:  2019-11-27       Impact factor: 11.622

6.  Quantitative proteome profiling of lymph node-positive vs. -negative colorectal carcinomas pinpoints MX1 as a marker for lymph node metastasis.

Authors:  Roland S Croner; Michael Stürzl; Tilman T Rau; Gergana Metodieva; Carol I Geppert; Elisabeth Naschberger; Berthold Lausen; Metodi V Metodiev
Journal:  Int J Cancer       Date:  2014-05-12       Impact factor: 7.396

7.  Nonparametric expression analysis using inferential replicate counts.

Authors:  Anqi Zhu; Avi Srivastava; Joseph G Ibrahim; Rob Patro; Michael I Love
Journal:  Nucleic Acids Res       Date:  2019-10-10       Impact factor: 16.971

8.  Swimming downstream: statistical analysis of differential transcript usage following Salmon quantification.

Authors:  Michael I Love; Charlotte Soneson; Rob Patro
Journal:  F1000Res       Date:  2018-06-27

9.  DiPhiSeq: robust comparison of expression levels on RNA-Seq data with large sample sizes.

Authors:  Jun Li; Alicia T Lamere
Journal:  Bioinformatics       Date:  2019-07-01       Impact factor: 6.937

10.  Bayesian Framework for Detecting Gene Expression Outliers in Individual Samples.

Authors:  John Vivian; Jordan M Eizenga; Holly C Beale; Olena M Vaske; Benedict Paten
Journal:  JCO Clin Cancer Inform       Date:  2020-02
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.