Literature DB >> 22003245

Normalization, testing, and false discovery rate estimation for RNA-sequencing data.

Jun Li1, Daniela M Witten, Iain M Johnstone, Robert Tibshirani.   

Abstract

We discuss the identification of genes that are associated with an outcome in RNA sequencing and other sequence-based comparative genomic experiments. RNA-sequencing data take the form of counts, so models based on the Gaussian distribution are unsuitable. Moreover, normalization is challenging because different sequencing experiments may generate quite different total numbers of reads. To overcome these difficulties, we use a log-linear model with a new approach to normalization. We derive a novel procedure to estimate the false discovery rate (FDR). Our method can be applied to data with quantitative, two-class, or multiple-class outcomes, and the computation is fast even for large data sets. We study the accuracy of our approaches for significance calculation and FDR estimation, and we demonstrate that our method has potential advantages over existing methods that are based on a Poisson or negative binomial model. In summary, this work provides a pipeline for the significance analysis of sequencing data.

Mesh:

Substances:

Year:  2011        PMID: 22003245      PMCID: PMC3372940          DOI: 10.1093/biostatistics/kxr031

Source DB:  PubMed          Journal:  Biostatistics        ISSN: 1465-4644            Impact factor:   5.899


  31 in total

1.  Moderated statistical tests for assessing differences in tag abundance.

Authors:  Mark D Robinson; Gordon K Smyth
Journal:  Bioinformatics       Date:  2007-09-19       Impact factor: 6.937

2.  RNA-seq: an assessment of technical reproducibility and comparison with gene expression arrays.

Authors:  John C Marioni; Christopher E Mason; Shrikant M Mane; Matthew Stephens; Yoav Gilad
Journal:  Genome Res       Date:  2008-06-11       Impact factor: 9.043

3.  The beginning of the end for microarrays?

Authors:  Jay Shendure
Journal:  Nat Methods       Date:  2008-07       Impact factor: 28.547

4.  The transcriptional landscape of the yeast genome defined by RNA sequencing.

Authors:  Ugrappa Nagalakshmi; Zhong Wang; Karl Waern; Chong Shou; Debasish Raha; Mark Gerstein; Michael Snyder
Journal:  Science       Date:  2008-05-01       Impact factor: 47.728

5.  Mapping and quantifying mammalian transcriptomes by RNA-Seq.

Authors:  Ali Mortazavi; Brian A Williams; Kenneth McCue; Lorian Schaeffer; Barbara Wold
Journal:  Nat Methods       Date:  2008-05-30       Impact factor: 28.547

Review 6.  RNA-Seq: a revolutionary tool for transcriptomics.

Authors:  Zhong Wang; Mark Gerstein; Michael Snyder
Journal:  Nat Rev Genet       Date:  2009-01       Impact factor: 53.242

7.  Transcript length bias in RNA-seq data confounds systems biology.

Authors:  Alicia Oshlack; Matthew J Wakefield
Journal:  Biol Direct       Date:  2009-04-16       Impact factor: 4.540

8.  Identifying differential expression in multiple SAGE libraries: an overdispersed log-linear model approach.

Authors:  Jun Lu; John K Tomfohr; Thomas B Kepler
Journal:  BMC Bioinformatics       Date:  2005-06-29       Impact factor: 3.169

9.  Measuring differential gene expression by short read sequencing: quantitative comparison to 2-channel gene expression microarrays.

Authors:  Joshua S Bloom; Zia Khan; Leonid Kruglyak; Mona Singh; Amy A Caudy
Journal:  BMC Genomics       Date:  2009-05-12       Impact factor: 3.969

10.  Deep sequencing-based expression analysis shows major advances in robustness, resolution and inter-lab portability over five microarray platforms.

Authors:  Peter A C 't Hoen; Yavuz Ariyurek; Helene H Thygesen; Erno Vreugdenhil; Rolf H A M Vossen; Renée X de Menezes; Judith M Boer; Gert-Jan B van Ommen; Johan T den Dunnen
Journal:  Nucleic Acids Res       Date:  2008-10-15       Impact factor: 16.971

View more
  123 in total

1.  Detection of Test Speededness Using Change-Point Analysis.

Authors:  Can Shao; Jun Li; Ying Cheng
Journal:  Psychometrika       Date:  2015-08-25       Impact factor: 2.500

2.  Zero-Inflated Beta Regression for Differential Abundance Analysis with Metagenomics Data.

Authors:  Xiaoling Peng; Gang Li; Zhenqiu Liu
Journal:  J Comput Biol       Date:  2015-12-16       Impact factor: 1.479

3.  The Brassicaceae Family Displays Divergent, Shoot-Skewed NLR Resistance Gene Expression.

Authors:  David Munch; Vikas Gupta; Asger Bachmann; Wolfgang Busch; Simon Kelly; Terry Mun; Stig Uggerhøj Andersen
Journal:  Plant Physiol       Date:  2017-11-29       Impact factor: 8.340

4.  The Sirt1 activator SRT3025 expands hematopoietic stem and progenitor cells and improves hematopoiesis in Fanconi anemia mice.

Authors:  Qing-Shuo Zhang; Matthew Deater; Kathryn Schubert; Laura Marquez-Loza; Carl Pelz; David A Sinclair; Markus Grompe
Journal:  Stem Cell Res       Date:  2015-05-22       Impact factor: 2.020

Review 5.  Human Wound-Healing Research: Issues and Perspectives for Studies Using Wide-Scale Analytic Platforms.

Authors:  Kristo Nuutila; Shintaro Katayama; Jyrki Vuola; Esko Kankuri
Journal:  Adv Wound Care (New Rochelle)       Date:  2014-03-01       Impact factor: 4.730

6.  Clustering and classification methods for single-cell RNA-sequencing data.

Authors:  Ren Qi; Anjun Ma; Qin Ma; Quan Zou
Journal:  Brief Bioinform       Date:  2020-07-15       Impact factor: 11.622

7.  Selecting between-sample RNA-Seq normalization methods from the perspective of their assumptions.

Authors:  Ciaran Evans; Johanna Hardin; Daniel M Stoebel
Journal:  Brief Bioinform       Date:  2018-09-28       Impact factor: 11.622

8.  Single-gene negative binomial regression models for RNA-Seq data with higher-order asymptotic inference.

Authors:  Yanming Di
Journal:  Stat Interface       Date:  2015       Impact factor: 0.582

9.  Quantitative proteome profiling of lymph node-positive vs. -negative colorectal carcinomas pinpoints MX1 as a marker for lymph node metastasis.

Authors:  Roland S Croner; Michael Stürzl; Tilman T Rau; Gergana Metodieva; Carol I Geppert; Elisabeth Naschberger; Berthold Lausen; Metodi V Metodiev
Journal:  Int J Cancer       Date:  2014-05-12       Impact factor: 7.396

10.  Graphical Models via Univariate Exponential Family Distributions.

Authors:  Eunho Yang; Pradeep Ravikumar; Genevera I Allen; Zhandong Liu
Journal:  J Mach Learn Res       Date:  2015-12       Impact factor: 3.654

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.