Literature DB >> 23961961

Calculating sample size estimates for RNA sequencing data.

Steven N Hart1, Terry M Therneau, Yuji Zhang, Gregory A Poland, Jean-Pierre Kocher.   

Abstract

BACKGROUND: Given the high technical reproducibility and orders of magnitude greater resolution than microarrays, next-generation sequencing of mRNA (RNA-Seq) is quickly becoming the de facto standard for measuring levels of gene expression in biological experiments. Two important questions must be taken into consideration when designing a particular experiment, namely, 1) how deep does one need to sequence? and, 2) how many biological replicates are necessary to observe a significant change in expression?
RESULTS: Based on the gene expression distributions from 127 RNA-Seq experiments, we find evidence that 91% ± 4% of all annotated genes are sequenced at a frequency of 0.1 times per million bases mapped, regardless of sample source. Based on this observation, and combining this information with other parameters such as biological variation and technical variation that we empirically estimate from our large datasets, we developed a model to estimate the statistical power needed to identify differentially expressed genes from RNA-Seq experiments.
CONCLUSIONS: Our results provide a needed reference for ensuring RNA-Seq gene expression studies are conducted with the optimally sample size, power, and sequencing depth. We also make available both R code and an Excel worksheet for investigators to calculate for their own experiments.

Mesh:

Substances:

Year:  2013        PMID: 23961961      PMCID: PMC3842884          DOI: 10.1089/cmb.2012.0283

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  11 in total

1.  RNA-sequence analysis of human B-cells.

Authors:  Jonathan M Toung; Michael Morley; Mingyao Li; Vivian G Cheung
Journal:  Genome Res       Date:  2011-05-02       Impact factor: 9.043

2.  Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks.

Authors:  Cole Trapnell; Adam Roberts; Loyal Goff; Geo Pertea; Daehwan Kim; David R Kelley; Harold Pimentel; Steven L Salzberg; John L Rinn; Lior Pachter
Journal:  Nat Protoc       Date:  2012-03-01       Impact factor: 13.491

3.  Ultrafast and memory-efficient alignment of short DNA sequences to the human genome.

Authors:  Ben Langmead; Cole Trapnell; Mihai Pop; Steven L Salzberg
Journal:  Genome Biol       Date:  2009-03-04       Impact factor: 13.583

4.  Technical and biological variance structure in mRNA-Seq data: life in the real world.

Authors:  Ann L Oberg; Brian M Bot; Diane E Grill; Gregory A Poland; Terry M Therneau
Journal:  BMC Genomics       Date:  2012-07-07       Impact factor: 3.969

Review 5.  From RNA-seq reads to differential expression results.

Authors:  Alicia Oshlack; Mark D Robinson; Matthew D Young
Journal:  Genome Biol       Date:  2010-12-22       Impact factor: 13.583

6.  Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation.

Authors:  Cole Trapnell; Brian A Williams; Geo Pertea; Ali Mortazavi; Gordon Kwan; Marijke J van Baren; Steven L Salzberg; Barbara J Wold; Lior Pachter
Journal:  Nat Biotechnol       Date:  2010-05-02       Impact factor: 54.908

7.  Evaluation of the coverage and depth of transcriptome by RNA-Seq in chickens.

Authors:  Ying Wang; Noushin Ghaffari; Charles D Johnson; Ulisses M Braga-Neto; Hui Wang; Rui Chen; Huaijun Zhou
Journal:  BMC Bioinformatics       Date:  2011-10-18       Impact factor: 3.169

8.  Transcript length bias in RNA-seq data confounds systems biology.

Authors:  Alicia Oshlack; Matthew J Wakefield
Journal:  Biol Direct       Date:  2009-04-16       Impact factor: 4.540

9.  edgeR: a Bioconductor package for differential expression analysis of digital gene expression data.

Authors:  Mark D Robinson; Davis J McCarthy; Gordon K Smyth
Journal:  Bioinformatics       Date:  2009-11-11       Impact factor: 6.937

10.  TopHat: discovering splice junctions with RNA-Seq.

Authors:  Cole Trapnell; Lior Pachter; Steven L Salzberg
Journal:  Bioinformatics       Date:  2009-03-16       Impact factor: 6.937

View more
  99 in total

1.  Monocyte Polarization is Altered by Total-Body Irradiation in Male Rhesus Macaques: Implications for Delayed Effects of Acute Radiation Exposure.

Authors:  Kristofer T Michalson; Andrew N Macintyre; Gregory D Sempowski; J Daniel Bourland; Timothy D Howard; Gregory A Hawkins; Gregory O Dugan; J Mark Cline; Thomas C Register
Journal:  Radiat Res       Date:  2019-06-04       Impact factor: 2.841

2.  Simulation, power evaluation and sample size recommendation for single-cell RNA-seq.

Authors:  Kenong Su; Zhijin Wu; Hao Wu
Journal:  Bioinformatics       Date:  2020-12-08       Impact factor: 6.937

3.  Biological Perspectives of RNA-Sequencing Experimental Design.

Authors:  Metsada Pasmanik-Chor
Journal:  Methods Mol Biol       Date:  2021

4.  Power and sample size calculations for high-throughput sequencing-based experiments.

Authors:  Chung-I Li; David C Samuels; Ying-Yong Zhao; Yu Shyr; Yan Guo
Journal:  Brief Bioinform       Date:  2018-11-27       Impact factor: 11.622

Review 5.  Tools and resources for analyzing gene expression changes in glaucomatous neurodegeneration.

Authors:  Robert W Nickells; Heather R Pelzel
Journal:  Exp Eye Res       Date:  2015-05-19       Impact factor: 3.467

6.  PROPER: comprehensive power evaluation for differential expression using RNA-seq.

Authors:  Hao Wu; Chi Wang; Zhijin Wu
Journal:  Bioinformatics       Date:  2014-10-01       Impact factor: 6.937

Review 7.  Metatranscriptome of the Oral Microbiome in Health and Disease.

Authors:  J Solbiati; J Frias-Lopez
Journal:  J Dent Res       Date:  2018-03-08       Impact factor: 6.116

8.  The Expanding Toolkit of Translating Ribosome Affinity Purification.

Authors:  Joseph D Dougherty
Journal:  J Neurosci       Date:  2017-12-13       Impact factor: 6.167

9.  Differentially expressed genes from RNA-Seq and functional enrichment results are affected by the choice of single-end versus paired-end reads and stranded versus non-stranded protocols.

Authors:  Susan M Corley; Karen L MacKenzie; Annemiek Beverdam; Louise F Roddam; Marc R Wilkins
Journal:  BMC Genomics       Date:  2017-05-23       Impact factor: 3.969

10.  Use of RNA-seq to identify cardiac genes and gene pathways differentially expressed between dogs with and without dilated cardiomyopathy.

Authors:  Steven G Friedenberg; Lhoucine Chdid; Bruce Keene; Barbara Sherry; Alison Motsinger-Reif; Kathryn M Meurs
Journal:  Am J Vet Res       Date:  2016-07       Impact factor: 1.156

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.