Literature DB >> 23615947

A model based criterion for gene expression calls using RNA-seq data.

Günter P Wagner1, Koryu Kin, Vincent J Lynch.   

Abstract

The power of deep sequencing technology to reliably detect single RNA reads leads to a paradoxical problem of high sensitivity. In hybridization or PCR based methods for RNA quantification, the concern is low sensitivity, i.e., the problem that the signal from truly expressed genes might not be distinguishable from noise. In contrast, the problem with RNA-seq is that it is not clear whether genes with very low read counts are from low expressed genes or merely transcriptional noise. The frequency distribution for read counts does not show a clear separation in two classes of genes, which makes the decision whether a gene is to be considered expressed or not seemingly arbitrary. Here we address this problem by suggesting a statistical model that considers the number of transcripts detected in a RNA-seq study as a mixture of two distributions: one is a exponential distribution for transcripts from inactive genes, and a negative binomial distribution for actively transcribed genes. We apply this model to a number of RNA-seq data sets and find that the model fits the data very well. The calculated criteria for distinguishing between expressed and non-expressed gene is remarkably consistent among data sets, suggesting genes with more than two transcripts per million transcripts (TPM) are highly likely from actively transcribed genes. This criterion is consistent with the criterion of 1 RPKM proposed by Hebenstreit et al. Mol Sys Biol 7:497 (2011), based on chromatin modification and per cell RNA expression data. Hence, the regression model correctly identifies the not actively expressed class of genes and thus, provides an operational criterion for classifying genes in expressed and non-expressed sets, facilitating the interpretation of RNA-seq data.

Entities:  

Mesh:

Substances:

Year:  2013        PMID: 23615947     DOI: 10.1007/s12064-013-0178-3

Source DB:  PubMed          Journal:  Theory Biosci        ISSN: 1431-7613            Impact factor:   1.919


  11 in total

1.  Transcriptional noise and the fidelity of initiation by RNA polymerase II.

Authors:  Kevin Struhl
Journal:  Nat Struct Mol Biol       Date:  2007-02       Impact factor: 15.369

2.  The transcriptional landscape of the yeast genome defined by RNA sequencing.

Authors:  Ugrappa Nagalakshmi; Zhong Wang; Karl Waern; Chong Shou; Debasish Raha; Mark Gerstein; Michael Snyder
Journal:  Science       Date:  2008-05-01       Impact factor: 47.728

3.  Mapping and quantifying mammalian transcriptomes by RNA-Seq.

Authors:  Ali Mortazavi; Brian A Williams; Kenneth McCue; Lorian Schaeffer; Barbara Wold
Journal:  Nat Methods       Date:  2008-05-30       Impact factor: 28.547

4.  Measurement of mRNA abundance using RNA-seq data: RPKM measure is inconsistent among samples.

Authors:  Günter P Wagner; Koryu Kin; Vincent J Lynch
Journal:  Theory Biosci       Date:  2012-08-08       Impact factor: 1.919

5.  Three abundance classes in HeLa cell messenger RNA.

Authors:  J O Bishop; J G Morton; M Rosbash; M Richardson
Journal:  Nature       Date:  1974-07-19       Impact factor: 49.962

6.  ChIP-seq accurately predicts tissue-specific activity of enhancers.

Authors:  Axel Visel; Matthew J Blow; Zirong Li; Tao Zhang; Jennifer A Akiyama; Amy Holt; Ingrid Plajzer-Frick; Malak Shoukry; Crystal Wright; Feng Chen; Veena Afzal; Bing Ren; Edward M Rubin; Len A Pennacchio
Journal:  Nature       Date:  2009-02-12       Impact factor: 49.962

7.  A scaling normalization method for differential expression analysis of RNA-seq data.

Authors:  Mark D Robinson; Alicia Oshlack
Journal:  Genome Biol       Date:  2010-03-02       Impact factor: 13.583

8.  RNA-Seq gene expression estimation with read mapping uncertainty.

Authors:  Bo Li; Victor Ruotti; Ron M Stewart; James A Thomson; Colin N Dewey
Journal:  Bioinformatics       Date:  2009-12-18       Impact factor: 6.937

Review 9.  RNA-Seq: a revolutionary tool for transcriptomics.

Authors:  Zhong Wang; Mark Gerstein; Michael Snyder
Journal:  Nat Rev Genet       Date:  2009-01       Impact factor: 53.242

10.  Differential expression analysis for sequence count data.

Authors:  Simon Anders; Wolfgang Huber
Journal:  Genome Biol       Date:  2010-10-27       Impact factor: 13.583

View more
  66 in total

1.  A multi-omic screening approach for the discovery of thermoactive glycoside hydrolases.

Authors:  Philip Busch; Marcel Suleiman; Christian Schäfers; Garabed Antranikian
Journal:  Extremophiles       Date:  2021-01-08       Impact factor: 2.395

2.  Allele-specific miRNA-binding analysis identifies candidate target genes for breast cancer risk.

Authors:  Ana Jacinta-Fernandes; Joana M Xavier; Ramiro Magno; Joel G Lage; Ana-Teresa Maia
Journal:  NPJ Genom Med       Date:  2020-02-13       Impact factor: 8.617

3.  MIPPIE: the mouse integrated protein-protein interaction reference.

Authors:  Gregorio Alanis-Lobato; Jannik S Möllmann; Martin H Schaefer; Miguel A Andrade-Navarro
Journal:  Database (Oxford)       Date:  2020-01-01       Impact factor: 3.451

4.  Redefining the Small Regulatory RNA Transcriptome in Streptococcus pneumoniae Serotype 2 Strain D39.

Authors:  Dhriti Sinha; Kurt Zimmer; Todd A Cameron; Douglas B Rusch; Malcolm E Winkler; Nicholas R De Lay
Journal:  J Bacteriol       Date:  2019-06-21       Impact factor: 3.490

Review 5.  Old cell, new trick? Cnidocytes as a model for the evolution of novelty.

Authors:  Leslie S Babonis; Mark Q Martindale
Journal:  Integr Comp Biol       Date:  2014-04-25       Impact factor: 3.326

6.  An endogenous retroviral envelope syncytin and its cognate receptor identified in the viviparous placental Mabuya lizard.

Authors:  Guillaume Cornelis; Mathis Funk; Cécile Vernochet; Francisca Leal; Oscar Alejandro Tarazona; Guillaume Meurice; Odile Heidmann; Anne Dupressoir; Aurélien Miralles; Martha Patricia Ramirez-Pinilla; Thierry Heidmann
Journal:  Proc Natl Acad Sci U S A       Date:  2017-11-21       Impact factor: 11.205

7.  Polymorphism and Divergence of Novel Gene Expression Patterns in Drosophila melanogaster.

Authors:  Julie M Cridland; Alex C Majane; Hayley K Sheehy; David J Begun
Journal:  Genetics       Date:  2020-07-31       Impact factor: 4.562

Review 8.  Quantitative bacterial transcriptomics with RNA-seq.

Authors:  James P Creecy; Tyrrell Conway
Journal:  Curr Opin Microbiol       Date:  2014-12-05       Impact factor: 7.934

9.  RNA-seq of HaHV-1-infected abalones reveals a common transcriptional signature of Malacoherpesviruses.

Authors:  Chang-Ming Bai; Umberto Rosani; Ya-Nan Li; Shu-Min Zhang; Lu-Sheng Xin; Chong-Ming Wang
Journal:  Sci Rep       Date:  2019-01-30       Impact factor: 4.379

10.  A Needle in A Haystack: Tracing Bivalve-Associated Viruses in High-Throughput Transcriptomic Data.

Authors:  Umberto Rosani; Maxwell Shapiro; Paola Venier; Bassem Allam
Journal:  Viruses       Date:  2019-03-01       Impact factor: 5.048

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.