Literature DB >> 23888185

Simultaneous isoform discovery and quantification from RNA-seq.

David Hiller1, Wing Hung Wong.   

Abstract

RNA sequencing is a recent technology which has seen an explosion of methods addressing all levels of analysis, from read mapping to transcript assembly to differential expression modeling. In particular the discovery of isoforms at the transcript assembly stage is a complex problem and current approaches suffer from various limitations. For instance, many approaches use graphs to construct a minimal set of isoforms which covers the observed reads, then perform a separate algorithm to quantify the isoforms, which can result in a loss of power. Current methods also use ad-hoc solutions to deal with the vast number of possible isoforms which can be constructed from a given set of reads. Finally, while the need of taking into account features such as read pairing and sampling rate of reads has been acknowledged, most existing methods do not seamlessly integrate these features as part of the model. We present Montebello, an integrated statistical approach which performs simultaneous isoform discovery and quantification by using a Monte Carlo simulation to find the most likely isoform composition leading to a set of observed reads. We compare Montebello to Cufflinks, a popular isoform discovery approach, on a simulated data set and on 46.3 million brain reads from an Illumina tissue panel. On this data set Montebello appears to offer a modest improvement over Cufflinks when considering discovery and parsimony metrics. In addition Montebello mitigates specific difficulties inherent in the Cufflinks approach. Finally, Montebello can be fine-tuned depending on the type of solution desired.

Entities:  

Keywords:  Algorithms; Alternative Splicing; Isoform Discovery; Monte Carlo; RNA-Seq

Year:  2013        PMID: 23888185      PMCID: PMC3718502          DOI: 10.1007/s12561-012-9069-2

Source DB:  PubMed          Journal:  Stat Biosci        ISSN: 1867-1764


  35 in total

1.  Using Poisson mixed-effects model to quantify transcript-level gene expression in RNA-Seq.

Authors:  Ming Hu; Yu Zhu; Jeremy M G Taylor; Jun S Liu; Zhaohui S Qin
Journal:  Bioinformatics       Date:  2011-11-08       Impact factor: 6.937

2.  A powerful and flexible approach to the analysis of RNA sequence count data.

Authors:  Yi-Hui Zhou; Kai Xia; Fred A Wright
Journal:  Bioinformatics       Date:  2011-08-02       Impact factor: 6.937

3.  Analysis and design of RNA sequencing experiments for identifying isoform regulation.

Authors:  Yarden Katz; Eric T Wang; Edoardo M Airoldi; Christopher B Burge
Journal:  Nat Methods       Date:  2010-11-07       Impact factor: 28.547

4.  baySeq: empirical Bayesian methods for identifying differential expression in sequence count data.

Authors:  Thomas J Hardcastle; Krystyna A Kelly
Journal:  BMC Bioinformatics       Date:  2010-08-10       Impact factor: 3.169

5.  Detection of splice junctions from paired-end RNA-seq data by SpliceMap.

Authors:  Kin Fai Au; Hui Jiang; Lan Lin; Yi Xing; Wing Hung Wong
Journal:  Nucleic Acids Res       Date:  2010-04-05       Impact factor: 16.971

6.  IsoformEx: isoform level gene expression estimation using weighted non-negative least squares from mRNA-Seq data.

Authors:  Hyunsoo Kim; Yingtao Bi; Sharmistha Pal; Ravi Gupta; Ramana V Davuluri
Journal:  BMC Bioinformatics       Date:  2011-07-27       Impact factor: 3.169

7.  RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome.

Authors:  Bo Li; Colin N Dewey
Journal:  BMC Bioinformatics       Date:  2011-08-04       Impact factor: 3.307

8.  An expectation-maximization algorithm for probabilistic reconstructions of full-length isoforms from splice graphs.

Authors:  Yi Xing; Tianwei Yu; Ying Nian Wu; Meenakshi Roy; Joseph Kim; Christopher Lee
Journal:  Nucleic Acids Res       Date:  2006-06-06       Impact factor: 16.971

9.  edgeR: a Bioconductor package for differential expression analysis of digital gene expression data.

Authors:  Mark D Robinson; Davis J McCarthy; Gordon K Smyth
Journal:  Bioinformatics       Date:  2009-11-11       Impact factor: 6.937

10.  SPACE: an algorithm to predict and quantify alternatively spliced isoforms using microarrays.

Authors:  Miguel A Anton; Dorleta Gorostiaga; Elizabeth Guruceaga; Victor Segura; Pedro Carmona-Saez; Alberto Pascual-Montano; Ruben Pio; Luis M Montuenga; Angel Rubio
Journal:  Genome Biol       Date:  2008-02-29       Impact factor: 13.583

View more
  8 in total

1.  Characterization of the human ESC transcriptome by hybrid sequencing.

Authors:  Kin Fai Au; Vittorio Sebastiano; Pegah Tootoonchi Afshar; Jens Durruthy Durruthy; Lawrence Lee; Brian A Williams; Harm van Bakel; Eric E Schadt; Renee A Reijo-Pera; Jason G Underwood; Wing Hung Wong
Journal:  Proc Natl Acad Sci U S A       Date:  2013-11-26       Impact factor: 11.205

2.  On the complexity of Minimum Path Cover with Subpath Constraints for multi-assembly.

Authors:  Romeo Rizzi; Alexandru I Tomescu; Veli Mäkinen
Journal:  BMC Bioinformatics       Date:  2014-09-10       Impact factor: 3.169

3.  Integrative gene set enrichment analysis utilizing isoform-specific expression.

Authors:  Lie Li; Xinlei Wang; Guanghua Xiao; Adi Gazdar
Journal:  Genet Epidemiol       Date:  2017-06-04       Impact factor: 2.135

4.  IsoSCM: improved and alternative 3' UTR annotation using multiple change-point inference.

Authors:  Sol Shenker; Pedro Miura; Piero Sanfilippo; Eric C Lai
Journal:  RNA       Date:  2014-11-18       Impact factor: 4.942

Review 5.  Radiogenomic Analysis of Oncological Data: A Technical Survey.

Authors:  Mariarosaria Incoronato; Marco Aiello; Teresa Infante; Carlo Cavaliere; Anna Maria Grimaldi; Peppino Mirabelli; Serena Monti; Marco Salvatore
Journal:  Int J Mol Sci       Date:  2017-04-12       Impact factor: 5.923

6.  Computational approaches for isoform detection and estimation: good and bad news.

Authors:  Claudia Angelini; Daniela De Canditiis; Italia De Feis
Journal:  BMC Bioinformatics       Date:  2014-05-09       Impact factor: 3.169

7.  CIDANE: comprehensive isoform discovery and abundance estimation.

Authors:  Stefan Canzar; Sandro Andreotti; David Weese; Knut Reinert; Gunnar W Klau
Journal:  Genome Biol       Date:  2016-01-30       Impact factor: 13.583

Review 8.  A survey of best practices for RNA-seq data analysis.

Authors:  Ana Conesa; Pedro Madrigal; Sonia Tarazona; David Gomez-Cabrero; Alejandra Cervera; Andrew McPherson; Michał Wojciech Szcześniak; Daniel J Gaffney; Laura L Elo; Xuegong Zhang; Ali Mortazavi
Journal:  Genome Biol       Date:  2016-01-26       Impact factor: 13.583

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.