Literature DB >> 23734627

A novel min-cost flow method for estimating transcript expression with RNA-Seq.

Alexandru I Tomescu1, Anna Kuosmanen, Romeo Rizzi, Veli Mäkinen.   

Abstract

BACKGROUND: Through transcription and alternative splicing, a gene can be transcribed into different RNA sequences (isoforms), depending on the individual, on the tissue the cell is in, or in response to some stimuli. Recent RNA-Seq technology allows for new high-throughput ways for isoform identification and quantification based on short reads, and various methods have been put forward for this non-trivial problem.
RESULTS: In this paper we propose a novel radically different method based on minimum-cost network flows. This has a two-fold advantage: on the one hand, it translates the problem as an established one in the field of network flows, which can be solved in polynomial time, with different existing solvers; on the other hand, it is general enough to encompass many of the previous proposals under the least sum of squares model. Our method works as follows: in order to find the transcripts which best explain, under a given fitness model, a splicing graph resulting from an RNA-Seq experiment, we find a min-cost flow in an offset flow network, under an equivalent cost model. Under very weak assumptions on the fitness model, the optimal flow can be computed in polynomial time. Parsimoniously splitting the flow back into few path transcripts can be done with any of the heuristics and approximations available from the theory of network flows. In the present implementation, we choose the simple strategy of repeatedly removing the heaviest path.
CONCLUSIONS: We proposed a new very general method based on network flows for a multiassembly problem arising from isoform identification and quantification with RNA-Seq. Experimental results on prediction accuracy show that our method is very competitive with popular tools such as Cufflinks and IsoLasso. Our tool, called Traph (Transcrips in gRAPHs), is available at: http://www.cs.helsinki.fi/gsa/traph/.

Entities:  

Mesh:

Substances:

Year:  2013        PMID: 23734627      PMCID: PMC3622638          DOI: 10.1186/1471-2105-14-S5-S15

Source DB:  PubMed          Journal:  BMC Bioinformatics        ISSN: 1471-2105            Impact factor:   3.169


  11 in total

1.  Splicing graphs and EST assembly problem.

Authors:  Steffen Heber; Max Alekseyev; Sing-Hoi Sze; Haixu Tang; Pavel A Pevzner
Journal:  Bioinformatics       Date:  2002       Impact factor: 6.937

2.  The multiassembly problem: reconstructing multiple transcript isoforms from EST fragment mixtures.

Authors:  Yi Xing; Alissa Resch; Christopher Lee
Journal:  Genome Res       Date:  2004-02-12       Impact factor: 9.043

3.  Sparse linear modeling of next-generation mRNA sequencing (RNA-Seq) data for isoform discovery and abundance estimation.

Authors:  Jingyi Jessica Li; Ci-Ren Jiang; James B Brown; Haiyan Huang; Peter J Bickel
Journal:  Proc Natl Acad Sci U S A       Date:  2011-12-01       Impact factor: 11.205

4.  IsoLasso: a LASSO regression approach to RNA-Seq based transcriptome assembly.

Authors:  Wei Li; Jianxing Feng; Tao Jiang
Journal:  J Comput Biol       Date:  2011-09-27       Impact factor: 1.479

5.  Mapping and quantifying mammalian transcriptomes by RNA-Seq.

Authors:  Ali Mortazavi; Brian A Williams; Kenneth McCue; Lorian Schaeffer; Barbara Wold
Journal:  Nat Methods       Date:  2008-05-30       Impact factor: 28.547

Review 6.  Computation for ChIP-seq and RNA-seq studies.

Authors:  Shirley Pepke; Barbara Wold; Ali Mortazavi
Journal:  Nat Methods       Date:  2009-11       Impact factor: 28.547

7.  The clonal and mutational evolution spectrum of primary triple-negative breast cancers.

Authors:  Sohrab P Shah; Andrew Roth; Rodrigo Goya; Arusha Oloumi; Gavin Ha; Yongjun Zhao; Gulisa Turashvili; Jiarui Ding; Kane Tse; Gholamreza Haffari; Ali Bashashati; Leah M Prentice; Jaswinder Khattra; Angela Burleigh; Damian Yap; Virginie Bernard; Andrew McPherson; Karey Shumansky; Anamaria Crisan; Ryan Giuliany; Alireza Heravi-Moussavi; Jamie Rosner; Daniel Lai; Inanc Birol; Richard Varhol; Angela Tam; Noreen Dhalla; Thomas Zeng; Kevin Ma; Simon K Chan; Malachi Griffith; Annie Moradian; S-W Grace Cheng; Gregg B Morin; Peter Watson; Karen Gelmon; Stephen Chia; Suet-Feung Chin; Christina Curtis; Oscar M Rueda; Paul D Pharoah; Sambasivarao Damaraju; John Mackey; Kelly Hoon; Timothy Harkins; Vasisht Tadigotla; Mahvash Sigaroudinia; Philippe Gascard; Thea Tlsty; Joseph F Costello; Irmtraud M Meyer; Connie J Eaves; Wyeth W Wasserman; Steven Jones; David Huntsman; Martin Hirst; Carlos Caldas; Marco A Marra; Samuel Aparicio
Journal:  Nature       Date:  2012-04-04       Impact factor: 49.962

8.  MetaVelvet: an extension of Velvet assembler to de novo metagenome assembly from short sequence reads.

Authors:  Toshiaki Namiki; Tsuyoshi Hachiya; Hideaki Tanaka; Yasubumi Sakakibara
Journal:  Nucleic Acids Res       Date:  2012-07-19       Impact factor: 16.971

9.  Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation.

Authors:  Cole Trapnell; Brian A Williams; Geo Pertea; Ali Mortazavi; Gordon Kwan; Marijke J van Baren; Steven L Salzberg; Barbara J Wold; Lior Pachter
Journal:  Nat Biotechnol       Date:  2010-05-02       Impact factor: 54.908

10.  TopHat: discovering splice junctions with RNA-Seq.

Authors:  Cole Trapnell; Lior Pachter; Steven L Salzberg
Journal:  Bioinformatics       Date:  2009-03-16       Impact factor: 6.937

View more
  22 in total

1.  Accurate assembly of transcripts through phase-preserving graph decomposition.

Authors:  Mingfu Shao; Carl Kingsford
Journal:  Nat Biotechnol       Date:  2017-11-13       Impact factor: 54.908

2.  On the complexity of Minimum Path Cover with Subpath Constraints for multi-assembly.

Authors:  Romeo Rizzi; Alexandru I Tomescu; Veli Mäkinen
Journal:  BMC Bioinformatics       Date:  2014-09-10       Impact factor: 3.169

3.  StringTie enables improved reconstruction of a transcriptome from RNA-seq reads.

Authors:  Mihaela Pertea; Geo M Pertea; Corina M Antonescu; Tsung-Cheng Chang; Joshua T Mendell; Steven L Salzberg
Journal:  Nat Biotechnol       Date:  2015-02-18       Impact factor: 54.908

4.  Leveraging transcript quantification for fast computation of alternative splicing profiles.

Authors:  Gael P Alamancos; Amadís Pagès; Juan L Trincado; Nicolás Bellora; Eduardo Eyras
Journal:  RNA       Date:  2015-07-15       Impact factor: 4.942

5.  A rank-based sequence aligner with applications in phylogenetic analysis.

Authors:  Liviu P Dinu; Radu Tudor Ionescu; Alexandru I Tomescu
Journal:  PLoS One       Date:  2014-08-18       Impact factor: 3.240

6.  Letting the data speak for themselves: a fully Bayesian approach to transcriptome assembly.

Authors:  Marcel H Schulz
Journal:  Genome Biol       Date:  2014       Impact factor: 13.583

7.  Bayesian transcriptome assembly.

Authors:  Lasse Maretty; Jonas Andreas Sibbesen; Anders Krogh
Journal:  Genome Biol       Date:  2014       Impact factor: 13.583

8.  Efficient RNA isoform identification and quantification from RNA-Seq data with network flows.

Authors:  Elsa Bernard; Laurent Jacob; Julien Mairal; Jean-Philippe Vert
Journal:  Bioinformatics       Date:  2014-05-09       Impact factor: 6.937

9.  Transcriptome assembly and quantification from Ion Torrent RNA-Seq data.

Authors:  Serghei Mangul; Adrian Caciula; Sahar Al Seesi; Dumitru Brinza; Ion Mӑndoiu; Alex Zelikovsky
Journal:  BMC Genomics       Date:  2014-07-14       Impact factor: 3.969

10.  A convex formulation for joint RNA isoform detection and quantification from multiple RNA-seq samples.

Authors:  Elsa Bernard; Laurent Jacob; Julien Mairal; Eric Viara; Jean-Philippe Vert
Journal:  BMC Bioinformatics       Date:  2015-08-19       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.