Literature DB >> 25690850

StringTie enables improved reconstruction of a transcriptome from RNA-seq reads.

Mihaela Pertea1, Geo M Pertea1, Corina M Antonescu1, Tsung-Cheng Chang2, Joshua T Mendell3, Steven L Salzberg4.   

Abstract

Methods used to sequence the transcriptome often produce more than 200 million short sequences. We introduce StringTie, a computational method that applies a network flow algorithm originally developed in optimization theory, together with optional de novo assembly, to assemble these complex data sets into transcripts. When used to analyze both simulated and real data sets, StringTie produces more complete and accurate reconstructions of genes and better estimates of expression levels, compared with other leading transcript assembly programs including Cufflinks, IsoLasso, Scripture and Traph. For example, on 90 million reads from human blood, StringTie correctly assembled 10,990 transcripts, whereas the next best assembly was of 7,187 transcripts by Cufflinks, which is a 53% increase in transcripts assembled. On a simulated data set, StringTie correctly assembled 7,559 transcripts, which is 20% more than the 6,310 assembled by Cufflinks. As well as producing a more complete transcriptome assembly, StringTie runs faster on all data sets tested to date compared with other assembly software, including Cufflinks.

Entities:  

Mesh:

Substances:

Year:  2015        PMID: 25690850      PMCID: PMC4643835          DOI: 10.1038/nbt.3122

Source DB:  PubMed          Journal:  Nat Biotechnol        ISSN: 1087-0156            Impact factor:   54.908


  30 in total

Review 1.  Alternative splicing: new insights from global analyses.

Authors:  Benjamin J Blencowe
Journal:  Cell       Date:  2006-07-14       Impact factor: 41.582

Review 2.  Computational methods for transcriptome annotation and quantification using RNA-seq.

Authors:  Manuel Garber; Manfred G Grabherr; Mitchell Guttman; Cole Trapnell
Journal:  Nat Methods       Date:  2011-05-27       Impact factor: 28.547

3.  Integrative annotation of human large intergenic noncoding RNAs reveals global properties and specific subclasses.

Authors:  Moran N Cabili; Cole Trapnell; Loyal Goff; Magdalena Koziol; Barbara Tazon-Vega; Aviv Regev; John L Rinn
Journal:  Genes Dev       Date:  2011-09-02       Impact factor: 11.361

4.  Biases in Illumina transcriptome sequencing caused by random hexamer priming.

Authors:  Kasper D Hansen; Steven E Brenner; Sandrine Dudoit
Journal:  Nucleic Acids Res       Date:  2010-04-14       Impact factor: 16.971

5.  Oases: robust de novo RNA-seq assembly across the dynamic range of expression levels.

Authors:  Marcel H Schulz; Daniel R Zerbino; Martin Vingron; Ewan Birney
Journal:  Bioinformatics       Date:  2012-02-24       Impact factor: 6.937

6.  Optimizing de novo transcriptome assembly from short-read RNA-Seq data: a comparative study.

Authors:  Qiong-Yi Zhao; Yi Wang; Yi-Meng Kong; Da Luo; Xuan Li; Pei Hao
Journal:  BMC Bioinformatics       Date:  2011-12-14       Impact factor: 3.169

7.  Streaming fragment assignment for real-time analysis of sequencing experiments.

Authors:  Adam Roberts; Lior Pachter
Journal:  Nat Methods       Date:  2012-11-18       Impact factor: 28.547

8.  iReckon: simultaneous isoform discovery and abundance estimation from RNA-seq data.

Authors:  Aziz M Mezlini; Eric J M Smith; Marc Fiume; Orion Buske; Gleb L Savich; Sohrab Shah; Sam Aparicio; Derek Y Chiang; Anna Goldenberg; Michael Brudno
Journal:  Genome Res       Date:  2012-11-29       Impact factor: 9.043

9.  The UCSC Genome Browser database: 2014 update.

Authors:  Donna Karolchik; Galt P Barber; Jonathan Casper; Hiram Clawson; Melissa S Cline; Mark Diekhans; Timothy R Dreszer; Pauline A Fujita; Luvina Guruvadoo; Maximilian Haeussler; Rachel A Harte; Steve Heitner; Angie S Hinrichs; Katrina Learned; Brian T Lee; Chin H Li; Brian J Raney; Brooke Rhead; Kate R Rosenbloom; Cricket A Sloan; Matthew L Speir; Ann S Zweig; David Haussler; Robert M Kuhn; W James Kent
Journal:  Nucleic Acids Res       Date:  2013-11-21       Impact factor: 16.971

10.  Assessment of transcript reconstruction methods for RNA-seq.

Authors:  Josep F Abril; Pär G Engström; Felix Kokocinski; Tamara Steijger; Tim J Hubbard; Roderic Guigó; Jennifer Harrow; Paul Bertone
Journal:  Nat Methods       Date:  2013-11-03       Impact factor: 28.547

View more
  2000 in total

1.  The First Draft Genome Assembly of Snow Sheep (Ovis nivicola).

Authors:  Maulik Upadhyay; Andreas Hauser; Elisabeth Kunz; Stefan Krebs; Helmut Blum; Arsen Dotsev; Innokentiy Okhlopkov; Vugar Bagirov; Gottfried Brem; Natalia Zinovieva; Ivica Medugorac
Journal:  Genome Biol Evol       Date:  2020-08-01       Impact factor: 3.416

2.  Transcriptome sequencing and analysis for the pigmentation of scale and skin in common carp (Cyprinus carpio).

Authors:  Yu-Jie Zhao; Jun Xiao; Mei-Di Huangyang; Ran Zhao; Qi Wang; Yan Zhang; Jiong-Tang Li
Journal:  Mol Biol Rep       Date:  2021-03-19       Impact factor: 2.316

3.  RASER: reads aligner for SNPs and editing sites of RNA.

Authors:  Jaegyoon Ahn; Xinshu Xiao
Journal:  Bioinformatics       Date:  2015-08-30       Impact factor: 6.937

4.  Spontaneous Tumor Regression in Tasmanian Devils Associated with RASL11A Activation.

Authors:  Mark J Margres; Manuel Ruiz-Aravena; Rodrigo Hamede; Kusum Chawla; Austin H Patton; Matthew F Lawrance; Alexandra K Fraik; Amanda R Stahlke; Brian W Davis; Elaine A Ostrander; Menna E Jones; Hamish McCallum; Patrick J Paddison; Paul A Hohenlohe; David Hockenbery; Andrew Storfer
Journal:  Genetics       Date:  2020-06-18       Impact factor: 4.562

5.  A Prognostic Signature for Lower Grade Gliomas Based on Expression of Long Non-Coding RNAs.

Authors:  Manjari Kiran; Ajay Chatrath; Xiwei Tang; Daniel Macrae Keenan; Anindya Dutta
Journal:  Mol Neurobiol       Date:  2018-11-03       Impact factor: 5.590

6.  Transcriptional Pause Sites Delineate Stable Nucleosome-Associated Premature Polyadenylation Suppressed by U1 snRNP.

Authors:  Anthony C Chiu; Hiroshi I Suzuki; Xuebing Wu; Dig B Mahat; Andrea J Kriz; Phillip A Sharp
Journal:  Mol Cell       Date:  2018-02-01       Impact factor: 17.970

Review 7.  Identification and Quantification of Proteoforms by Mass Spectrometry.

Authors:  Leah V Schaffer; Robert J Millikin; Rachel M Miller; Lissa C Anderson; Ryan T Fellers; Ying Ge; Neil L Kelleher; Richard D LeDuc; Xiaowen Liu; Samuel H Payne; Liangliang Sun; Paul M Thomas; Trisha Tucholski; Zhe Wang; Si Wu; Zhijie Wu; Dahang Yu; Michael R Shortreed; Lloyd M Smith
Journal:  Proteomics       Date:  2019-05       Impact factor: 3.984

8.  Necklace: combining reference and assembled transcriptomes for more comprehensive RNA-Seq analysis.

Authors:  Nadia M Davidson; Alicia Oshlack
Journal:  Gigascience       Date:  2018-05-01       Impact factor: 6.524

9.  Flexible expressed region analysis for RNA-seq with derfinder.

Authors:  Leonardo Collado-Torres; Abhinav Nellore; Alyssa C Frazee; Christopher Wilks; Michael I Love; Ben Langmead; Rafael A Irizarry; Jeffrey T Leek; Andrew E Jaffe
Journal:  Nucleic Acids Res       Date:  2016-09-29       Impact factor: 16.971

10.  A molecular map of lung neuroendocrine neoplasms.

Authors:  Aurélie A G Gabriel; Emilie Mathian; Lise Mangiante; Catherine Voegele; Vincent Cahais; Akram Ghantous; James D McKay; Nicolas Alcala; Lynnette Fernandez-Cuesta; Matthieu Foll
Journal:  Gigascience       Date:  2020-10-30       Impact factor: 6.524

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.