Literature DB >> 27592709

Rail-RNA: scalable analysis of RNA-seq splicing and coverage.

Abhinav Nellore1,2,3, Leonardo Collado-Torres2,3,4, Andrew E Jaffe2,3,4,5, José Alquicira-Hernández2,6, Christopher Wilks1,3, Jacob Pritt1,3, James Morton7, Jeffrey T Leek2,3, Ben Langmead1,2,3.   

Abstract

MOTIVATION: RNA sequencing (RNA-seq) experiments now span hundreds to thousands of samples. Current spliced alignment software is designed to analyze each sample separately. Consequently, no information is gained from analyzing multiple samples together, and it requires extra work to obtain analysis products that incorporate data from across samples.
RESULTS: We describe Rail-RNA, a cloud-enabled spliced aligner that analyzes many samples at once. Rail-RNA eliminates redundant work across samples, making it more efficient as samples are added. For many samples, Rail-RNA is more accurate than annotation-assisted aligners. We use Rail-RNA to align 667 RNA-seq samples from the GEUVADIS project on Amazon Web Services in under 16 h for US$0.91 per sample. Rail-RNA outputs alignments in SAM/BAM format; but it also outputs (i) base-level coverage bigWigs for each sample; (ii) coverage bigWigs encoding normalized mean and median coverages at each base across samples analyzed; and (iii) exon-exon splice junctions and indels (features) in columnar formats that juxtapose coverages in samples in which a given feature is found. Supplementary outputs are ready for use with downstream packages for reproducible statistical analysis. We use Rail-RNA to identify expressed regions in the GEUVADIS samples and show that both annotated and unannotated (novel) expressed regions exhibit consistent patterns of variation across populations and with respect to known confounding variables.
AVAILABILITY AND IMPLEMENTATION: Rail-RNA is open-source software available at http://rail.bio. CONTACTS: anellore@gmail.com or langmea@cs.jhu.edu. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com

Entities:  

Mesh:

Year:  2017        PMID: 27592709      PMCID: PMC5860083          DOI: 10.1093/bioinformatics/btw575

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  36 in total

1.  OSA: a fast and accurate alignment tool for RNA-Seq.

Authors:  Jun Hu; Huanying Ge; Matt Newman; Kejun Liu
Journal:  Bioinformatics       Date:  2012-05-15       Impact factor: 6.937

2.  Reproducibility of high-throughput mRNA and small RNA sequencing across laboratories.

Authors:  Peter A C 't Hoen; Marc R Friedländer; Jonas Almlöf; Michael Sammeth; Irina Pulyakhina; Seyed Yahya Anvar; Jeroen F J Laros; Henk P J Buermans; Olof Karlberg; Mathias Brännvall; Johan T den Dunnen; Gert-Jan B van Ommen; Ivo G Gut; Roderic Guigó; Xavier Estivill; Ann-Christine Syvänen; Emmanouil T Dermitzakis; Tuuli Lappalainen
Journal:  Nat Biotechnol       Date:  2013-09-15       Impact factor: 54.908

3.  The case for cloud computing in genome informatics.

Authors:  Lincoln D Stein
Journal:  Genome Biol       Date:  2010-05-05       Impact factor: 13.583

4.  Fast gapped-read alignment with Bowtie 2.

Authors:  Ben Langmead; Steven L Salzberg
Journal:  Nat Methods       Date:  2012-03-04       Impact factor: 28.547

5.  Differential expression analysis of RNA-seq data at single-base resolution.

Authors:  Alyssa C Frazee; Sarven Sabunciyan; Kasper D Hansen; Rafael A Irizarry; Jeffrey T Leek
Journal:  Biostatistics       Date:  2014-01-06       Impact factor: 5.899

6.  Cloud computing and the DNA data race.

Authors:  Michael C Schatz; Ben Langmead; Steven L Salzberg
Journal:  Nat Biotechnol       Date:  2010-07       Impact factor: 54.908

7.  Detection of splice junctions from paired-end RNA-seq data by SpliceMap.

Authors:  Kin Fai Au; Hui Jiang; Lan Lin; Yi Xing; Wing Hung Wong
Journal:  Nucleic Acids Res       Date:  2010-04-05       Impact factor: 16.971

Review 8.  RNA-Seq: a revolutionary tool for transcriptomics.

Authors:  Zhong Wang; Mark Gerstein; Michael Snyder
Journal:  Nat Rev Genet       Date:  2009-01       Impact factor: 53.242

9.  The Subread aligner: fast, accurate and scalable read mapping by seed-and-vote.

Authors:  Yang Liao; Gordon K Smyth; Wei Shi
Journal:  Nucleic Acids Res       Date:  2013-04-04       Impact factor: 16.971

10.  TopHat2: accurate alignment of transcriptomes in the presence of insertions, deletions and gene fusions.

Authors:  Daehwan Kim; Geo Pertea; Cole Trapnell; Harold Pimentel; Ryan Kelley; Steven L Salzberg
Journal:  Genome Biol       Date:  2013-04-25       Impact factor: 13.583

View more
  27 in total

1.  Improving the value of public RNA-seq expression data by phenotype prediction.

Authors:  Shannon E Ellis; Leonardo Collado-Torres; Andrew Jaffe; Jeffrey T Leek
Journal:  Nucleic Acids Res       Date:  2018-05-18       Impact factor: 16.971

2.  Reproducible RNA-seq analysis using recount2.

Authors:  Leonardo Collado-Torres; Abhinav Nellore; Kai Kammers; Shannon E Ellis; Margaret A Taub; Kasper D Hansen; Andrew E Jaffe; Ben Langmead; Jeffrey T Leek
Journal:  Nat Biotechnol       Date:  2017-04-11       Impact factor: 54.908

3.  Targeting Tumor-Stromal IL6/STAT3 Signaling through IL1 Receptor Inhibition in Pancreatic Cancer.

Authors:  Austin R Dosch; Samara Singh; Xizi Dai; Siddharth Mehra; Iago De Castro Silva; Anna Bianchi; Supriya Srinivasan; Zhen Gao; Yuguang Ban; Xi Chen; Sulagna Banerjee; Nagaraj S Nagathihalli; Jashodeep Datta; Nipun B Merchant
Journal:  Mol Cancer Ther       Date:  2021-09-13       Impact factor: 6.261

4.  Bias-invariant RNA-sequencing metadata annotation.

Authors:  Hannes Wartmann; Sven Heins; Karin Kloiber; Stefan Bonn
Journal:  Gigascience       Date:  2021-09-22       Impact factor: 6.524

5.  Flexible expressed region analysis for RNA-seq with derfinder.

Authors:  Leonardo Collado-Torres; Abhinav Nellore; Alyssa C Frazee; Christopher Wilks; Michael I Love; Ben Langmead; Rafael A Irizarry; Jeffrey T Leek; Andrew E Jaffe
Journal:  Nucleic Acids Res       Date:  2016-09-29       Impact factor: 16.971

6.  SeQuiLa-cov: A fast and scalable library for depth of coverage calculations.

Authors:  Marek Wiewiórka; Agnieszka Szmurło; Wiktor Kuśmirek; Tomasz Gambin
Journal:  Gigascience       Date:  2019-08-01       Impact factor: 6.524

Review 7.  Cloud computing for genomic data analysis and collaboration.

Authors:  Ben Langmead; Abhinav Nellore
Journal:  Nat Rev Genet       Date:  2018-01-30       Impact factor: 53.242

8.  Epigenomic and Transcriptomic Dynamics During Human Heart Organogenesis.

Authors:  Jennifer VanOudenhove; Tara N Yankee; Andrea Wilderman; Justin Cotney
Journal:  Circ Res       Date:  2020-08-09       Impact factor: 17.367

9.  The Lair: a resource for exploratory analysis of published RNA-Seq data.

Authors:  Harold Pimentel; Pascal Sturmfels; Nicolas Bray; Páll Melsted; Lior Pachter
Journal:  BMC Bioinformatics       Date:  2016-12-01       Impact factor: 3.169

10.  Single cell derived mRNA signals across human kidney tumors.

Authors:  Matthew D Young; Thomas J Mitchell; Lars Custers; Thanasis Margaritis; Francisco Morales-Rodriguez; Kwasi Kwakwa; Eleonora Khabirova; Gerda Kildisiute; Thomas R W Oliver; Ronald R de Krijger; Marry M van den Heuvel-Eibrink; Federico Comitani; Alice Piapi; Eva Bugallo-Blanco; Christine Thevanesan; Christina Burke; Elena Prigmore; Kirsty Ambridge; Kenny Roberts; Felipe A Vieira Braga; Tim H H Coorens; Ignacio Del Valle; Anna Wilbrey-Clark; Lira Mamanova; Grant D Stewart; Vincent J Gnanapragasam; Dyanne Rampling; Neil Sebire; Nicholas Coleman; Liz Hook; Anne Warren; Muzlifah Haniffa; Marcel Kool; Stefan M Pfister; John C Achermann; Xiaoling He; Roger A Barker; Adam Shlien; Omer A Bayraktar; Sarah A Teichmann; Frank C Holstege; Kerstin B Meyer; Jarno Drost; Karin Straathof; Sam Behjati
Journal:  Nat Commun       Date:  2021-06-23       Impact factor: 14.919

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.