Literature DB >> 26821742

RiboGalaxy: A browser based platform for the alignment, analysis and visualization of ribosome profiling data.

Audrey M Michel1, James P A Mullan1, Vimalkumar Velayudhan1, Patrick B F O'Connor1, Claire A Donohue1, Pavel V Baranov1.   

Abstract

Ribosome profiling (ribo-seq) is a technique that uses high-throughput sequencing to reveal the exact locations and densities of translating ribosomes at the entire transcriptome level. The technique has become very popular since its inception in 2009. Yet experimentalists who generate ribo-seq data often have to rely on bioinformaticians to process and analyze their data. We present RiboGalaxy ( http://ribogalaxy.ucc.ie ), a freely available Galaxy-based web server for processing and analyzing ribosome profiling data with the visualization functionality provided by GWIPS-viz ( http://gwips.ucc.ie ). RiboGalaxy offers researchers a suite of tools specifically tailored for processing ribo-seq and corresponding mRNA-seq data. Researchers can take advantage of the published workflows which reduce the multi-step alignment process to a minimum of inputs from the user. Users can then explore their own aligned data as custom tracks in GWIPS-viz and compare their ribosome profiles to existing ribo-seq tracks from published studies. In addition, users can assess the quality of their ribo-seq data, determine the strength of the triplet periodicity signal, generate meta-gene ribosome profiles as well as analyze the relative impact of mRNA sequence features on local read density. RiboGalaxy is accompanied by extensive documentation and tips for helping users. In addition we provide a forum ( http://gwips.ucc.ie/Forum ) where we encourage users to post their questions and feedback to improve the overall RiboGalaxy service.

Entities:  

Keywords:  Galaxy; RNA-seq; gene expression; mRNA; protein synthesis; ribo-seq; ribosome footprinting; ribosome profiling; translation; translatomics

Mesh:

Substances:

Year:  2016        PMID: 26821742      PMCID: PMC4829337          DOI: 10.1080/15476286.2016.1141862

Source DB:  PubMed          Journal:  RNA Biol        ISSN: 1547-6286            Impact factor:   4.652


Introduction

Ribosome profiling, also known as ribo-seq, is a high through-put technique where mRNA fragments protected by ribosomes are isolated and sequenced. This allows the ribosome density along all mRNA transcripts present in the cell to be measured providing genome-wide information on protein synthesis (GWIPS) in vivo. The usages and adaptations of the technique have grown rapidly in recent years (see reviews by ). The pre-processing and alignment of ribo-seq data requires tools which are primarily Linux based and are designed for command-line usage. To visually explore the ribo-seq alignment profiles additional software may be required. If the researcher wishes to compare their ribo-seq alignments to other published ribo-seq data, they would need to download, pre-process and align these data as well. In addition, researchers may wish to carry out analysis that is specific to ribo-seq data, e.g. determine the strength of the triplet periodicity signal and use it for detecting the translated reading frame. Furthermore, the analysis would likely require access to a robust computational infrastructure with a relatively large storage capacity, facilities that are not always readily available. Even if the relevant resources were at hand, the results by different groups may not be always comparable because of differences in the data processing across studies. To address these challenges, we have developed RiboGalaxy, a Galaxy-based web server where researchers can align, analyze and visually explore their ribo-seq data using an internet browser without the need to install numerous software tools and construct analysis pipelines.

Results

Usage of RiboGalaxy is free and does not require user registration. However, to take advantage of certain features such as the published workflows and pages, or upload large datasets using FTP, users will need to create a RiboGalaxy account (a procedure where the user only has to provide a username and password).

Using RiboGalaxy

The typical ribo-seq alignment pipeline requires pre-processing the sequence reads to remove adaptor sequences and filter out any sequence reads that correspond to ribosomal RNA (rRNA). The Pre-processing tool suite on RiboGalaxy hosts Cutadapt for adaptor sequence removal and Bowtie for rRNA removal. RiboGalaxy currently provides pre-built rRNA indices for 10 model organisms. More rRNA indices will be added over time and users may also build their own indices. As ribosome footprints are derived from mRNA, alignment to a transcriptome is usually appropriate. However, there are cases where alignment to a genome is of interest, for example if a transcriptome is not well characterized. We provide both options on RiboGalaxy. The Align to transcriptome using Bowtie option is available under the Transcriptome Mapping suite. Note that for alignments to a transcriptome, a reference transcriptome FASTA file is required. One option is to obtain a reference FASTA file from the UCSC Main table browser option in Get Data (we explain how to do this on the RiboGalaxy Help page). The Align to genome using Bowtie is available under the GWIPS-viz Mapping suite. Pre-built indices are currently provided for 13 genome assemblies. Genomic ribosome profiles can be created for either RNase I or micrococcal nuclease (MNase) generated data and the profiles for the entire genome can be viewed directly as a custom track in GWIPS-viz. If the researcher wishes to generate mRNA (sub-codon) ribosome profiles, we recommend mapping with Bowtie in the Transcriptome Mapping suite followed by riboplot (see ). Running the ribocount tool will provide sub-codon profile counts in spreadsheet format for all transcripts to which ribosome footprints reads were mapped. Triplet periodicity and metagene analysis can also be carried out using the riboSeqR suite of tools (see for examples). The RUST suite of tools allows researchers to check the quality of their ribo-seq data as well as determine the relative impact of mRNA features such as codons, amino acids, dipeptides and tripeptides on the local ribosome profiling read density (see ). The RiboTools suite provides functionality for exploring stop codon readthrough events () and translation in alternative reading frames. Differential translation expression analysis can be carried out on ribo-seq and corresponding mRNA-seq data using the baySeq tool which is available in the riboSeqR suite on RiboGalaxy.
Figure 1.

(A) A sub-codon ribosome footprint along with the open reading frame (ORF) organization for the rat Ldha gene (NM_017025). The footprint reads are color coded (red, green, blue (see the color version of the figure online)) according to the frame alignment (1, 2, 3). The background gray alignments represent mRNA-seq data for the corresponding transcript. In this profile, the majority of the footprint reads are green indicating that they originate from an ORF in the second reading frame. (B) A triplet periodicity plot generated for protein coding regions of the zebrafish transcriptome showing the frequency of footprint 5′ ends mapping across the 3 frames depending on their nucleotide sequence length. (C) A metagene profile of ribosome density relative to the annotated start and stop codons in the zebrafish transcriptome. (D) A RUST metafootprint profile that reveals the influence of mRNA codons on the read density relative to the decoding center (the A-site codon coordinate is shown as 0). The relative entropy (Kullback-Leibler divergence) across these sites is also provided. (E) The relative enrichment of 61 codons in the A-site assessed as the RUST ratio. Codons are grouped by encoded amino acids that are colored according to their physicochemical properties. (F) A ribo-seq profile for the yeast WWM1 gene (YFL010C). The gray bar shows the location of the gene on chromosome VI. The top red panel shows the footprint alignments across the entire WWM1 gene region while the lower red panel is a zoom into the area around the annotated stop codon showing footprint alignments downstream of the annotated stop codon to the next in-frame stop codon. Panel A was generated using the RiboPlot suite on RiboGalaxy using data from the Andreev DE et al. study. Panels B and C were generated using the riboSeqR suite on RiboGalaxy using ribo-seq data from the Bazzini AA et al. study. Panels C and D were generated using the RUST suite on RiboGalaxy using data from the Andreev DE et al. study. Panel F was generated using the RiboTools suite on RiboGalaxy using ribo-seq data from the Baudin-Baillieu A et al. study.

(A) A sub-codon ribosome footprint along with the open reading frame (ORF) organization for the rat Ldha gene (NM_017025). The footprint reads are color coded (red, green, blue (see the color version of the figure online)) according to the frame alignment (1, 2, 3). The background gray alignments represent mRNA-seq data for the corresponding transcript. In this profile, the majority of the footprint reads are green indicating that they originate from an ORF in the second reading frame. (B) A triplet periodicity plot generated for protein coding regions of the zebrafish transcriptome showing the frequency of footprint 5′ ends mapping across the 3 frames depending on their nucleotide sequence length. (C) A metagene profile of ribosome density relative to the annotated start and stop codons in the zebrafish transcriptome. (D) A RUST metafootprint profile that reveals the influence of mRNA codons on the read density relative to the decoding center (the A-site codon coordinate is shown as 0). The relative entropy (Kullback-Leibler divergence) across these sites is also provided. (E) The relative enrichment of 61 codons in the A-site assessed as the RUST ratio. Codons are grouped by encoded amino acids that are colored according to their physicochemical properties. (F) A ribo-seq profile for the yeast WWM1 gene (YFL010C). The gray bar shows the location of the gene on chromosome VI. The top red panel shows the footprint alignments across the entire WWM1 gene region while the lower red panel is a zoom into the area around the annotated stop codon showing footprint alignments downstream of the annotated stop codon to the next in-frame stop codon. Panel A was generated using the RiboPlot suite on RiboGalaxy using data from the Andreev DE et al. study. Panels B and C were generated using the riboSeqR suite on RiboGalaxy using ribo-seq data from the Bazzini AA et al. study. Panels C and D were generated using the RUST suite on RiboGalaxy using data from the Andreev DE et al. study. Panel F was generated using the RiboTools suite on RiboGalaxy using ribo-seq data from the Baudin-Baillieu A et al. study. We provide Published pages (available under the Shared Data tab) to illustrate how to use the tools hosted on RiboGalaxy. Data sets from the Andreev et al. study were used for this purpose and the raw ribo-seq FASTQ files are available as Data Libraries under the Shared Data tab for testing purposes.

Using the RiboGalaxy published workflows

In RiboGalaxy we provide Published workflows for genome mapping (available under the Shared Data tab). Apart from being time efficient, these published workflows should particularly help researchers who are not familiar with alignment pipelines for ribo-seq data, or who are not familiar with the usage of the individual tools. If the researcher wishes to explore their ribo-seq data as a custom track in GWIPS-viz, with the advantage of comparing their alignment profiles to publicly available ribo-seq and mRNA-seq tracks, then the easiest option is to use the corresponding genome workflow. This will reduce the multi-step pre-processing and alignment process to a minimum of inputs from the user: only the adaptor sequence and the name of the custom track for visualization in GWIPS-viz need to be specified. We currently provide published workflows for the following 13 genome assemblies: Homo sapiens (hg19 and hg38), Mus musculus (mm10), Rattus norvegicus (rn6), Danio rerio (danRer7), Caenorhabditis elegans (ce10), Arabidopsis thaliana (araTha1), Drosophila melanogaster (dm3), Saccharomyces cerevisiae (sacCer3), Escherichia coli (eschcoli_k12), Bacillus subtilis (baciSubt2), Human herpesvirus 5 strain Merlin (HHV5) and Bacteriophage lambda (NC_001416). On successful completion of an in-built workflow, an email notification is sent to the user. The custom track is visible only to the researcher and not to other public users of GWIPS-viz. All of the alignment results, coverage and profile information can be downloaded from RiboGalaxy while snapshot images of the ribosome profiles can be generated in GWIPS-viz. We do not provide published workflows for transcriptome alignments. The reason is transcript annotations are updated regularly and it is advisable to obtain the latest reference annotations. We explain on our Help page how to get annotations in FASTA file format using the UCSC Main Table Browser from within RiboGalaxy. The user may also wish to use their own data-specific transcriptome assemblies as reference sequences.

RiboGalalxy help pages and forum

RiboGalaxy is publicly available at http://riboseq.org/ and is accompanied by extensive documentation and tips for helping users. We provide help pages specifically for RiboGalaxy usage under the Help section. For each individual tool, we provide short Tips at the top of each tool page to help the user get started as well as more extensive help support on each tool page. In addition we provide a RiboGalaxy forum (http://gwips.ucc.ie/Forum) where we encourage users to post their questions and feedback to improve the overall RiboGalaxy service.

Discussion

Up to recently, software for the analysis of ribo-seq data were not publicly available. Now, however, several tools have been developed specifically for ribo-seq. We plan to expand the existing repertoire of tools on RiboGalaxy by integrating such tools (e.g., riboDiff) as well as developing more software for the analysis of ribo-seq data.

Materials and methods

RiboGalaxy uses the Galaxy framework for the pre-processing, alignment and analysis pipelines. GWIPS-viz is used for the ribo-seq data visualization as the alignment profiles can be explored in conjunction with publicly available ribo-seq and mRNA-seq tracks. Galaxy Trackster is also available for data visualization. RiboGalaxy runs on Ubuntu 14.04.3 LTS, with Apache 2.4.7 and PostgreSQL 9.3.10. The version of each of the tools hosted on RiboGalaxy is provided at the top of the page when you click on the tool. As RiboGalaxy is a platform dedicated to ribo-seq data analysis, and not a general Galaxy server platform, only tools that are related to ribo-seq data alignment pipelines and analysis are hosted. The pre-processing and alignment software were downloaded from the Galaxy toolshed. In house python scripts were used for developing the RiboPlot suite as well as for generating ribosome profiles for visualization in GWIPS-viz while xml wrappers were written for the integration of the riboSeqR and the RUST suites of tools. The RiboGalaxy implementations of RiboPlot, riboSeqR and RUST are now available in the Galaxy toolshed.
  19 in total

1.  Molecular biology. Translation goes global.

Authors:  Robert B Weiss; John F Atkins
Journal:  Science       Date:  2011-12-16       Impact factor: 47.728

2.  RiboTools: a Galaxy toolbox for qualitative ribosome profiling analysis.

Authors:  Rachel Legendre; Agnès Baudin-Baillieu; Isabelle Hatin; Olivier Namy
Journal:  Bioinformatics       Date:  2015-03-25       Impact factor: 6.937

3.  Online resources for genomic analysis using high-throughput sequencing.

Authors:  Daniel Blankenberg; James Taylor; Anton Nekrutenko
Journal:  Cold Spring Harb Protoc       Date:  2015-02-05

4.  baySeq: empirical Bayesian methods for identifying differential expression in sequence count data.

Authors:  Thomas J Hardcastle; Krystyna A Kelly
Journal:  BMC Bioinformatics       Date:  2010-08-10       Impact factor: 3.169

5.  Genome-wide ribosome profiling reveals complex translational regulation in response to oxidative stress.

Authors:  Maxim V Gerashchenko; Alexei V Lobanov; Vadim N Gladyshev
Journal:  Proc Natl Acad Sci U S A       Date:  2012-10-08       Impact factor: 11.205

6.  Ultrafast and memory-efficient alignment of short DNA sequences to the human genome.

Authors:  Ben Langmead; Cole Trapnell; Mihai Pop; Steven L Salzberg
Journal:  Genome Biol       Date:  2009-03-04       Impact factor: 13.583

7.  Genome-wide analysis in vivo of translation with nucleotide resolution using ribosome profiling.

Authors:  Nicholas T Ingolia; Sina Ghaemmaghami; John R S Newman; Jonathan S Weissman
Journal:  Science       Date:  2009-02-12       Impact factor: 47.728

8.  Oxygen and glucose deprivation induces widespread alterations in mRNA translation within 20 minutes.

Authors:  Dmitry E Andreev; Patrick B F O'Connor; Alexander V Zhdanov; Ruslan I Dmitriev; Ivan N Shatsky; Dmitri B Papkovsky; Pavel V Baranov
Journal:  Genome Biol       Date:  2015-05-06       Impact factor: 13.583

9.  The UCSC Genome Browser database: 2015 update.

Authors:  Kate R Rosenbloom; Joel Armstrong; Galt P Barber; Jonathan Casper; Hiram Clawson; Mark Diekhans; Timothy R Dreszer; Pauline A Fujita; Luvina Guruvadoo; Maximilian Haeussler; Rachel A Harte; Steve Heitner; Glenn Hickey; Angie S Hinrichs; Robert Hubley; Donna Karolchik; Katrina Learned; Brian T Lee; Chin H Li; Karen H Miga; Ngan Nguyen; Benedict Paten; Brian J Raney; Arian F A Smit; Matthew L Speir; Ann S Zweig; David Haussler; Robert M Kuhn; W James Kent
Journal:  Nucleic Acids Res       Date:  2014-11-26       Impact factor: 19.160

10.  The use of duplex-specific nuclease in ribosome profiling and a user-friendly software package for Ribo-seq data analysis.

Authors:  Betty Y Chung; Thomas J Hardcastle; Joshua D Jones; Nerea Irigoyen; Andrew E Firth; David C Baulcombe; Ian Brierley
Journal:  RNA       Date:  2015-08-18       Impact factor: 4.942

View more
  41 in total

1.  Codon Resolution Analysis of Ribosome Profiling Data.

Authors:  Alexander Bartholomäus; Zoya Ignatova
Journal:  Methods Mol Biol       Date:  2021

2.  Processive Recoding and Metazoan Evolution of Selenoprotein P: Up to 132 UGAs in Molluscs.

Authors:  Janinah Baclaocos; Didac Santesmasses; Marco Mariotti; Katarzyna Bierła; Michael B Vetick; Sharon Lynch; Rob McAllen; John J Mackrill; Gary Loughran; Roderic Guigó; Joanna Szpunar; Paul R Copeland; Vadim N Gladyshev; John F Atkins
Journal:  J Mol Biol       Date:  2019-08-20       Impact factor: 5.469

3.  Polyamine Control of Translation Elongation Regulates Start Site Selection on Antizyme Inhibitor mRNA via Ribosome Queuing.

Authors:  Ivaylo P Ivanov; Byung-Sik Shin; Gary Loughran; Ioanna Tzani; Sara K Young-Baird; Chune Cao; John F Atkins; Thomas E Dever
Journal:  Mol Cell       Date:  2018-04-19       Impact factor: 17.970

4.  Selective ribosome profiling to study interactions of translating ribosomes in yeast.

Authors:  Carla V Galmozzi; Dorina Merker; Ulrike A Friedrich; Kristina Döring; Günter Kramer
Journal:  Nat Protoc       Date:  2019-07-22       Impact factor: 13.491

5.  Transcriptome-wide measurement of translation by ribosome profiling.

Authors:  Nicholas J McGlincy; Nicholas T Ingolia
Journal:  Methods       Date:  2017-06-01       Impact factor: 3.608

6.  Multimapping confounds ribosome profiling analysis: A case-study of the Hsp90 molecular chaperone.

Authors:  Jackson C Halpin; Radhika Jangi; Timothy O Street
Journal:  Proteins       Date:  2019-07-19

7.  RiboDiPA: a novel tool for differential pattern analysis in Ribo-seq data.

Authors:  Keren Li; C Matthew Hope; Xiaozhong A Wang; Ji-Ping Wang
Journal:  Nucleic Acids Res       Date:  2020-12-02       Impact factor: 16.971

8.  REPARATION: ribosome profiling assisted (re-)annotation of bacterial genomes.

Authors:  Elvis Ndah; Veronique Jonckheere; Adam Giess; Eivind Valen; Gerben Menschaert; Petra Van Damme
Journal:  Nucleic Acids Res       Date:  2017-11-16       Impact factor: 16.971

9.  Monitoring mammalian mitochondrial translation with MitoRiboSeq.

Authors:  Sophia Hsin-Jung Li; Michel Nofal; Lance R Parsons; Joshua D Rabinowitz; Zemer Gitai
Journal:  Nat Protoc       Date:  2021-05-05       Impact factor: 13.491

10.  RiboDoc: A Docker-based package for ribosome profiling analysis.

Authors:  Pauline François; Hugo Arbes; Stéphane Demais; Agnès Baudin-Baillieu; Olivier Namy
Journal:  Comput Struct Biotechnol J       Date:  2021-05-07       Impact factor: 7.271

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.