| Literature DB >> 35751589 |
Emil K Gustavsson1,2, David Zhang1,2, Regina H Reynolds1,2, Sonia Garcia-Ruiz1,3,2, Mina Ryten1,3,2.
Abstract
MOTIVATION: The advent of long-read sequencing technologies has increased demand for the visualisation and interpretation of transcripts. However, tools that perform such visualizations remain inflexible and lack the ability to easily identify differences between transcript structures. Here, we introduce ggtranscript, an R package that provides a fast and flexible method to visualize and compare transcripts. As a ggplot2 extension, ggtranscript inherits the functionality and familiarity of ggplot2 making it easy to use. AVAILABILITY: ggtranscript is an R package available at https://github.com/dzhang32/ggtranscript (DOI: https://doi.org/10.5281/zenodo.6374061) via an open-source MIT license. Further is available at https://dzhang32.github.io/ggtranscript/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.Entities:
Year: 2022 PMID: 35751589 PMCID: PMC9344834 DOI: 10.1093/bioinformatics/btac409
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.931
Fig. 1.ggtranscript enables a fast and flexible method to visualize and compare transcript isoforms. ggtranscript is a ggplot2 extension that introduces five new geoms and a set of helper functions: (A) geom_range() and geom_intron() enable the plotting of exons and introns, the core components of transcript annotation. In addition, geom_range() has been used to visually distinguish coding regions from untranslated regions. (B) geom_half_range() enables users to plot only half of a range on the top or bottom of a transcript structure; one use case of which is to visualize the differences between two transcripts (SOD201 and SOD202). (C) geom_junction() enables the plotting of junction curves, which can be overlaid across transcript structures to annotate them with supporting short-read RNA-sequencing data. The number represent junction usage. (D) Longer, more complex transcripts, with small differences between exons of interest, can be more difficult to visualize. (E) For this reason, ggtranscript includes a helper function shorten_gaps() which shortens regions that do not overlap an exon to a fixed, user-inputted width. Transcripts in D and E are coloured by their transcript biotype. (F) In addition, the function to_diff() facilitates visualization of longer transcripts by highlighting differences in comparison to a reference transcript