| Literature DB >> 29982439 |
Breon M Schmidt1, Nadia M Davidson1,2, Anthony D K Hawkins1, Ray Bartolo1, Ian J Majewski3,4, Paul G Ekert1, Alicia Oshlack1,2.
Abstract
Background: Genomic profiling efforts have revealed a rich diversity of oncogenic fusion genes. While there are many methods for identifying fusion genes from RNA-sequencing (RNA-seq) data, visualizing these transcripts and their supporting reads remains challenging. Findings: Clinker is a bioinformatics tool written in Python, R, and Bpipe that leverages the superTranscript method to visualize fusion genes. We demonstrate the use of Clinker to obtain interpretable visualizations of the RNA-seq data that lead to fusion calls. In addition, we use Clinker to explore multiple fusion transcripts with novel breakpoints within the P2RY8-CRLF2 fusion gene in B-cell acute lymphoblastic leukemia. Conclusions: Clinker is freely available software that allows visualization of fusion genes and the RNA-seq data used in their discovery.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29982439 PMCID: PMC6065480 DOI: 10.1093/gigascience/giy079
Source DB: PubMed Journal: Gigascience ISSN: 2047-217X Impact factor: 6.524
Figure 2.: (A)KMT2A-MLLT3 fusion gene visualized in IGV after alignment to the human genome. The backgrounds of the IGV tracks are colored to distinguish between the coverage (purple), aligned reads (white), and annotation (blue), with green reads indicating that its partner is on a different chromosome. Such alignments may support the existence of a fusion. (B) Clinker output of the KMT2A-MLLT3 gene fusion, visualized in IGV and (C) the GViz visualization. The tracks in the Clinker GViz visualization are (top to bottom): a superTranscript scale axis, a read coverage track, a gene boundary track, a protein domain track, a transcript (with exons annotation) track, and a sashimi plot that indicates the fusion breakpoints (dark purple). The breakpoints are also indicated by the vertical lines. In addition to the Clinker tracks, the IGV visualization includes a read support track.
Figure 1.: A visual representation of the Clinker pipeline. Users can choose to stop at step 2, inspect fusion genes of interest in IGV, and then commence figure production for a refined list of fusion genes. The Fusion Caller (pink) and IGV (light blue) steps are external to Clinker.
Figure 3.Visualization of six samples containing the P2RY8-CRLF2 fusion. We combined the Clinker output (mapped reads, fusion superTranscript, and annotation track) for the six samples using Gviz in R. From top to bottom: six coverage tracks with annotated breakpoints demonstrating read support, gene track, protein domains, and gene transcripts. Each sample contains the canonical transcript (navy vertical line) as well as a novel upstream splicing occurring within the 5′UTR exon ofCRLF2 (annotated with the red vertical line over the CRLF2 gene) along with two other transcripts that are not in-frame. The read support for this Clinker output can be compared to that of JAFFA's in Supplementary Table S1.