| Literature DB >> 29259323 |
Jin Zhang1, Teng Gao2,3, Christopher A Maher4,5,6,7.
Abstract
Despite the increasing quantity of tools for accurately predicting gene fusion candidates from sequencing data, we are still faced with the critical challenge of visualizing the corresponding gene fusion products to infer their biological consequence (i.e. novel protein and increased gene expression). This is currently accomplished by manually inspecting and inferring the biological consequence of top scoring gene fusion candidates. This labor-intensive process could be made easier by automating the annotation of gene fusion products and generating easily interpretable visualizations. We developed a gene fusion visualization tool, called INTEGRATE-Vis, that generates comprehensive, highly customizable, publication-quality graphics focused on annotating each gene fusion at the transcript- and protein-level and assessing expression within an individual sample or across a patient cohort. INTEGRATE-Vis is the first comprehensive gene fusion visualization tool to help a user infer the potential consequence of a gene fusion event. It has potential utility in both research and clinical settings. INTEGRATE-Vis is available at https://github.com/ChrisMaherLab/INTEGRATE-Vis .Entities:
Mesh:
Year: 2017 PMID: 29259323 PMCID: PMC5736641 DOI: 10.1038/s41598-017-18257-2
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Overview of the INTEGRATE-Vis pipeline. INTEGRATE-Vis contains four major modules to plot four types of figures for gene fusions: isoform structure, domain, exon expression, and gene partner expression. The first three modules are for individual samples, and the last module is for cohort data. Only minimal inputs in standard formats, including BEDPE, TSV, FASTA, BAM, and GTF are needed to make the plots.
Figure 2INTEGRATE-Vis output illustrated using the TMPRSS2-ERG gene fusion in prostate cancer. INTEGRATE-Vis outputs four visualizations including: (A) gene fusion transcript isoforms, (B) the predicted protein structure of the gene fusion, (C) RNA-Seq read coverage across each gene fusion partner to reveal changes in exon expression (A red line is plotted at the fusion junctions at both gene partners. Exon boundaries are represented by blue lines. A marked expression change occurs between exons 3 and 4 of ERG.), and (D) expression of each gene fusion partner across the TCGA PRAD cohort. Blue is used to represent supporting reads, exons, transcript, and genomic locations for the 5′ gene partner (TMPRSS2), while red is for those of the 3′ gene partner (ERG).