Literature DB >> 27153567

ProtAnnot: an App for Integrated Genome Browser to display how alternative splicing and transcription affect proteins.

Tarun Mall1, John Eckstein1, David Norris1, Hiral Vora1, Nowlan H Freese1, Ann E Loraine1.   

Abstract

UNLABELLED: One gene can produce multiple transcript variants encoding proteins with different functions. To facilitate visual analysis of transcript variants, we developed ProtAnnot, which shows protein annotations in the context of genomic sequence. ProtAnnot searches InterPro and displays profile matches (protein annotations) alongside gene models, exposing how alternative promoters, splicing and 3' end processing add, remove, or remodel functional motifs. To draw attention to these effects, ProtAnnot color-codes exons by frame and displays a cityscape graphic summarizing exonic sequence at each position. These techniques make visual analysis of alternative transcripts faster and more convenient for biologists.
AVAILABILITY AND IMPLEMENTATION: ProtAnnot is a plug-in App for Integrated Genome Browser, an open source desktop genome browser available from http://www.bioviz.org CONTACT: aloraine@uncc.edu.
© The Author 2016. Published by Oxford University Press.

Entities:  

Mesh:

Substances:

Year:  2016        PMID: 27153567      PMCID: PMC4978921          DOI: 10.1093/bioinformatics/btw068

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


1 Introduction

Many genes produce multiple transcript variants due to alternative splicing, alternative promoters and alternative 3′ end processing. Often these transcript variants encode proteins with different amino acid sequences and thus different functions. We and other groups have often used protein annotation methods to detect when this occurs. For example, we used BLOCKS, InterPro and TM-HMM to show that alternative transcription frequently remodels or deletes conserved regions and trans-membrane spans in human and mouse proteins (Cline ; Loraine ). However, even now it is difficult for biologists to perform similar analysis. Using Web tools (Rodriguez ), biologists can identify conserved regions in proteins encoded by different splice variants, but mapping those regions back onto gene structures is time-consuming and error-prone. To address this, we developed ProtAnnot as a new plug-in extension for the Integrated Genome Browser (IGB). IGB is a highly interactive, desktop genome browser that helps biologists explore and analyze experimental data from genomics, especially RNA-Seq data (Nicol ). Using ProtAnnot together with IGB, users can achieve deeper insight into how alternative transcription affects protein sequence and function.

2 Results

ProtAnnot enables fast, efficient visual analysis of the impact of alternative transcription on proteins by extending standard genome browser iconography, in which linked blocks represent transcript structures and block thickness indicate translated regions. ProtAnnot improves on this in three ways. First, it uses exon fill colors to show the frame of translation, revealing frame shifts across transcript variants. By comparing exon colors between transcripts, a user can quickly determine if they encode the same protein without having to zoom in to see the amino acid sequence. Second, ProtAnnot introduces an exon summary graphic, a series of blocks at the bottom of the display whose heights indicate the number of exons overlapping each position. Height differences between blocks signal where models differ. By scanning the exon summary, users can easily identify difference regions (English ), sequences that are differentially included in transcripts due to alternative splicing, promoters, or 3′-end processing. The exon summary graphic draws attention to these regions by exploiting our native ability to notice discontinuities in a horizon (Fig. 1).
Fig. 1

ProtAnnot visualization of Arabidopsis thaliana gene AT4G36690 encoding splicing regulator U2AF65 shows how alternative splicing deletes a nucleotide-binding domain. Different colors for coding region exons indicate different frames of translation

ProtAnnot visualization of Arabidopsis thaliana gene AT4G36690 encoding splicing regulator U2AF65 shows how alternative splicing deletes a nucleotide-binding domain. Different colors for coding region exons indicate different frames of translation Third, ProtAnnot exposes how different regions of a gene may encode different functions by displaying protein annotations next to their respective transcripts. In ProtAnnot, these protein annotations appear as single- or multi-span linked blocks beneath the transcripts that encode them. A thin line links spans from the same motif; note that these matches often span introns. To use ProtAnnot, users select ProtAnnot in the IGB App Manager (available from the Plug-ins tab in IGB 8.5); this downloads ProtAnnot to a local cache. A new menu item labeled ‘Start ProtAnnot’ then appears in the IGB Tools menu. Next, the user selects one or more gene models within the IGB main display window and selects ‘Tools > Start ProtAnnot’. This opens ProtAnnot in a new window, which shows the selected gene models with color-coded exons and exon summary graphic. To search InterPro using ProtAnnot, users select the InterProScan tabbed panel and click a button labeled ‘Run InterProScan,’ which opens a new window with search options available from the InterProScan Web service hosted at the European Bioinformatics Institute. Users then select one or many databases to search, enter an email address and run the search. Note that the InterProScan Web service maintainers require the user’s contact information to run the search service. When the search finishes, ProtAnnot adds newly found protein annotations to the display, below their respective transcripts. ProtAnnot also updates the status message with a link to an XML file hosted on the EBI Web site containing the ‘raw’ results; this is mainly a convenience for developers. Clicking a protein annotation opens the Properties tab, which lists information about that particular profile or motif. Depending on the selected item, this can include its InterPro identifier, name, description and a link to the InterPro Web site. Users can also shift-click to select multiple annotations, putting all of the available information side-by-side, allowing for direct comparison. Figure 1 shows an example visualization that highlights how ProtAnnot can lead to new discoveries. This example shows a gene from the model plant Arabidopsis thaliana encoding a homolog of U2AF65, part of the U2AF dimer that recruits the U2 snRNP complex to the branchpoint adenosine residue during the early steps of splicing. Coding region exons are color-coded to indicate frame of translation, making it easy to notice when overlapping exons encode the same or different peptides. Likewise, the exon summary graphic highlights differences in splicing between gene models. Previously, we analyzed RNA-Seq data from pollen and leaves and found that in pollen, AT4G36690.3 is the dominant isoform, but in leaves, AT4G36690.4 predominates (Loraine ). As shown in Figure 1, visualizing the gene models in ProtAnnot exposes potential functional consequences of pollen-specific splicing of U2AF65A. The pollen-specific isoform (lower) lacks a region encoding a nucleotide-binding alpha-beta domain, which is involved in RNA-binding. Thus, alternative splicing of U2AF65A is likely to have important functional consequences. As with IGB, we developed ProtAnnot using the GenoViz SDK, a Java toolkit for building genome browsers (Helt ). By using GenoViz, we were able to implement advanced visualization techniques familiar to IGB users with minimal effort. These include: user-settable zoom focus indicated by a zoom stripe graphic, fast animated zooming, edge matching of selected items and selectable Glyphs. Search results can be saved and reopened later, which saves time for the user and also reduces load on the InterProScan Web service. ProtAnnot saves results to an XML format file (extension ‘.paxml’) that contains a slice of genomic sequence surrounding the gene models, an offset indicating the relationship between the slice and the larger reference, transcript structures using coordinates relative to the slice and protein annotations in protein sequence coordinates.

3 Conclusion

ProtAnnot benefits users by exposing how gene structures affect protein sequence and function. As such, ProtAnnot complements the MI Bundle, another IGB extension that links genomic features to protein interaction and structure viewers (Céol and Müller, 2015). Like MI Bundle, ProtAnnot highlights relationships between the language of DNA (exons, introns, codons) and the more structure-oriented language of protein sequence, thus helping biologists achieve deeper understanding of gene function.
  8 in total

1.  The effects of alternative splicing on transmembrane proteins in the mouse genome.

Authors:  M S Cline; R Shigeta; R L Wheeler; M A Siani-Rose; D Kulp; A E Loraine
Journal:  Pac Symp Biocomput       Date:  2004

2.  Protein-based analysis of alternative splicing in the human genome.

Authors:  Ann E Loraine; Gregg A Helt; Melissa S Cline; Michael A Siani-Rose
Journal:  Proc IEEE Comput Soc Bioinform Conf       Date:  2002

3.  Prevalence of alternative splicing choices in Arabidopsis thaliana.

Authors:  Adam C English; Ketan S Patel; Ann E Loraine
Journal:  BMC Plant Biol       Date:  2010-06-04       Impact factor: 4.215

4.  The Integrated Genome Browser: free software for distribution and exploration of genome-scale datasets.

Authors:  John W Nicol; Gregg A Helt; Steven G Blanchard; Archana Raja; Ann E Loraine
Journal:  Bioinformatics       Date:  2009-08-04       Impact factor: 6.937

5.  RNA-seq of Arabidopsis pollen uncovers novel transcription and alternative splicing.

Authors:  Ann E Loraine; Sheila McCormick; April Estrada; Ketan Patel; Peng Qin
Journal:  Plant Physiol       Date:  2013-04-16       Impact factor: 8.340

6.  APPRIS WebServer and WebServices.

Authors:  Jose Manuel Rodriguez; Angel Carro; Alfonso Valencia; Michael L Tress
Journal:  Nucleic Acids Res       Date:  2015-05-18       Impact factor: 16.971

7.  The MI bundle: enabling network and structural biology in genome visualization tools.

Authors:  Arnaud Céol; Heiko Müller
Journal:  Bioinformatics       Date:  2015-07-25       Impact factor: 6.937

8.  Genoviz Software Development Kit: Java tool kit for building genomics visualization applications.

Authors:  Gregg A Helt; John W Nicol; Ed Erwin; Eric Blossom; Steven G Blanchard; Stephen A Chervitz; Cyrus Harmon; Ann E Loraine
Journal:  BMC Bioinformatics       Date:  2009-08-25       Impact factor: 3.169

  8 in total
  7 in total

1.  Regionally clustered ABCC8 polymorphisms in a prospective cohort predict cerebral oedema and outcome in severe traumatic brain injury.

Authors:  Ruchira Menka Jha; Theresa A Koleck; Ava M Puccio; David O Okonkwo; Seo-Young Park; Benjamin E Zusman; Robert S B Clark; Lori A Shutter; Jessica S Wallisch; Philip E Empey; Patrick M Kochanek; Yvette P Conley
Journal:  J Neurol Neurosurg Psychiatry       Date:  2018-04-19       Impact factor: 10.154

2.  Downstream TRPM4 Polymorphisms Are Associated with Intracranial Hypertension and Statistically Interact with ABCC8 Polymorphisms in a Prospective Cohort of Severe Traumatic Brain Injury.

Authors:  Ruchira M Jha; Shashvat M Desai; Benjamin E Zusman; Theresa A Koleck; Ava M Puccio; David O Okonkwo; Seo-Young Park; Lori A Shutter; Patrick M Kochanek; Yvette P Conley
Journal:  J Neurotrauma       Date:  2019-02-01       Impact factor: 5.269

3.  Identification of protein features encoded by alternative exons using Exon Ontology.

Authors:  Léon-Charles Tranchevent; Fabien Aubé; Louis Dulaurier; Clara Benoit-Pilven; Amandine Rey; Arnaud Poret; Emilie Chautard; Hussein Mortada; François-Olivier Desmet; Fatima Zahra Chakrama; Maira Alejandra Moreno-Garcia; Evelyne Goillot; Stéphane Janczarski; Franck Mortreux; Cyril F Bourgeois; Didier Auboeuf
Journal:  Genome Res       Date:  2017-04-18       Impact factor: 9.043

4.  An 'eFP-Seq Browser' for visualizing and exploring RNA sequencing data.

Authors:  Alexander Sullivan; Priyank K Purohit; Nowlan H Freese; Asher Pasha; Eddi Esteban; Jamie Waese; Alison Wu; Michelle Chen; Chih Y Chin; Richard Song; Sneha R Watharkar; Agnes P Chan; Vivek Krishnakumar; Matthew W Vaughn; Chris Town; Ann E Loraine; Nicholas J Provart
Journal:  Plant J       Date:  2019-08-23       Impact factor: 6.417

5.  Computational prediction and characterisation of miRNAs and their pathway genes in human schistosomiasis caused by Schistosoma haematobium.

Authors:  Thaís Cunha de Sousa Cardoso; Carlos Bruno de Araújo; Laysa Gomes Portilho; Luiz Guilherme Alves Mendes; Tamires Caixeta Alves; Gustavo Caetano Silva; Thales Henrique Cherubino Ribeiro; Peterson Elizandro Gandolfi; Enyara Rezende Morais; Laurence Rodrigues do Amaral; Matheus de Souza Gomes
Journal:  Mem Inst Oswaldo Cruz       Date:  2020-05-08       Impact factor: 2.743

6.  Integrated Genome Browser App Store.

Authors:  Sameer Shanbhag; Riddhi Patil; Noor Zahara; Chirag Shetty; Rachel Weidenhammer; Sneha Watharkar; Pranav Tambvekar; Philip P Badzuh; Chester Dias; Narendra Vankayala; Prutha Kulkarni; Charan Vallapureddy; Shamika Kulkarni; Pooja Nikhare; Nowlan H Freese; Ann E Loraine
Journal:  Bioinformatics       Date:  2022-02-18       Impact factor: 6.937

Review 7.  Anno genominis XX: 20 years of Arabidopsis genomics.

Authors:  Nicholas J Provart; Siobhan M Brady; Geraint Parry; Robert J Schmitz; Christine Queitsch; Dario Bonetta; Jamie Waese; Korbinian Schneeberger; Ann E Loraine
Journal:  Plant Cell       Date:  2021-05-31       Impact factor: 12.085

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.