| Literature DB >> 27085803 |
Peter Jehl1, Jean Manguy1, Denis C Shields1, Desmond G Higgins1, Norman E Davey2.
Abstract
Low-throughput experiments and high-throughput proteomic and genomic analyses have created enormous quantities of data that can be used to explore protein function and evolution. The ability to consolidate these data into an informative and intuitive format is vital to our capacity to comprehend these distinct but complementary sources of information. However, existing tools to visualize protein-related data are restricted by their presentation, sources of information, functionality or accessibility. We introduce ProViz, a powerful browser-based tool to aid biologists in building hypotheses and designing experiments by simplifying the analysis of functional and evolutionary features of proteins. Feature information is retrieved in an automated manner from resources describing protein modular architecture, post-translational modification, structure, sequence variation and experimental characterization of functional regions. These features are mapped to evolutionary information from precomputed multiple sequence alignments. Data are displayed in an interactive and information-rich yet intuitive visualization, accessible through a simple protein search interface. This allows users with limited bioinformatic skills to rapidly access data pertinent to their research. Visualizations can be further customized with user-defined data either manually or using a REST API. ProViz is available at http://proviz.ucd.ie/.Entities:
Mesh:
Year: 2016 PMID: 27085803 PMCID: PMC4987877 DOI: 10.1093/nar/gkw265
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Schema describing data retrieval, data processing and data display for the ProViz protein visualization tool. A user inputs a protein search term or user-defined sequence data (a protein sequence or protein multiple sequence alignment). Search terms (and, if possible, user-defined sequence data) are mapped to a UniProt accession. On the server-side, the UniProt accession is used to retrieve feature data from various resources and sequence-based prediction tools are applied to the protein sequences. All data is processed and returned to the browser-based ProViz front end for visualization. Blue boxes denote functions, red boxes denote external data sources, yellow boxes denote local data sources, grey boxes denote local bioinformatic tools and green boxes denote processed data.
Figure 2.(A) ProViz visualization for Cyclin-dependent kinase inhibitor 1A (CDKN1A) showing selected features of CDKN1A and a GeneTree alignment of CDKN1A orthologues. Key aspects of the visualization are numbered: (1) Protein name and species; (2) options sidebar; (3) data information sidebar; (4) data select, hide and help buttons; (5) information hover tooltip; (6) options toolbar; (7) protein architecture overview; (8) protein sequence data; (9) protein feature data. The visualization in the example can be viewed at http://proviz.ucd.ie/proviz.php?uniprot_acc=P38936. (B) A zoomed view of a section of the visualization from panel A labelled with the types of data that are present in each section of the protein feature data. (C) Examples of the available track types.