| Literature DB >> 26329268 |
Christian Stolte, Kenneth S Sabir, Julian Heinrich, Christopher J Hammang, Andrea Schafferhans, Seán I O'Donoghue.
Abstract
BACKGROUND: To understand the molecular mechanisms that give rise to a protein's function, biologists often need to (i) find and access all related atomic-resolution 3D structures, and (ii) map sequence-based features (e.g., domains, single-nucleotide polymorphisms, post-translational modifications) onto these structures.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26329268 PMCID: PMC4547178 DOI: 10.1186/1471-2105-16-S11-S7
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Common questions about protein structures biologists seek to answer and the visual encoding and/or panel used in the Aquaria user interface to address these.
| Question | Visual Encoding and/or Panel | |
|---|---|---|
| 1 | For a given protein sequence, how many related 3D structures are known? | Matching Structures (= MS) |
| 2 | Do any structures match exactly? If not, what is the best match? | vertical position & colour-code (MS) |
| 3 | Which structure spans most of the given sequence? | horizontal position (MS) |
| 4 | Which molecular states occur amongst all related structures? | thumbnails (MS) |
| 5 | For a given domain, how many related structures are known? | horizontal position (MS) |
| 6 | For a given domain, do any structures match exactly? If not, what is the best match? | position & colour-code (MS) |
| 7 | For a given domain, which molecular configurations occur in the related structures? | tree level 1 (MS) |
| 8 | For a given domain and molecular configuration, which PDB entries are related? | tree level 2 (MS) |
| 9 | In a large molecular assembly, where is a given protein? How does it interact with others? | semi-transparency (3D Structure) |
| 10 | How well does a structure match to a given protein sequence? | colour-code (3D Structure) |
| 11 | Which residues in the structure differ from the specified protein sequence? | colour-code (3D Structure) |
| 12 | Which residues in the structure differ in a related organism (e.g., in mouse)? | colour-code (3D Structure) |
| 13 | For a given residue in the structure, where is it in the sequence? | 3D Structure |
| 14 | For a given residue in the sequence, where is it in the structure? | 3D Structure |
| 15 | Where in the structure is the N-terminus? Where is the C-terminus? | 3D Structure |
| 16 | Which residues make contact with ligands? Or with other proteins? | 3D Structure |
| 17 | Which kinds of features are available, and how many? | Features |
| 18 | For a given feature (e.g., a domain, PTM, or SNP), where is it in the sequence? | Features |
| 19 | For a given feature (e.g., a domain, PTM, or SNP), where is it in the structure? | Features & 3D Structure |
| 20 | For a given structure, where are a set of features located (e.g., all domains, PTMs, or SNPs)? | Features & 3D Structure |
| 21 | For each partner of a given protein, ... [repeat questions 1-20]? | 3D Structure |
Figure 1The Aquaria user interface comprises five panels: the 3D structure panel (A) shows the currently selected 3D structure with various rendering modes using the same colour scheme as applied for all structures and aggregates (E) that match a given user query entered through the search panel (B). A white background is used to visually connect the sequence being rendered in the 3D view and its cluster in the matching structures panel (E). Panels on either side give information about the Uniprot entry corresponding to the query (C) as well as details of the structure (D) being shown in the 3D structure panel.
Figure 2Sequence variation features for P53. The features panel shows a collection of annotated tracks derived from various sources that can be used to map features onto the 3D structure displayed in the 3D viewer. Here we show a large number of natural variant sites (mutations) for the tumor suppressor protein P53. Overlapping features are drawn in separate lanes, thus enabling users to identify the residues that have largest number of distinct mutations, and map these onto structures.
Figure 3Structures and features for human BLK protein. (A) BLK contains three independently evolved sequence domains, each with its own function. By mapping the UniProt "domain" feature set (B) onto the structure automatically selected by Aquaria, the user is able to clearly see which parts of the structure correspond to which domain. (C) The Matching Structure panel shows that there is no PDB structure exactly matching the BLK sequence, while many (over 3,000) related structures exist, thus providing a wealth of detail on the molecular processes of related proteins. Most of the related structures match to the kinase domain; many match to either the SH2 or SH3 domain, while a small number contain all three domains in the same order as BLK.
Figure 4Atomic structure of an amyloid fibre formed from APP. The structure (PDB 2yti) has been mapped with the UniProt "polypeptide region" feature lane, which indicates the location of enzymatic cleavage sites. If the indicated cleavages by gamma secretase (at residue 711) or alpha secretase (at residues 687-688) occur, the APP amyloid fibre could not form. Thus, this figure shows the power of combining features with structures, as it suggests potential research directions that could be pursued as potential treatments for Alzheimer's.
Figure 5Screen capture from APP animation. This shows the final model for the overall structure of APP assembled using Aquaria (yellow). The protein is enzymatically cleaved at several positions; the region of the protein that eventually forms part of the amyloid fibre is shown in orange.