| Literature DB >> 16836757 |
Elaine C Meng1, Eric F Pettersen, Gregory S Couch, Conrad C Huang, Thomas E Ferrin.
Abstract
BACKGROUND: Comparing related structures and viewing the structures in the context of sequence alignments are important tasks in protein structure-function research. While many programs exist for individual aspects of such work, there is a need for interactive visualization tools that: (a) provide a deep integration of sequence and structure, far beyond mapping where a sequence region falls in the structure and vice versa; (b) facilitate changing data of one type based on the other (for example, using only sequence-conserved residues to match structures, or adjusting a sequence alignment based on spatial fit); (c) can be used with a researcher's own data, including arbitrary sequence alignments and annotations, closely or distantly related sets of proteins, etc.; and (d) interoperate with each other and with a full complement of molecular graphics features. We describe enhancements to UCSF Chimera to achieve these goals.Entities:
Mesh:
Substances:
Year: 2006 PMID: 16836757 PMCID: PMC1570152 DOI: 10.1186/1471-2105-7-339
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Protein structure and associated sequence alignment. The structure of proclavaminate amidino hydrolase [PDB:1gq6] and the seed alignment from Pfam [33] for its family. Only a portion of the alignment is depicted. One chain of the trimer, shown as a ribbon, is associated with the first sequence in the alignment. The other two chains are shown as light blue backbone traces. The ribbon is colored from blue to pink to yellow with increasing conservation. Gray ribbon segments represent residues in columns for which conservation was not calculated due to a high proportion of gaps (0.5 or higher). Conservation values, also shown as histogram bars above the alignment, were obtained with the entropy-based measure in AL2CO [8] and "independent counts" weighting. Active site ions are shown in cyan. Residues in the structure that are completely conserved in the alignment have their side chains displayed and are indicated with blue boxes on the alignment. The part of the alignment that is shown includes conserved (boxed) ion-binding residues in a loop to the lower right of the ions. Highly conserved residues are primarily involved in the active site or inter-subunit interactions. Residues in the protein core are moderately conserved. In the sequence alignment, Clustal X coloring is used for the residue one-letter codes. The alignment part of the figure was exported directly from Multalign Viewer as Encapsulated PostScript.
Chimera results for difficult structure pairs
| pair | PDB ID | PDB ID | MM-default | MM-noSS | MM-allSS |
| 1 | 3chy | 2fox | 21/1.2, 88/2.2 | NR | 22/1.1, 88/2.4 |
| 2 | 2aza A | 1paz | 34/1.2, 79/2.0 | 8/1.5, 19/2.7 | 16/1.4, 74/2.3 |
| 3 | 1cew I | 1mol A | 29/1.0, 73/1.9 | 6/0.8, 7/1.5 | 26/0.9, 70/2.0 |
| 4 | 1cid | 2rhe | 38/1.1, 90/2.0 | 10/1.3, 21/2.4 | 11/1.3, 57/2.5 |
| 5 | 1crl | 1ede | 32/1.3, 184/2.5 | 6/1.3, 26/3.2 | 13/1.3, 166/2.6 |
| 6 | 2sim | 1nsb A | 33/1.2, 257/2.6 | 4/1.5, 28/3.2 | 20/1.2, 174/2.9 |
| 7 | 1ten | 3hhr B | 53/1.1, 82/1.3 | 4/1.0, 16/2.9 | 5/0.9, 22/2.9 |
| 8 | 1tie | 4fgf | 21/1.3, 73/2.2 | 6/1.1, 22/2.7 | 15/1.2, 97/2.2 |
| 9 | 2snv | 5ptp | 32/1.3, 118/2.3 | 8/1.5, 22/2.4 | 12/1.4, 103/2.4 |
| 10 | 1gp1 A | 2trx A | 33/1.0, 89/1.8 | 5/0.5, 17/2.7 | 30/1.2, 88/1.7 |
Results are of the form N/RMSD, where N is the number of residue pairs matched and RMSD is the corresponding alpha-carbon root-mean-square deviation. The first set per column represents the final MatchMaker iteration (after pruning), while the second represents all equivalences from subsequent use of Match -> Align (cutoff 5.0 angstroms). MM-default: MatchMaker with default parameters (BLOSUM-62, 30% secondary structure weighting, prior secondary structure calculation, iteration cutoff 2.0 angstroms; further details are available in the supplementary information [see supplementary.doc]). MM-noSS: same as MM-default except without secondary structure scoring. MM-allSS: same as MM-default except with only the secondary structure term (100% weighting). NR: no results; MatchMaker iteration ended prematurely when the number of residue pairs fell below four.
Figure 2Comparison of matched orientations (pair 8). Comparison of matched orientations from MatchMaker (default settings) and CE [26] (web server [34] with default settings). Each matched structure [PDB:4fgf] is rainbow-colored from blue at the N-terminus to red at the C-terminus. The reference structure [PDB:1tie] is shown in tan on the right.
Results from different programs for difficult structure pairs
| pair | PDB ID | PDB ID | Chimera | TOPOFIT | CE | CE/MA |
| 1 | 3chy | 2fox | 88/2.2 | 19/0.7 | 108/3.6 | 93/2.6 |
| 2 | 2aza A | 1paz | 79/2.0 | 72/1.7 | 84/2.9 | 81/2.4 |
| 3 | 1cew I | 1mol A | 73/1.9 | 73/1.6 | 81/2.3 | 78/1.9 |
| 4 | 1cid | 2rhe | 90/2.0 | 69/1.5 | 97/2.9 | 88/1.9 |
| 5 | 1crl | 1ede | 184/2.5 | 143/1.9 | 219/3.8 | 183/2.3 |
| 6 | 2sim | 1nsb A | 257/2.6 | 207/2.0 | 275/3.0 | 266/2.5 |
| 7 | 1ten | 3hhr B | 82/1.3 | 81/1.4 | 87/1.9 | 85/1.6 |
| 8 | 1tie | 4fgf | 73/2.2 | 88/1.6 | 116/2.9 | 105/2.1 |
| 9 | 2snv | 5ptp | 118/2.3 | 83/1.8 | 130/3.1 | 118/2.4 |
| 10 | 1gp1 A | 2trx A | 89/1.8 | 96/1.6 | 64/5.2 | 54/2.7 |
Results are of the form N/RMSD, where N is the number of residue pairs matched and RMSD is the corresponding alpha-carbon root-mean-square deviation. Chimera: results from using MatchMaker and then Match -> Align with default settings, as reported under MM-default in Table 1. TOPOFIT [31] results were obtained using the web server [35]. CE [26] results were obtained using the web server [34] with default settings. CE/MA: equivalences obtained by using Match -> Align with cutoff 5.0 angstroms on the superpositions from CE.