| Literature DB >> 23514271 |
Zhouxi Wang1, Pengcheng Yin, Joslynn S Lee, Ramya Parasuram, Srinivas Somarowthu, Mary Jo Ondrechen.
Abstract
BACKGROUND: The prediction of biochemical function from the 3D structure of a protein has proved to be much more difficult than was originally foreseen. A reliable method to test the likelihood of putative annotations and to predict function from structure would add tremendous value to structural genomics data. We report on a new method, Structurally Aligned Local Sites of Activity (SALSA), for the prediction of biochemical function based on a local structural match at the predicted catalytic or binding site.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23514271 PMCID: PMC3584854 DOI: 10.1186/1471-2105-14-S3-S13
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Sequence identity matrix for five previously characterized OMPDCs (structures 1-5) and the SG protein PY01515 (PDB ID 2aqw).
| PDB ID: | |||||||
|---|---|---|---|---|---|---|---|
| 0.240 | 0.260 | 0.600 | 0.060 | 0.240 | |||
| 0.240 | 0.280 | 0.280 | 0.120 | 0.220 | |||
| 0.260 | 0.280 | 0.300 | 0.080 | 0.280 | |||
| 0.600 | 0.280 | 0.300 | 0.060 | 0.200 | |||
| 0.060 | 0.120 | 0.080 | 0.060 | 0.020 | |||
| (SG) | 0.240 | 0.220 | 0.280 | 0.200 | 0.020 | ||
Local structural alignment of the consensus signature residues for the OMPDCs.
| Structurally aligned signature active site residues for OMPDC | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| PDB | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | |
| D11 | H88 | P182 | R215 | ||||||
| D20 | H98 | P180 | R203 | ||||||
| Protein | D37 | H122 | P202 | R235 | |||||
| D22 | H99 | P189 | R222 | ||||||
| D23 | n165 | P264 | R294 | ||||||
| SG | D23 | K105 | D139 | K141 | D144 | n168 | P267 | R297 | |
The first five rows represent previously-characterized OMPDCs. The sixth row is a putative OMPDC from Structural Genomics. The columns represent spatially coincident positions in the structural alignment; these positions are numbered 1-8. Known catalytic residues are shown in boldface. POOL-predicted residues are shown in uppercase; residues not predicted by POOL are shown in lowercase. The beta strand on which each position is located is given at the top of the column, above the position number. The good match between the SG protein and the known OMPDCs suggests common function.
Figure 1Structural alignment of predicted residues for YP_001304206.1 (gray) with those of endo-1,3-1,4-beta-D-glucan 4-glucanohydrolase (pink).
Local structural alignment of the residues in the GH16 consensus signature positions for the known representative GH16, endo-1,3-1,4-beta-D-glucan 4-glucanohydrolase, with the SG protein YP_001304206.1.
| Spatial Positions→ | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 2ayh (GH16) | F92 | Y94 | W103 | d104 | L111 | Q119 | N121 | Y123 | Y147 | W158 | W184 | |||
| 3h3l (SG) | F125 | - | Y138 | - | s140 | E142 | q144 | L146 | A166 | Y168 | - | t187 | H198 | H256 |
Previously reported active site residues are shown in boldface. POOL-predicted residues (top 8%) are shown in uppercase; residues not predicted are shown in lowercase. The poor match suggests different functions.
Figure 2Structural alignment of the POOL-predicted residues for the structural genomics protein YP_001304206.1 (gray) with those of a beta-1,4-xylanase from . The overlap of the three catalytic residues, E87, Y89, and E176 of the xylanase with the aligned, predicted residues from YP_001304206.1 is highlighted in the blue box and shown in close-up in the large box on the right.
Local structural alignment of the predicted active site residues by SALSA for a known ECH from Rattus norvegicus (PDB ID 1ey3) with predicted residues for a Structural Genomics protein from Mycobacterium avium (PDB ID 3q1t), reported to be an ECH.
| Spatial Positions→ | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 |
|---|---|---|---|---|---|---|---|---|---|
| Known ECH (1ey3) | C149 | D150 | R178 | k241 | N245 | ||||
| SG protein "ECH" (3q1t) | G76 | A123 | V126 | a131 | D132 | H146 | C160 | k223 | n227 |
Known catalytic residues are shown in boldface. Residues predicted by POOL are in uppercase; residues not predicted are in lowercase. Note the poor match between the residues of the SG protein with those of the representative ECH.
Local structural alignment of the predicted residues for the SG protein from Mycobacterium avium (PDB ID 3q1t) with the corresponding residues of ABDH from Cyanobacterium anabaena.
| POOL Ranking→ | 1 | 2 | 3 | 4 | 5 | 6 | 7 |
|---|---|---|---|---|---|---|---|
| SG protein "ECH" (3q1t) | D155 | H146 | E244 | D144 | H44 | C160 | H156 |
| Known ABDH (2j5s) | D141 | l158 | g154 | ||||
The spatial positions 1 through 7 correspond to the ordinal values for the top seven residues in the POOL rank order for 3q1t.
Known catalytic residues for ABDH are shown in boldface. Residues predicted by POOL are in uppercase; residues not predicted are in lowercase. Note that the match between the residues of the SG protein and of ABDH is better than that of Table 4.
Figure 3Structural alignment of the top POOL-predicted residues for the SG protein (purple; PDB 3q1t), reported to be an enoyl-CoA hydratase, with those of ABDH (green). H43, H144, D153, and E243 are known catalytic residues in ABDH.