| Literature DB >> 16844973 |
M Tyagi1, P Sharma, C S Swamy, F Cadet, N Srinivasan, A G de Brevern, B Offmann.
Abstract
Encoding protein 3D structures into 1D string using short structural prototypes or structural alphabets opens a new front for structure comparison and analysis. Using the well-documented 16 motifs of Protein Blocks (PBs) as structural alphabet, we have developed a methodology to compare protein structures that are encoded as sequences of PBs by aligning them using dynamic programming which uses a substitution matrix for PBs. This methodology is implemented in the applications available in Protein Block Expert (PBE) server. PBE addresses common issues in the field of protein structure analysis such as comparison of proteins structures and identification of protein structures in structural databanks that resemble a given structure. PBE-T provides facility to transform any PDB file into sequences of PBs. PBE-ALIGNc performs comparison of two protein structures based on the alignment of their corresponding PB sequences. PBE-ALIGNm is a facility for mining SCOP database for similar structures based on the alignment of PBs. Besides, PBE provides an interface to a database (PBE-SAdb) of preprocessed PB sequences from SCOP culled at 95% and of all-against-all pairwise PB alignments at family and superfamily levels. PBE server is freely available at http://bioinformatics.univ-reunion.fr/PBE/.Entities:
Mesh:
Year: 2006 PMID: 16844973 PMCID: PMC1538797 DOI: 10.1093/nar/gkl199
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Mining SCOP for similar structures using PB alignment
| True class versus hit class | ALPHA | BETA | ALPHABETA | APLUSB | MULTIDOM | MEMBRANE | SMALL | Total |
|---|---|---|---|---|---|---|---|---|
| ALPHA | 245 (88.1%) | 1 | 12 | 9 | 1 | 5 | 5 | 278 |
| BETA | 2 | 404 (94.4%) | 5 | 10 | 0 | 1 | 6 | 428 |
| ALPHABETA | 3 | 5 | 255 (89.5%) | 18 | 3 | 0 | 1 | 285 |
| APLUSB | 16 | 23 | 27 | 240 (76.2%) | 0 | 1 | 8 | 315 |
| MULTIDOM | 0 | 0 | 5 | 2 | 11 (61.1%) | 0 | 0 | 18 |
| MEMBRANE | 10 | 5 | 0 | 1 | 1 | 12 (41.3%) | 0 | 29 |
| SMALL | 2 | 15 | 0 | 8 | 0 | 0 | 122 (84.7%) | 144 |
| 1500 |
Confusion matrix between true (vertical) and predicted (horizontal) SCOP classes.
Figure 1Mining SCOP for similar structures using PB alignment. Distribution of number of hits in top 10 ranking alignments. If a given query and extracted alignment have same FOLD, a hit is counted at that position.
Mining SCOP for similar structures using PB alignment
| SCOP class | Top10 (%) | Top5 (%) | Top1 (%) |
|---|---|---|---|
| Alpha (1312) | 86.1 (1130) | 82.6 (1087) | 75.0 (985) |
| Beta (2076) | 92.9 (1930) | 91.4 (1897) | 87.2 (1811) |
| AlphaBeta (1386) | 93.6 (1298) | 92.0 (1275) | 88.4 (1226) |
| AplusB (1500) | 88.3 (1325) | 86.3 (1294) | 81.3 (1219) |
| Small (700) | 87.7 (614) | 84.3 (590) | 70.3 (492) |
| Membrane (139) | 91.4 (127) | 89.2 (124) | 81.3 (113) |
| MultiDomain (82) | 85.4 (70) | 84.1 (69) | 81.7 (67) |
Hit rates (in percentage) for identifying similar fold within each SCOP classes. Are given rates that take into account Top10, Top5 and Top1 ranking alignments. Exact numbers for each case are given within parentheses.