| Literature DB >> 20920230 |
Paola Bertolazzi1, Concettina Guerra, Giampaolo Liuzzi.
Abstract
BACKGROUND: A relevant problem in drug design is the comparison and recognition of protein binding sites. Binding sites recognition is generally based on geometry often combined with physico-chemical properties of the site since the conformation, size and chemical composition of the protein surface are all relevant for the interaction with a specific ligand. Several matching strategies have been designed for the recognition of protein-ligand binding sites and of protein-protein interfaces but the problem cannot be considered solved.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20920230 PMCID: PMC2957401 DOI: 10.1186/1471-2105-11-488
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Distance matrix. The matrix shows the results of all-to-all comparisons. The 9 ligands are indicated along the rows and columns and the proteins binding each ligand are grouped together. The grid of horizontal and vertical black lines separates different groups of proteins. The matrix is color-coded from 0 (blue) to 1 (red), with red corresponding to high number of aligned atoms and therefore high similarity in the shape of the binding sites and blue to the lowest degree of similarity.
Figure 2ROC curves. The curves reported in the figure show the fraction of true positives or correct answers vs. the fraction of false positives for all positions of the ranked solutions. Each curve in the figure shows the average values obtained on all query proteins of a group. As expected, the curve corresponding to PO4 (in green) deviates the most from the diagonal line being the closest to the top-left corner of the ROC square. Thus CO has a good success in predicting a PO4 binding site. By contrast, the worst performance is achieved for NAD binding proteins with the associated curve in dotted black.
Comparison of CO with MolLoc.
| 1 | atpE-1hck | 62 | 1.2 | 1.94 | 45 | 1.3 | 2.89 |
| 2 | 1atpE-1phk | 57 | 0.91 | 1.6 | 63 | 0.9 | 1.43 |
| 3 | 1atpE-1csn | 50 | 1.18 | 2.36 | 55 | 0.9 | 1.64 |
| 4 | 1atpE-1nsf | 34 | 2.11 | 6.21 | 11 | 1.4 | 12.73 |
| 5 | 1atpE-1j7k | 25 | 1.81 | 7.24 | 25 | 1.6 | 6.4 |
| 6 | 1atpE-1e8xA | 24 | 1.74 | 7.25 | 20 | 1.7 | 8.5 |
| 7 | 1atpE-1f9aC | 21 | 2.17 | 10.33 | 18 | 1.6 | 8.89 |
| 8 | 1atpE-1kay | 20 | 1.9 | 9.5 | 8 | 1.7 | 21.25 |
| 9 | 1atpE-1yag | 20 | 1.92 | 9.6 | 17 | 1.6 | 9.41 |
| 10 | 1atpE-1a82 | 19 | 2.02 | 10.63 | 13 | 1.9 | 14.62 |
| 11 | 1atpE-1jjv | 18 | 1.76 | 9.78 | 10 | 1.8 | 18 |
| 12 | 1atpE-1gn8A | 17 | 2.37 | 13.94 | 14 | 1.6 | 11.43 |
| 13 | 1atpE-1b8aA | 16 | 2.05 | 12.81 | 10 | 2 | 20 |
| 14 | 1atpE-1mjhA | 16 | 2.28 | 14.25 | 14 | 1.9 | 13.57 |
| 15 | 1atpE-1e2q | 15 | 1.39 | 9.27 | 5 | 1.8 | 36 |
| 16 | 1atpE-1kp2A | 13 | 1.51 | 11.62 | 15 | 1.9 | 12.67 |
| 17 | 1atpE-1ayl | 12 | 1.21 | 10.08 | 16 | 2 | 12.5 |
| 18 | 1atpE-1g5t | 7 | 2.26 | 32.29 | 8 | 1.6 | 20 |
| avg.SAS | 10.04 | avg.SAS | 12.88 | ||||
Pairwise comparisons of the binding site of protein 1atp with other 18 proteins all binding ATP (columns 2). The results of CO (columns 3-5) and MolLoc (columns 6-8). For a de_nition of SAS see the text. The comparisons are ranked based on the number of corresponding atoms in CO (column 3).
Figure 3Example of a computed superimposition. Comparison of CO with MolLoc. Pairwise comparisons of the binding site of protein 1atp with other 18 proteins all binding ATP (columns 2). The results of CO (columns 3-5) and MolLoc (columns 6-8). For a definition of SAS see the text. The comparisons are ranked based on the number of corresponding atoms in CO (column 3).