| Literature DB >> 20594344 |
Ratna R Thangudu1, Manoj Tyagi, Benjamin A Shoemaker, Stephen H Bryant, Anna R Panchenko, Thomas Madej.
Abstract
BACKGROUND: The study of protein-small molecule interactions is vital for understanding protein function and for practical applications in drug discovery. To benefit from the rapidly increasing structural data, it is essential to improve the tools that enable large scale binding site prediction with greater emphasis on their biological validity.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20594344 PMCID: PMC2909224 DOI: 10.1186/1471-2105-11-365
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Statistics of small molecules and their binding sites observed in protein structure complexes. a) Number of small molecules and binding sites observed per protein chain, b) size of the observed binding sites, c) histogram showing the number of observed and inferred binding sites with plotted versus the fraction (%) of protein chains having these sites.
Figure 2Biological validity of the IBIS inferred binding sites. a) Histogram showing the frequency of protein chains as function of their biological relevancy as suggested by overlap of the inferred binding sites with CDD conserved feature annotation. b) Percentage of proteins with their inferred sites having their 1st and 2nd rank clusters with CD annotations; 165 proteins have only one predicted site.
Prediction sensitivity (%) of the top three predictions by different geometric approaches and their comparison to IBIS.
| Method* | Top1 | Top2 | Top3 |
|---|---|---|---|
| IBIS100 | 73 | 89 | 89 |
| IBIS90 | 75 | 91 | 91 |
| IBIS80 | 72 | 88 | 88 |
| LIGSITEcs | 71 | 79 | 85 |
| PASS | 58 | 67 | 75 |
| Q-SiteFinder | 52 | 60 | 75 |
| SURFNET | 42 | 58 | 62 |
*IBIS100, IBIS90, IBIS80 dataset contains unbound query proteins for which the average sequence identity between the unbound protein and members of the binding site clusters containing the bound homolog is no more than 100%, 90% and 80% respectively.
Figure 3Tyrosine kinase homologoues with varying degrees of sequence conservation with different small molecules in their ATP-binding pocket. (a) Ephb2 Receptor Kinase domain with ADP. (b) Syk Tyrosine Kinase Domain in complex with Gleevec. (c) Ephb2 Receptor Tyrosine Kinase with Adenine.
Figure 4Mapping of the inferred binding site. Inferred binding site of peptide deformylase P.aeruginosa (PDB:1IX1) mapped onto the sequence of Helicobacter pyroli and its agreement with the observed binding site in N-trans-caffeoyltyramin-PDF complex (PDB: 1EW5). MMDB residue numbering is used which starts from the beginning of the corresponding GenBank protein sequence.
Figure 5Overview of the IBIS binding site annotation procedure.