| Literature DB >> 18940825 |
Abstract
MOTIVATION: Specific non-covalent binding of metal ions and ligands, such as nucleotides and cofactors, is essential for the function of many proteins. Computational methods are useful for predicting the location of such binding sites when experimental information is lacking. Methods that use structural information, when available, are particularly promising since they can potentially identify non-contiguous binding motifs that cannot be found using only the amino acid sequence. Furthermore, a prediction method that can utilize low-resolution models is advantageous because high-resolution structures are available for only a relatively small fraction of proteins.Entities:
Mesh:
Substances:
Year: 2008 PMID: 18940825 PMCID: PMC2639300 DOI: 10.1093/bioinformatics/btn543
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Number of ligand binding proteins and sites in the non-redundant datasets for each ligand
| Ligand | Number of proteins in the training set | Number of sites in the training set |
|---|---|---|
| Ca | 273(355) | 434(536) |
| Cu | 33(45) | 51(67) |
| Fe | 84(95) | 105(123) |
| Mg | 433(479) | 549(575) |
| Mn | 148(172) | 232(239) |
| Zn | 401(541) | 517(687) |
| AMP | 48(71) | 48(83) |
| ATP, ADP+analogs | 172(269) | 173(299) |
| FAD | 52(85) | 52(105) |
| Heme | 53(123) | 57(208) |
| NAD, NADP, NADPH | 107(170) | 107(199) |
The values in parentheses are the numbers of proteins or sites before removing sites that contact multiple proteins.
Area under the ROC curve for 10-fold cross-validation prediction of ligand binding sites using ligand-bound and ligand-unbound protein structures
| AUC | ||||
|---|---|---|---|---|
| Ligand | Bound structure | Only top 20 variables (bound structure) | Without conservation (bound structure) | Unbound structure |
| Ca | 0.861 | 0.850 | 0.856 | 0.813 |
| Cu | 0.952 | 0.784 | 0.952 | |
| Fe | 0.960 | 0.948 | 0.953 | |
| Mg | 0.823 | 0.809 | 0.794 | 0.763 |
| Mn | 0.897 | 0.884 | 0.879 | 0.895 |
| Zn | 0.964 | 0.958 | 0.958 | 0.913 |
| AMP | 0.799 | 0.842 | 0.797 | |
| ATP, ADP+analogs | 0.884 | 0.901 | 0.852 | 0.836 |
| FAD | 0.941 | 0.928 | 0.941 | |
| Heme | 0.971 | 0.955 | 0.971 | |
| NAD, NADP, | 0.959 | 0.959 | 0.959 | 0.876 |
| NADPH | ||||
AUC values are also given for predictions using only the top 20 most important variables and without evolutionary conservation.
Fig. 1.An example illustrating the successful discrimination between two different metal ions, which in this case bind to the same protein (fungal lysyl oxidase, PDB entry 1N9E). Calcium and copper binding site predictions are shown in the top and bottom figures, respectively. Predicted binding residues, with R>20, are colored red, predicted non-binding residues, with R<10, are colored blue, and residues with intermediate ambiguous values are colored yellow. The bound copper ion is orange and the calcium ions are green. The insets show a detailed view of the binding sites.