| Literature DB >> 23895374 |
Adrià Cereto-Massagué1, María José Ojeda, Robbie P Joosten, Cristina Valls, Miquel Mulero, M Josepa Salvado, Anna Arola-Arnal, Lluís Arola, Santiago Garcia-Vallvé, Gerard Pujadas.
Abstract
BACKGROUND: Many Protein Data Bank (PDB) users assume that the deposited structural models are of high quality but forget that these models are derived from the interpretation of experimental data. The accuracy of atom coordinates is not homogeneous between models or throughout the same model. To avoid basing a research project on a flawed model, we present a tool for assessing the quality of ligands and binding sites in crystallographic models from the PDB.Entities:
Year: 2013 PMID: 23895374 PMCID: PMC3733808 DOI: 10.1186/1758-2946-5-36
Source DB: PubMed Journal: J Cheminform ISSN: 1758-2946 Impact factor: 5.514
Figure 1Automatic ligand and binding site classification. This diagram shows the process by which the ligands and binding sites of each PDB/PDB_REDO model are classified based on how well the model fits the ED.
Figure 2Example of a Good ligand with a Dubious binding site. Here, we can see a ligand (S14 B1002 in PDB entry 2FJP [23]) and its binding site, from the analysis of DPP4_HUMAN using the Default (PDB) profile. The only dubious residue from the binding site is the one with the yellow ED represented as ball and stick and colored by B-factor.
Figure 3Example of a dubious ligand with a bad binding site. Here, we can see a ligand (AZV A 1 in PDB entry 3Q8W [24]) and its binding site from the same analysis as in Figure 2. As can be seen, some residues from this binding site hardly fit their ED (in yellow). The ligand mostly fits its ED, but it still has some discrepancies.
Average validation scores for ligands in PDB and PDB_REDO
| RSR c | 0.120 | 0.104 |
| RSCC c | 0.90 | 0.92 |
| Heat of formation (kJ/mol) d | −1011 | −1067 |
| Hydrogen bonding energy (kJ/mol) d | −57.7 | −58.8 |
| Hydrophobic contact strength d,e | 16.20 | 16.43 |
| π-π interaction strength d,e | 1.26 | 1.28 |
| Cation-π interaction strength d,e | 1.17 | 1.19 |
| Number of atomic clashes d | 9.1 | 7.9 |
a A smaller value is better for RSR, heat of formation (strained ligand conformations give higher values), hydrogen bonding energy and number of atomic clashes. A larger value is better for RSCC, hydrophobic contact strength, π-π interaction strength and cation-π interaction strength.
b Average over 16,904 ligands (13,703 for heat of formation) in 5,932 structure models.
c Calculated using EDSTATS [27].
d Calculated using YASARA [32] using the atomic coordinates as is.
e The average reported is the average sum of all interactions for a single ligand.
Figure 4The guanosine-5′-monophosphate binding site in chain C of PDB entry 1A97 [[34]] is an example of a ligand and binding site flagged as dubious and bad in the PDB, respectively (left panel: upper with cyan ED for the binding site and red ED for the ligand; lower with default view: yellow ED for and residues), but scored as in PDB_REDO (right panel: upper with cyan ED for the binding site and red ED for the ligand; lower with previously bad or dubious residues with orange ED). The RSR and RSCC of the ligand improve from 0.154 to 0.065 and from 0.86 to 0.97, respectively. Two extra hydrogen bonds are introduced, improving the total hydrogen bonding energy from −157 kJ/mol to −199 kJ/mol. The all-atom root mean square deviation of the ligand is 0.6 Å. Of the residues in the binding site, arginine 69 and the boric acid molecule improve most significantly in terms of fit to the ED.
Analysis of all binding sites present in both PDB and PDB_REDO
| EDS | Good | 5,145 | 1,600 | 926 | 7,671 |
| Bad | 5,500 | 3,727 | 8,395 | 17,622 | |
| Dubious | 3,659 | 2,953 | 7,915 | 14,527 | |
| 14,304 | 8,280 | 17,236 | 39,820 | ||
This table shows how binding sites were classified when coming from the EDS or from the PDB_REDO databank.
Analysis of all ligands present in both PDB and PDB_REDO
| EDS | Good | 11,741 | 16 | 662 | 12,419 |
| Bad | 9,819 | 1,206 | 6,098 | 17,123 | |
| Dubious | 3,790 | 229 | 6,259 | 10,278 | |
| 25,350 | 1,451 | 17,236 | 39,820 | ||
This table shows how ligands were classified when coming from the EDS or from the PDB_REDO databank.
Number of complexes classified as , or after applying VHELIBS to 75 ligand/DPP-IV binding site complexes using the profile
| Ligand | Good | 43 | 0 | 12 | 55 |
| Bad | 0 | 0 | 0 | 0 | |
| Dubious | 14 | 0 | 6 | 20 | |
| 57 | 0 | 18 | 75 | ||