| Literature DB >> 31459585 |
Giuseppe Sciortino1,2, Eugenio Garribba2, Jaime Rodríguez-Guerra Pedregal1, Jean-Didier Maréchal1.
Abstract
With more than a third of the genome encoding for metal-containing biomolecules, the in silico prediction of how metal ions bind to proteins is crucial in chemistry, biology, and medicine. To date, algorithms for metal-binding site prediction are mainly based on sequence analysis. Those methods have reached enough quality to predict the correct region of the protein and the coordinating residues involved in metal-binding, but they do not provide three-dimensional (3D) models. On the contrary, the prediction of accurate 3D models for protein-metal adducts by structural bioinformatics and molecular modeling techniques is still a challenge. Here, we present an update of our multipurpose molecular modeling suite, GaudiMM, to locate metal-binding sites in proteins. The approach is benchmarked on 105 X-ray structures with resolution lower than 2.0 Å. Results predict the correct binding site of the metal in the biological scaffold for all the entries in the data set. Generated 3D models of the protein-metal coordination complexes reach root-mean-square deviation values under 1.0 Å between calculated and experimental structures. The whole process is purely based on finding poses that satisfy metal-derived geometrical rules without needing sequence or fine electronic inputs. Additional post-optimizations, including receptor flexibility, have been tested and suggest that more extensive searches, required when the host structures present a low level of pre-organization, are also possible. With this new update, GaudiMM is now able to look for metal-binding sites in biological scaffolds and clearly shows how explicitly considering the geometric particularities of the first coordination sphere of the metal in a docking process provides excellent results.Entities:
Year: 2019 PMID: 31459585 PMCID: PMC6648054 DOI: 10.1021/acsomega.8b03457
Source DB: PubMed Journal: ACS Omega ISSN: 2470-1343
Figure 1Given a metal ion M, potential donors (D) are searched in its surroundings. If the distance M–D is smaller than the r threshold for more than three potential donors, the RMSD between the donor atom positions and the chosen ideal polyhedron (an octahedron in this figure) is computed. If the test is successful and D is bound to another atom (1st_neighbor), the M–D–1st_neighbor angle α is computed. If 1st_neighbor has another nonterminal neighbor (2nd_neighbor), the M–D–1st_neighbor–2nd_neighbor dihedral angle θ is also computed. The absolute sines of the differences between the calculated angles and their ideal counterparts (according to chimera.bondGeom routines) are then calculated. Since the polyhedron matching procedure is scalable, the absolute difference between the M–D distances and the expected ideal coordination distances for those elements are also computed. All the obtained terms are then summed together, and the sum should be zero for an ideal coordination geometry.
Summary of RMSD between the Experimental and the Predicted Binding Sites (First Column) and Coord_Fitness Distribution of the Data Set
| RMSD | total | Coord_Fitness ≤ 3.5 | 3.5 < Coord_Fitness ≤ 5.0 |
|---|---|---|---|
| <0.5 | 67 | 27 | 28 |
| 0.5–1.0 | 38 | 16 | 15 |
| >1.0 |
Value reported in Å.
RMSD computed via UCSF Chimera.
Value calculated by GaudiMM using eq S1 in the Supporting Information.
Figure 2Comparison between the GaudiMM solution (in orange) and the original X-ray diffraction (XRD) structure (in dots surface) for the PDB structures (a) 1EJJ, (b) 414A, and (c) 2Y12.
Summary of Reproduced Binding Sites, Donors, and RMSD Values Obtained after the Flexible Side Chain Docking Simulation
| structure (PDB) | donorsXRD | rotamers | donorscalcd | RMSD |
|---|---|---|---|---|
| 1AVW (Ca) | 3(−OOCAsp); 2(OCamide) | Glu70, Glu80, Glu77, Val75, Asn72 | –OOC(Glu80); OCamide(Val75) | 0.556 |
| 1AX1 (Mn) | 2(−OOCAsp); (−OOCGlu); (NHis) | Asp129, Asp136, Glu127, His142 | (−OOC)Asp129; (−OOC)Asp136; (−OOC)Glu127; (N)His142 | 1.238 |
| 1B8C (Mg) | 4(−OOCAsp); (OCamide) | Asp90, Asp92, Asp94, Ile96, Asp101 | (−OOC)Asp90; (−OOC)Asp92; (−OOC)Asp101; (OCamide)Lys96 | 0.792 |
| 1B71 (Fe) | 3(−OOCGlu); (NHis) | Glu94, Glu128, Glu53, His131 | (−OOC)Glu94; (−OOC)Glu128; (N)His131 | 0.268 |
| 1FX7 (Co) | 2(−OOCGlu); 2(NHis); (OCNGln) | His79, Glu83, His98, Glu172, Gln175 | (N)His79; (−OOC)Glu83; (N)His98; (−OOC)Glu172 | 0.889 |
| 1J5Y (Ni) | (−OOCGlu); 3(NHis) | His148, His79, His146, Glu87 | ||
| 1XJS (Zn) | 3(−SCys); (−OOCAsp) | Cys66, Cys128, Cys41, Asp43 | (−S)Cys128; (−S)Cys41; (−OOC)Asp43 | 1.023 |
| 3AWS (Cu) | (−OOCAsp); (NHis); (OCamide); (−Namide) | His82, Glu67, His68 | (−OOC)Glu67; (N)His82 | 0.391 |
| success rate | 87.5% |
RMSD value computed via UCSF Chimera, reported in Å.
Bidentate coordination of the carboxylate group.
The correct binding site was not identified during the calculation.