| Literature DB >> 28291754 |
Daria A Beshnova1, Joana Pereira1, Victor S Lamzin1.
Abstract
Macromolecular X-ray crystallography is one of the main experimental techniques to visualize protein-ligand interactions. The high complexity of the ligand universe, however, has delayed the development of efficient methods for the automated identification, fitting and validation of ligands in their electron-density clusters. The identification and fitting are primarily based on the density itself and do not take into account the protein environment, which is a step that is only taken during the validation of the proposed binding mode. Here, a new approach, based on the estimation of the major energetic terms of protein-ligand interaction, is introduced for the automated identification of crystallographic ligands in the indicated binding site with ARP/wARP. The applicability of the method to the validation of protein-ligand models from the Protein Data Bank is demonstrated by the detection of models that are `questionable' and the pinpointing of unfavourable interatomic contacts.Entities:
Keywords: ARP/wARP; LigEnergy; automated identification of crystallographic ligands; protein–ligand interaction energy; protein–ligand interactions; structure validation
Mesh:
Substances:
Year: 2017 PMID: 28291754 PMCID: PMC5349431 DOI: 10.1107/S2059798317003400
Source DB: PubMed Journal: Acta Crystallogr D Struct Biol ISSN: 2059-7983 Impact factor: 7.652
Figure 1The energy of the bound state for the selected protein–ligand complexes. (a) The distribution of the energy computed; outliers with highly positive values are not shown. The median, minimum and maximum values are indicated. (b) The energy of the bound state as a function of the number of non-H ligand atoms. The Pearson linear correlation coefficient (ρ) and the smooth conditional mean (computed with ggplot2; Wickham, 2009 ▸) are shown. (c) Distribution of the bound-state energy normalized by the number of non-H ligand atoms; outliers with highly positive values are not shown. The median, minimum and maximum values are indicated.
Figure 2An example of weak protein–ligand interaction: a tandem winged-helix domain of RNA polymerase I subunit A49 in complex with a fragment of polyethylene glycol PEG 4000 (PE4).
Figure 3Ligand guessing without (yellow skeleton) and with (blue skeleton) the use of the estimated energy as an additional parameter for (a) the hypoxanthine-guanine-xanthine phosphoribosyltransferase [PDB entry 1hgx; 2mF o − mF c map contoured at a 2.5σ level above the mean (0.833 e Å−3) in black mesh], (b) the Bud32–Cgi121 protein complex [PDB entry 4ww7; 2mF o − mF c map contoured at a 2.5σ level above the mean (0.709 e Å−3) in black mesh] and (c) the putative 5′-methylthioadenosine/S-adenosylhomocysteine nucleosidase from B. burgdorferi B31 [PDB entry 4l0m; 2mF o − mF c map contoured at a 2.5σ level above the mean (0.831 e Å−3) in black mesh]. Dashed black lines indicate favourable contacts; dashed red lines show interatomic clashes.