| Literature DB >> 28291758 |
Jason C Cole1, Ilenia Giangreco1, Colin R Groom1.
Abstract
The Cambridge Structural Database (CSD) is the worldwide resourceEntities:
Keywords: Cambridge Structural Database; macromolecular crystallography; scripting
Mesh:
Substances:
Year: 2017 PMID: 28291758 PMCID: PMC5349435 DOI: 10.1107/S2059798316014352
Source DB: PubMed Journal: Acta Crystallogr D Struct Biol ISSN: 2059-7983 Impact factor: 7.652
Figure 1Contour plot showing the distribution of alcohol groups around oxazole rings. The more ‘solid’ the surface, the higher the likelihood of a group being in that position. None of the handful of alcohol groups near the O atom in small-molecule structures form hydrogen bonds. Such analyses can be performed using the interaction-map capabilities within Mercury or using the program IsoStar.
Figure 2Analysis of protein–ligand interactions using CSD data for KIT kinase (PDB entry 4hvs; Zhang et al., 2013 ▸). The bound inhibitor is superimposed on the interaction maps. The statistically most likely places one would find hydrogen-bond donors are shown in blue, hydrogen-bond acceptors in red and halogens in green. The maps show that these groups in the modelled ligand match the binding prediction well: it makes plausible interactions.
Figure 3The distributions of geometrical parameters in the CSD compared with the ligand in PDB entry 4hvs. The thumbnail histograms represent observations in the CSD of bonds and valence angles between atoms identical to, and in similar environments to, the ligand. Red lines show the values observed in the ligand. The green bars of the histograms represent structures in the CSD as of August 2015, the yellow bars represent structures added between then and November 2015, and the red bars those added between November 2015 and February 2016. (a) Bond-length distributions. For the highlighted bonds, the values lie outside the range observed in small-molecule structures. (b) Bond valence angle distributions. For valence angles at the highlighted atoms, the values lie outside the range observed in small-molecule structures. Note that some CSD distributions are significantly more populated than others and also how the recent addition of many structures (i.e. red and yellow bars) better informs us of the expected valence angles of CF3 groups (c).
Figure 4Docking of the 4hvs ligand using GOLD (shown in magenta), superimposed on the modelled ligand from the PDB structure (grey). The numbers represent the contribution of hydrogen-bonding groups to the docking score. Both N atoms of the azaindole ring are positioned well to make hydrogen bonds (green dashes), reflected in a ‘perfect’ score of −1.00. However, the N atom of the pyridine ring in the modelled crystal structure is rather poorly positioned to hydrogen-bond, reflected in its score of −0.51. The docking software positions the ligand slightly differently in the binding site and improves the geometry of the ligand, such that the atom is now in a perfect position to hydrogen-bond.
Figure 5Query used to return relevant dipeptides from the CSD. (a) Key peptide query and returned data parameters. (b) An improper torsion angle used to constrain the search to l-amino-acid dipeptide fragments. To ensure that we only extract peptidic fragments from l-amino-acid systems, we constrain the results so that the improper torsion angle C7–C8–C5–N3 is in the range 0.0–180°.
Figure 6Ramachandran plot of CSD entries containing dipeptides. The plot is coloured from blue to red based on τ, the backbone valence angle subtended at the α-carbon. A small number of entries with extreme values of τ (predominantly in cyclized or metal-bound structures) were removed.