| Literature DB >> 35480897 |
Nicholas M Pearce1, Rachael Skyner2, Tobias Krojer3.
Abstract
The throughput of macromolecular X-ray crystallography experiments has surged over the last decade. This remarkable gain in efficiency has been facilitated by increases in the availability of high-intensity X-ray beams, (ultra)fast detectors and high degrees of automation. These developments have in turn spurred the development of several dedicated centers for crystal-based fragment screening which enable the preparation and collection of hundreds of single-crystal diffraction datasets per day. Crystal structures of target proteins in complex with small-molecule ligands are of immense importance for structure-based drug design (SBDD) and their rapid turnover is a prerequisite for accelerated development cycles. While the experimental part of the process is well defined and has by now been established at several synchrotron sites, it is noticeable that software and algorithmic aspects have received far less attention, as well as the implications of new methodologies on established paradigms for structure determination, analysis, and visualization. We will review three key areas of development of large-scale protein-ligand studies. First, we will look into new software developments for batch data processing, followed by a discussion of the methodological changes in the analysis, modeling, refinement and deposition of structures for SBDD, and the changes in mindset that these new methods require, both on the side of depositors and users of macromolecular models. Finally, we will highlight key new developments for the presentation and analysis of the collections of structures that these experiments produce, and provide an outlook for future developments.Entities:
Keywords: data management; data presentation and analysis; fragment screening; macromolecular crystallography; multi-state modelling
Year: 2022 PMID: 35480897 PMCID: PMC9035521 DOI: 10.3389/fmolb.2022.861491
Source DB: PubMed Journal: Front Mol Biosci ISSN: 2296-889X
FIGURE 1Outline of a generic data processing workbench. Future workbenches will likely be hosted in the cloud [e.g., European Open Science Cloud (EOSC)] and take in various (meta-) data through customizable entry points. Users can then define the sample/data relationship and connect pre-defined tasks as needed for their workflow, such as in KNIME workflows. Workflows could be saved and shared with any other interested party. All results and workflows would be stored in an internal database for each project and local programs or web-services can gain access through a dedicated “Results” node.
FIGURE 2Conventional OMIT maps do not generally produce clear unambiguous evidence of binding for low occupancy ligands, even at high resolution. Examples for four binding fragments showing different levels of support for binding from OMIT maps, where clear evidence of binding is shown by PanDDA event maps. Map coloring: 2mFo-DFc maps are shown as blue mesh and all difference maps are shown as green/red mesh; associated PanDDA event maps are shown as purple surfaces. Map type and contour are as indicated. Both types of OMIT maps are produced by phenix.polder. Maps are truncated (carved) at 3Å around relevant residues and ligands for clarity; PanDDA maps are carved at 2Å. Model coloring: To distinguish alternate conformations, carbon atoms for ligand-associated conformations are coloured orange and non-ligand-associated conformations are coloured blue; main-conformation (full-occupancy) atoms are coloured light gray; all other atoms are colored by element except waters, which are coloured as per carbon atoms. Resolutions: (A) 1.40Å, (B) 1.60Å, (C) 1.29Å, (D) 1.34Å. Refined ligand occupancies: (A) 0.41, (B) 0.50, (C) 0.38, (D) 0.22. PanDDA event map pseudo-occupancies (1-BDC): (A) 0.15, (B) 0.17, (C) 0.10, (D) 0.11. (A) Binding is not evident in the 2mFo-DFc maps at a moderate contour level, but is clearly supported by both types of OMIT map, especially when considered in combination with the extra density from the superposed water molecules from non-ligand-associated conformations, as modeled. It is debatable whether the OMIT maps alone would provide strong enough evidence to support modeling of the ligand, but the single ligand conformation is clearly evidenced in the PanDDA event map, preventing potential misinterpretation of the OMIT map as multiple conformations of the ligand. (B) Similar to (A), but with less evidence in the simple OMIT map. The polder OMIT map provides an envelope which fits well with the envelope provided by the ligand and the superposed water molecules, as modeled. It is unlikely either OMIT map would be accepted as evidence of binding, but once more, the ligand conformation is clearly identified in the event map. (C) OMIT maps show mostly features which correspond to superposed (not-ligand-associated) waters, and do not present evidence for the bound ligand, unlike the event map. (D) Ligand binding coincides with an alternate conformation of an arginine residue, which dominates the refined maps and OMIT maps.
FIGURE 3Fragalysis aims to provide immediate access to ligand-protein information without confounding crystallographic artifacts. To achieve this, a given crystal structure (top left—Crystal A) is inspected to find all of the individual ligands. These ligands are then separated into separate bound-state entities (top right—Ligand 0A and Ligand 1A) using the Fragalysis API. Ligands are subsequently separated from their respective protein (top-right Ligand 0A and Protein 0A), and presented in Fragalysis as part of an ensemble of all ligands and proteins in the same reference frame (bottom).