| Literature DB >> 34921117 |
Joseph M Paggi1,2,3,4, Julia A Belk1,2,3,4, Scott A Hollingsworth1,2,3,4, Nicolas Villanueva5, Alexander S Powers1,2,3,4,6, Mary J Clark5, Augustine G Chemparathy1,2,3,4, Jonathan E Tynan1,2,3,4, Thomas K Lau1,2,3,4, Roger K Sunahara5, Ron O Dror7,2,3,4.
Abstract
Over the past five decades, tremendous effort has been devoted to computational methods for predicting properties of ligands-i.e., molecules that bind macromolecular targets. Such methods, which are critical to rational drug design, fall into two categories: physics-based methods, which directly model ligand interactions with the target given the target's three-dimensional (3D) structure, and ligand-based methods, which predict ligand properties given experimental measurements for similar ligands. Here, we present a rigorous statistical framework to combine these two sources of information. We develop a method to predict a ligand's pose-the 3D structure of the ligand bound to its target-that leverages a widely available source of information: a list of other ligands that are known to bind the same target but for which no 3D structure is available. This combination of physics-based and ligand-based modeling improves pose prediction accuracy across all major families of drug targets. Using the same framework, we develop a method for virtual screening of drug candidates, which outperforms standard physics-based and ligand-based virtual screening methods. Our results suggest broad opportunities to improve prediction of various ligand properties by combining diverse sources of information through customized machine-learning approaches.Entities:
Keywords: antipsychotics; artificial intelligence; drug design; structural biology; virtual screening
Mesh:
Substances:
Year: 2021 PMID: 34921117 PMCID: PMC8713799 DOI: 10.1073/pnas.2112621118
Source DB: PubMed Journal: Proc Natl Acad Sci U S A ISSN: 0027-8424 Impact factor: 11.205
Fig. 1.ComBind leverages nonstructural data to improve ligand binding pose predictions. (A) Standard docking methods take as input the chemical structure of the query ligand and the 3D structure of the target protein and predict a binding pose using a per-ligand scoring function. (B) ComBind additionally considers other ligands known to bind the target protein (whose binding poses are not known), resulting in more-accurate predictions. For clarity, hydrogen and fluorine atoms are omitted from the 3D renderings.
Fig. 2.Distinct ligands that bind to a given target protein often adopt similar binding poses and do so more frequently than predicted by a state-of-the-art per-ligand docking method. (A) Chemically distinct ligands share key interactions with the mineralocorticoid receptor (Protein Data Bank IDs: 2AA2, 5L7E, and 5MWP). (B) Across a set of 3,115 ligand pairs, interaction similarities are generally higher in pairs of correct poses than in pairs of poses ranked highly by a per-ligand scoring function. Shading depicts the per-target SEM. A.U.: arbitrary units. (C) Across a set of 690 ligand pairs with a shared substructure, the substructure tends to be placed more similarly in correct poses than in other poses ranked highly by a per-ligand scoring function ().
Fig. 3.ComBind discovers and rewards key interactions shared by distinct ligands. (A) Whereas per-ligand docking considers each ligand individually, ComBind jointly selects poses for all ligands, optimizing for poses that are individually favorable according to a per-ligand scoring function and together form a coherent set of protein–ligand interactions. (B) We tested the ability of ComBind to predict the poses of 11 ligands that bind the β1-adrenergic receptor. Each dot corresponds to a single ligand, with the dot’s position indicating the error in the predicted pose (RMSD from the experimentally determined pose) for ComBind and for state-of-the art per-ligand docking software (Glide). A pose is considered correct if its RMSD is ≤2.0 Å (dashed lines). ComBind predicts a substantially more accurate pose than Glide for seven of the 11 ligands. (C) The set of residues with which each ligand forms salt bridges or hydrogen bonds when positioned in its experimentally determined pose (Top), the pose predicted by per-ligand docking (Left), and the pose predicted by ComBind (Right).
Fig. 4.ComBind outperforms per-ligand docking on a diverse benchmark set. Performance of ComBind, as compared to a per-ligand scoring function, using helper ligands selected automatically from ChEMBL. All results are for “cross-docking” (the query ligand is docked into a structure determined in the presence of a distinct ligand). (A) Performance per target protein, target protein family (GPCRs, ion channels, etc.), and overall. Green disks and black circles indicate performance for ComBind and a state-of-the art per-ligand docking software package (Glide), respectively. (B) Performance as a function of the number of helper ligands. When using no helper ligands, ComBind is equivalent to per-ligand docking.
Fig. 5.On virtual screening, ComBindVS outperforms per-ligand docking, a ligand-based method (“chemical similarity”), and a combination of the two. Performance on DUD-E benchmark set when restricting to candidate molecules that have less than 0.2 Tanimoto similarity to all helper ligands (Top) or a maximum of between 0.2 and 0.3 Tanimoto similarity to any helper ligand (Bottom). “Chemical similarity” is a state-of-the-art ligand-based method ().
Fig. 6.Prediction and validation of the binding poses of antipsychotics at D2R. (A and B) Glide (A) and ComBind (B) predict very different binding poses for pimozide (and for benperidol; ). (C) Mutagenesis experiments validate ComBind’s predictions. In ComBind’s predicted pose for pimozide, its “extra” ring is uncomfortably close to S193, such that decreasing the size of residue 193 (S193A) increases pimozide’s binding affinity and increasing the size of residue 193 (S193V and S193L) decreases pimozide’s binding affinity. WT represents the wild-type (unmutated) receptor. (D) As a control, we verified that benperidol—which lacks this “extra” ring but is otherwise identical to pimozide—does not exhibit the same trend. Error bars show SEM. Refer to for additional data. (E) Comparison of experimental binding energies and per-ligand docking scores for analogs of spiperone. The red circle denotes the experimental binding energy and per-ligand docking score for spiperone. (F) Same but for ComBindVS scores instead of per-ligand docking scores.