Literature DB >> 34652141

Improving Structure-Based Virtual Screening with Ensemble Docking and Machine Learning.

Joel Ricci-Lopez1,2, Sergio A Aguila2, Michael K Gilson3, Carlos A Brizuela1.   

Abstract

One of the main challenges of structure-based virtual screening (SBVS) is the incorporation of the receptor's flexibility, as its explicit representation in every docking run implies a high computational cost. Therefore, a common alternative to include the receptor's flexibility is the approach known as ensemble docking. Ensemble docking consists of using a set of receptor conformations and performing the docking assays over each of them. However, there is still no agreement on how to combine the ensemble docking results to obtain the final ligand ranking. A common choice is to use consensus strategies to aggregate the ensemble docking scores, but these strategies exhibit slight improvement regarding the single-structure approach. Here, we claim that using machine learning (ML) methodologies over the ensemble docking results could improve the predictive power of SBVS. To test this hypothesis, four proteins were selected as study cases: CDK2, FXa, EGFR, and HSP90. Protein conformational ensembles were built from crystallographic structures, whereas the evaluated compound library comprised up to three benchmarking data sets (DUD, DEKOIS 2.0, and CSAR-2012) and cocrystallized molecules. Ensemble docking results were processed through 30 repetitions of 4-fold cross-validation to train and validate two ML classifiers: logistic regression and gradient boosting trees. Our results indicate that the ML classifiers significantly outperform traditional consensus strategies and even the best performance case achieved with single-structure docking. We provide statistical evidence that supports the effectiveness of ML to improve the ensemble docking performance.

Entities:  

Mesh:

Substances:

Year:  2021        PMID: 34652141      PMCID: PMC8865842          DOI: 10.1021/acs.jcim.1c00511

Source DB:  PubMed          Journal:  J Chem Inf Model        ISSN: 1549-9596            Impact factor:   4.956


  86 in total

1.  Utilizing experimental data for reducing ensemble size in flexible-protein docking.

Authors:  Mengang Xu; Markus A Lill
Journal:  J Chem Inf Model       Date:  2011-12-19       Impact factor: 4.956

2.  PTRAJ and CPPTRAJ: Software for Processing and Analysis of Molecular Dynamics Trajectory Data.

Authors:  Daniel R Roe; Thomas E Cheatham
Journal:  J Chem Theory Comput       Date:  2013-06-25       Impact factor: 6.006

3.  Ensemble docking into multiple crystallographically derived protein structures: an evaluation based on the statistical analysis of enrichments.

Authors:  Ian R Craig; Jonathan W Essex; Katrin Spiegel
Journal:  J Chem Inf Model       Date:  2010-04-26       Impact factor: 4.956

4.  In Need of Bias Control: Evaluating Chemical Data for Machine Learning in Structure-Based Virtual Screening.

Authors:  Jochen Sieg; Florian Flachsenberg; Matthias Rarey
Journal:  J Chem Inf Model       Date:  2019-03-05       Impact factor: 4.956

5.  Exploring protein flexibility: incorporating structural ensembles from crystal structures and simulation into virtual screening protocols.

Authors:  David J Osguthorpe; Woody Sherman; Arnold T Hagler
Journal:  J Phys Chem B       Date:  2012-04-23       Impact factor: 2.991

6.  Improvement of virtual screening results by docking data feature analysis.

Authors:  Marcelino Arciniega; Oliver F Lange
Journal:  J Chem Inf Model       Date:  2014-05-14       Impact factor: 4.956

7.  Assessing an ensemble docking-based virtual screening strategy for kinase targets by considering protein flexibility.

Authors:  Sheng Tian; Huiyong Sun; Peichen Pan; Dan Li; Xuechu Zhen; Youyong Li; Tingjun Hou
Journal:  J Chem Inf Model       Date:  2014-09-29       Impact factor: 4.956

8.  Plasticity of the Binding Site of Renin: Optimized Selection of Protein Structures for Ensemble Docking.

Authors:  Claas Strecker; Bernd Meyer
Journal:  J Chem Inf Model       Date:  2018-05-02       Impact factor: 4.956

Review 9.  Protein flexibility in docking and surface mapping.

Authors:  Katrina W Lexa; Heather A Carlson
Journal:  Q Rev Biophys       Date:  2012-05-09       Impact factor: 5.318

10.  An improved relaxed complex scheme for receptor flexibility in computer-aided drug design.

Authors:  Rommie E Amaro; Riccardo Baron; J Andrew McCammon
Journal:  J Comput Aided Mol Des       Date:  2008-01-15       Impact factor: 3.686

View more
  3 in total

1.  Ensemble learning from ensemble docking: revisiting the optimum ensemble size problem.

Authors:  Sara Mohammadi; Zahra Narimani; Mitra Ashouri; Rohoullah Firouzi; Mohammad Hossein Karimi-Jafari
Journal:  Sci Rep       Date:  2022-01-10       Impact factor: 4.379

2.  Collaborative Approach between Explainable Artificial Intelligence and Simplified Chemical Interactions to Explore Active Ligands for Cyclin-Dependent Kinase 2.

Authors:  Tomomi Shimazaki; Masanori Tachikawa
Journal:  ACS Omega       Date:  2022-03-18

3.  Virtual Screening with Gnina 1.0.

Authors:  Jocelyn Sunseri; David Ryan Koes
Journal:  Molecules       Date:  2021-12-04       Impact factor: 4.411

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.