Literature DB >> 22658682

A scalable and accurate method for classifying protein-ligand binding geometries using a MapReduce approach.

T Estrada1, B Zhang, P Cicotti, R S Armen, M Taufer.   

Abstract

We present a scalable and accurate method for classifying protein-ligand binding geometries in molecular docking. Our method is a three-step process: the first step encodes the geometry of a three-dimensional (3D) ligand conformation into a single 3D point in the space; the second step builds an octree by assigning an octant identifier to every single point in the space under consideration; and the third step performs an octree-based clustering on the reduced conformation space and identifies the most dense octant. We adapt our method for MapReduce and implement it in Hadoop. The load-balancing, fault-tolerance, and scalability in MapReduce allow screening of very large conformation spaces not approachable with traditional clustering methods. We analyze results for docking trials for 23 protein-ligand complexes for HIV protease, 21 protein-ligand complexes for Trypsin, and 12 protein-ligand complexes for P38alpha kinase. We also analyze cross docking trials for 24 ligands, each docking into 24 protein conformations of the HIV protease, and receptor ensemble docking trials for 24 ligands, each docking in a pool of HIV protease receptors. Our method demonstrates significant improvement over energy-only scoring for the accurate identification of native ligand geometries in all these docking assessments. The advantages of our clustering approach make it attractive for complex applications in real-world drug design efforts. We demonstrate that our method is particularly useful for clustering docking results using a minimal ensemble of representative protein conformational states (receptor ensemble docking), which is now a common strategy to address protein flexibility in molecular docking.
Copyright © 2012 Elsevier Ltd. All rights reserved.

Entities:  

Mesh:

Substances:

Year:  2012        PMID: 22658682     DOI: 10.1016/j.compbiomed.2012.05.001

Source DB:  PubMed          Journal:  Comput Biol Med        ISSN: 0010-4825            Impact factor:   4.589


  4 in total

Review 1.  Crowd sourcing difficult problems in protein science.

Authors:  Nathan S Alexander; Krzysztof Palczewski
Journal:  Protein Sci       Date:  2017-08-29       Impact factor: 6.725

2.  A survey of algorithms for transforming molecular dynamics data into metadata for in situ analytics based on machine learning methods.

Authors:  Michela Taufer; Trilce Estrada; Travis Johnston
Journal:  Philos Trans A Math Phys Eng Sci       Date:  2020-01-20       Impact factor: 4.226

Review 3.  Proteins and Their Interacting Partners: An Introduction to Protein-Ligand Binding Site Prediction Methods.

Authors:  Daniel Barry Roche; Danielle Allison Brackenridge; Liam James McGuffin
Journal:  Int J Mol Sci       Date:  2015-12-15       Impact factor: 5.923

4.  Automatic generation of bioinformatics tools for predicting protein-ligand binding sites.

Authors:  Yusuke Komiyama; Masaki Banno; Kokoro Ueki; Gul Saad; Kentaro Shimizu
Journal:  Bioinformatics       Date:  2015-11-05       Impact factor: 6.937

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.