Literature DB >> 21417267

Data-driven high-throughput prediction of the 3-D structure of small molecules: review and progress.

Alessio Andronico1, Arlo Randall, Ryan W Benz, Pierre Baldi.   

Abstract

Accurate prediction of the 3-D structure of small molecules is essential in order to understand their physical, chemical, and biological properties, including how they interact with other molecules. Here, we survey the field of high-throughput methods for 3-D structure prediction and set up new target specifications for the next generation of methods. We then introduce COSMOS, a novel data-driven prediction method that utilizes libraries of fragment and torsion angle parameters. We illustrate COSMOS using parameters extracted from the Cambridge Structural Database (CSD) by analyzing their distribution and then evaluating the system's performance in terms of speed, coverage, and accuracy. Results show that COSMOS represents a significant improvement when compared to state-of-the-art prediction methods, particularly in terms of coverage of complex molecular structures, including metal-organics. COSMOS can predict structures for 96.4% of the molecules in the CSD (99.6% organic, 94.6% metal-organic), whereas the widely used commercial method CORINA predicts structures for 68.5% (98.5% organic, 51.6% metal-organic). On the common subset of molecules predicted by both methods, COSMOS makes predictions with an average speed per molecule of 0.15 s (0.10 s organic, 0.21 s metal-organic) and an average rmsd of 1.57 Å (1.26 Å organic, 1.90 Å metal-organic), and CORINA makes predictions with an average speed per molecule of 0.13s (0.18s organic, 0.08s metal-organic) and an average rmsd of 1.60 Å (1.13 Å organic, 2.11 Å metal-organic). COSMOS is available through the ChemDB chemoinformatics Web portal at http://cdb.ics.uci.edu/ .

Entities:  

Mesh:

Year:  2011        PMID: 21417267      PMCID: PMC3081951          DOI: 10.1021/ci100223t

Source DB:  PubMed          Journal:  J Chem Inf Model        ISSN: 1549-9596            Impact factor:   4.956


  31 in total

1.  The Protein Data Bank.

Authors:  H M Berman; J Westbrook; Z Feng; G Gilliland; T N Bhat; H Weissig; I N Shindyalov; P E Bourne
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

2.  Comparison of knowledge-based and distance geometry approaches for generation of molecular conformations.

Authors:  B P Feuston; M D Miller; J C Culberson; R B Nachbar; S K Kearsley
Journal:  J Chem Inf Comput Sci       Date:  2001 May-Jun

3.  Conformational sampling by self-organization.

Authors:  Huafeng Xu; Sergei Izrailev; Dimitris K Agrafiotis
Journal:  J Chem Inf Comput Sci       Date:  2003 Jul-Aug

4.  TOUCHSTONE II: a new approach to ab initio protein structure prediction.

Authors:  Yang Zhang; Andrzej Kolinski; Jeffrey Skolnick
Journal:  Biophys J       Date:  2003-08       Impact factor: 4.033

5.  ZINC--a free database of commercially available compounds for virtual screening.

Authors:  John J Irwin; Brian K Shoichet
Journal:  J Chem Inf Model       Date:  2005 Jan-Feb       Impact factor: 4.956

6.  ChemDB update--full-text search and virtual chemical space.

Authors:  Jonathan H Chen; Erik Linstead; S Joshua Swamidass; Dennis Wang; Pierre Baldi
Journal:  Bioinformatics       Date:  2007-06-28       Impact factor: 6.937

7.  Analogy in computer-assisted design.

Authors:  W T Wipke; M A Hahn
Journal:  Prog Clin Biol Res       Date:  1989

8.  DrugBank: a comprehensive resource for in silico drug discovery and exploration.

Authors:  David S Wishart; Craig Knox; An Chi Guo; Savita Shrivastava; Murtaza Hassanali; Paul Stothard; Zhan Chang; Jennifer Woolsey
Journal:  Nucleic Acids Res       Date:  2006-01-01       Impact factor: 16.971

9.  PubChem: a public information system for analyzing bioactivities of small molecules.

Authors:  Yanli Wang; Jewen Xiao; Tugba O Suzek; Jian Zhang; Jiyao Wang; Stephen H Bryant
Journal:  Nucleic Acids Res       Date:  2009-06-04       Impact factor: 16.971

10.  Frog: a FRee Online druG 3D conformation generator.

Authors:  T Bohme Leite; D Gomes; M A Miteva; J Chomilier; B O Villoutreix; P Tufféry
Journal:  Nucleic Acids Res       Date:  2007-05-07       Impact factor: 16.971

View more
  5 in total

1.  Computational Approach to Molecular Catalysis by 3d Transition Metals: Challenges and Opportunities.

Authors:  Konstantinos D Vogiatzis; Mikhail V Polynski; Justin K Kirkland; Jacob Townsend; Ali Hashemi; Chong Liu; Evgeny A Pidko
Journal:  Chem Rev       Date:  2018-10-30       Impact factor: 60.622

2.  Incorporating post-translational modifications and unnatural amino acids into high-throughput modeling of protein structures.

Authors:  Ken Nagata; Arlo Randall; Pierre Baldi
Journal:  Bioinformatics       Date:  2014-02-25       Impact factor: 6.937

3.  JTHERGAS: Thermodynamic Estimation from 2D Graphical Representations of Molecules.

Authors:  Edward Blurock; V Warth; X Grandmougin; R Bounaceur; P A Glaude; F Battin-Leclerc
Journal:  Energy (Oxf)       Date:  2012-07       Impact factor: 7.147

4.  Small-molecule 3D structure prediction using open crystallography data.

Authors:  Peter Sadowski; Pierre Baldi
Journal:  J Chem Inf Model       Date:  2013-12-10       Impact factor: 4.956

5.  Open Babel: An open chemical toolbox.

Authors:  Noel M O'Boyle; Michael Banck; Craig A James; Chris Morley; Tim Vandermeersch; Geoffrey R Hutchison
Journal:  J Cheminform       Date:  2011-10-07       Impact factor: 5.514

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.