Literature DB >> 30739238

Multi-task generative topographic mapping in virtual screening.

Arkadii Lin1,2, Dragos Horvath1, Gilles Marcou1, Bernd Beck2, Alexandre Varnek3.   

Abstract

The previously reported procedure to generate "universal" Generative Topographic Maps (GTMs) of the drug-like chemical space is in practice a multi-task learning process, in which both operational GTM parameters (example: map grid size) and hyperparameters (key example: the molecular descriptor space to be used) are being chosen by an evolutionary process in order to fit/select "universal" GTM manifolds. After selection (a one-time task aimed at optimizing the compromise in terms of neighborhood behavior compliance, over a large pool of various biological targets), for any further use the manifolds are ready to provide "fit-free" predictive models. Using any structure-activity set-irrespectively whether the associated target served at map fitting stage or not-the generation or "coloring" a property landscape enables predicting the property for any external molecule, with zero additional fitable parameters involved. While previous works have signaled the excellent behavior of such models in aggressive three-fold cross-validation assessments of their predictive power, the present work wished to explore their behavior in Virtual Screening (VS), here simulated on hand of external DUD ligand and decoy series that are fully disjoint from the ChEMBL-extracted landscape coloring sets. Beyond the rather robust results of the universal GTM manifolds in this challenge, it could be shown that the descriptor spaces selected by the evolutionary multi-task learner were intrinsically able to serve as an excellent support for many other VS procedures, starting from parameter-free similarity searching, to local (target-specific) GTM models, to parameter-rich, nonlinear Random Forest and Neural Network approaches.

Entities:  

Keywords:  Big data; ChEMBL; DUD; Generative topographic mapping; Ligand-based virtual screening; Multi-task learning; Neural networks; Universal maps

Mesh:

Substances:

Year:  2019        PMID: 30739238     DOI: 10.1007/s10822-019-00188-x

Source DB:  PubMed          Journal:  J Comput Aided Mol Des        ISSN: 0920-654X            Impact factor:   3.686


  23 in total

1.  Ligand-based combinatorial design of selective purinergic receptor (A2A) antagonists using self-organizing maps.

Authors:  Gisbert Schneider; Matthias Nettekoven
Journal:  J Comb Chem       Date:  2003 May-Jun

2.  Inductive transfer of knowledge: application of multi-task learning and feature net approaches to model tissue-air partition coefficients.

Authors:  Alexandre Varnek; Cédric Gaudin; Gilles Marcou; Igor Baskin; Anil Kumar Pandey; Igor V Tetko
Journal:  J Chem Inf Model       Date:  2009-01       Impact factor: 4.956

Review 3.  Machine-learning approaches in drug discovery: methods and applications.

Authors:  Antonio Lavecchia
Journal:  Drug Discov Today       Date:  2014-11-04       Impact factor: 7.851

4.  Chemical data visualization and analysis with incremental generative topographic mapping: big data challenge.

Authors:  Héléna A Gaspar; Igor I Baskin; Gilles Marcou; Dragos Horvath; Alexandre Varnek
Journal:  J Chem Inf Model       Date:  2014-12-19       Impact factor: 4.956

5.  GTM-Based QSAR Models and Their Applicability Domains.

Authors:  H A Gaspar; I I Baskin; G Marcou; D Horvath; A Varnek
Journal:  Mol Inform       Date:  2015-02-03       Impact factor: 3.353

6.  Generative Topographic Mapping (GTM): Universal Tool for Data Visualization, Structure-Activity Modeling and Dataset Comparison.

Authors:  N Kireeva; I I Baskin; H A Gaspar; D Horvath; G Marcou; A Varnek
Journal:  Mol Inform       Date:  2012-04-04       Impact factor: 3.353

7.  ISIDA Property-Labelled Fragment Descriptors.

Authors:  Fiorella Ruggiu; Gilles Marcou; Alexandre Varnek; Dragos Horvath
Journal:  Mol Inform       Date:  2010-12-09       Impact factor: 3.353

8.  Mapping of the Available Chemical Space versus the Chemical Universe of Lead-Like Compounds.

Authors:  Arkadii Lin; Dragos Horvath; Valentina Afonina; Gilles Marcou; Jean-Louis Reymond; Alexandre Varnek
Journal:  ChemMedChem       Date:  2018-01-29       Impact factor: 3.466

9.  Analysis of neighborhood behavior in lead optimization and array design.

Authors:  George Papadatos; Anthony W J Cooper; Visakan Kadirkamanathan; Simon J F Macdonald; Iain M McLay; Stephen D Pickett; John M Pritchard; Peter Willett; Valerie J Gillet
Journal:  J Chem Inf Model       Date:  2009-02       Impact factor: 4.956

10.  Self-organizing maps for identification of new inhibitors of P-glycoprotein.

Authors:  Dominik Kaiser; Lothar Terfloth; Stephan Kopp; Jan Schulz; Randolf de Laet; Peter Chiba; Gerhard F Ecker; Johann Gasteiger
Journal:  J Med Chem       Date:  2007-03-13       Impact factor: 7.446

View more
  3 in total

1.  Adapting the DeepSARM approach for dual-target ligand design.

Authors:  Atsushi Yoshimori; Huabin Hu; Jürgen Bajorath
Journal:  J Comput Aided Mol Des       Date:  2021-03-13       Impact factor: 3.686

2.  Parallel Generative Topographic Mapping: An Efficient Approach for Big Data Handling.

Authors:  Arkadii Lin; Igor I Baskin; Gilles Marcou; Dragos Horvath; Bernd Beck; Alexandre Varnek
Journal:  Mol Inform       Date:  2020-04-29       Impact factor: 3.353

3.  A Chemographic Audit of anti-Coronavirus Structure-activity Information from Public Databases (ChEMBL).

Authors:  Dragos Horvath; Alexey Orlov; Dmitry I Osolodkin; Aydar A Ishmukhametov; Gilles Marcou; Alexandre Varnek
Journal:  Mol Inform       Date:  2020-05-14       Impact factor: 4.050

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.