Literature DB >> 27490381

GTM-Based QSAR Models and Their Applicability Domains.

H A Gaspar1, I I Baskin1,2,3, G Marcou1, D Horvath1, A Varnek4,5.   

Abstract

In this paper we demonstrate that Generative Topographic Mapping (GTM), a machine learning method traditionally used for data visualisation, can be efficiently applied to QSAR modelling using probability distribution functions (PDF) computed in the latent 2-dimensional space. Several different scenarios of the activity assessment were considered: (i) the "activity landscape" approach based on direct use of PDF, (ii) QSAR models involving GTM-generated on descriptors derived from PDF, and, (iii) the k-Nearest Neighbours approach in 2D latent space. Benchmarking calculations were performed on five different datasets: stability constants of metal cations Ca(2+) , Gd(3+) and Lu(3+) complexes with organic ligands in water, aqueous solubility and activity of thrombin inhibitors. It has been shown that the performance of GTM-based regression models is similar to that obtained with some popular machine-learning methods (random forest, k-NN, M5P regression tree and PLS) and ISIDA fragment descriptors. By comparing GTM activity landscapes built both on predicted and experimental activities, we may visually assess the model's performance and identify the areas in the chemical space corresponding to reliable predictions. The applicability domain used in this work is based on data likelihood. Its application has significantly improved the model performances for 4 out of 5 datasets.
© 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

Entities:  

Keywords:  Activity landscape; Dimensionality reduction; GTM descriptors.; Generative topographic mapping; QSAR

Mesh:

Substances:

Year:  2015        PMID: 27490381     DOI: 10.1002/minf.201400153

Source DB:  PubMed          Journal:  Mol Inform        ISSN: 1868-1743            Impact factor:   3.353


  10 in total

1.  Mappability of drug-like space: towards a polypharmacologically competent map of drug-relevant compounds.

Authors:  Pavel Sidorov; Helena Gaspar; Gilles Marcou; Alexandre Varnek; Dragos Horvath
Journal:  J Comput Aided Mol Des       Date:  2015-11-12       Impact factor: 3.686

2.  Multi-task generative topographic mapping in virtual screening.

Authors:  Arkadii Lin; Dragos Horvath; Gilles Marcou; Bernd Beck; Alexandre Varnek
Journal:  J Comput Aided Mol Des       Date:  2019-02-09       Impact factor: 3.686

3.  Predictive cartography of metal binders using generative topographic mapping.

Authors:  Igor I Baskin; Vitaly P Solov'ev; Alexander A Bagatur'yants; Alexandre Varnek
Journal:  J Comput Aided Mol Des       Date:  2017-07-07       Impact factor: 3.686

4.  QSAR modeling and chemical space analysis of antimalarial compounds.

Authors:  Pavel Sidorov; Birgit Viira; Elisabeth Davioud-Charvet; Uko Maran; Gilles Marcou; Dragos Horvath; Alexandre Varnek
Journal:  J Comput Aided Mol Des       Date:  2017-04-03       Impact factor: 3.686

5.  From bird's eye views to molecular communities: two-layered visualization of structure-activity relationships in large compound data sets.

Authors:  Shilva Kayastha; Ryo Kunimoto; Dragos Horvath; Alexandre Varnek; Jürgen Bajorath
Journal:  J Comput Aided Mol Des       Date:  2017-10-06       Impact factor: 3.686

6.  Applied machine learning for predicting the lanthanide-ligand binding affinities.

Authors:  Suryanaman Chaube; Sriram Goverapet Srinivasan; Beena Rai
Journal:  Sci Rep       Date:  2020-08-31       Impact factor: 4.379

Review 7.  A Survey of Multi-task Learning Methods in Chemoinformatics.

Authors:  Sergey Sosnin; Mariia Vashurina; Michael Withnall; Pavel Karpov; Maxim Fedorov; Igor V Tetko
Journal:  Mol Inform       Date:  2018-11-28       Impact factor: 3.353

8.  Discovery of novel chemical reactions by deep generative recurrent neural network.

Authors:  William Bort; Igor I Baskin; Timur Gimadiev; Artem Mukanov; Ramil Nugmanov; Pavel Sidorov; Gilles Marcou; Dragos Horvath; Olga Klimchuk; Timur Madzhidov; Alexandre Varnek
Journal:  Sci Rep       Date:  2021-02-04       Impact factor: 4.379

9.  Drug enrichment and discovery from schizophrenia genome-wide association results: an analysis and visualisation approach.

Authors:  H A Gaspar; G Breen
Journal:  Sci Rep       Date:  2017-09-29       Impact factor: 4.379

10.  A Chemographic Audit of anti-Coronavirus Structure-activity Information from Public Databases (ChEMBL).

Authors:  Dragos Horvath; Alexey Orlov; Dmitry I Osolodkin; Aydar A Ishmukhametov; Gilles Marcou; Alexandre Varnek
Journal:  Mol Inform       Date:  2020-05-14       Impact factor: 4.050

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.