Literature DB >> 16563012

Benchmarking of linear and nonlinear approaches for quantitative structure-property relationship studies of metal complexation with ionophores.

Igor V Tetko1, Vitaly P Solov'ev, Alexey V Antonov, Xiaojun Yao, Jean Pierre Doucet, Botao Fan, Frank Hoonakker, Denis Fourches, Piere Jost, Nicolas Lachiche, Alexandre Varnek.   

Abstract

A benchmark of several popular methods, Associative Neural Networks (ANN), Support Vector Machines (SVM), k Nearest Neighbors (kNN), Maximal Margin Linear Programming (MMLP), Radial Basis Function Neural Network (RBFNN), and Multiple Linear Regression (MLR), is reported for quantitative-structure property relationships (QSPR) of stability constants logK1 for the 1:1 (M:L) and logbeta2 for 1:2 complexes of metal cations Ag+ and Eu3+ with diverse sets of organic molecules in water at 298 K and ionic strength 0.1 M. The methods were tested on three types of descriptors: molecular descriptors including E-state values, counts of atoms determined for E-state atom types, and substructural molecular fragments (SMF). Comparison of the models was performed using a 5-fold external cross-validation procedure. Robust statistical tests (bootstrap and Kolmogorov-Smirnov statistics) were employed to evaluate the significance of calculated models. The Wilcoxon signed-rank test was used to compare the performance of methods. Individual structure-complexation property models obtained with nonlinear methods demonstrated a significantly better performance than the models built using multilinear regression analysis (MLRA). However, the averaging of several MLRA models based on SMF descriptors provided as good of a prediction as the most efficient nonlinear techniques. Support Vector Machines and Associative Neural Networks contributed in the largest number of significant models. Models based on fragments (SMF descriptors and E-state counts) had higher prediction ability than those based on E-state indices. The use of SMF descriptors and E-state counts provided similar results, whereas E-state indices lead to less significant models. The current study illustrates the difficulties of quantitative comparison of different methods: conclusions based only on one data set without appropriate statistical tests could be wrong.

Entities:  

Mesh:

Substances:

Year:  2006        PMID: 16563012     DOI: 10.1021/ci0504216

Source DB:  PubMed          Journal:  J Chem Inf Model        ISSN: 1549-9596            Impact factor:   4.956


  9 in total

1.  Predictive cartography of metal binders using generative topographic mapping.

Authors:  Igor I Baskin; Vitaly P Solov'ev; Alexander A Bagatur'yants; Alexandre Varnek
Journal:  J Comput Aided Mol Des       Date:  2017-07-07       Impact factor: 3.686

Review 2.  The role of machine learning method in the synthesis and biological ınvestigation of heterocyclic compounds.

Authors:  Arif Mermer
Journal:  Mol Divers       Date:  2021-10-20       Impact factor: 2.943

3.  Applied machine learning for predicting the lanthanide-ligand binding affinities.

Authors:  Suryanaman Chaube; Sriram Goverapet Srinivasan; Beena Rai
Journal:  Sci Rep       Date:  2020-08-31       Impact factor: 4.379

Review 4.  Current mathematical methods used in QSAR/QSPR studies.

Authors:  Peixun Liu; Wei Long
Journal:  Int J Mol Sci       Date:  2009-04-29       Impact factor: 6.208

5.  QSPR ensemble modelling of the 1:1 and 1:2 complexation of Co²⁺, Ni²⁺, and Cu²⁺ with organic ligands: relationships between stability constants.

Authors:  Vitaly Solov'ev; Alexandre Varnek; Aslan Tsivadze
Journal:  J Comput Aided Mol Des       Date:  2014-04-16       Impact factor: 3.686

6.  Online chemical modeling environment (OCHEM): web platform for data storage, model development and publishing of chemical information.

Authors:  Iurii Sushko; Sergii Novotarskyi; Robert Körner; Anil Kumar Pandey; Matthias Rupp; Wolfram Teetz; Stefan Brandmaier; Ahmed Abdelaziz; Volodymyr V Prokopenko; Vsevolod Y Tanchuk; Roberto Todeschini; Alexandre Varnek; Gilles Marcou; Peter Ertl; Vladimir Potemkin; Maria Grishina; Johann Gasteiger; Christof Schwab; Igor I Baskin; Vladimir A Palyulin; Eugene V Radchenko; William J Welsh; Vladyslav Kholodovych; Dmitriy Chekmarev; Artem Cherkasov; Joao Aires-de-Sousa; Qing-You Zhang; Andreas Bender; Florian Nigsch; Luc Patiny; Antony Williams; Valery Tkachenko; Igor V Tetko
Journal:  J Comput Aided Mol Des       Date:  2011-06-10       Impact factor: 3.686

7.  Modeling the Biodegradability of Chemical Compounds Using the Online CHEmical Modeling Environment (OCHEM).

Authors:  Susann Vorberg; Igor V Tetko
Journal:  Mol Inform       Date:  2013-11-28       Impact factor: 3.353

8.  Machine learning-based analysis of overall stability constants of metal-ligand complexes.

Authors:  Kaito Kanahashi; Makoto Urushihara; Kenji Yamaguchi
Journal:  Sci Rep       Date:  2022-07-25       Impact factor: 4.996

9.  The development of models to predict melting and pyrolysis point data associated with several hundred thousand compounds mined from PATENTS.

Authors:  Igor V Tetko; Daniel M Lowe; Antony J Williams
Journal:  J Cheminform       Date:  2016-01-22       Impact factor: 5.514

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.