Literature DB >> 27463195

Application of Random Forest and Multiple Linear Regression Techniques to QSPR Prediction of an Aqueous Solubility for Military Compounds.

Nikolay A Kovdienko1, Pavel G Polishchuk2, Eugene N Muratov2,3, Anatoly G Artemenko2, Victor E Kuz'min2, Leonid Gorb4, Frances Hill5, Jerzy Leszczynski1,5.   

Abstract

The relationship between the aqueous solubility of more than two thousand eight hundred organic compounds and their structures was investigated using a QSPR approach based on Simplex Representation of Molecular Structure (SiRMS). The dataset consists of 2537 diverse organic compounds. Multiple Linear Regression (MLR) and Random Forest (RF) methods were used for statistical modeling at the 2D level of representation of molecular structure. Statistical characteristics of the best models are quite good (MLR method: R(2) =0.85, Q(2) =0.83; RF method: R(2) =0.99, R(2) oob =0.88). The external validation set of 301 compounds (including 47 nitro-, nitroso- and nitrogen-rich compounds of military interest) which were not included in the training set and modeling process, was used for evaluation of the models predictivity. Thus, well-fitted and robust (R(2) test (MLR)=0.76 and R(2) test (RF)=0.82) models were obtained for both statistical techniques using descriptors based on the topological structural information only. The predicted solubility values for military compounds are in good agreement with experimental ones. Developed QSPR models represent powerful and easy-to-use virtual screening tool that can be recommended for prediction of aqueous solubility.
Copyright © 2010 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

Entities:  

Keywords:  Environmental chemistry; Structure-property relationships

Year:  2010        PMID: 27463195     DOI: 10.1002/minf.201000001

Source DB:  PubMed          Journal:  Mol Inform        ISSN: 1868-1743            Impact factor:   3.353


  8 in total

Review 1.  QSAR without borders.

Authors:  Eugene N Muratov; Jürgen Bajorath; Robert P Sheridan; Igor V Tetko; Dmitry Filimonov; Vladimir Poroikov; Tudor I Oprea; Igor I Baskin; Alexandre Varnek; Adrian Roitberg; Olexandr Isayev; Stefano Curtarolo; Denis Fourches; Yoram Cohen; Alan Aspuru-Guzik; David A Winkler; Dimitris Agrafiotis; Artem Cherkasov; Alexander Tropsha
Journal:  Chem Soc Rev       Date:  2020-05-01       Impact factor: 54.564

2.  Computer-Assisted Decision Support for Student Admissions Based on Their Predicted Academic Performance.

Authors:  Eugene Muratov; Margaret Lewis; Denis Fourches; Alexander Tropsha; Wendy C Cox
Journal:  Am J Pharm Educ       Date:  2017-04       Impact factor: 2.047

3.  Maternal-fetal transfer rates of PCBs, OCPs, PBDEs, and dioxin-like compounds predicted through quantitative structure-activity relationship modeling.

Authors:  Akifumi Eguchi; Masamichi Hanazato; Norimichi Suzuki; Yoshiharu Matsuno; Emiko Todaka; Chisato Mori
Journal:  Environ Sci Pollut Res Int       Date:  2015-09-23       Impact factor: 4.223

4.  Computer-Assisted Discovery of Alkaloids with Schistosomicidal Activity.

Authors:  Renata Priscila Barros de Menezes; Jéssika de Oliveira Viana; Eugene Muratov; Luciana Scotti; Marcus Tullius Scotti
Journal:  Curr Issues Mol Biol       Date:  2022-01-15       Impact factor: 2.976

5.  Summary of 17 chemicals evaluated by OECD TG229 using Japanese Medaka, Oryzias latipes in EXTEND 2016.

Authors:  Yukio Kawashima; Yuta Onishi; Norihisa Tatarazako; Hirotaka Yamamoto; Masaaki Koshio; Tomohiro Oka; Yoshifumi Horie; Haruna Watanabe; Takashi Nakamoto; Jun Yamamoto; Hidenori Ishikawa; Tomomi Sato; Kunihiko Yamazaki; Taisen Iguchi
Journal:  J Appl Toxicol       Date:  2021-11-02       Impact factor: 3.628

6.  Three machine learning models for the 2019 Solubility Challenge.

Authors:  John B O Mitchell
Journal:  ADMET DMPK       Date:  2020-06-15

7.  Machine Learning Analysis of Essential Oils from Cuban Plants: Potential Activity against Protozoa Parasites.

Authors:  Renata Priscila Barros de Menezes; Luciana Scotti; Marcus Tullius Scotti; Jesús García; Rosalia González; Lianet Monzote; William N Setzer
Journal:  Molecules       Date:  2022-02-17       Impact factor: 4.411

8.  Amino substituted nitrogen heterocycle ureas as kinase insert domain containing receptor (KDR) inhibitors: Performance of structure-activity relationship approaches.

Authors:  Hayriye Yilmaz; Natalia Sizochenko; Bakhtiyor Rasulev; Andrey Toropov; Yahya Guzel; Viktor Kuz'min; Danuta Leszczynska; Jerzy Leszczynski
Journal:  J Food Drug Anal       Date:  2015-04-01       Impact factor: 6.157

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.