Literature DB >> 29135323

Impact assessment of the rational selection of training and test sets on the predictive ability of QSAR models.

M F Andrada1, E G Vega-Hissi2, M R Estrada1, J C Garro Martinez2.   

Abstract

This study performed an analysis of the influence of the training and test set rational selection on the quality and predictively of the quantitative structure-activity relationship (QSAR) model. The study was carried out on three different datasets of Influenza Neuraminidase (H1N1) inhibitors. The three datasets were divided into training and test sets using three rational selection methods: based on k-means, Kennard-Stone algorithm and Activity and the results were compared with Random selection. Then, a total of 31,490 mathematical models were developed and those models that presented a determination coefficient higher than: r2train > 0.8, r2loo > 0.7, r2test > 0.5 and minimum standard deviation (SD) and minimum root-mean square error (RMS) were selected. The selected models were validated using the internal leave-one-out method and the predictive capacity was evaluated by the external test set. The results indicate that random selection could lead to erroneous results. In return, a rational selection allows for obtaining more reliable conclusions. The QSAR models with major predictive power were found using the k-means algorithm and selection by activity.

Entities:  

Keywords:  Kennard–Stone; QSAR; based on activity; k-means; random selection; rational partition of dataset

Mesh:

Substances:

Year:  2017        PMID: 29135323     DOI: 10.1080/1062936X.2017.1397056

Source DB:  PubMed          Journal:  SAR QSAR Environ Res        ISSN: 1026-776X            Impact factor:   3.000


  4 in total

1.  Molecular insights on ABL kinase activation using tree-based machine learning models and molecular docking.

Authors:  Philipe Oliveira Fernandes; Diego Magno Martins; Aline de Souza Bozzi; João Paulo A Martins; Adolfo Henrique de Moraes; Vinícius Gonçalves Maltarollo
Journal:  Mol Divers       Date:  2021-06-30       Impact factor: 3.364

2.  Assessing the calibration in toxicological in vitro models with conformal prediction.

Authors:  Ola Spjuth; Andrea Volkamer; Andrea Morger; Fredrik Svensson; Staffan Arvidsson McShane; Niharika Gauraha; Ulf Norinder
Journal:  J Cheminform       Date:  2021-04-29       Impact factor: 5.514

3.  Polysaccharide determination and habitat classification for fresh Dendrobiums with hyperspectral imagery and modified RBFNN.

Authors:  Yuzhen Wei; Wenjun Hu; Feiyue Wu; Yi He
Journal:  RSC Adv       Date:  2022-01-05       Impact factor: 3.361

4.  A Comprehensive QSAR Study on Antileishmanial and Antitrypanosomal Cinnamate Ester Analogues.

Authors:  Freddy A Bernal; Thomas J Schmidt
Journal:  Molecules       Date:  2019-11-28       Impact factor: 4.411

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.