Literature DB >> 26434574

Consistency of QSAR models: Correct split of training and test sets, ranking of models and performance parameters.

A Rácz1,2, D Bajusz3, K Héberger1.   

Abstract

Recent implementations of QSAR modelling software provide the user with numerous models and a wealth of information. In this work, we provide some guidance on how one should interpret the results of QSAR modelling, compare and assess the resulting models, and select the best and most consistent ones. Two QSAR datasets are applied as case studies for the comparison of model performance parameters and model selection methods. We demonstrate the capabilities of sum of ranking differences (SRD) in model selection and ranking, and identify the best performance indicators and models. While the exchange of the original training and (external) test sets does not affect the ranking of performance parameters, it provides improved models in certain cases (despite the lower number of molecules in the training set). Performance parameters for external validation are substantially separated from the other merits in SRD analyses, highlighting their value in data fusion.

Entities:  

Keywords:  cross-validation; model selection; performance parameters; ranking; sum of ranking differences

Mesh:

Substances:

Year:  2015        PMID: 26434574     DOI: 10.1080/1062936X.2015.1084647

Source DB:  PubMed          Journal:  SAR QSAR Environ Res        ISSN: 1026-776X            Impact factor:   3.000


  7 in total

1.  A combined Fisher and Laplacian score for feature selection in QSAR based drug design using compounds with known and unknown activities.

Authors:  Mohammad Amin Valizade Hasanloei; Razieh Sheikhpour; Mehdi Agha Sarram; Elnaz Sheikhpour; Hamdollah Sharifi
Journal:  J Comput Aided Mol Des       Date:  2017-12-26       Impact factor: 3.686

2.  Chemical similarity of molecules with physiological response.

Authors:  Izudin Redžepović; Boris Furtula
Journal:  Mol Divers       Date:  2022-08-17       Impact factor: 3.364

3.  Comparison of Descriptor- and Fingerprint Sets in Machine Learning Models for ADME-Tox Targets.

Authors:  Álmos Orosz; Károly Héberger; Anita Rácz
Journal:  Front Chem       Date:  2022-06-08       Impact factor: 5.545

4.  Intercorrelation Limits in Molecular Descriptor Preselection for QSAR/QSPR.

Authors:  Anita Rácz; Dávid Bajusz; Károly Héberger
Journal:  Mol Inform       Date:  2019-04-04       Impact factor: 3.353

5.  3D-QSAR Studies of 1,2,4-Oxadiazole Derivatives as Sortase A Inhibitors.

Authors:  Neda Shakour; Farzin Hadizadeh; Prashant Kesharwani; Amirhossein Sahebkar
Journal:  Biomed Res Int       Date:  2021-12-06       Impact factor: 3.411

6.  Factor analysis, sparse PCA, and Sum of Ranking Differences-based improvements of the Promethee-GAIA multicriteria decision support technique.

Authors:  János Abonyi; Tímea Czvetkó; Zsolt T Kosztyán; Károly Héberger
Journal:  PLoS One       Date:  2022-02-25       Impact factor: 3.240

7.  Comparison of various methods for validity evaluation of QSAR models.

Authors:  Shadi Shayanfar; Ali Shayanfar
Journal:  BMC Chem       Date:  2022-08-23
  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.