Literature DB >> 22856471

GA(M)E-QSAR: a novel, fully automatic genetic-algorithm-(meta)-ensembles approach for binary classification in ligand-based drug design.

Yunierkis Pérez-Castillo1, Cosmin Lazar, Jonatan Taminau, Mathy Froeyen, Miguel Ángel Cabrera-Pérez, Ann Nowé.   

Abstract

Computer-aided drug design has become an important component of the drug discovery process. Despite the advances in this field, there is not a unique modeling approach that can be successfully applied to solve the whole range of problems faced during QSAR modeling. Feature selection and ensemble modeling are active areas of research in ligand-based drug design. Here we introduce the GA(M)E-QSAR algorithm that combines the search and optimization capabilities of Genetic Algorithms with the simplicity of the Adaboost ensemble-based classification algorithm to solve binary classification problems. We also explore the usefulness of Meta-Ensembles trained with Adaboost and Voting schemes to further improve the accuracy, generalization, and robustness of the optimal Adaboost Single Ensemble derived from the Genetic Algorithm optimization. We evaluated the performance of our algorithm using five data sets from the literature and found that it is capable of yielding similar or better classification results to what has been reported for these data sets with a higher enrichment of active compounds relative to the whole actives subset when only the most active chemicals are considered. More important, we compared our methodology with state of the art feature selection and classification approaches and found that it can provide highly accurate, robust, and generalizable models. In the case of the Adaboost Ensembles derived from the Genetic Algorithm search, the final models are quite simple since they consist of a weighted sum of the output of single feature classifiers. Furthermore, the Adaboost scores can be used as ranking criterion to prioritize chemicals for synthesis and biological evaluation after virtual screening experiments.

Mesh:

Substances:

Year:  2012        PMID: 22856471     DOI: 10.1021/ci300146h

Source DB:  PubMed          Journal:  J Chem Inf Model        ISSN: 1549-9596            Impact factor:   4.956


  4 in total

1.  Toward the computer-aided discovery of FabH inhibitors. Do predictive QSAR models ensure high quality virtual screening performance?

Authors:  Yunierkis Pérez-Castillo; Maykel Cruz-Monteagudo; Cosmin Lazar; Jonatan Taminau; Mathy Froeyen; Miguel Angel Cabrera-Pérez; Ann Nowé
Journal:  Mol Divers       Date:  2014-03-27       Impact factor: 2.943

Review 2.  Systemic QSAR and phenotypic virtual screening: chasing butterflies in drug discovery.

Authors:  Maykel Cruz-Monteagudo; Stephan Schürer; Eduardo Tejera; Yunierkis Pérez-Castillo; José L Medina-Franco; Aminael Sánchez-Rodríguez; Fernanda Borges
Journal:  Drug Discov Today       Date:  2017-03-06       Impact factor: 7.851

3.  A desirability-based multi objective approach for the virtual screening discovery of broad-spectrum anti-gastric cancer agents.

Authors:  Yunierkis Perez-Castillo; Aminael Sánchez-Rodríguez; Eduardo Tejera; Maykel Cruz-Monteagudo; Fernanda Borges; M Natália D S Cordeiro; Huong Le-Thi-Thu; Hai Pham-The
Journal:  PLoS One       Date:  2018-02-08       Impact factor: 3.240

4.  Improvement of Prediction Performance With Conjoint Molecular Fingerprint in Deep Learning.

Authors:  Liangxu Xie; Lei Xu; Ren Kong; Shan Chang; Xiaojun Xu
Journal:  Front Pharmacol       Date:  2020-12-18       Impact factor: 5.810

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.