| Literature DB >> 30688405 |
Yanmin Zhang1, Yuchen Wang1, Weineng Zhou1, Yuanrong Fan1, Junnan Zhao1, Lu Zhu1, Shuai Lu1, Tao Lu1,2, Yadong Chen1, Haichun Liu1.
Abstract
Data mining methods based on machine learning play an increasingly important role in drug design and discovery. In the current work, eight machine learning methods including decision trees, k-Nearest neighbor, support vector machines, random forests, extremely randomized trees, AdaBoost, gradient boosting trees, and XGBoost were evaluated comprehensively through a case study of ACC inhibitor data sets. Internal and external data sets were employed for cross-validation of the eight machine learning methods. Results showed that the extremely randomized trees model performed best and was adopted as the first step of virtual screening. Together with structure-based virtual screening in the second step, this combined strategy obtained desirable results. This work indicates that the combination of machine learning methods with traditional structure-based virtual screening can effectively strengthen the ability in finding potential hits from large compound database for a given target.Keywords: ACC inhibitors; extremely randomized trees; machine learning; molecular docking
Mesh:
Year: 2019 PMID: 30688405 DOI: 10.1111/cbdd.13494
Source DB: PubMed Journal: Chem Biol Drug Des ISSN: 1747-0277 Impact factor: 2.817