Literature DB >> 18533644

Evaluation of virtual screening performance of support vector machines trained by sparsely distributed active compounds.

X H Ma1, R Wang, S Y Yang, Z R Li, Y Xue, Y C Wei, B C Low, Y Z Chen.   

Abstract

Virtual screening performance of support vector machines (SVM) depends on the diversity of training active and inactive compounds. While diverse inactive compounds can be routinely generated, the number and diversity of known actives are typically low. We evaluated the performance of SVM trained by sparsely distributed actives in six MDDR biological target classes composed of a high number of known actives (983-1645) of high, intermediate, and low structural diversity (muscarinic M1 receptor agonists, NMDA receptor antagonists, thrombin inhibitors, HIV protease inhibitors, cephalosporins, and renin inhibitors). SVM trained by regularly sparse data sets of 100 actives show improved yields at substantially reduced false-hit rates compared to those of published studies and those of Tanimoto-based similarity searching method based on the same data sets and molecular descriptors. SVM trained by very sparse data sets of 40 actives (2.4%-4.1% of the known actives) predicted 17.5-39.5%, 23.0-48.1%, and 70.2-92.4% of the remaining 943-1605 actives in the high, intermediate, and low diversity classes, respectively, 13.8-68.7% of which are outside the training compound families. SVM predicted 99.97% and 97.1% of the 9.997 M PUBCHEM and 167K remaining MDDR compounds as inactive and 2.6%-8.3% of the 19,495-38,483 MDDR compounds similar to the known actives as active. These suggest that SVM has substantial capability in identifying novel active compounds from sparse active data sets at low false-hit rates.

Entities:  

Mesh:

Substances:

Year:  2008        PMID: 18533644     DOI: 10.1021/ci800022e

Source DB:  PubMed          Journal:  J Chem Inf Model        ISSN: 1549-9596            Impact factor:   4.956


  11 in total

1.  Discovery of Influenza A virus neuraminidase inhibitors using support vector machine and Naïve Bayesian models.

Authors:  Wenwen Lian; Jiansong Fang; Chao Li; Xiaocong Pang; Ai-Lin Liu; Guan-Hua Du
Journal:  Mol Divers       Date:  2015-12-21       Impact factor: 2.943

2.  Update of TTD: Therapeutic Target Database.

Authors:  Feng Zhu; BuCong Han; Pankaj Kumar; XiangHui Liu; XiaoHua Ma; Xiaona Wei; Lu Huang; YangFan Guo; LianYi Han; ChanJuan Zheng; YuZong Chen
Journal:  Nucleic Acids Res       Date:  2009-11-20       Impact factor: 16.971

3.  Consensus model for identification of novel PI3K inhibitors in large chemical library.

Authors:  Chin Yee Liew; Xiao Hua Ma; Chun Wei Yap
Journal:  J Comput Aided Mol Des       Date:  2010-02-11       Impact factor: 3.686

Review 4.  In-silico approaches to multi-target drug discovery : computer aided multi-target drug design, multi-target virtual screening.

Authors:  Xiao Hua Ma; Zhe Shi; Chunyan Tan; Yuyang Jiang; Mei Lin Go; Boon Chuan Low; Yu Zong Chen
Journal:  Pharm Res       Date:  2010-03-11       Impact factor: 4.200

5.  Exploiting PubChem for Virtual Screening.

Authors:  Xiang-Qun Xie
Journal:  Expert Opin Drug Discov       Date:  2010-12       Impact factor: 6.098

6.  Introduction to the BioChemical Library (BCL): An Application-Based Open-Source Toolkit for Integrated Cheminformatics and Machine Learning in Computer-Aided Drug Discovery.

Authors:  Benjamin P Brown; Oanh Vu; Alexander R Geanes; Sandeepkumar Kothiwale; Mariusz Butkiewicz; Edward W Lowe; Ralf Mueller; Richard Pape; Jeffrey Mendenhall; Jens Meiler
Journal:  Front Pharmacol       Date:  2022-02-21       Impact factor: 5.810

7.  Evolution of Support Vector Machine and Regression Modeling in Chemoinformatics and Drug Discovery.

Authors:  Raquel Rodríguez-Pérez; Jürgen Bajorath
Journal:  J Comput Aided Mol Des       Date:  2022-03-19       Impact factor: 4.179

8.  Fast rule-based bioactivity prediction using associative classification mining.

Authors:  Pulan Yu; David J Wild
Journal:  J Cheminform       Date:  2012-11-23       Impact factor: 5.514

9.  Development and experimental test of support vector machines virtual screening method for searching Src inhibitors from large compound libraries.

Authors:  Bucong Han; Xiaohua Ma; Ruiying Zhao; Jingxian Zhang; Xiaona Wei; Xianghui Liu; Xin Liu; Cunlong Zhang; Chunyan Tan; Yuyang Jiang; Yuzong Chen
Journal:  Chem Cent J       Date:  2012-11-23       Impact factor: 4.215

10.  The influence of negative training set size on machine learning-based virtual screening.

Authors:  Rafał Kurczab; Sabina Smusz; Andrzej J Bojarski
Journal:  J Cheminform       Date:  2014-06-11       Impact factor: 5.514

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.