Literature DB >> 16418315

Support vector machines in HTS data mining: Type I MetAPs inhibition study.

Jianwen Fang1, Yinghua Dong, Gerald H Lushington, Qi-Zhuang Ye, Gunda I Georg.   

Abstract

This article reports a successful application of support vector machines (SVMs) in mining high-throughput screening (HTS) data of a type I methionine aminopeptidases (MetAPs) inhibition study. A library with 43,736 small organic molecules was used in the study, and 1355 compounds in the library with 40% or higher inhibition activity were considered as active. The data set was randomly split into a training set and a test set (3:1 ratio). The authors were able to rank compounds in the test set using their decision values predicted by SVM models that were built on the training set. They defined a novel score PT50, the percentage of the test set needed to be screened to recover 50% of the actives, to measure the performance of the models. With carefully selected parameters, SVM models increased the hit rates significantly, and 50% of the active compounds could be recovered by screening just 7% of the test set. The authors found that the size of the training set played a significant role in the performance of the models. A training set with 10,000 member compounds is likely the minimum size required to build a model with reasonable predictive power.

Entities:  

Mesh:

Substances:

Year:  2006        PMID: 16418315     DOI: 10.1177/1087057105284334

Source DB:  PubMed          Journal:  J Biomol Screen        ISSN: 1087-0571


  3 in total

1.  kNNsim: k-nearest neighbors similarity with genetic algorithm features optimization enhances the prediction of activity classes for small molecules.

Authors:  Dariusz Plewczynski
Journal:  J Mol Model       Date:  2008-07-29       Impact factor: 1.810

2.  Artificial neural network--based analysis of high-throughput screening data for improved prediction of active compounds.

Authors:  Swapan Chakrabarti; Stan R Svojanovsky; Romana Slavik; Gunda I Georg; George S Wilson; Peter G Smith
Journal:  J Biomol Screen       Date:  2009-12

3.  Engineering proteinase K using machine learning and synthetic genes.

Authors:  Jun Liao; Manfred K Warmuth; Sridhar Govindarajan; Jon E Ness; Rebecca P Wang; Claes Gustafsson; Jeremy Minshull
Journal:  BMC Biotechnol       Date:  2007-03-26       Impact factor: 2.563

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.