| Literature DB >> 19940083 |
Swapan Chakrabarti1, Stan R Svojanovsky, Romana Slavik, Gunda I Georg, George S Wilson, Peter G Smith.
Abstract
Artificial neural networks (ANNs) are trained using high-throughput screening (HTS) data to recover active compounds from a large data set. Improved classification performance was obtained on combining predictions made by multiple ANNs. The HTS data, acquired from a methionine aminopeptidases inhibition study, consisted of a library of 43,347 compounds, and the ratio of active to nonactive compounds, R(A/N), was 0.0321. Back-propagation ANNs were trained and validated using principal components derived from the physicochemical features of the compounds. On selecting the training parameters carefully, an ANN recovers one-third of all active compounds from the validation set with a 3-fold gain in R(A/N) value. Further gains in R(A/N) values were obtained upon combining the predictions made by a number of ANNs. The generalization property of the back-propagation ANNs was used to train those ANNs with the same training samples, after being initialized with different sets of random weights. As a result, only 10% of all available compounds were needed for training and validation, and the rest of the data set was screened with more than a 10-fold gain of the original R(A/N) value. Thus, ANNs trained with limited HTS data might become useful in recovering active compounds from large data sets.Entities:
Mesh:
Year: 2009 PMID: 19940083 PMCID: PMC4080791 DOI: 10.1177/1087057109351312
Source DB: PubMed Journal: J Biomol Screen ISSN: 1087-0571