Literature DB >> 16562975

Virtual screening using binary kernel discrimination: effect of noisy training data and the optimization of performance.

Beining Chen1, Robert F Harrison, Kitsuchart Pasupa, Peter Willett, David J Wilton, David J Wood, Xiao Qing Lewell.   

Abstract

Binary kernel discrimination (BKD) uses a training set of compounds, for which structural and qualitative activity data are available, to produce a model that can then be applied to the structures of other compounds in order to predict their likely activity. Experiments with the MDL Drug Data Report database show that the optimal value of the smoothing parameter, and hence the predictive power of BKD, is crucially dependent on the number of false positives in the training set. It is also shown that the best results for BKD are achieved using one particular optimization method for the determination of the smoothing parameter that lies at the heart of the method and using the Jaccard/Tanimoto coefficient in the kernel function that is used to compute the similarity between a test set molecule and the members of the training set.

Mesh:

Year:  2006        PMID: 16562975     DOI: 10.1021/ci0505426

Source DB:  PubMed          Journal:  J Chem Inf Model        ISSN: 1549-9596            Impact factor:   4.956


  4 in total

Review 1.  Evaluation of machine-learning methods for ligand-based virtual screening.

Authors:  Beining Chen; Robert F Harrison; George Papadatos; Peter Willett; David J Wood; Xiao Qing Lewell; Paulette Greenidge; Nikolaus Stiefl
Journal:  J Comput Aided Mol Des       Date:  2007-01-05       Impact factor: 3.686

2.  Large scale study of multiple-molecule queries.

Authors:  Ramzi J Nasr; S Joshua Swamidass; Pierre F Baldi
Journal:  J Cheminform       Date:  2009-06-04       Impact factor: 5.514

3.  Virtual screening by a new Clustering-based Weighted Similarity Extreme Learning Machine approach.

Authors:  Kitsuchart Pasupa; Wasu Kudisthalert
Journal:  PLoS One       Date:  2018-04-13       Impact factor: 3.240

4.  Effect of missing data on multitask prediction methods.

Authors:  Antonio de la Vega de León; Beining Chen; Valerie J Gillet
Journal:  J Cheminform       Date:  2018-05-22       Impact factor: 5.514

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.