| Literature DB >> 25481478 |
Jie Xia1, Ermias Lemma Tilahun2, Terry-Elinor Reid2, Liangren Zhang3, Xiang Simon Wang4.
Abstract
Retrospective small-scale virtual screening (VS) based on benchmarking data sets has been widely used to estimate ligand enrichments of VS approaches in the prospective (i.e. real-world) efforts. However, the intrinsic differences of benchmarking sets to the real screening chemical libraries can cause biased assessment. Herein, we summarize the history of benchmarking methods as well as data sets and highlight three main types of biases found in benchmarking sets, i.e. "analogue bias", "artificial enrichment" and "false negative". In addition, we introduce our recent algorithm to build maximum-unbiased benchmarking sets applicable to both ligand-based and structure-based VS approaches, and its implementations to three important human histone deacetylases (HDACs) isoforms, i.e. HDAC1, HDAC6 and HDAC8. The leave-one-out cross-validation (LOO CV) demonstrates that the benchmarking sets built by our algorithm are maximum-unbiased as measured by property matching, ROC curves and AUCs.Entities:
Keywords: Analogue bias; Artificial enrichment; Benchmarking methodology; Decoy sets; Ligand-based virtual screening; Structure-based virtual screening
Mesh:
Substances:
Year: 2014 PMID: 25481478 PMCID: PMC4278665 DOI: 10.1016/j.ymeth.2014.11.015
Source DB: PubMed Journal: Methods ISSN: 1046-2023 Impact factor: 3.608