Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Do not hesitate to use Tversky-and other hints for successful active analogue searches with feature count descriptors.

Literature DB >> 23731338

Do not hesitate to use Tversky-and other hints for successful active analogue searches with feature count descriptors.

Dragos Horvath¹, Gilles Marcou, Alexandre Varnek.

Abstract

This study is an exhaustive analysis of the neighborhood behavior over a large coherent data set (ChEMBL target/ligand pairs of known Ki, for 165 targets with >50 associated ligands each). It focuses on similarity-based virtual screening (SVS) success defined by the ascertained optimality index. This is a weighted compromise between purity and retrieval rate of active hits in the neighborhood of an active query. One key issue addressed here is the impact of Tversky asymmetric weighing of query vs candidate features (represented as integer-value ISIDA colored fragment/pharmacophore triplet count descriptor vectors). The nearly a 3/4 million independent SVS runs showed that Tversky scores with a strong bias in favor of query-specific features are, by far, the most successful and the least failure-prone out of a set of nine other dissimilarity scores. These include classical Tanimoto, which failed to defend its privileged status in practical SVS applications. Tversky performance is not significantly conditioned by tuning of its bias parameter α. Both initial "guesses" of α = 0.9 and 0.7 were more successful than Tanimoto (at its turn, better than Euclid). Tversky was eventually tested in exhaustive similarity searching within the library of 1.6 M commercial + bioactive molecules at http://infochim.u-strasbg.fr/webserv/VSEngine.html , comparing favorably to Tanimoto in terms of "scaffold hopping" propensity. Therefore, it should be used at least as often as, perhaps in parallel to Tanimoto in SVS. Analysis with respect to query subclasses highlighted relationships of query complexity (simply expressed in terms of pharmacophore pattern counts) and/or target nature vs SVS success likelihood. SVS using more complex queries are more robust with respect to the choice of their operational premises (descriptors, metric). Yet, they are best handled by "pro-query" Tversky scores at α > 0.5. Among simpler queries, one may distinguish between "growable" (allowing for active analogs with additional features), and a few "conservative" queries not allowing any growth. These (typically bioactive amine transporter ligands) form the specific application domain of "pro-candidate" biased Tversky scores at α < 0.5.

Entities: Disease

Mesh：

Year: 2013 PMID： 23731338 DOI： 10.1021/ci400106g

Source DB: PubMed Journal: J Chem Inf Model ISSN： 1549-9596 Impact factor: 4.956

Keyword Cloud
Cited

10 in total

1. Mappability of drug-like space: towards a polypharmacologically competent map of drug-relevant compounds.

Authors: Pavel Sidorov; Helena Gaspar; Gilles Marcou; Alexandre Varnek; Dragos Horvath
Journal: J Comput Aided Mol Des Date: 2015-11-12 Impact factor: 3.686

2. QSAR modeling and chemical space analysis of antimalarial compounds.

Authors: Pavel Sidorov; Birgit Viira; Elisabeth Davioud-Charvet; Uko Maran; Gilles Marcou; Dragos Horvath; Alexandre Varnek
Journal: J Comput Aided Mol Des Date: 2017-04-03 Impact factor: 3.686

3. Shape similarity guided pose prediction: lessons from D3R Grand Challenge 3.

Authors: Ashutosh Kumar; Kam Y J Zhang
Journal: J Comput Aided Mol Des Date: 2018-08-06 Impact factor: 3.686

4. Maximum common substructure-based Tversky index: an asymmetric hybrid similarity measure.

Authors: Ryo Kunimoto; Martin Vogt; Jürgen Bajorath
Journal: J Comput Aided Mol Des Date: 2016-08-11 Impact factor: 3.686

5. ROCS-derived features for virtual screening.

Authors: Steven Kearnes; Vijay Pande
Journal: J Comput Aided Mol Des Date: 2016-09-08 Impact factor: 3.686

6. Assessment of tautomer distribution using the condensed reaction graph approach.

Authors: T R Gimadiev; T I Madzhidov; R I Nugmanov; I I Baskin; I S Antipin; A Varnek
Journal: J Comput Aided Mol Des Date: 2018-01-29 Impact factor: 3.686

7. Ultra-High-Throughput Structure-Based Virtual Screening for Small-Molecule Inhibitors of Protein-Protein Interactions.

Authors: David K Johnson; John Karanicolas
Journal: J Chem Inf Model Date: 2016-01-14 Impact factor: 4.956