Literature DB >> 23731338

Do not hesitate to use Tversky-and other hints for successful active analogue searches with feature count descriptors.

Dragos Horvath1, Gilles Marcou, Alexandre Varnek.   

Abstract

This study is an exhaustive analysis of the neighborhood behavior over a large coherent data set (ChEMBL target/ligand pairs of known Ki, for 165 targets with >50 associated ligands each). It focuses on similarity-based virtual screening (SVS) success defined by the ascertained optimality index. This is a weighted compromise between purity and retrieval rate of active hits in the neighborhood of an active query. One key issue addressed here is the impact of Tversky asymmetric weighing of query vs candidate features (represented as integer-value ISIDA colored fragment/pharmacophore triplet count descriptor vectors). The nearly a 3/4 million independent SVS runs showed that Tversky scores with a strong bias in favor of query-specific features are, by far, the most successful and the least failure-prone out of a set of nine other dissimilarity scores. These include classical Tanimoto, which failed to defend its privileged status in practical SVS applications. Tversky performance is not significantly conditioned by tuning of its bias parameter α. Both initial "guesses" of α = 0.9 and 0.7 were more successful than Tanimoto (at its turn, better than Euclid). Tversky was eventually tested in exhaustive similarity searching within the library of 1.6 M commercial + bioactive molecules at http://infochim.u-strasbg.fr/webserv/VSEngine.html , comparing favorably to Tanimoto in terms of "scaffold hopping" propensity. Therefore, it should be used at least as often as, perhaps in parallel to Tanimoto in SVS. Analysis with respect to query subclasses highlighted relationships of query complexity (simply expressed in terms of pharmacophore pattern counts) and/or target nature vs SVS success likelihood. SVS using more complex queries are more robust with respect to the choice of their operational premises (descriptors, metric). Yet, they are best handled by "pro-query" Tversky scores at α > 0.5. Among simpler queries, one may distinguish between "growable" (allowing for active analogs with additional features), and a few "conservative" queries not allowing any growth. These (typically bioactive amine transporter ligands) form the specific application domain of "pro-candidate" biased Tversky scores at α < 0.5.

Entities:  

Mesh:

Year:  2013        PMID: 23731338     DOI: 10.1021/ci400106g

Source DB:  PubMed          Journal:  J Chem Inf Model        ISSN: 1549-9596            Impact factor:   4.956


  10 in total

1.  Mappability of drug-like space: towards a polypharmacologically competent map of drug-relevant compounds.

Authors:  Pavel Sidorov; Helena Gaspar; Gilles Marcou; Alexandre Varnek; Dragos Horvath
Journal:  J Comput Aided Mol Des       Date:  2015-11-12       Impact factor: 3.686

2.  QSAR modeling and chemical space analysis of antimalarial compounds.

Authors:  Pavel Sidorov; Birgit Viira; Elisabeth Davioud-Charvet; Uko Maran; Gilles Marcou; Dragos Horvath; Alexandre Varnek
Journal:  J Comput Aided Mol Des       Date:  2017-04-03       Impact factor: 3.686

3.  Shape similarity guided pose prediction: lessons from D3R Grand Challenge 3.

Authors:  Ashutosh Kumar; Kam Y J Zhang
Journal:  J Comput Aided Mol Des       Date:  2018-08-06       Impact factor: 3.686

4.  Maximum common substructure-based Tversky index: an asymmetric hybrid similarity measure.

Authors:  Ryo Kunimoto; Martin Vogt; Jürgen Bajorath
Journal:  J Comput Aided Mol Des       Date:  2016-08-11       Impact factor: 3.686

5.  ROCS-derived features for virtual screening.

Authors:  Steven Kearnes; Vijay Pande
Journal:  J Comput Aided Mol Des       Date:  2016-09-08       Impact factor: 3.686

6.  Assessment of tautomer distribution using the condensed reaction graph approach.

Authors:  T R Gimadiev; T I Madzhidov; R I Nugmanov; I I Baskin; I S Antipin; A Varnek
Journal:  J Comput Aided Mol Des       Date:  2018-01-29       Impact factor: 3.686

7.  Ultra-High-Throughput Structure-Based Virtual Screening for Small-Molecule Inhibitors of Protein-Protein Interactions.

Authors:  David K Johnson; John Karanicolas
Journal:  J Chem Inf Model       Date:  2016-01-14       Impact factor: 4.956

8.  Computational chemogenomics: is it more than inductive transfer?

Authors:  J B Brown; Yasushi Okuno; Gilles Marcou; Alexandre Varnek; Dragos Horvath
Journal:  J Comput Aided Mol Des       Date:  2014-04-27       Impact factor: 3.686

9.  Analysis of drug-endogenous human metabolite similarities in terms of their maximum common substructures.

Authors:  Steve O'Hagan; Douglas B Kell
Journal:  J Cheminform       Date:  2017-03-09       Impact factor: 5.514

10.  Electrostatic-field and surface-shape similarity for virtual screening and pose prediction.

Authors:  Ann E Cleves; Stephen R Johnson; Ajay N Jain
Journal:  J Comput Aided Mol Des       Date:  2019-10-24       Impact factor: 3.686

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.