Literature DB >> 19789655

Novel Algorithms for the Identification of Biologically Informative Chemical Diversity Metrics.

Bhargav Theertham1, Jenna L Wang, Jianwen Fang, Gerald H Lushington.   

Abstract

Despite great advances in the efficiency of analytical and synthetic chemistry, time and available starting material still limit the number of unique compounds that can be practically synthesized and evaluated as prospective therapeutics. Chemical diversity analysis (the capacity to identify finite diverse subsets that reliably represent greater manifolds of drug-like chemicals) thus remains an important resource in drug discovery. Despite an unproven track record, chemical diversity has also been used to posit, from preliminary screen hits, new compounds with similar or better activity. Identifying diversity metrics that demonstrably encode bioactivity trends is thus of substantial potential value for intelligent assembly of targeted screens. This paper reports novel algorithms designed to simultaneously reflect chemical similarity or diversity trends and apparent bioactivity in compound collections. An extensive set of descriptors are evaluated within large NCI screening data sets according to bioactivity differentiation capacities, quantified as the ability to co-localize known active species into bioactive-rich K-means clusters. One method tested for descriptor selection orders features according to relative variance across a set of training compounds, and samples increasingly finer subset meshes for descriptors whose exclusion from the model induces drastic drops in relative bioactive colocalization. This yields metrics with reasonable bioactive enrichment (greater than 50% of all bioactive compounds collected into clusters or cells with significantly enriched active/inactive rates) for each of the four data sets examined herein. A second method replaces variance by an active/inactive divergence score, achieving comparable enrichment via a much more efficient search process. Combinations of the above metrics are tested in 2D rectilinear diversity models, achieving similarly successful colocalization statistics, with metrics derived from the active/inactive divergence score typically outperforming those selected from the variance criterion and computed from the DiverseSolutions software.

Year:  2008        PMID: 19789655      PMCID: PMC2753527          DOI: 10.2174/157340908783769292

Source DB:  PubMed          Journal:  Curr Comput Aided Drug Des        ISSN: 1573-4099            Impact factor:   1.606


  18 in total

1.  VolSurf: a new tool for the pharmacokinetic optimization of lead compounds.

Authors:  G Cruciani; M Pastor; W Guba
Journal:  Eur J Pharm Sci       Date:  2000-10       Impact factor: 4.384

2.  Use of alignment-free molecular descriptors in diversity analysis and optimal sampling of molecular libraries.

Authors:  Fabien Fontaine; Manuel Pastor; Hugo Gutiérrez-de-Terán; Juan J Lozano; Ferran Sanz
Journal:  Mol Divers       Date:  2003       Impact factor: 2.943

3.  Do structurally similar molecules have similar biological activity?

Authors:  Yvonne C Martin; James L Kofron; Linda M Traphagen
Journal:  J Med Chem       Date:  2002-09-12       Impact factor: 7.446

4.  Feature selection for descriptor based classification models. 1. Theory and GA-SEC algorithm.

Authors:  Jörg K Wegner; Holger Fröhlich; Andreas Zell
Journal:  J Chem Inf Comput Sci       Date:  2004 May-Jun

Review 5.  There is no such thing as 'diversity'!

Authors:  Hans-Jörg Roth
Journal:  Curr Opin Chem Biol       Date:  2005-06       Impact factor: 8.822

6.  A very large diversity space of synthetically accessible compounds for use with drug design programs.

Authors:  Sergey Nikitin; Natalia Zaitseva; Olga Demina; Vera Solovieva; Evgeny Mazin; Sergey Mikhalev; Maxim Smolov; Anatoly Rubinov; Peter Vlasov; Dmitry Lepikhin; Denis Khachko; Valery Fokin; Cary Queen; Viktor Zosimov
Journal:  J Comput Aided Mol Des       Date:  2005-01       Impact factor: 3.686

Review 7.  Molecular similarity and diversity in chemoinformatics: from theory to applications.

Authors:  Ana G Maldonado; J P Doucet; Michel Petitjean; Bo-Tao Fan
Journal:  Mol Divers       Date:  2006-02       Impact factor: 2.943

8.  Identification of biological activity profiles using substructural analysis and genetic algorithms.

Authors:  V J Gillet; P Willett; J Bradshaw
Journal:  J Chem Inf Comput Sci       Date:  1998 Mar-Apr

9.  A case study in misidentification of cancer cell lines: MCF-7/AdrR cells (re-designated NCI/ADR-RES) are derived from OVCAR-8 human ovarian carcinoma cells.

Authors:  Mordechai Liscovitch; Dana Ravid
Journal:  Cancer Lett       Date:  2006-02-28       Impact factor: 8.679

10.  Karyotypic complexity of the NCI-60 drug-screening panel.

Authors:  Anna V Roschke; Giovanni Tonon; Kristen S Gehlhaus; Nicolas McTyre; Kimberly J Bussey; Samir Lababidi; Dominic A Scudiero; John N Weinstein; Ilan R Kirsch
Journal:  Cancer Res       Date:  2003-12-15       Impact factor: 12.701

View more
  2 in total

1.  Uniqueness: skews bit occurrence frequencies in randomly generated fingerprint libraries.

Authors:  Nelson G Chen
Journal:  Mol Divers       Date:  2016-05-26       Impact factor: 2.943

2.  Design and synthesis of medium-ring lactam libraries inspired by octalactin a. A convergent-divergent approach.

Authors:  Neil Brown; Ge Gao; Machiko Minatoya; Baohan Xie; David Vandervelde; Gerald H Lushington; Jean-Pierre H Perchellet; Elisabeth M Perchellet; Kyle R Crow; Keith R Buszek
Journal:  J Comb Chem       Date:  2008-08-12
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.