Literature DB >> 16711738

Structural unit analysis identifies lead series and facilitates scaffold hopping in combinatorial chemistry.

Philippa R N Wolohan1, Lakshmi B Akella, Roman J Dorfman, Peter G Nell, Stefan M Mundt, Robert D Clark.   

Abstract

Combinatorial chemistry and high-throughput screening technologies produce huge amounts of data on a regular basis. Sieving through these libraries of compounds and their associated assay data to identify appropriate series for follow-up is a daunting task, which has created a need for computational techniques that can find coherent islands of structure-activity relationships in this sea. Structural unit analysis (SUA) examines an entire data set so as to identify the molecular substructures or fragments that distinguish compounds with high activity from those with average activity. The algorithm is iterative and follows set heuristics in order to generate the structural units. It produces graphs that represent a set of units, which become SUA rules. Finding all of the input structures that match these graphs generates clusters. The Apriori algorithm for association rule mining is adapted to explore all of the combinations of structural units that define useful series. User-defined constraints are applied toward series selection and the refinement of rules. The significance of a series is determined by applying statistical methods appropriate to each data set. Application to the NCI-H23 (DTP Human Tumor Cell Line Screen) database serves to illustrate the process by which structural series are identified. An application of the method to scaffold hopping is then discussed in connection with proprietary screening data from a lead optimization project directed toward the treatment of respiratory tract infections at Bayer Healthcare. SUA was able to successfully identify promising alternative core structures in addition to identifying compounds with above-average activity and selectivity.

Entities:  

Mesh:

Year:  2006        PMID: 16711738     DOI: 10.1021/ci050432z

Source DB:  PubMed          Journal:  J Chem Inf Model        ISSN: 1549-9596            Impact factor:   4.956


  2 in total

1.  Successful identification of key chemical structure modifications that lead to improved ADME profiles.

Authors:  Lourdes Cucurull-Sanchez
Journal:  J Comput Aided Mol Des       Date:  2010-05-09       Impact factor: 3.686

2.  Indirect similarity based methods for effective scaffold-hopping in chemical compounds.

Authors:  Nikil Wale; Ian A Watson; George Karypis
Journal:  J Chem Inf Model       Date:  2008-04-11       Impact factor: 4.956

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.