| Literature DB >> 24900303 |
Mathias Wawer1, Jürgen Bajorath1.
Abstract
We combine two graphical SAR analysis methods, Network-like Similarity Graphs (NSGs) and Similarity-Potency Trees (SPTs), to search for SAR information in a large and heterogeneous compound data set containing more than 13,000 antimalarial screening hits that was recently released by GlaxoSmithKline (GSK). The NSG-SPT approach first identifies subsets of compounds inducing local SAR discontinuity in data sets and then extracts available SAR information from these subsets in a graphically intuitive manner. Applying the NSG-SPT analysis scheme, we have identified in the GSK collection compound subsets of high local SAR information content including both known and previously unknown antimalarial chemotypes, which yielded interpretable SAR patterns. This information should be helpful to prioritize and select antimalarial candidate compounds for further chemical exploration. Furthermore, the NSG-SPT tools are publicly available, and our study also shows how to practically apply these SAR analysis methods to study large compound data sets.Entities:
Keywords: Anti-malaria screening hits; data mining; graphical SAR analysis; network-like similarity graphs; similarity-potency trees; structure−activity relationship (SAR) information
Year: 2011 PMID: 24900303 PMCID: PMC4018131 DOI: 10.1021/ml100240z
Source DB: PubMed Journal: ACS Med Chem Lett ISSN: 1948-5875 Impact factor: 4.345