| Literature DB >> 27467413 |
Sayan Ranu1, Bradley T Calhoun2, Ambuj K Singh1, S Joshua Swamidass3.
Abstract
Identifying the overrepresented substructures from a set of molecules with similar activity is a common task in chemical informatics. Existing substructure miners are deterministic, requiring the activity of all mined molecules to be known with high confidence. In contrast, we introduce pGraphSig, a probabilistic structure miner, which effectively mines structures from noisy data, where many molecules are labeled with their probability of being active. We benchmark pGraphSig on data from several small-molecule high throughput screens, finding that it can more effectively identify overrepresented structures than a deterministic structure miner.Entities:
Keywords: Chemoinformatics; Drug discovery; High throughput screening; Virtual screening
Year: 2011 PMID: 27467413 DOI: 10.1002/minf.201100058
Source DB: PubMed Journal: Mol Inform ISSN: 1868-1743 Impact factor: 3.353