| Literature DB >> 22711790 |
Jan Wildenhain1, Nicholas Fitzgerald, Mike Tyers.
Abstract
UNLABELLED: The MolClass toolkit and data portal generate computational models from user-defined small molecule datasets based on structural features identified in hit and non-hit molecules in different screens. Each new model is applied to all datasets in the database to classify compound specificity. MolClass thus defines a likelihood value for each compound entry and creates an activity fingerprint across diverse sets of screens. MolClass uses a variety of machine-learning methods to find molecular patterns and can therefore also assign a priori predictions of bioactivities for previously untested molecules. The power of the MolClass resource will grow as a function of the number of screens deposited in the database.Entities:
Mesh:
Year: 2012 PMID: 22711790 PMCID: PMC3413392 DOI: 10.1093/bioinformatics/bts349
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1MolClass features. (A) current state of data from public resources such as PubChem and ChemBank. (B) MolClass workflow from experimental data to activity likelihoods. (C) Likelihood scores for fenbendazole and aspirin in 14 different models: (1) neurosphere proliferation, +/none (Diamandis ); (2) Caco-2 permeation, +/− (Hou ); (3) flucanozole synergizer, +/none (Spitzer ); (4) Caenorhabditis elegans drug bioaccumulation, none/+ (Burns ); (5) Ames mutagenicity benchmark, none/+ (Hansen ); (6) mutagenicity prediction, +/none (Kazius ); (7) blood–brain barrier penetration, +/− (Li ); (8) PubChem AID 1828 +/none; (9) PubChem AID 595 +/− (10) ChemBank 1000423 +/− (11) ChemBank 1001644 +/− (12) ChemBank 1000359 +/− (13) autofluoresence none/+ and (14) ChEMBL TargetID CHEMBL204 none/ +. ‘+’ activating, ‘−’ inhibiting and ‘none’ no effect