| Literature DB >> 25348214 |
Felix A Kruger1, Anna Gaulton1, Michal Nowotka1, John P Overington1.
Abstract
UNLABELLED: PPDMs is a resource that maps small molecule bioactivities to protein domains from the Pfam-A collection of protein families. Small molecule bioactivities mapped to protein domains add important precision to approaches that use protein sequence searches alignments to assist applications in computational drug discovery and systems and chemical biology. We have previously proposed a mapping heuristic for a subset of bioactivities stored in ChEMBL with the Pfam-A domain most likely to mediate small molecule binding. We have since refined this mapping using a manual procedure. Here, we present a resource that provides up-to-date mappings and the possibility to review assigned mappings as well as to participate in their assignment and curation. We also describe how mappings provided through the PPDMs resource are made accessible through the main schema of the ChEMBL database.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25348214 PMCID: PMC4341065 DOI: 10.1093/bioinformatics/btu711
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.Schematic illustration of homology-based transfer of binding domain annotation. The schematic shows how a catalogue of Pfam-A domains with known small molecule interactions was mapped to protein sequences in the ChEMBL target dictionary
The table below summarizes how activities in the current release distribute over the three possible outcomes
| Outcome | # All | % All | # Active | % Active |
|---|---|---|---|---|
| i) Successful map | 750 653 | 53.5 | 269 128 | 76.2 |
| ii) Not mapped | 625 135 | 44.5 | 63 010 | 17.9 |
| iii) Conflicting map | 28 327 | 2.0 | 20 839 | 5.9 |
| Total | 1 404 115 | 100 | 352 977 | 100 |
Columns headed ‘all’ represent all activities, whereas columns headed ‘active’ represent activities from binding assays where pChEMBL is >5. %, percentage relative to total.
aTotal count.