| Literature DB >> 26464438 |
Christopher Southan1, Joanna L Sharman1, Helen E Benson1, Elena Faccenda1, Adam J Pawson1, Stephen P H Alexander2, O Peter Buneman3, Anthony P Davenport4, John C McGrath5, John A Peters6, Michael Spedding7, William A Catterall8, Doriano Fabbro9, Jamie A Davies10.
Abstract
The IUPHAR/BPS Guide to PHARMACOLOGY (GtoPdb, http://www.guidetopharmacology.org) provides expert-curated molecular interactions between successful and potential drugs and their targets in the human genome. Developed by the International Union of Basic and Clinical Pharmacology (IUPHAR) and the British Pharmacological Society (BPS), this resource, and its earlier incarnation as IUPHAR-DB, is described in our 2014 publication. This update incorporates changes over the intervening seven database releases. The unique model of content capture is based on established and new target class subcommittees collaborating with in-house curators. Most information comes from journal articles, but we now also index kinase cross-screening panels. Targets are specified by UniProtKB IDs. Small molecules are defined by PubChem Compound Identifiers (CIDs); ligand capture also includes peptides and clinical antibodies. We have extended the capture of ligands and targets linked via published quantitative binding data (e.g. Ki, IC50 or Kd). The resulting pharmacological relationship network now defines a data-supported druggable genome encompassing 7% of human proteins. The database also provides an expanded substrate for the biennially published compendium, the Concise Guide to PHARMACOLOGY. This article covers content increase, entity analysis, revised curation strategies, new website features and expanded download options.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26464438 PMCID: PMC4702778 DOI: 10.1093/nar/gkv1037
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Target class content
| Targets | UniProt ID count |
|---|---|
| 7TM receptors* | 395 |
| Nuclear hormone receptors | 48 |
| Catalytic receptors | 239 |
| Ligand-gated ion channels | 84 |
| Voltage-gated ion channels | 141 |
| Other ion channels | 47 |
| Enzymes (all) | 1164 |
| Transporters | 508 |
| Kinases | 539 |
| Proteases | 240 |
| Other proteins | 135 |
| Total number of targets | 2761 |
*Not all our 7TM receptor records are unequivocally assigned as GPCRs, but for convenience we refer to these generally as GPCRs in the text.
Figure 1.Hierarchical listing for the ion channel families and subfamilies.
Figure 2.High level Gene Ontology (GO) functional categories for three sets of human proteins. Set A was generated from the total proteome of 20,204. Set B represents the 1228 targets with quantitative ligand binding data in GtoPdb. Set C represents the 554 targets where at least one approved drug is included in the ligand binding data. Panel D provides the colour key to the top-level GO categories. The charts were generated by loading Swiss-Prot IDs from the protein sets into the PANTHER Gene List Analysis Tool (55).
Ligand category counts. SID refers to the PubChem Substance Identifier and CID the PubChem Compound Identifier
| Ligand classification | Count |
|---|---|
| Synthetic organics | 5055 |
| Metabolites | 582 |
| Endogenous peptides | 759 |
| Other peptides including synthetic peptides | 1222 |
| Natural products | 234 |
| Antibodies | 138 |
| Inorganics | 34 |
| Approved drugs | 1233 |
| Withdrawn drugs | 67 |
| Ligands with INNs | 1882 |
| Isotopically labelled ligands | 593 |
| PubChem CIDs | 6037 |
| PubChem SIDs | 8024 |
| Total number of ligands | 8024 |
Figure 3.PubChem intersects. Figures were obtained via the PubChem interface using mostly pre-existing indexing. The exceptions are custom selects (described below) for patents, INN or United States Adopted Names (USAN) and Lipinski Rule-of-Five (ROF) + 150–800 Mw. With the exception of the SIDs (Row 1) intersects are CID counts. These queries were executed at the beginning of September 2015 when the PubChem CID total was 60.8 million and our own SIDs from release 2015.2 had been processed.
Interaction counts. Primary target indicates the dominant MMOA
| Interaction type | Count |
|---|---|
| Targets with ligand interactions | 1505 |
| Targets with quantitative ligand interactions | 1228 |
| Targets with approved drug interactions | 554 |
| Primary targets with approved drug interactions | 312 |
| Ligands with target interactions | 6796 |
| Ligands with quantitative interactions (approved drugs) | 5860 (738) |
| Ligands with clinical use summaries (approved drugs) | 1724 (1231) |
| Number of binding constants | 44691 |
| Number of binding constants curated from the literature | 13484 |
Content changes since our 2014 publication (6). Only those major categories that could be normalised for comparison between 2013 and 2015 are included
| Oct 2013 | 2015 | Percentage increase | |
|---|---|---|---|
| Target protein IDs | 2485 | 2761 | 11 |
| Ligands total | 6064 | 8024 | 32 |
| Approved drugs | 559 | 1233 | 121 |
| Antibodies | 10 | 138 | 1280 |
| Peptides | 1776 | 1981 | 12 |
| Synthetic small molecules | 3504 | 5055 | 44 |
| PubChem SIDs | 3107 | 8024 | 158 |
| PubChem CIDs | 2694 | 6037 | 124 |
| Binding constants | 41076 | 44691 | 9 |
| References | 21774 | 27880 | 28 |
Figure 4.Relationship growth since 2012. The first (left-most) chart shows the number of targets with curated ligand interactions while the second chart includes only those targets that are supported by quantitative data. The third and fourth charts show the number of approved drugs with data-supported targets and those that may be considered primary targets, respectively.
Figure 5.Inhibitors table from the detailed view of the BACE2 target entry, with the inclusion of five lead compounds from patents.
Figure 6.Clinically-Relevant Mutations and Pathophysiology for Kv7.1.
Figure 7.Clinical data summary tab for the approved drug telmisartan.
Examples of links where we have direct interactions with the database teams
| Resource | Connectivity Comments | Reference |
|---|---|---|
| BindingDB | Comprehensive ligand-target database, we now cross-reference selected patent extractions from this source | ( |
| ChEMBL and UniChem | Inclusion of our target protein pointers and a ChEMBL look-up for our ligand entries loaded in UniChem | ( |
| DrugBank | Target cross-references and chemical ontology connection | ( |
| ESTER | Alpha/beta hydrolase cross-references | ( |
| GeneCards | Gene expression and functional data aggregator | ( |
| GPCRDB | Specific pointers to their detailed features, curation of mutations, sequence display toolbox and residue numbering system | ( |
| GUDMAP | Links to proteins involved in GenitoUrinary (GU) tract development | ( |
| HGNC | Long standing and frequent interactions on target family nomenclature issues | ( |
| IMGT/mAb-DB | Pointers to provenanced sequences for clinical antibodies, target interactions, display tools and residue numbering system | ( |
| MEROPS | Feature details, classification, ligand mapping, other protease-specific issues | ( |
| neXtProt | Data and features additional to Swiss-Prot, semantic mining technology | ( |
| NURSA | Detailed NHR information including transcriptome mining functionality | ( |
| Orphanet | Unique rare genetic disease curation and disease term connectivity | ( |
| PubChem | Covering aspects of chemical curation, drug naming and our submitted structures. Plans for future peptide and BioAssay Links | ( |
| UniProtKB | Maintenance of our own selectable cross-references to proteins with quantitative interactions | ( |
| Wikipedia | Updating, adding new target and ligand links, including filling in ‘chemistry boxes’ | ( |
Figure 8.Intersects and differentials for human Swiss-Prot ID cross-referenced source databases that curate chemistry-to-protein mappings. Data were generated via the UniProtKB interface and the diagram prepared using the Venny tool (http://bioinfogp.cnb.csic.es/tools/venny/). The union of all four sets is 3603, based on the Swiss-Prot ID cross-references from UniProtKB release 2015_07.