| Literature DB >> 24303301 |
Oktie Hassanzadeh1, Qian Zhu, Robert Freimuth, Richard Boyce.
Abstract
Structured Product Labels (SPLs) contain information about drugs that can be valuable to clinical and translational research, especially if it can be linked to other sources that provide data about drug targets, chemical properties, interactions, and biological pathways. Unfortunately, SPLs currently provide coarsely-structured drug information and lack the detailed annotation that is required to support computational use cases. To help address this issue we created LinkedSPLs, a Linked Data resource that extends the "web of drug identity" using information extracted from SPLs. In this paper we describe the mapping that LinkedSPLs provides between SPL active ingredients and DrugBank chemical entities. These mappings were created using three approaches: InChI chemical structure descriptors comparison, exact string matching based on the chemical name, and automatic (unsupervised) linkage identification. Comparison of the approaches found that, while these three approaches are complementary, the automatic approach performs well in terms of precision and recall.Entities:
Year: 2013 PMID: 24303301 PMCID: PMC3814463
Source DB: PubMed Journal: AMIA Jt Summits Transl Sci Proc
Figure 1.An overview of the three mapping methods
The results of three different approaches to mapping drug product active ingredients to DrugBank 3.0.
| Approach 1: InChI identifier | Approach 2: ChEBI identifier | Approach 3: Automatic | |||||||
|---|---|---|---|---|---|---|---|---|---|
| Valid | Not Valid | Total | Valid | Not Valid | Total | Valid | Not Valid | Total | |
| Active ingredients (N=2,264) | 424 | 5 | 429 | 707 | 11 | 718 | 1,162 | 17 | 1,179 |
A comparison of the overlap of validated mappings
| InChI identifier | ChEBI identifier | InChI + ChEBI | Automatic | |
|---|---|---|---|---|
| InChI identifier | 424 | 261 | 424 | 395 |
| ChEBI identifier | --- | 707 | 707 | 650 |
| InChI + ChEBI | -- | -- | 831 | 791 |
| Automatic | -- | -- | -- | 1162 |