| Literature DB >> 28296354 |
E Soysal1, H-J Lee1, Y Zhang1, L-C Huang1, X Chen1, Q Wei1, W Zheng1, J T Chang1, T Cohen1, J Sun1, H Xu1.
Abstract
Despite the existence of various databases cataloging cancer drugs, there is an emerging need to support the development and application of personalized therapies, where an integrated understanding of the clinical factors and drug mechanism of action and its gene targets is necessary. We have developed CATTLE (CAncer Treatment Treasury with Linked Evidence), a comprehensive cancer drug knowledge base providing information across the complete spectrum of the drug life cycle. The CATTLE system collects relevant data from 22 heterogeneous databases, integrates them into a unified model centralized on drugs, and presents comprehensive drug information via an interactive web portal with a download function. A total of 2,323 unique cancer drugs are currently linked to rich information from these databases in CATTLE. Through two use cases, we demonstrate that CATTLE can be used in supporting both research and practice in personalized oncology.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28296354 PMCID: PMC5351410 DOI: 10.1002/psp4.12174
Source DB: PubMed Journal: CPT Pharmacometrics Syst Pharmacol ISSN: 2163-8306
Figure 1An overview of the workflow for building CATTLE: data collection, integration, and visualization.
Data sources used in the CATTLE (CAncer Treatment Treasury with Linked Evidence) database
| Resource | Link | Contribution | # of Records |
|---|---|---|---|
| FDA drug label |
| Description, Indication | 6,825 drugs |
| NCIT |
| Description, Synonyms, External IDs | 121,794 Records |
| PubChem |
| Structure, Synonyms, External IDs, Bioassay | 1.1 million bioassays, 90 million compounds, 220 million substance |
| ChEMBL |
| Structure, External IDs | 1,3 million Chemicals |
| TTD |
| Structure, External IDs, Indication, Drug Targets | 2,025 Targets, 17,816 Drugs |
| MeSH |
| Synonyms, External IDs | 822,268 Concepts |
| DrugBank |
| External IDs, Indication, Drug interaction | 8,246 Drugs, 4,170 Targets, 15,438 Drug‐Target associations |
| KEGG |
| Pathway, Indication, Drug interaction, Drug Targets | 10 thousand Drugs, 20 million Genes, 501 Pathways |
| ChEBI |
| Synonyms, External IDs | 50 thousand Compounds |
| RxNorm |
| Synonyms, External IDs | 630 thousand Records |
| UMLS |
| Synonyms, External IDs | 7.4 million Concept |
| ATC |
| External IDs, drug class | 6,601 Drugs |
| MATADOR |
| Drug targets | 15 thousand Drug‐Target Association |
| SIDER |
| Side effects | 5,868 Side Effects of 1,430 Drugs with 139 thousand Associations. |
| FAERS |
| Pharmacovigilance | 4,070,067 Reports |
| PharmGKB |
| Pharmacogenetic variation/genes | 24,784,816 Records |
| CCLE |
| Pharmacogenetic variation/genes | 23,390 Records |
| GDSC |
| Drug sensitivity/resistance genes | Total number of IC50 values 224,510 |
| cMAP |
| Drug‐induced gene expression | 1.5M gene expression profiles from ∼5,000 small‐molecule compounds, and ∼3,000 genetic reagents |
| PubMed |
| Literature | 24.6 million |
|
|
| Clinical trials, Indications | 231,860 Trials |
| USPTO |
| Patents | 326,032 Patents |
FDA, Food and Drug Administration; NCIT, National Cancer Institute Thesaurus; ChEMBL, Chemical Database of Bioactive Molecules by European Bioinformatics Institute‐European Molecular Biology Laboratory; TTD, Theurapeutic Target Database; MeSH, Medical Subject Headings; KEGG, Kyoto Encyclopedia of Genes and Genomes; ChEBI, Chemical Entities of Biological Interest; RxNorm, Normalized naming system for generic and branded drugs; UMLS, Unified Medical Language System; ATC, Anatomical Therapeutic Chemical Classification System; MATADOR, Manually Annotated Targets and Drugs Online Resource; SIDER, Side Effect Resource; CCLE, Cancer Cell Line Encyclopedia; GDCS, Genomics of Drug Sensitivity in Cancer; cMAP, Connectivity Map O2; USPTO, United States Patent and Trademark Office.
Figure 2An example of visualization of integrated information across the full spectrum of drug development.
Distribution of CATTLE drugs in different phases of clinical trials, and clinical trials in different phases
| Drug development stage | # of drugs | % | # of trials | % | |
|---|---|---|---|---|---|
| Unknown | 31 | 1.33 | 1,691 | 3.42 | |
| Exploratory study | Phase 0 | 6 | 0.26 | 129 | 0.26 |
| Safety and effectiveness study | Phase I | 646 | 27.80 | 8,898 | 17.99 |
| Phase I/phase II | 214 | 9.21 | 4,926 | 9.96 | |
| Phase II | 682 | 29.36 | 23,872 | 48.27 | |
| Phase II/phase III | 39 | 1.68 | 717 | 1.45 | |
| Phase III | 446 | 19.20 | 8,304 | 16.79 | |
| Postmarket study | Phase IV | 258 | 11.11 | 917 | 1.85 |
| Total | 2,323 | 100.00 | 49,454 | 100.00 | |
Figure 3(a) number of drugs and genes per cancer type (by topology); (b) number of drugs and genes per cancer type (by morphology); (c) number of drugs related to each gene category.
Top 5 drugs with the most number of evidence documents for each evidence type
| Evidence type | Drug name | # of associated docs |
|---|---|---|
| Top 5 Literatures | Mycophenolate | 107,155 |
| Estrogen | 61,558 | |
| Vascular Endothelial Growth Factor Receptor 2 | 59,006 | |
| Cisplatin | 49,742 | |
| Epinephrine | 42,496 | |
| Top 5 Bioassays | Ciprofloxacin | 12,527 |
| Doxorubicin | 7,377 | |
| Indomethacin | 6,189 | |
| Fluconazole | 5,386 | |
| Amphotericin B | 5,117 | |
| Top 5 Clinical Trials | Cyclophosphamide | 1,716 |
| Cisplatin | 1,668 | |
| Paclitaxel | 1,397 | |
| Gemcitabine | 1,335 | |
| Carboplatin | 1,325 | |
| Top 5 Patents | Dimethicone | 20,553 |
| Phenobarbital | 11,720 | |
| Polyethylene Glycol | 8,688 | |
| Omeprazole | 5,753 | |
| Methylphenidate | 4,147 |