| Literature DB >> 35654801 |
Guy Shtar1, Louise Azulay2, Omer Nizri3, Lior Rokach3, Bracha Shapira3.
Abstract
In recent years, due to the complementary action of drug combinations over mono-therapy, the multiple-drugs for multiple-targets paradigm has received increased attention to treat bacterial infections and complex diseases. Although new drug combinations screening has benefited from experimental tests like automated high throughput screening, it is limited due to the large number of possible drug combinations. The task of drug combination screening can be streamlined through computational methods and models. Such models require up-to-date databases; however, existing databases are static and consist of the data collected at the time of their creation. This paper introduces the Continuous Drug Combination Database (CDCDB), a continuously updated drug combination database. The CDCDB includes over 40,795 drug combinations, of which 17,107 are unique combinations consisting of more than 4,129 individual drugs, curated from ClinicalTrials.gov, the FDA Orange Book®, and patents. To create CDCDB, we use various methods, including natural language processing techniques, to improve the process of drug combination discovery, ensuring that our database can be used for drug synergy prediction. Website: https://icc.ise.bgu.ac.il/medical_ai/CDCDB/ .Entities:
Mesh:
Substances:
Year: 2022 PMID: 35654801 PMCID: PMC9163158 DOI: 10.1038/s41597-022-01360-z
Source DB: PubMed Journal: Sci Data ISSN: 2052-4463 Impact factor: 8.501
Fig. 1Drug combinations in clinical trials in CDCDB, version of 19th of April 2022: (A) Distribution of drug combination clinical trials over the years. (B) Duration in years of drug combination studies over the years.
Fig. 2Overview of the database creation process. CDCDB is created from various sources, named entity recognition and manual filtering techniques are used to maintain the quality of CDCDB.
MeSH terms of the clinical studies in CDCDB.
| MeSH term | Occurrences |
|---|---|
| Other | 17,795 |
| Breast Neoplasms | 1,153 |
| Lung Neoplasms | 903 |
| Lymphoma | 870 |
| Leukemia | 855 |
| Carcinoma, Non-Small-Cell Lung | 854 |
| Carcinoma | 728 |
| Neoplasms | 693 |
| Multiple Myeloma | 634 |
| Neoplasms, Plasma Cell | 568 |
| Hepatitis C | 440 |
| Hepatitis | 435 |
| Prostatic Neoplasms | 430 |
| Leukemia, Myeloid | 420 |
| Diabetes Mellitus, Type 2 | 417 |
| Leukemia, Myeloid, Acute | 408 |
| Diabetes Mellitus | 383 |
| Adenocarcinoma | 360 |
| Colorectal Neoplasms | 351 |
| Hepatitis A | 349 |
| Leukemia, Lymphoid | 336 |
| Melanoma | 329 |
| Myelodysplastic Syndromes | 321 |
| Syndrome | 299 |
| Leukemia, Lymphocytic, Chronic, B-Cell | 290 |
| Lymphoma, Non-Hodgkin | 277 |
| Lymphoma, B-Cell | 256 |
| Hepatitis C, Chronic | 253 |
Comparison of drug combination databases.
| Database | Sources | Number of Combinations | Number of Individual Drugs | Therapeutic Field | Release Year |
|---|---|---|---|---|---|
| DCDB 1.0[ | Orange Book and PubMed | 499 | 485 | Many | 2010 |
| DCDB[ | ClinicalTrials, Orange Book, and PubMed | 1,363 | 904 | Many | 2014 |
| ASDCD[ | PubMed, Google Scholar, and Web of Science | 210 | 105 | Fungal | 2014 |
| NCI-ALMANAC[ | HTS from Orange Book | 5,232 | 104 | Cancer | 2017 |
| TTD[ | FDA, ClinicalTrials and PubMed | 34,019 | unknown | Many | 2018 |
| DREAM[ | FDA, HTS experiments | 910 | 115 | Cancer | 2019 |
| DrugComb (includes mono-therapies)[ | Public datasets, publications, user uploads | 739,964 (mostly in-vitro) | 8,397 | Cancer, Malaria, COVID-19 | 2021 |
| DrugCombDB[ | HTS experiments, Orange Book, NCI-ALMANAC, DREAM, PubMed, and external databases | 448,555 (only ∼7,000 in-vivo) | 2,887 | Many | 2020 |
| O’Neil | HTS experiments | 583 | 38 | Cancer | 2016 |
| NCATS Malaria Dataset[ | HTS experiments | 14,810 | 206 | Malaria | 2015 |
| AZ-DREAM[ | HTS experiments | 910 | 85 | Cancer | 2015 |
| Antibiotic combinations[ | HTS experiments | 210 | 21 | Antibiotic | 2006 |
| CDCDB (current work) | AACT (clinical trials), FDA Orange Book, and Integrity (patents) | 40,795 (as of Jan. 2021) | 4,195 (as of Jan. 2021) | Many |
Fig. 3CDCDB’s sources. Amount of groups with a certain number of drugs combined in the group, distributed by data source.
| Measurement(s) | drug combination effect modeling • drug combination effect modeling |
| Technology Type(s) | Text mining • Clinical Trials Informatics System |
| Factor Type(s) | Medicine |
| Sample Characteristic - Organism | Homo sapiens |