| Literature DB >> 32495081 |
Ananth Punyala1, Rachana Lankapalli2, Diane Hindman3, Rebecca Racz4.
Abstract
PURPOSE: Drug indications and disease symptoms often confound adverse event reports in real-world datasets, including electronic health records and reports in the FDA Adverse Event Reporting System (FAERS). A thorough, standardized set of indications and symptoms is needed to identify these confounders in such datasets for drug research and safety assessment. The aim of this study is to create a comprehensive list of drug-indication associations and disease-symptom associations using multiple resources, including existing databases and natural language processing.Entities:
Keywords: Databases; Indications; Natural language processing; Symptoms; Text-mining
Mesh:
Year: 2020 PMID: 32495081 PMCID: PMC7419351 DOI: 10.1007/s00228-020-02898-w
Source DB: PubMed Journal: Eur J Clin Pharmacol ISSN: 0031-6970 Impact factor: 2.953
Fig. 1A graphical depiction of the methods and analyses used in this study
Fig. 2Distribution of US drug approvals by source and year. The majority of drugs included in this dataset were approved post-1982, according to FDA Orange Book data. Most approval dates pre-1982 are listed as “Approved Prior to Jan 1, 1982” in the Orange Book. Most of the drugs extracted after 2013 were retrieved from RxNorm
Precision of dataset associations
| Source | Number of pairs | Precision | Estimated number of false positives |
|---|---|---|---|
| Drug-indication associations | |||
| SIDER | 3063 | 76% | 735 |
| RxNorm | 4095 | 86% | 573 |
| SIDER + RxNorm | 986 | 99% | 10 |
| Disease-symptom associations | |||
| MedlinePlus | 1317 | 98% | 26 |
| NLP | 8901 | 57% | 3827 |
| MedlinePlus + NLP | 105 | 99% | 1 |
| Drug-indication-symptom associations | |||
| All | 149,164 | 58% | 62,649 |
Fig. 3Threshold determination of TF-IDF scores based on precision. When TF-IDF scores were used as a threshold, precision ranged from 0.54 to 0.65