| Literature DB >> 35758373 |
Hae Reong Kim1, MinDong Sung1, Ji Ae Park1, Kyeongseob Jeong1, Ho Heon Kim1, Suehyun Lee2, Yu Rang Park1.
Abstract
BACKGROUND: Adverse drug reactions (ADRs) are unintended negative drug-induced responses. Determining the association between drugs and ADRs is crucial, and several methods have been proposed to demonstrate this association. This systematic review aimed to examine the analytical tools by considering original articles that utilized statistical and machine learning methods for detecting ADRs.Entities:
Mesh:
Year: 2022 PMID: 35758373 PMCID: PMC9276413 DOI: 10.1097/MD.0000000000029387
Source DB: PubMed Journal: Medicine (Baltimore) ISSN: 0025-7974 Impact factor: 1.817
Figure 1Publications removed based on title or abstract∗; improper study subject (eg, bird, mouse, cats, and dog), improper candidate (eg, biology, genetic, gene expression, stem cell, HER2, DNA, biologics, β1, beta blockers, mutation, inhibitor, genotype, chemical, pathway, T-cell, surgical, surgery, image, MRI, alcohol, smoke, marijuana, and diet), and improper research design (eg, randomized clinical trial, RCT, clinical trial, meta-analysis, pilot, systematic review, Delphi, and social media). Full-text articles excluded, based on manual reviews∗∗; In case of lack of a clear goal, improper candidate or research design that is not filtered out of search terms, and lack of drug-induced adverse event. Figure 1. PRISMA Flow Chart.
Figure 2CDM∗ = Common data model. The use of multiple algorithms within one study may result in duplicate inclusions. Figure 2. Sankey diagram for statistical methods.
Including databases in this research.
| Category | Database | Information from the database |
| SRS | EudraVigilance[ | The database for adverse reactions to drug which have been authorised in clinical trials in the European Economic Area |
| FAERS (including VAERS)[ | Drug and ADR association for postmarketing drug safety surveillance from the Food and Drug Administration's | |
| VigiBase[ | Individual Case Safety Reports of suspected ADRs | |
| Other type of SRS[ | National specific SRS database | |
| EMR | Medical records[ | Institution specific standardized data (eg, diagnosis, medication) |
| Medical note[ | Unstructured text data (eg, nursing records, surgery, and hospitalization records) | |
| Other data sources | SIDER[ | The information of side effects and indication for marketed drugs |
| DrugBank[ | Non-redundant protein (drug target, enzyme, transporter, carrier, thus informing on drugs’ mechanism of action and metabolism) sequences | |
| PubChem[ | The chemical information of drugs, unique chemical structures, and biological activity data of chemical substances tested in assay experiments | |
| KEGG[ | Drug, Compound and Disease databases providing chemical structures, targets, metabolizing enzymes | |
| Common Data Model[ | A uniform set of metadata, allowing data and its meaning to be shared across applications (eg, OMOP CDM) | |
| Health Insurance system[ | National specific health insurance system data (eg, NHIS) | |
| App, web data[ | Data generated and collected through the app or web (eg, MedHelp) | |
| Registry[ | National Data Registry (eg, cardiovascular disease) | |
| Simulation data[ | Fake data created for specific situations for algorithm verification |
Statistical methods for ADR detection in SRS data.
| Systems Author | Category of method | Method | Source | Purpose |
| Candore et al[ | Disproportionate method | Almost all | Multiple SRS data | To compare the performance of commonly used algorithms detecting ADRs |
| Monaco et al[ | Disproportionate method | PRR | EudraVigilance | to find out suspected ADRs |
| Raschi et al[ | Disproportionate method | ROR | FAERS | To assess the hepatic safety of novel oral anticoagulants |
| Fukazawa et al[ | Disproportionate method | ROR | FAERS | To conduct a disproportionality analysis and categorized these signals into groups which are signals with statistical significance and those without signals |
| Rahman et al and Alatawi et al[ | Disproportionate method | ROR | FAERS | To compare whether adverse event reporting patterns are similar between brand and generic drugs |
| Hoffman et al[ | Disproportionate method | ROR | FAERS | To construct a list of signal ADRs |
| Takada et al[ | Disproportionate method | ROR | FAERS | To test that the use of sodium channel-blocking antiepileptic drugs are inversely associated with cancer |
| Yu et al[ | Disproportionate method | ROR | FAERS | To assess the extent of sex differences in ADRs |
| Yue et al[ | Disproportionate method | ROR | FAERS | To investigate acute kidney injury events associated with the concomitant use of oral acyclovir or valacyclovir with a Nonsteroidal anti-inflammatory drugs |
| Cai et al[ | LRT | Likelihood ratio test | VAERS | To propose a powerful testing procedure for signal detection of temporal variation in ADR |
| Tong et al[ | LRT | Likelihood ratio test based on zero-inflated poisson (ZIP-LRT) | VAERS | To identify four adverse events that are rare and have significantly different reporting rates for FLU4 vaccine |
| Zhao et al[ | LRT | The extended likelihood ratio with Poisson model and zero-inflated Poisson model (Ext-ZIP-LRT) | FAERS | To identify ADR signals that have disproportionately high reporting rates |
| Wang et al[ | LRT | Count-dependent probability mixture drug-count response model (MDRM) | FAERS and OMOP CDM | To introduce two novel mixture drug-count response models for detecting drug combinations of high dimension that induce myopathy |
| handler et al[ | Disproportionate method | PRR | VigiBase database | To explore reporting patterns for HPV vaccine |
| Sugawara et al[ | Disproportionate method | ROR | National specific SRS data | To evaluate the incidence of respiratory depression by use of opioids |
| Tan et al[ | Disproportionate method | PRR | SRS data | To explore risks injection-related ADRs |
| Trinh et al[ | Disproportionate method | PRR | National specific SRS data | To optimize signal detection investigating the interest of time-series analysis |
| Chan et al[ | LRT | Sequential Probability Ratio Test | National specific SRS data | To detect signals of disproportionate reporting with the hRRs |
| Marbac et al[ | Regression | Logistic regression with Metropolis–Hastings algorithm | SRS data | To identify a logistic regression with metropolis–hastings algorithm |
| Xu et al[ | Regression | Logistic | FAERS | To identify secondary medications for mitigating the adverse effects of a primary drug |
| Pettit et al[ | Regression | Logistic | FAERS | To figure out between posaconazole serum concentrations and toxicity |
| Lerch et al[ | Disproportionate method | Signals of disproportionate reporting | SRS data | To detect unknown causal associations between drugs and unexpected events |
ADR = adverse drug reaction.
Statistical methods for ADR detection in EMR data.
| Author | Category of method | Method | Source | Purpose |
| Uozumi et al[ | Regression | Survival | EMR | To investigate skin toxicity which is a common adverse event during cetuximab treatment |
| Jeong et al[ | Regression | Comparison of Extreme Laboratory Test results, among others | EMR and SIDER | To propose a model that enables ADR signal detection from existing algorithms based on the EHR laboratory results for inpatient |
| Nishihara et al[ | Regression | Survival | EMR | To investigate the relationships between increased blood pressure and bevacizumab administration |
| Otake et al[ | Regression | Survival | EMR | To assess whether chemotherapy-induced neutropenia could be a prognostic factor and clarify other prognostic factors with metastatic pancreatic cancer patients |
| Kucharz et al[ | Regression | Survival | EMR | To investigate cabozantinib-induced adverse events which are predictive factors of survival in case of sunitinib or axitinib |
| Dona et al[ | Regression | Survival | EMR | To confirm that nonsteroidal anti-inflammatory drugs induced urticaria/angioedema |
| Gadelha et al[ | Regression | Survival | EMR | To identify risk factors for death in patients who have suffered noninfectious ADR |
| Andrade et al[ | Regression | Survival | EMR | to identify the risk factors for ADRs in pediatric inpatients |
| Westberg et al[ | Regression | Survival | EMR | To assess the association of DTP likelihood of harm severity score, as measured by comprehensive medication management pharmacist after hospital discharge |
| Sobhonslidsuk et al[ | Regression | Survival | EMR | To confirm that toxic liver diseases are mainly caused by drug-induced liver injury |
| Cordiner et al[ | Regression | Survival | EMR | To test for Antipsychotic polypharmacy runs the risk of additional ADR and drug interactions |
| Merid et al[ | Regression | Survival | EMR | To assess incidence and predictors of major adverse drug events among drug resistant tuberculosis patients |
| Oshikoya et al[ | Regression | Survival | EMR | To determine the risk of serious ADR when oral azithromycin or intravenous/intramuscular fentanyl are used off-label compared to on-label in pediatric ICU |
| Okamoto et al[ | Regression | Survival | EMR | To examine adverse event occurrence rates by grade, deaths and the appearance of severe ADR |
| Dedefo et al[ | Regression | Logistic | EMR | To assess the incidence and determinants of medication errors and adverse drug events among hospitalized children |
| Blumenthal et al[ | Regression | Logistic | EMR | To address inpatient penicillin allergies results in more broad-spectrum antibiotic use, treatment failures, and adverse drug events |
| Sellick et al[ | Regression | Logistic | EMR | To measure the incidence and risk factors for fluoroquinolone-associated psychosis or delirium |
| Degu et al[ | Regression | Logistic | EMR | To figure out hospital admissions which are due to drug related problems |
| Mill et al[ | Regression | Logistic | EMR | To assess the accuracy and the negative predictive value of the graded provocation challenge in a cohort of children referred with suspected allergy to amoxicillin |
| Ilich et al[ | Regression | Logistic | EMR | To determine whether female colorectal cancer patients experienced a higher incidence of dose-limiting toxicity than men when treated with adjuvant capecitabine |
| Khong et al[ | Regression | Negative binomial | EMR | To affect the interleukin-2 therapy for metastatic melanoma and renal cell carcinoma |
| Daley et al[ | Regression | Conditional poisson | EMR | To evaluate the safety for influenza vaccine in children |
| Vock et al[ | other | Inverse Probability of Censoring Weighting | EMR | To propose a technique for mining right-censored time-to-event data |
ADR = adverse drug reaction.
Statistical methods for ADR detection in other data sources.
| Author | Category of method | Method | Source | Purpose |
| Wang et al[ | LRT | Maximum log likelihood ratio | Common Data Model | to propose tree- based scan statistics to detect ADR signal |
| Maura et al[ | Other | Sequence Symmetry Analysis | Health Insurance system | to assess the association between DOAC initiation and the onset of nonbleeding adverse events |
ADR = adverse drug reaction, DOAC = Direct Oral Anti-Coagulants.
Figure 3MedEffect∗ = National SRS data. The use of multiple algorithms within one study may result in duplicate inclusions. Figure 3. Sankey diagram for machine learning methods.
Machine learning methods for ADR in SRS data.
| Author | Category of method | Method | Source | Purpose |
| Xiao et al[ | Bayesian | Monte-Carlo Expectation-Maximization procedure | FAERS, MedEffect, among others | To detects exact drug safety signals from multiple data sources via Monte Carlo Expectation Maximization and signal combination step |
| Cai et al[ | Bayesian | Causal Bayesian Network | FAERS | To discover DDIs |
| Li et al[ | Other methods | Inductive matrix completion | FAERS, DrugBank, among others | To find a random matrix value which minimized the distance between the drug and the ADR by the loss function and regularization |
| Ren et al[ | Other methods | Blockmetrices with correlation | VAERS | To use correlation matrices to detect the adverse events or symptoms after vaccination |
| Liu et al[ | Other methods | Autoencoder-Based Semi-Supervised Learning Algorithm and weighted SVM | FAERS and ONC High-Priority | To propose a machine learning framework to extract useful features and identify potential highpriority DDIs |
ADR = adverse drug reaction, DDI = Drug-Drug Interaction.
Machine learning methods for ADR in EMR data.
| Author | Category of method | Method | Source | Purpose |
| Wang et al[ | Supervised | Random Forest | EMR, DrugBank, and etc. | To develop data-mining method for detection of ADRs |
| Zhao et al[ | Supervised | Random Forest | EMR | To detect the drug-induced diagnosis ADRs |
| Zhao et al[ | Supervised | Random Forest | EMR | To learn weights for ADRs detection |
| Boyce et al[ | Supervised | Random Forest and so on | Admission notes | To show the value of text mining for identifying suspected bleeding ADRs |
| Desautels et al[ | Supervised | AdaBoost | EMR | To identify patients who suffer from ICU readmission |
| Wunnava et al[ | Supervised | Bi-directional long short-term memory, among others | EMR notes | To develop rule-based tokenization techniques for ADRs detection |
| Wang et al[ | Supervised | Regularized logistic regression, linear support vector machine | EMR notes | To evaluate the feasibility of multiclass classification for ADRs |
ADR = adverse drug reaction.
Machine learning methods for ADR in other data sources.
| Author | Category of method | Method | Source | Purpose |
| Kastrin et al[ | Other methods | Unsupervised and supervised method | DrugBank, KEGG, NDF-RT, and Twosides | To represent the process of discovering potential DDIs and to evaluate performance of unsupervised and supervised machine learning methods |
| Bean et al[ | Other methods | Weighted predictive method | DrugBank, SIDER, and EMR | To use knowledge about drugs known to cause an ADR to predict new causes |
| Zhang et al[ | Other methods | Feature-derived graph regularized matrix factorization | SIDER, DrugBank, KEGG DRUG and PubChem | To predict ADRs based on known drug-side effect associations |
| Zhao et al[ | Supervised | Random forest | STITCH, KEGG, DrugBank, RDKit and SIDER | To detect the ADRs |
| Muñoz et al[ | Supervised | Feature selection-based multi-label k-nearest neighbour | PubChem and Bio2RDF dataset (DrugBank, SIDER, KEGG, etc.) | To explore effects of: using knowledge graphs as a representation of heterogeneous data; and casting ADRs prediction as a multilabel ranking problem |
| Song et al[ | Supervised | Pairwise kernel SVM classifier | DrugBank and SIDER | To predict drug pairs and check if they truly interact with each other |
| Zhang et al[ | Supervised | Feature selection-based multi-label k-nearest neighbor method | SIDER, PubChem, DrugBank, KEGG DRUG, and etc. | To build the association between feature and ADR vector for multilabel learning |
| Davazdahemami et al[ | Supervised | Gradient boosted trees | MEDLINE and DrugBank | To predict the drug and ADR associations |
| Hoang et al[ | Supervised | Sequence symmetry method | DrugBank | To assess the utility of supervised machine learning as a signal detection tool for ADRs |
| Liu et al[ | Supervised | XGBoost | Osteoarthritis Initiative dataset | To identify high-risk features of cardiovascular diseases caused by analgesics OA patients |
| Ross et al[ | Bayesian | Bayesian method | Cardiology's National Cardiovascular Data Registry | To provide insights into whether multiple methods used as an ensemble to detect all safety signals |
| Cotterill et al[ | Bayesian | Bayesian method | Simulated data | To account for a subgroup effect to ADRs by including covariates |
| Yang et al[ | Supervised | Association rule mining metrics | MedHelp | To propose a framework for drug safety signal detection by harnessing online health community data which associated ADRs and DDIs |
ADR = adverse drug reaction, DDI = Drug-Drug Interaction.