| Literature DB >> 36050484 |
Jeong-Eun Lee1, Ju Hwan Kim1, Ji-Hwan Bae1, Inmyung Song2, Ju-Young Shin3,4,5.
Abstract
There has been a growing attention on using machine learning (ML) in pharmacovigilance. This study aimed to investigate the utility of supervised ML algorithms on timely detection of safety signals in the Korea Adverse Event Reporting System (KAERS), using infliximab as a case drug, between 2009 and 2018. Input data set for ML training was constructed based on the drug label information and spontaneous reports in the KAERS. Gold standard dataset containing known AEs was randomly divided into the training and test sets. Two supervised ML algorithms (gradient boosting machine [GBM], random forest [RF]) were fitted with hyperparameters tuned on the training set by using a fivefold validation. Then, we stratified the KAERS data by calendar year to create 10 cumulative yearly datasets, in which ML algorithms were applied to detect five pre-specified AEs of infliximab identified during post-marketing surveillance. Four AEs were detected by both GBM and RF in the first year they appeared in the KAERS and earlier than they were updated in the drug label of infliximab. We further applied our models to data retrieved from the US Food and Drug Administration Adverse Event Reporting System repository and found that they outperformed existing disproportionality methods. Both GBM and RF demonstrated reliable performance in detecting early safety signals and showed promise for applying such approaches to pharmacovigilance.Entities:
Mesh:
Substances:
Year: 2022 PMID: 36050484 PMCID: PMC9436954 DOI: 10.1038/s41598-022-18522-z
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.996
Characteristics of the AE reports of infliximab and methotrexate in the KAERS between 2009 and 2018.
| Characteristics | Infliximab | Methotrexate | |||
|---|---|---|---|---|---|
| N = 4482 | 100.00 (%) | N = 6894 | 100.00 (%) | ||
| < .0001 | |||||
| Male | 2002 | 44.7 | 2563 | 37.2 | |
| Female | 1929 | 43.0 | 4144 | 60.1 | |
| Unknown | 551 | 12.3 | 187 | 2.7 | |
| < .0001 | |||||
| < 20 | 413 | 9.2 | 1635 | 23.7 | |
| 20–29 | 486 | 10.8 | 387 | 5.6 | |
| 30–39 | 735 | 16.4 | 611 | 8.9 | |
| 40–49 | 650 | 14.5 | 841 | 12.2 | |
| 50–59 | 666 | 14.9 | 1208 | 17.5 | |
| 60–69 | 449 | 10.0 | 960 | 13.9 | |
| ≥ 70 | 233 | 5.2 | 544 | 7.9 | |
| Unknown | 850 | 19.0 | 708 | 10.3 | |
| < .0001 | |||||
| 2009 | 16 | 0.4 | 68 | 1.0 | |
| 2010 | 77 | 1.7 | 219 | 3.2 | |
| 2011 | 136 | 3.0 | 290 | 4.2 | |
| 2012 | 122 | 2.7 | 642 | 9.3 | |
| 2013 | 264 | 5.9 | 852 | 12.4 | |
| 2014 | 521 | 11.6 | 747 | 10.8 | |
| 2015 | 631 | 14.1 | 686 | 10.0 | |
| 2016 | 1608 | 35.9 | 971 | 14.1 | |
| 2017 | 670 | 15.0 | 1137 | 16.5 | |
| 2018 | 437 | 9.8 | 1282 | 18.6 | |
| < .0001 | |||||
| Yes | 1255 | 28.0 | 1257 | 18.2 | |
| < .0001 | |||||
| Spontaneous | 1244 | 27.8 | 5605 | 81.3 | |
| Post-marketing surveillance | 2718 | 60.6 | 38 | 0.6 | |
| Literature | 404 | 9.0 | 542 | 7.9 | |
| Others | 116 | 2.6 | 709 | 10.3 | |
| < .0001 | |||||
| Physician | 3351 | 74.8 | 1696 | 24.6 | |
| Pharmacist | 67 | 1.5 | 987 | 14.3 | |
| Nurse | 493 | 11.0 | 3126 | 45.3 | |
| Consumer | 112 | 2.5 | 96 | 1.4 | |
| Healthcare professional | 6 | 0.1 | 110 | 1.6 | |
| Others | 80 | 1.8 | 438 | 6.4 | |
| Unknown | 373 | 8.3 | 441 | 6.4 | |
| < .0001 | |||||
| RPVC | 632 | 14.1 | 6124 | 88.8 | |
| Manufacturer | 3733 | 83.3 | 676 | 9.8 | |
| Medical institution | 37 | 0.8 | 85 | 1.2 | |
| Pharmacy | 1 | 0 | 5 | 0.1 | |
| Consumer | 0 | 0 | 3 | 0 | |
| Others | 79 | 1.8 | 1 | 0 | |
AE adverse event, KAERS Korea Adverse Event Reporting System, RPVC regional pharmacovigilance center.
Early signal detection results across different data mining methods in the KAERS between 2009 and 2018.
| Drug | Adverse event term | WHO-ART preferred team | Label update (year) | Signaling prior to label update | |||
|---|---|---|---|---|---|---|---|
| RF | GBM | aROR | IC | ||||
| Infliximab | Agranulocytosis | Agranulocytosis | 2017 | Y | Y | N | N |
| cervical cancer | Cervical carcinoma | 2017 | Y | Y | N | N | |
| Cerebrovascular accidents | Cerebellar infarction | 2017 | Y | Y | N | N | |
| Cerebral infarction | |||||||
| Leukemia | Leukemia acute | 2018 | Y | Y | N | N | |
| Leukemia granulocytic | |||||||
| Transient visual loss | Vision abnormal | 2010 | N | N | N | N | |
KAERS Korea Adverse Event Reporting System, WHO-ART world health organization-adverse reaction terminology, RF random forest, GBM gradient boosting machine, aROR adjusted reporting odds ratio, IC information component.
Figure 1Standardized differences of data mining methods by calendar year for each pre-specified AE updated in the labeling information of infliximab Abbreviations: AE, adverse event; GBM, gradient boosting machine; RF, random forest; ROR025, adjusted reporting odds ratio; IC05, information component.
Figure 2Receiver operating characteristic (ROC) curve illustrating the prediction performances of data mining methods used to detect safety signals of infliximab in KAERS (2009–2018) and FAERS (2014–2018). Abbreviation: KAERS, Korea adverse event reporting system; FAERS, FDA adverse event reporting system; AUROC, area under receiver operating characteristic curve; GBM, gradient boosting machine, RF, random forest; ROR, Reporting odds ratio; IC, Information component, Prob, probability. *ROR025 is the lower limit of a 95% confidence interval for estimated reporting odds ratio. †IC025 is the lower limit of a 95% confidence interval for estimated information component.
Safety signal detection among the unknown AEs of infliximab reported in the KAERS between 2009 and 2018.
| WHO-ART preferred term | Data mining methods | |||||||
|---|---|---|---|---|---|---|---|---|
| GBM | RF | aROR | IC | |||||
| Signal | Probability | Signal | Probability | Signal | ROR025a | Signal | IC05b | |
| Acne | Y | 0.92 | Y | 0.74 | N | 0.64 | N | − 0.33 |
| Alopecia | Y | 0.94 | Y | 0.77 | N | 0.08 | N | − 2.66 |
| Asthenia | Y | 1 | Y | 0.95 | N | 0.36 | N | − 1.34 |
| Bilirubinaemia | Y | 0.93 | Y | 0.71 | N | 0.02 | N | − 4.37 |
| Cytomegalovirus colitis | Y | 0.87 | Y | 0.75 | N | 0.2 | N | − 1.07 |
| Death | Y | 0.88 | Y | 0.76 | N | 0.01 | N | − 2.89 |
| Drug reaction paradoxical | Y | 0.88 | Y | 0.8 | N | < 0.01 | N | − 0.17 |
| Epistaxis | Y | 0.93 | Y | 0.81 | N | 0.78 | N | − 0.63 |
| Extravasation | Y | 0.86 | Y | 0.73 | N | 0.03 | N | − 2.11 |
| Gastroenteritis | Y | 0.97 | Y | 0.76 | N | < 0.01 | N | − 0.07 |
| Haematuria | Y | 0.99 | Y | 0.82 | N | 0.04 | N | − 1.83 |
| Hepatocellular damage | Y | 0.88 | Y | 0.67 | N | 0.06 | N | − 3.27 |
| Hypoaesthesia | Y | 0.9 | Y | 0.7 | N | 0.14 | N | − 1.39 |
| Liver fatty | Y | 0.97 | Y | 0.79 | N | < 0.01 | N | − 0.17 |
| Melaena | Y | 0.95 | Y | 0.89 | Y | 2.79 | Y | 0.62 |
| Mouth dry | Y | 0.78 | Y | 0.65 | N | 0.26 | N | − 1.06 |
| Oedema genital | Y | 0.9 | Y | 0.63 | N | < 0.01 | N | − 1.48 |
| Oedema periorbital | Y | 0.88 | Y | 0.64 | N | 0.11 | N | − 1.99 |
| Paraesthesia | Y | 0.82 | Y | 0.83 | N | 0.83 | N | − 0.16 |
| Psoriasis | Y | 0.87 | Y | 0.71 | N | < 0.01 | Y | 0.19 |
| Pulmonary infiltration | Y | 0.97 | Y | 0.82 | N | 0.32 | N | − 0.3 |
| Stomatitis ulcerative | Y | 0.89 | Y | 0.78 | N | 0.08 | N | − 1.82 |
| Stridor | Y | 0.82 | Y | 0.65 | N | < 0.01 | N | − 1.48 |
| Stupor | Y | 0.84 | Y | 0.64 | N | 0.14 | N | − 3.17 |
| Temperature changed sensation | Y | 1 | Y | 0.96 | Y | 4.23 | N | − 0.08 |
| Tremor | Y | 0.85 | Y | 0.71 | N | 0.2 | N | − 0.8 |
| Uveitis | Y | 0.66 | Y | 0.7 | N | < 0.01 | Y | 0.08 |
KAERS Korea Adverse Event Reporting System, WHO-ART world health organization−adverse reaction terminology, RF random forest, GBM gradient boosting machine, aROR adjusted reporting odds ratio, IC information component.
aLower bound of the 95% confidence interval of adjusted ROR.
bLower bound of the 90% confidence interval of IC.
Figure 3Step-by-step process from dataset construction to evaluation of the data mining methods in detecting early safety signals of infliximab in the KAERS between 2009 and 2018. Abbreviations: KAERS, Korea Adverse Event Reporting System; ATC, Anatomical Therapeutic Chemical Classification.