| Literature DB >> 35579816 |
Guillaume L Martin1,2, Julien Jouganous1, Romain Savidan1, Axel Bellec1, Clément Goehrs1, Mehdi Benkebil3, Ghada Miremont4,5, Joëlle Micallef6,7, Francesco Salvo4,5, Antoine Pariente4,5, Louis Létinier8,9,10.
Abstract
INTRODUCTION: Adverse drug reaction reports are usually manually assessed by pharmacovigilance experts to detect safety signals associated with drugs. With the recent extension of reporting to patients and the emergence of mass media-related sanitary crises, adverse drug reaction reports currently frequently overwhelm pharmacovigilance networks. Artificial intelligence could help support the work of pharmacovigilance experts during such crises, by automatically coding reports, allowing them to prioritise or accelerate their manual assessment. After a previous study showing first results, we developed and compared state-of-the-art machine learning models using a larger nationwide dataset, aiming to automatically pre-code patients' adverse drug reaction reports.Entities:
Mesh:
Substances:
Year: 2022 PMID: 35579816 PMCID: PMC9112264 DOI: 10.1007/s40264-022-01153-8
Source DB: PubMed Journal: Drug Saf ISSN: 0114-5916 Impact factor: 5.228
Fig. 1Summary of the methodological differences between tasks. ADRs adverse drug reactions, ER emergency room, LGBM Light Gradient Boosted Machine, PT Preferred Term, XLM Cross-lingual Language Model
Fig. 2Flowchart of the selection of datasets. ADR adverse drug reaction, ANSM French National Agency for the Safety of Medicines and Health Products, CRPV Regional Pharmacovigilance Centre
Patient and report characteristics
| ADR identification task | Seriousness assessment task | |||
|---|---|---|---|---|
| Development dataset | External validation dataset | Development dataset | External validation dataset | |
| Patient age, years | ||||
| Median (IQR) | 51 (37–62) | 51 (39–63) | 41 (30–58) | 39 (31–56) |
| Patient sex, | ||||
| Female | 8792 (84.5) | 1056 (85.9) | 3732 (78.0) | 313 (77.5) |
| Encoded PTs per report, | ||||
| Median (IQR) | 4 (2–6) | 5 (3–7) | 3 (1–5) | 3 (2–6) |
| Reported drugs per report, | ||||
| Median (IQR) | 1 (1–1) | 1 (1–1) | 1 (1–1) | 1 (1–1) |
| Reports about vaccines, | ||||
| Yes | 275 (2.6) | 13 (1.1) | 275 (5.7) | 13 (3.2) |
| Reports considered serious, | ||||
| Yes | 2454 (23.6) | 136 (11.1) | 1128 (23.6) | 84 (20.8) |
ADRs adverse drug reactions, IQR interquartile range, PTs Preferred Terms
Fig. 3Repartition of the most frequently reported drugs and adverse drug reactions (ADRs) coded in Medical Dictionary for Regulatory Activities Preferred Terms (PTs) across complete datasets. A Mosaic plot of the most frequently reported drugs and B mosaic plot of the most frequently reported PTs
Model comparison metrics, using the prediction threshold maximising F-measure
| Validation | Task | Models | AUC | F-measure | PPV | Sensibility | Specificity | TN | FP | FN | TP |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Internal validation | ADR identification | TF-IDF + LGBM | 0.97 (0.96–0.97) | 0.80 (0.78–0.81) | 0.85 (0.83–0.87) | 0.75 (0.73–0.78) | 1 (1–1) | 353,938 (345,353–360,293) | 703 (612–842) | 1329 (1194–1438) | 4028 (3857–4271) |
| XLM | 0.97 (0.96–0.97) | 0.78 (0.76–0.79) | 0.84 (0.82–0.86) | 0.73 (0.70–0.75) | 1 (1–1) | 353,854 (353,563–354,099) | 736 (609–883) | 1469 (1314–1592) | 3916 (3702–4131) | ||
| Seriousness assessment | FastText + LGBM | 0.85 (0.82–0.87) | 0.63 (0.59–0.68) | 0.58 (0.52–0.69) | 0.69 (0.60–0.82) | 0.85 (0.77–0.91) | 629 (559–682) | 110 (62–166) | 69 (43–94) | 156 (129–194) | |
| CamemBERT + LGBM | 0.84 (0.81–0.87) | 0.63 (0.57–0.67) | 0.56 (0.49–0.65) | 0.71 (0.57–0.81) | 0.83 (0.75–0.90) | 615 (542–672) | 126 (72–183) | 65 (44–96) | 160 (125–192) | ||
| External validation | ADR identification | TF-IDF + LGBM | 0.97 (0.97–0.97) | 0.82 (0.81–0.82) | 0.88 (0.86–0.89) | 0.76 (0.75–0.78) | 1 (1–1) | 287,770 (287,640–287,896) | 502 (444–573) | 1128 (1054–1198) | 3631 (3530–3751) |
| XLM | 0.97 (0.97–0.97) | 0.80 (0.79–0.80) | 0.87 (0.86–0.88) | 0.74 (0.73–0.75) | 1 (1–1) | 288,717 (288,604–288,837) | 530 (476–558) | 1256 (1208–1310) | 3527 (3434–3602) | ||
| Seriousness assessment | FastText + LGBM | 0.87 (0.85–0.89) | 0.65 (0.60–0.70) | 0.58 (0.49–0.69) | 0.77 (0.60–0.88) | 0.85 (0.75–0.92) | 274 (244–299) | 50 (25–80) | 21 (11–36) | 69 (54–79) | |
| CamemBERT + LGBM | 0.86 (0.83–0.89) | 0.63 (0.59–0.68) | 0.56 (0.49–0.67) | 0.74 (0.59–0.84) | 0.84 (0.76–0.91) | 271 (246–294) | 53 (30–78) | 23 (14–37) | 67 (53–76) |
ADR adverse drug reactions, AUC area under the curve, FP false positive, FN false negative, LGBM Light Gradient Boosted Machine, PPV positive predictive value, TF-IDF Term Frequency-Inverse Document Frequency, TN true negative, TP true positive, XLM Cross-lingual Language Model
Fig. 4Adverse drug reaction identification: receiver operating characteristic (ROC) and precision-recall (PR) curves. A ROC curve of internal validation, B PR curve of internal validation, C ROC curve of external validation and D PR curve of external validation
Fig. 5Seriousness assessment: receiver operating characteristic (ROC) and precision-recall (PR) curves. A ROC curve of internal validation, B PR curve of internal validation, with dataset seriousness prevalence at y = 0.24, C ROC curve of external validation and D PR curve of external validation, with dataset seriousness prevalence at y = 0.21
| Artificial intelligence models were successfully developed and showed good performance to automatically pre-code patient adverse drug reaction reports. |
| An artificial intelligence-based pharmacovigilance tool was thus nationally approved and deployed in France in January 2021, in particular to assist professionals with the monitoring of the COVID-19 vaccination campaign. |
| Further studies will be needed to validate the performance of the tool in real-life settings. |