Behrooz Davazdahemami1, Dursun Delen2. 1. Department of Management Science and Information Systems, Oklahoma State University, Stillwater, Oklahoma, USA. 2. Department of Management Science and Information Systems, Center for Health Systems Innovation, Oklahoma State University, Stillwater, Oklahoma, USA.
Abstract
Objectives: This study extends prior research by combining a chronological pharmacovigilance network approach with machine-learning (ML) techniques to predict adverse drug events (ADEs) based on the drugs' similarities in terms of the proteins they target in the human body. The focus of this research, though, is particularly centered on predicting the drug-ADE associations for a set of 8 common and high-risk ADEs. Materials and methods: large collection of annotated MEDLINE biomedical articles was used to construct a drug-ADE network, and the network was further equipped with information about drugs' target proteins. Several network metrics were extracted and used as predictors in ML algorithms to predict the existence of network edges (ie, associations or relationships). Results: Gradient boosted trees (GBTs) as an ensemble ML algorithm outperformed other prediction methods in identifying the drug-ADE associations with an overall accuracy of 92.8% on the validation sample. The prediction model was able to predict drug-ADE associations, on average, 3.84 years earlier than they were actually mentioned in the biomedical literature. Conclusion: While network analysis and ML techniques were used in separation in prior ADE studies, our results showed that they, in combination with each other, can boost the power of one another and predict better. Moreover, our results highlight the superior capability of ensemble-type ML methods in capturing drug-ADE patterns compared to the regular (ie, singular), ML algorithms.
Objectives: This study extends prior research by combining a chronological pharmacovigilance network approach with machine-learning (ML) techniques to predict adverse drug events (ADEs) based on the drugs' similarities in terms of the proteins they target in the human body. The focus of this research, though, is particularly centered on predicting the drug-ADE associations for a set of 8 common and high-risk ADEs. Materials and methods: large collection of annotated MEDLINE biomedical articles was used to construct a drug-ADE network, and the network was further equipped with information about drugs' target proteins. Several network metrics were extracted and used as predictors in ML algorithms to predict the existence of network edges (ie, associations or relationships). Results: Gradient boosted trees (GBTs) as an ensemble ML algorithm outperformed other prediction methods in identifying the drug-ADE associations with an overall accuracy of 92.8% on the validation sample. The prediction model was able to predict drug-ADE associations, on average, 3.84 years earlier than they were actually mentioned in the biomedical literature. Conclusion: While network analysis and ML techniques were used in separation in prior ADE studies, our results showed that they, in combination with each other, can boost the power of one another and predict better. Moreover, our results highlight the superior capability of ensemble-type ML methods in capturing drug-ADE patterns compared to the regular (ie, singular), ML algorithms.
Authors: Hae Reong Kim; MinDong Sung; Ji Ae Park; Kyeongseob Jeong; Ho Heon Kim; Suehyun Lee; Yu Rang Park Journal: Medicine (Baltimore) Date: 2022-06-24 Impact factor: 1.817
Authors: Rebecca N Jerome; Meghan Morrison Joly; Nan Kennedy; Jana K Shirey-Rice; Dan M Roden; Gordon R Bernard; Kenneth J Holroyd; Joshua C Denny; Jill M Pulley Journal: Drug Saf Date: 2020-06 Impact factor: 5.606
Authors: Bence Ágg; Péter Ferdinandy; Mátyás Pétervári; Bettina Benczik; Olivér M Balogh; Balázs Petrovich Journal: Drug Saf Date: 2022-10-06 Impact factor: 5.228