| Literature DB >> 35579811 |
Yiqing Zhao1, Yue Yu2, Hanyin Wang1, Yikuan Li1, Yu Deng1, Guoqian Jiang2, Yuan Luo3.
Abstract
Monitoring adverse drug events or pharmacovigilance has been promoted by the World Health Organization to assure the safety of medicines through a timely and reliable information exchange regarding drug safety issues. We aim to discuss the application of machine learning methods as well as causal inference paradigms in pharmacovigilance. We first reviewed data sources for pharmacovigilance. Then, we examined traditional causal inference paradigms, their applications in pharmacovigilance, and how machine learning methods and causal inference paradigms were integrated to enhance the performance of traditional causal inference paradigms. Finally, we summarized issues with currently mainstream correlation-based machine learning models and how the machine learning community has tried to address these issues by incorporating causal inference paradigms. Our literature search revealed that most existing data sources and tasks for pharmacovigilance were not designed for causal inference. Additionally, pharmacovigilance was lagging in adopting machine learning-causal inference integrated models. We highlight several currently trending directions or gaps to integrate causal inference with machine learning in pharmacovigilance research. Finally, our literature search revealed that the adoption of causal paradigms can mitigate known issues with machine learning models. We foresee that the pharmacovigilance domain can benefit from the progress in the machine learning field.Entities:
Mesh:
Year: 2022 PMID: 35579811 PMCID: PMC9114053 DOI: 10.1007/s40264-022-01155-6
Source DB: PubMed Journal: Drug Saf ISSN: 0114-5916 Impact factor: 5.228
Fig. 1Relationships between pharmacovigilance data sources, analytical approaches, pharmacovigilance tasks, and causal inference paradigms. Each data source is commonly analyzed by specific analytical approaches depending on the characteristics of data in those data sources. Each pharmacovigilance task is also associated with specific analytical approaches. Causal inference paradigms are integrated with different analytical approaches and applied to pharmacovigilance tasks. ADE adverse drug event, LSTM long short-term memory, NLP natural language processing, RNN recurrent neural network, RWD real-world data, SVM support vector machine
Data sources for pharmacovigilance, analytical approaches, advantages, and biases
| Analytical approaches | Pharmacovigilance tasks | Advantages and biases |
|---|---|---|
| Association rule mining [ | Drug–event pair extraction [ ADE detection [ ADE prediction (post-marketing) [ | Advantages: 1. Large volume of data worldwide. Create potentials for machine learning models to be trained 2. Provide other related information such as demographic and indication data 3. More effective at detecting rare ADEs 4. Publicly accessible Biases: 1. No population denominator who takes the medications. Could not calculate incidence rates of ADEs. Limited ability to provide causal evaluation 2. Suffer from under-reporting and stimulated reporting. May cause bias in machine learning 3. Lower reporting rates for older products 4. May have duplicate reports 5. Reporters have diverse background, such as pharmaceuticals companies, physicians, patients, and lawyers, which may pose challenges in data standardization. May undermine machine learning model transportability 6. It will take a long time for data collection, thus there may be a delay in detection of ADEs |
| Disproportionality [ | ||
| Network analysis [ | ||
| Clustering [ | ||
| SVM, Bayesian classifier, decision tree and/or Random Forest [ | ||
| Disproportionality [ | Drug–event pair extraction [ ADE detection [ ADE prediction (post-marketing) [ | Advantages: 1. Provides a population denominator who has taken the same medications, which enables adoption of study designs for causal effect estimation 2. The data quality in well-curated RWD databases is better than SRS 3. Less duplicated and missing data in well-curated RWD databases 4. Less adverse event unreported rate 5. RWD databases could provide more complete clinical information such as lab test results. Provide better causal inference ability compared with SRS Biases: 1. Less sample size than SRS. May diminish predictive power of machine learning models 2. EHRs contain protected health information of the patients. Thus, it could not be opened to the public, also difficult to share between institutions 3. EHRs mainly record drug usage information in the hospital. Thus, EHRs work better in inpatient ADE detection than outpatient. May diminish generalizability of machine learning models |
| Cohort/case-based study [ | ||
| Sequence/temporal analysis [ | ||
| SVM, Bayesian classifier, decision tree and/or Random Forest [ | ||
| NLP relation extraction [ | ||
| Neural network [ | ||
| Association rule mining [ | Drug–event pair extraction [ ADE detection [ | Advantages: 1. Huge data size with rapid growth. Create potentials for machine learning models to be trained 2. Open access 3. The content is patient centric 4. Could conduct a “real-time” ADE monitor Biases: 1. The contents are not from experts, thus it may affect data quality and reliability 2. Using NLP to extract all the ADE-related data from texts is challenging. NLP techniques are essential before applying to any machine learning task or causal inference paradigm 3. Could not calculate ADE incidence rate. Limited ability to provide causal evaluation 4. Still need to be further confirmed by other evidence or analysis 5. Ethical issues may exist |
| SVM, Bayesian classifier, decision tree and/or Random Forest [ | ||
| NLP relation extraction [ | ||
| Neural network [ | ||
| Clustering [ | Drug–event pair extraction [ ADE detection [ | Advantages: 1. Data quality and reliability are better 2. Literature is easily accessible. Biases: 1. Data size is smaller than social media. May diminish predictive power of machine learning models 2. Timeliness is worse because of the peer-review and publishing process 3. Detected ADEs still need to be further confirmed by other evidence or analysis |
| SVM, Bayesian classifier, decision tree and/or Random Forest [ | ||
| NLP relation extraction [ | ||
| Neural network [ | ||
| SVM, Bayesian classifier, decision tree and/or Random Forest [ | ADE prediction (pre-marketing) [ | Advantages: 1. Most of the databases are open to the public 2. Better data structure and data standardization level. Create potentials for machine learning models to be trained Biases: 1. Need for a complicated paradigm to integrate and analyze the data 2. The graph structures in knowledge bases lack causal components, making causal interpretation difficult 3. Many false-positive results may impact the prediction accuracy 4. ADE prediction results are based on theoretical algorithms, which needs other RWD or evidence to confirm |
| Neural network [ | ||
ADE adverse drug event, EHR electronic health record, NLP natural language processing, RWD real-world data, SRS spontaneous reporting system, SVM support vector machine
Categorization of papers reviewed regarding data sources and machine learning methods used for four causal inference paradigms
| Machine learning methods | Data source | |
|---|---|---|
| SVM, Bayesian classifier, decision tree and/or Random Forest [ | SRS [ Social media [ RWD [ | |
| Regression [ | ||
| Neural network [ | ||
| SVM, Bayesian classifier, decision tree and/or Random Forest [ | RWD [ Simulated data [ | |
| Ensemble (boosting/bagging) [ | ||
| Regression [ | ||
| Neural network [ | ||
| SVM, Bayesian classifier, decision tree and/or Random Forest | ||
| Link prediction [ | Knowledge bases [ RWD [ | |
| Recommendation systems [ | ||
| Classification [ | ||
| Graph embedding | ||
| Link prediction [ | ||
| Recommendation Systems [ | ||
| Regression | ||
| Classification [ | ||
| Neural network | ||
| Link prediction [ | ||
| Recommendation systems [ | ||
| Predictive modeling [ | ||
| Ensemble (boosting/bagging) | ||
| Classification [ | ||
| Link prediction [ | ||
| Clustering [ | RWD [ Simulated data [ Social network data [ | |
| Decision tree [ | ||
| Neural network [ | ||
| New algorithms [ | ||
RWD real-world data, SRS spontaneous reporting system, SVM support vector machine
Papers for “propensity score matching” and “instrumental variables” are not applied in the field of pharmacovigilance. Papers for “graph-based causal inference” still lacks a clear causal interpretation from a graph perspective
Fig. 2Graph representations of relationships between X, Y, Z, and U under instrumental variable assumptions
Fig. 3Year and continent distribution of 19 papers most relevant to the intersection of machine learning, causal inference, and pharmacovigilance
| Most existing data sources and tasks for pharmacovigilance were not designed for causal inference. |
| Pharmacovigilance was lagging in adopting machine learning-causal inference integrated models. |
| Adoption of causal paradigms can mitigate known issues with machine learning models, which could further enhance the use of machine learning in pharmacovigilance tasks. |