| Literature DB >> 34957391 |
Ramya Vunikili1,2, Benjamin S Glicksberg3, Kipp W Johnson4, Joel T Dudley4, Lakshminarayanan Subramanian1, Khader Shameer2,4.
Abstract
Objective: Opioids are a class of drugs that are known for their use as pain relievers. They bind to opioid receptors on nerve cells in the brain and the nervous system to mitigate pain. Addiction is one of the chronic and primary adverse events of prolonged usage of opioids. They may also cause psychological disorders, muscle pain, depression, anxiety attacks etc. In this study, we present a collection of predictive models to identify patients at risk of opioid abuse and mortality by using their prescription histories. Also, we discover particularly threatening drug-drug interactions in the context of opioid usage. Methods and Materials: Using a publicly available dataset from MIMIC-III, two models were trained, Logistic Regression with L2 regularization (baseline) and Extreme Gradient Boosting (enhanced model), to classify the patients of interest into two categories based on their susceptibility to opioid abuse. We've also used K-Means clustering, an unsupervised algorithm, to explore drug-drug interactions that might be of concern.Entities:
Keywords: addicition; digital health; machine learing; opiod abuse; predictive modeling
Year: 2021 PMID: 34957391 PMCID: PMC8702828 DOI: 10.3389/frai.2021.742723
Source DB: PubMed Journal: Front Artif Intell ISSN: 2624-8212
Statistics of subjects in different age groups.
| Age group | Age range (Years) | Total no. Of subjects | No. Of subjects with side effects | Proportion |
|---|---|---|---|---|
| 1 | <13 | 269 | 0 | 0.000 |
| 2 | 13–19 | 253 | 7 | 0.028 |
| 3 | 20–40 | 2949 | 254 | 0.114 |
| 4 | 41–50 | 3273 | 203 | 0.062 |
| 5 | 51–65 | 8507 | 251 | 0.030 |
| 6 | 66–75 | 5974 | 26 | 0.004 |
| 7 | 76–85 | 5906 | 7 | 0.001 |
| 8 | >85 | 2861 | 1 | 0.0003 |
List of ICD 9 codes used for identifying subjects with adverse events.
| Broad category | ICD 9 codes |
|---|---|
| Opioid type or combination of opioid type with other drug dependence | 30400 30401 30402 30403 30470 30471 30472 30473 30550 30551 30552 30553 96500 96501 96502 96509 |
| Psychological effects | 30410 30411 30412 30413 30540 30541 30542 30543 |
| Psychostimulant dependence | 30440 30441 30442 30443 |
| Poisoning | 96502 96509 9701 E8500 E8501 E8502 |
| Hallucinogen dependence | 30450 30451 30452 30453 |
| Miscellaneous dependence | 30420 30421 30422 30423 30430 30431 30432 30433 |
FIGURE 1Correlation of features.
FIGURE 2Cumulative explained variance across different principle components.
FIGURE 3(A) Baseline—ROC curves (before and after performing SMOTE and PCA). (B) Baseline—Precision Recall (PR) curves (before and after performing SMOTE and PCA).
Summary of best parameters for XGBoost.
| Parameter | Value |
|---|---|
| learning rate | 0.1 |
| max depth | 10 |
| n estimators | 200 |
| objective | “binary: logistic” |
| Base score | 0.5 |
| booster | “gbtree” |
| max delta step | 0 |
| colsample bylevel | 1 |
| colsample bynode | 1 |
| req alpha | 0 |
| req lambda | 1 |
| scale pos weight | 1 |
| gamma | 0 |
FIGURE 4(A) Enhanced model—ROC curves (before and after performing SMOTE and PCA). (B) Enhanced model—Precision Recall (PR) curves (before and after performing SMOTE and PCA).
Summary of performance for predictive modeling tasks.
| Model | Target variable | Precision (PPV) | Recall (%) | NPV (%) | F1 score(%) | Accuracy (%) |
|---|---|---|---|---|---|---|
| Logistic Regression | Side effects | 78.45 | 74.91 | 75.60 | 76.64 | 77.17 |
| XGBoost | Side effects | 92.64 | 95.45 | 95.30 | 94.02 | 94.35 |
| XGBoost | Mortality | 66.67 | 31.82 | 76.20 | 43.07 | 74.83 |
FIGURE 5Elbow plot for K-means clustering.
FIGURE 6Importance of features.
FIGURE 7Confusion matrix.