| Literature DB >> 33986685 |
Ze Yu1, Huanhuan Ji2, Jianwen Xiao3, Ping Wei4, Lin Song2, Tingting Tang5, Xin Hao6, Jinyuan Zhang1, Qiaona Qi1, Yuchen Zhou1, Fei Gao1, Yuntao Jia2.
Abstract
The aim of this study was to apply machine learning methods to deeply explore the risk factors associated with adverse drug events (ADEs) and predict the occurrence of ADEs in Chinese pediatric inpatients. Data of 1,746 patients aged between 28 days and 18 years (mean age = 3.84 years) were included in the study from January 1, 2013, to December 31, 2015, in the Children's Hospital of Chongqing Medical University. There were 247 cases of ADE occurrence, of which the most common drugs inducing ADEs were antibacterials. Seven algorithms, including eXtreme Gradient Boosting (XGBoost), CatBoost, AdaBoost, LightGBM, Random Forest (RF), Gradient Boosting Decision Tree (GBDT), and TPOT, were used to select the important risk factors, and GBDT was chosen to establish the prediction model with the best predicting abilities (precision = 44%, recall = 25%, F1 = 31.88%). The GBDT model has better performance than Global Trigger Tools (GTTs) for ADE prediction (precision 44 vs. 13.3%). In addition, multiple risk factors were identified via GBDT, such as the number of trigger true (TT) (+), number of doses, BMI, number of drugs, number of admission, height, length of hospital stay, weight, age, and number of diagnoses. The influencing directions of the risk factors on ADEs were displayed through Shapley Additive exPlanations (SHAP). This study provides a novel method to accurately predict adverse drug events in Chinese pediatric inpatients with the associated risk factors, which may be applicable in clinical practice in the future.Entities:
Keywords: Chinese children; adverse drug event (s); machine learning; pediatric; prediction
Year: 2021 PMID: 33986685 PMCID: PMC8111537 DOI: 10.3389/fphar.2021.659099
Source DB: PubMed Journal: Front Pharmacol ISSN: 1663-9812 Impact factor: 5.810
Characteristics of patients with and without ADEs.
| Variable | Derivation cohort (N = 1,396) | Test cohort (N = 350) |
| Total (n = 1,746) | Patients with ADEs (n = 221) | Patients with no ADEs (n = 1,525) |
|---|---|---|---|---|---|---|
| Demographics | ||||||
| Female (%) | 33.6% | 40.6% | 0.02 | 35.0% | 32.6% | 35.3% |
| Age (y) | 3.8 ± 3.89 | 3.72 ± 3.89 | 0.35 | 3.84 ± 3.89 | 3.72 ± 4.12 | 3.86 ± 3.85 |
| Weight (kg) | 16.30 ± 11.66 | 15.54 ± 10.72 | 0.39 | 16.15 ± 11.48 | 15.37 ± 11.51 | 16.26 ± 11.47 |
| Height (cm) | 95.01 ± 29.37 | 93.46 ± 29.41 | 0.36 | 94.70 ± 29.38 | 91.97 ± 31.40 | 95.10 ± 29.06 |
| BMI (kg/cm2) | 16.49 ± 3.63 | 16.27 ± 3.13 | 0.84 | 16.45 ± 3.53 | 16.17 ± 2.85 | 16.49 ± 3.62 |
| Developmental and nutritional status | ||||||
| Fine | 513 (36.7%) | 126 (36.00%) | 0.41 | 639 (36.6%) | 70 (31.7%) | 569 (37.3%) |
| Medium | 757 (54.2%) | 187 (53.43%) | 944 (54.1%) | 123 (55.7%) | 821 (53.8%) | |
| Lower middle | 99 (7.1%) | 25 (7.14%) | 124 (7.1%) | 19 (8.6%) | 105 (6.9%) | |
| Others | 27 (1.9%) | 12 (3.43%) | 39 (2.2%) | 9 (4.1%) | 26 (2.0%) | |
| Status at birth | ||||||
| Natural delivery | 382 (27.4%) | 84 (24%) | 0.22 | 1,280 (73.3%) | 171 (77.4%) | 1,110 (72.7%) |
| Cesarean | 1,014 (72.6%) | 266 (76%) | 466 (26.7%) | 50 (22.6%) | 416 (27.3%) | |
| Premature birth | 63 (4.5%) | 16 (4.57%) | 0.92 | 79 (4.5%) | 14 (6.3%) | 65 (4.3%) |
| Weight at birth | 3.22 ± 0.50 | 3.22 ± 0.51 | 0.83 | 3.22 ± 0.50 | 3.18 ± 0.49 | 3.22 ± 0.51 |
| Admission | ||||||
| Length of stay (d) | 7.90 ± 5.51 | 7.58 ± 4.32 | 0.42 | 7.83 ± 5.29 | 10.23 ± 8.03 | 7.48 ± 4.66 |
| Number of medical diagnoses | 2.94 ± 1.89 | 3.11 ± 2.02 | 0.92 | 2.97 ± 1.89 | 2.92 ± 1.98 | 2.83 ± 1.71 |
| Number of admissions | 1.80 ± 1.43 | 1.88 ± 1.39 | 0.67 | 1.81 ± 1.42 | 2.07 ± 1.60 | 1.77 ± 1.39 |
| Number of admissions in the previous 1 year | 0.47 ± 1.02 | 0.59 ± 1.12 | 0.26 | 0.49 ± 1.04 | 0.61 ± 1.01 | 0.47 ± 1.04 |
| Treatment | ||||||
| Surgical operation | 422 (30.2%) | 90 (25.7%) | 0.11 | 506 (29.0%) | 64 (30.3%) | 442 (28.8%) |
| Number of drugs | 14.14 ± 6.82 | 14.31 ± 6.53 | 0.84 | 14.18 ± 6.77 | 18.82 ± 9.02 | 13.51 ± 6.01 |
| Number of doses | 114.24 ± 109.94 | 114.10 ± 87.77 | 0.83 | 113.94 ± 104.97 | 189.94 ± 187.00 | 102.92 ± 83.30 |
| Antibacterial use | 720 (51.6%) | 194 (55.43%) | 0.22 | 914 (52.3%) | 146 (66.1%) | 768 (50.4%) |
| Sedative analgesic use | 587 (42.0%) | 122 (34.86%) | 0.02 | 709 (40.6%) | 95 (43.0%) | 614 (40.3%) |
| Anesthetic use | 798 (57.2%) | 199 (56.86%) | 0.97 | 997 (57.1%) | 117 (52.5%) | 880 (57.7%) |
| Other | ||||||
| Number of TT (+) | 1.42 ± 1.49 | 1.56 ± 1.62 | 0.88 | 1.45 ± 1.51 | 2.95 ± 2.02 | 1.23 ± 1.29 |
| ADEs | 177 (79.1%) | 44 (19.9%) | 0.99 | 221 (12.7%) | 221 (100%) | 0 |
Abbreviations: BMI, body mass index; TT, trigger true; ADEs, adverse drug events
Notes: Data for variables are presented as mean ± variance, excluding those presented as cases and percentage (%). p-value is calculated for comparing the difference between derivation and test cohorts, p < 0.05 is considered significant.
Classification of drugs leading to occurrence of ADEs.
| Classification of medicines | Types of medicines | Number of cases | Percentage (%) |
|---|---|---|---|
| Anti-infective drugs | Antibacterials | 86 | 35.9 |
| Antivirals | 3 | ||
| Anti-tuberculosis drugs | 1 | ||
| Nervous system drugs | Anti-epileptics | 12 | 27.1 |
| Anti-anxiety drugs | 1 | ||
| Sedatives | 55 | ||
| Digestive system drugs | Acid inhibitors | 7 | 4.8 |
| Antidiarrheal drugs | 5 | ||
| Hormonal and endocrine system drugs | Glucocorticoids | 6 | 4.4 |
| Insulin | 5 | ||
| Drugs to regulate water, electrolyte, and acid–base balance | Potassium chloride, glucose | 10 | 4.0 |
| Urological system drugs | Diuretics | 4 | 2.0 |
| Dehydrating agent | 1 | ||
| Antipyretic, analgesic, and anti-inflammatory drugs | Antipyretics | 4 | 1.6 |
| Cardiovascular medicines | Anti–heart failure drugs | 1 | 1.2 |
| Anti-hypertensives | 1 | ||
| Anti-shock drugs | 1 | ||
| Vitamins, minerals, amino acids, etc. | Minerals, amino acids | 3 | 1.2 |
| Hematology and hematopoietic system drugs | Anticoagulants | 3 | 1.2 |
| Anti-allergic reaction drugs | Anti-allergy drugs | 2 | 0.8 |
| Others | Immunomodulators | 14 | 15.9 |
| Chinese herbal medicine/Chinese medicine injections | 6 | ||
| Mistake intake of paraquat, acetochlor, cocklebur | 6 | ||
| Blood products | 5 | ||
| Anesthetics | 2 | ||
| Medical tapes | 1 | ||
| Unspecified | 6 |
FIGURE 1Importance score ranking for risk factors.
FIGURE 2SHAP values of the important risk factors. The dot color is redder when the feature value gets higher and bluer when the feature value gets lower. When the SHAP value gets higher, the impact of the variable on model output is larger.
Model performance using seven algorithms.
| Model | Precision | Recall | F1 |
|---|---|---|---|
| GBDT | 44.00 | 25.00 | 31.88 |
| LightGBM | 27.27 | 6.82 | 10.91 |
| AdaBoost | 41.18 | 15.91 | 22.95 |
| RF | 23.08 | 13.64 | 17.14 |
| CatBoost | 46.15 | 13.64 | 21.05 |
| TPOT | 75.00 | 13.64 | 23.08 |
| XGBoost | 34.62 | 20.45 | 25.71 |
FIGURE 3Visual presentation of model performance based on seven algorithms. (A) displays the precision–recall curve. (B) displays the ROC curve. When the area under curve is closer to “1,” the performance of model classification and prediction is better. Abbreviations: RF, Random Forest; GBDT, Gradient Boosting Decision Tree; XGBoost, eXtreme Gradient Boosting.