| Literature DB >> 35847000 |
Ni Zhang1,2, Ling-Yun Pan3, Wan-Yi Chen3, Huan-Huan Ji1, Gui-Qin Peng3, Zong-Wei Tang3, Hui-Lai Wang4, Yun-Tao Jia1,2, Jun Gong4.
Abstract
The objective of this study was to apply a machine learning method to evaluate the risk factors associated with serious adverse events (SAEs) and predict the occurrence of SAEs in cancer inpatients using antineoplastic drugs. A retrospective review of the medical records of 499 patients diagnosed with cancer admitted between January 1 and December 31, 2017, was performed. First, the Global Trigger Tool (GTT) was used to actively monitor adverse drug events (ADEs) and SAEs caused by antineoplastic drugs and take the number of positive triggers as an intermediate variable. Subsequently, risk factors with statistical significance were selected by univariate analysis and least absolute shrinkage and selection operator (LASSO) analysis. Finally, using the risk factors after the LASSO analysis as covariates, a nomogram based on a logistic model, extreme gradient boosting (XGBoost), categorical boosting (CatBoost), adaptive boosting (AdaBoost), light-gradient-boosting machine (LightGBM), random forest (RF), gradient-boosting decision tree (GBDT), decision tree (DT), and ensemble model based on seven algorithms were used to establish the prediction models. A series of indicators such as the area under the ROC curve (AUROC) and the area under the PR curve (AUPR) was used to evaluate the model performance. A total of 94 SAE patients were identified in our samples. Risk factors of SAEs were the number of triggers, length of stay, age, number of combined drugs, ADEs occurred in previous chemotherapy, and sex. In the test cohort, a nomogram based on the logistic model owns the AUROC of 0.799 and owns the AUPR of 0.527. The GBDT has the best predicting abilities (AUROC = 0.832 and AUPR = 0.557) among the eight machine learning models and was better than the nomogram and was chosen to establish the prediction webpage. This study provides a novel method to accurately predict SAE occurrence in cancer inpatients.Entities:
Keywords: Global Trigger Tool; antineoplastic drugs; machine learning; prediction; serious adverse events
Year: 2022 PMID: 35847000 PMCID: PMC9277092 DOI: 10.3389/fphar.2022.896104
Source DB: PubMed Journal: Front Pharmacol ISSN: 1663-9812 Impact factor: 5.988
FIGURE 1Overview of the study design and model development.
Trigger items and their PPV.
| No. | Trigger | ADEs | Positive trigger (n) | ADEs (n) | SAEs (n) | PPV (%) |
|---|---|---|---|---|---|---|
| Laboratory | ||||||
| L1 | Hb < 100 g L−1 | Anemia | 94 | 73 | 19 | 77.66 |
| L2 | Platelets count <100*109 L−1 | Thrombocytopenia | 70 | 49 | 4 | 70 |
| L3 | Neutrophils <1.5*109 L−1 | Neutropenia | 76 | 60 | 24 | 78.95 |
| L4 | Leukocyte count <3*109 L−1 | Leukopenia | 128 | 109 | 36 | 85.16 |
| L5 | AST or CB > 2ULN; AST, ALP, and TBI elevated at least one of them >2×baseline) | Drug-induced hepatotoxicity | 23 | 6 | 3 | 26.09 |
| L6 | GFR <60 ml/min or 50% greater than baseline* | Drug-induced renal toxicity | 2 | 1 | 0 | 50 |
| L7 | Blood pressure >140/90 mmHg | Hypertension | 63 | 2 | 0 | 3.17 |
| L8 | Blood glucose >8.9 mmol L−1 | Drug-induced hyperglycaemia | 11 | 3 | 0 | 27.27 |
| L9 | Blood glucose <3 mmol L−1 | Drug-induced hypoglycemia | 4 | 0 | 0 | 0 |
| L10 | Serum kalium >5.5 mmol L−1 | Hyperkalemia | 0 | 0 | 0 | — |
| L11 | Serum kalium <3.0 mmol L−1 | Hypopotassemia | 60 | 39 | 6 | 65 |
| L12 | Serum calcium >2.8 mmol L−1 | Hypercalcemia | 0 | 0 | 0 | — |
| L13 | Serum calcium <2.0 mmol L−1 | Hypocalcemia | 34 | 17 | 4 | 50 |
| L14 | Thyroid-stimulating hormone>4.2 mIU·L−1 | Hypothyroidism | 5 | 2 | 0 | 40 |
| L15 | Thyroid-stimulating hormone <0.34 mIU·L−1 | Hyperthyroidism | 1 | 1 | 0 | 100 |
| L16 | Serum uric acid elevated (Female>360 mol L−1, male>420 μmol L−1) | Hyperuricemia | 30 | 21 | 4 | 70 |
| L17 | Positive qualitative test of urinary protein positive or urinary protein excretion> 150 mg per 24 h | Proteinuria | 3 | 0 | 0 | 0 |
| L18 | Troponin>0.64 ng mL−1 | Myocardial infarction | 1 | 1 | 1 | 100 |
| L19 | BNP >400 pg mL−1 or NT-prBNP>2000 pg mL−1 | Cardiac failure | 13 | 3 | 1 | 23.08 |
| Symptom | ||||||
| S1 | Oral mucositis | Oral mucositis | 7 | 4 | 1 | 57.14 |
| S2 | Fever (body temperature>38.2°C) | Fever | 1 | 1 | 0 | 100 |
| S3 | Diarrhea | Diarrhea | 6 | 4 | 0 | 66.67 |
| S4 | Nausea or vomiting | Nausea or vomiting | 199 | 186 | 0 | 93.47 |
| S5 | Constipation | Constipation | 21 | 15 | 0 | 71.43 |
| S6 | Desquamation; erythema; redness | Hand–foot syndrome | 2 | 2 | 0 | 100 |
| S7 | Rash | Rash | 2 | 2 | 0 | 100 |
| S8 | Paresthesia; neuropathy; pins and needles; pain in hands and feet | Peripheral neuritis | 1 | 1 | 0 | 100 |
| S9 | Extravasation | Extravasation | 0 | 0 | 0 | — |
| Medication | ||||||
| M1 | Corticosteroid and antihistamines use | Allergy | 130 | 4 | 0 | 3.08 |
| M2 | Antithrombotic use | Thromboembolism | 60 | 10 | 1 | 16.67 |
| M3 | Leucovorin use | Methotrexate poisoning | 6 | 0 | 0 | 0 |
| Treatment | ||||||
| T1 | Unplanned emergency treatment, resuscitation, or transfer to ICU | Emergency treatment, resuscitation, or transfer to ICU due to ADEs | 0 | 0 | 0 | — |
| T2 | Unplanned adjust therapeutic regimen | Adjust therapeutic regimen due to ADEs | 17 | 4 | 0 | 23.53 |
BNP, brain natriuretic peptide; ICU, intensive care unit; TSH, thyroid-stimulating hormone; AST, aspartate amino transferase; ALP, alkaline phosphatase; ULN, upper limit of normal; ADEs, Adverse drug events; PPV, positive predictive value.
Classification of drugs leading to the occurrence of SAEs.
| Classification of drugs | Suspected drugs | Number of cases (n) | Percentage (%) | Group percentage (%) |
|---|---|---|---|---|
| Platinum metal | Oxaliplatin | 25 | 3.66 | 24.16 |
| Cisplatin | 60 | 8.78 | ||
| Nedaplatin | 57 | 8.35 | ||
| Carboplatin | 23 | 3.37 | ||
| Antimetabolic drugs | Capecitabine | 24 | 3.51 | 12.15 |
| Gemcitabine | 22 | 3.22 | ||
| Tegafur | 12 | 1.76 | ||
| Fluorouracil | 8 | 1.17 | ||
| Methotrexate | 6 | 0.88 | ||
| Cytarabine | 2 | 0.29 | ||
| Pemetrexed | 7 | 1.02 | ||
| Fludarabine | 2 | 0.29 | ||
| Antineoplastic antibiotics | Pirarubicin | 48 | 7.03 | 15.08 |
| Epirubicin | 41 | 6.00 | ||
| Bleomycin | 11 | 1.61 | ||
| Dactinomycin | 3 | 0.44 | ||
| Plant origin and other derivatives | Paclitaxel | 92 | 13.47 | 31.77 |
| Docetaxel | 33 | 4.83 | ||
| Vindesin | 44 | 6.44 | ||
| Etoposide | 35 | 5.12 | ||
| Irinotecan | 7 | 1.02 | ||
| Vinorelbine | 6 | 0.88 | ||
| Alkylating agent | Cyclophosphamide | 100 | 14.64 | 16.40 |
| Dacarbazine | 12 | 1.76 | ||
| Molecular targeted drugs | Rituximab | 1 | 0.15 | 0.44 |
| Trastuzumab | 1 | 0.15 | ||
| Bevacizumab | 1 | 0.15 |
Characteristics of patients with and without SAEs.
| Characteristic | Training cohort (N = 399) | Test cohort (N = 100) |
| Patients with no SAEs in the training cohort (N = 330) | Patients with SAEs in the training cohort (N = 69) |
|
|---|---|---|---|---|---|---|
| Sex (male) | 164 (41%) | 29 (29%) | 0.026 | 143 (43%) | 21 (30%) | 0.048 |
| Age (year) | 53 (46, 63) | 52 (48, 61) | 0.787 | 53 (47, 63) | 49 (42, 55) | 0.001 |
| Length of stay (days) | 8.0 (6.0, 12.0) | 8.0 (6.0, 10.2) | 0.702 | 7.0 (6.0, 11.0) | 10.0 (7.0, 15.0) | <0.001 |
| Weight (kg) | 58 (53, 63) | 59 (51, 63) | 0.740 | 58 (53, 63) | 57 (52, 62) | 0.800 |
| Off-label drug use (yes) | 104 (26%) | 19 (19%) | 0.143 | 86 (26%) | 18 (26%) | 0.996 |
| Cancer type | 0.508 | 0.007 | ||||
| Breast cancer | 93 (23%) | 28 (28%) | 75 (23%) | 18 (26%) | ||
| Lung cancer | 81 (20%) | 21 (21%) | 75 (23%) | 6 (8.7%) | ||
| Lymphoma | 48 (12%) | 8 (8%) | 35 (11%) | 13 (19%) | ||
| Gastrointestinal | 48 (12%) | 10 (10%) | 45 (14%) | 3 (4.3%) | ||
| Genital system | 64 (16%) | 21 (21%) | 51 (15%) | 13 (19%) | ||
| Others | 65 (16%) | 12 (12%) | 49 (15%) | 16 (23%) | ||
| Cancer stage | 0.494 | 0.873 | ||||
| Ⅰ | 43 (11%) | 13 (13%) | 37 (11%) | 6 (8.7%) | ||
| Ⅱ | 92 (23%) | 25 (25%) | 74 (22%) | 18 (26%) | ||
| Ⅲ | 120 (30%) | 34 (34%) | 100 (30%) | 20 (29%) | ||
| Ⅳ | 144 (36%) | 28 (28%) | 119 (36%) | 25 (36%) | ||
| Operation (yes) | 115 (29%) | 27 (27%) | 0.718 | 98 (30%) | 17 (25%) | 0.399 |
| Basic diseases (yes) | 93 (23%) | 22 (22%) | 0.781 | 79 (24%) | 14 (20%) | 0.514 |
| Radiation therapy (yes) | 21 (5.3%) | 15 (15%) | 0.001 | 19 (5.8%) | 2 (2.9%) | 0.333 |
| ADEs occurred in previous chemotherapy (yes) | 119 (30%) | 35 (35%) | 0.316 | 90 (27%) | 29 (42%) | 0.015 |
| Number of previous chemotherapies | 2.0 (0.0, 4.0) | 2.0 (0.0, 3.0) | 0.369 | 2.00 (0.00, 4.00) | 3.00 (1.00, 5.00) | 0.004 |
| KPS | 0.200 | 0.251 | ||||
| 60 | 3 (0.8%) | 0 (0%) | 3 (0.9%) | 0 (0%) | ||
| 70 | 43 (11%) | 4 (4.0%) | 34 (10%) | 9 (13%) | ||
| 80 | 132 (33.3%) | 34 (34%) | 115 (35.3%) | 17 (25%) | ||
| 90 | 192 (48%) | 50 (50%) | 156 (47%) | 36 (52%) | ||
| 100 | 29 (7.3%) | 12 (12%) | 22 (6.7%) | 7 (10%) | ||
| Number of antineoplastic drugs | 0.892 | 0.160 | ||||
| 1 | 50 (13%) | 10 (10%) | 42 (13%) | 8 (12%) | ||
| 2 | 254 (64%) | 69 (69%) | 214 (65%) | 40 (58%) | ||
| 3 | 53 (13%) | 13 (13%) | 44 (13%) | 9 (13%) | ||
| 4 | 26 (6.5%) | 4 (4.0%) | 20 (6.1%) | 6 (8.7%) | ||
| 5 | 15 (3.8%) | 4 (4.0%) | 9 (2.7%) | 6 (8.7%) | ||
| 6 | 1 (0.3%) | 0 (0%) | 1 (0.3%) | 0 (0%) | ||
| Number of combined drugs | 5.00 (4.00, 7.00) | 5.00 (4.00, 7.00) | 0.531 | 5.00 (4.00, 6.00) | 6.00 (4.00, 8.00) | 0.018 |
| Number of triggers | 2.00 (1.00,3.00) | 2.00 (1.00,3.00) | 0.187 | 1.00 (1.00,2.00) | 3.00 (2.00,4.00) | <0.001 |
FIGURE 2LASSO analysis after the univariate analysis.
Logistic regression for SAEs.
| Variables | B value |
| OR | 95% CI |
|---|---|---|---|---|
| Age | -0.033 | 0.004 | 0.967 | 0.945, 0.990 |
| Length of stay | 0.064 | 0.017 | 1.067 | 1.011, 1.125 |
| Number of triggers | 0.635 | <0.001 | 1.886 | 1.531, 2.323 |
FIGURE 3Nomogram based on the logistic model.
FIGURE 4Nomogram calibration curve and AUROC in the test cohort.
Eight algorithms’ model performance in the test cohort.
| Model | AUROC | SEN | SPE | AUPR | Precision | Recall | F1 |
|---|---|---|---|---|---|---|---|
| RF | 0.805 (0.705, 0.906) | 0.760 | 0.773 | 0.550 | 0.528 | 0.760 | 0.623 |
| XGBoost | 0.754 (0.645, 0.863) | 0.760 | 0.720 | 0.323 | 0.475 | 0.760 | 0.585 |
| DT | 0.650 (0.525, 0.776) | 0.480 | 0.853 | 0.150 | 0.522 | 0.480 | 0.500 |
| GBDT | 0.832 (0.744, 0.920) | 0.720 | 0.853 | 0.557 | 0.621 | 0.720 | 0.667 |
| LightGBM | 0.750 (0.635, 0.864) | 0.840 | 0.640 | 0.485 | 0.438 | 0.840 | 0.575 |
| AdaBoost | 0.782 (0.678, 0.886) | 0.640 | 0.867 | 0.538 | 0.615 | 0.640 | 0.627 |
| CatBoost | 0.817 (0.725, 0.909) | 0.720 | 0.813 | 0.462 | 0.563 | 0.720 | 0.632 |
| Ensemble learning model | 0.797 (0.694, 0.899) | 0.720 | 0.840 | 0.537 | 0.600 | 0.720 | 0.655 |
FIGURE 5ROC curve of eight models in the test cohort.
FIGURE 6PR curve of eight models in the test cohort.
FIGURE 7Ranking of variable importance in the GBDT model.