| Literature DB >> 36012968 |
Antonio Ramón1, Marta Zaragozá1, Ana María Torres2, Joaquín Cascón2, Pilar Blasco1, Javier Milara1,3,4, Jorge Mateo2.
Abstract
Among the IL-6 inhibitors, tocilizumab is the most widely used therapeutic option in patients with SARS-CoV-2-associated severe respiratory failure (SRF). The aim of our study was to provide evidence on predictors of poor outcome in patients with COVID-19 treated with tocilizumab, using machine learning (ML) techniques. We conducted a retrospective study, analyzing the clinical, laboratory and sociodemographic data of patients admitted for severe COVID-19 with SRF, treated with tocilizumab. The extreme gradient boost (XGB) method had the highest balanced accuracy (93.16%). The factors associated with a worse outcome of tocilizumab use in terms of mortality were: baseline situation at the start of tocilizumab treatment requiring invasive mechanical ventilation (IMV), elevated ferritin, lactate dehydrogenase (LDH) and glutamate-pyruvate transaminase (GPT), lymphopenia, and low PaFi [ratio between arterial oxygen pressure and inspired oxygen fraction (PaO2/FiO2)] values. The factors associated with a worse outcome of tocilizumab use in terms of hospital stay were: baseline situation at the start of tocilizumab treatment requiring IMV or supplemental oxygen, elevated levels of ferritin, glutamate-oxaloacetate transaminase (GOT), GPT, C-reactive protein (CRP), LDH, lymphopenia, and low PaFi values. In our study focused on patients with severe COVID-19 treated with tocilizumab, the factors that were weighted most strongly in predicting worse clinical outcome were baseline status at the start of tocilizumab treatment requiring IMV and hyperferritinemia.Entities:
Keywords: COVID-19; SARS-CoV-2; cytokine release syndrome; machine learning; tocilizumab
Year: 2022 PMID: 36012968 PMCID: PMC9410189 DOI: 10.3390/jcm11164729
Source DB: PubMed Journal: J Clin Med ISSN: 2077-0383 Impact factor: 4.964
Figure 1The figure shows the scheme followed in the learning and testing process of this work.
Summary of the mean values and standard deviation of balanced accuracy, recall, precision, F1 score, AUC, MCC, DYI and Kappa index of the machine learning models and the proposed method implemented in this article.
| Methods | Balanced Accuracy | Recall | Precision | |
|---|---|---|---|---|
| SVM | 82.77 ± 0.47 | 82.87 ± 0.53 | 82.18 ± 0.57 | 82.52 ± 0.62 |
| BLDA | 80.91 ± 0.81 | 81.02 ± 0.78 | 80.31 ± 0.75 | 80.66 ± 0.73 |
| DT | 83.54 ± 0.63 | 83.63 ± 0.68 | 83.01 ± 0.64 | 83.32 ± 0.62 |
| KNN | 86.86 ± 0.54 | 86.98 ± 0.51 | 86.59 ± 0.46 | 86.79 ± 0.47 |
| XGB | 93.16 ± 0.25 | 93.25 ± 0.31 | 92.49 ± 0.27 | 92.87 ± 0.29 |
|
|
|
|
|
|
| SVM | 0.82 ± 0.02 | 73.45 ± 0.58 | 82.78 ± 0.61 | 72.91 ± 0.59 |
| BLDA | 0.80 ± 0.02 | 71.79 ± 0.73 | 80.92 ± 0.72 | 71.89 ± 0.71 |
| DT | 0.83 ± 0.02 | 74.18 ± 0.67 | 83.55 ± 0.65 | 73.71 ± 0.68 |
| KNN | 0.86 ± 0.02 | 76.97 ± 0.48 | 86.87 ± 0.45 | 77.15 ± 0.46 |
| XGB | 0.93 ± 0.02 | 84.41 ± 0.25 | 93.17 ± 0.26 | 83.91 ± 0.28 |
Abbreviations: SVM: support vector machine, BLDA: Bayesian linear discriminant analysis, DT: decision tree, KNN: K-nearest neighbour, XGB: extreme gradient boost, AUC: area under curve, MCC: Matthew’s correlation coefficient, DYI: degenerated Younden index.
Figure 2ROC curves for the five assessed machine learning predictors. Abbreviations: ROC: receiver operating characteristic, XGB: extreme gradient boost, KNN: K-nearest neighbour, DT: decision tree, SVM: support vector machine, BLDA: Bayesian linear discriminant analysis.
Figure 3The figure shows the radar plot of the training phase (left) and validation (right) for the prediction of mortality in COVID-19 patients treat to tocilizumab. Abbreviations: AUC: area under curve, MCC: Matthew’s correlation coefficient, XGB: extreme gradient boost, KNN: K-nearest neighbour, DT: decision tree, SVM: support vector machine, BLDA: Bayesian linear discriminant analysis.
Parameters used for the implementation of different machine learning methods.
| Method | |
|---|---|
| SVM | C = 1.0 |
| sigma = 0.5 | |
| Numerical tolerance = 0.001 | |
| Iteration limit = 100 | |
| Kernel function: Linear kernel, Gaussian, Quadratic and Cubic | |
| BLDA | Kernel: Bayesian |
| DT | Minimum number of instances in leaves = 4 |
| Minimum number of instances in internal nodes = 6 | |
| Maximum depth = 100 | |
| KNN | Number of neighbours = 20 |
| Distance metric: Euclidean | |
| Weight: Uniform | |
| XGB | Base estimator: tree |
| Maximum number of splits = 20 | |
| Learning rate = 0.1 | |
| Number of learners = 50 |
Abbreviations: SVM: support vector machine, BLDA: Bayesian linear discriminant analysis, DT: decision tree, KNN: K-nearest neighbour, XGB: extreme gradient boost.
Basal clinical data of patients. Data are n (%) or median (IQR), unless otherwise stated.
| Variable | Cohort |
|---|---|
| Number of patients | 67 |
| Age (years) (IQR) | 65 (57–74.5) |
| Male (%) | 43 (64.2) |
| Exitus, | 24 (35.8) |
| Hospital admission (days) after tocilizumab administration (IQR) | 14 (10–29.5) |
| IMV, | 13 (19.4) |
| 7-day mortality, | 7 (10.4) |
| 21-day mortality, | 10 (14.9) |
| Antivirals drugs, | 30 (44.8) |
| Lopinavir/ritonavir, | 27 (40.3) |
| Remdesivir, | 3 (4.5) |
| Hydroxychloroquine, | 34 (50.7) |
| Interferon-beta, | 7 (10.4) |
| Anakinra, | 33 (49.2) |
| Baseline situation at the start of tocilizumab treatment requiring supplemental oxygen, | 60 (89.5) |
| Baseline situation at the start of tocilizumab treatment requiring IMV, | 7 (10.4) |
| Smoker/ex-smoker, | 14 (20.9) |
| Diabetes, | 18 (26.9) |
| COPD, | 4 (5.9) |
| Arterial hypertension, | 34 (50.7) |
| Dyslipemia, | 21 (31.3) |
| Obesity [BMI ≥ 30 kg/m2], | 4 (5.9) |
| Ischemic heart disease, | 5 (7.5) |
| Chronic kidney disease, | 2 (2.9) |
| Lymphocytes (10 × 9/L) (IQR) | 0.95 (0.5–1.3) |
| CRP (mg/L) (IQR) | 12.5 (5.8–21.1) |
| LDH (U/L) (IQR) | 759 (538–934) |
| Procalcitonin (ng/mL) (IQR) | 3.49 (0.3–10.9) |
| Ferritin (µg/L) (IQR) | 828 (412–1388) |
| FiO2 (%) (IQR) | 37.5 (28–50) |
| PaFi (IQR) | 198.5 (135–251.5) |
| GPT (U/L) (IQR) | 39 (26–66.5) |
| GOT(U/L) (IQR) | 45 (31–67) |
Abbreviations: IQR: interquartile range, IMV: invasive mechanical ventilation, COPD: chronic obstructive pulmonary disease, BMI: body mass index, CRP: C-reactive protein, LDH: lactate dehydrogenase, FiO2: inspired oxygen fraction, PaFi: ratio between arterial oxygen pressure and inspired oxygen fraction (PaO2/FiO2), GPT: glutamate-pyruvate transaminase, GOT: glutamate-oxaloacetate transaminase.