| Literature DB >> 35692595 |
Jingyi Li1, Yaling Li1,2, Yali Gao3, Xueli Niu1, Mingsui Tang1, Chang Fu1, Zihan Wang1, Jiayi Liu1, Bing Song1,2, Hongduo Chen1, Xinghua Gao1, Xiuhao Guan1.
Abstract
Background: Invasive candidiasis is a common cancer-related complication with a high fatality rate. If patients with a high risk of dying in the hospital are identified early and accurately, physicians can make better clinical judgments. However, epidemiological analyses and mortality prediction models of cancer patients with invasive candidiasis remain limited. Method: A set of 40 potential risk factors was acquired in a sample of 258 patients with both invasive candidiasis and cancer. To begin, risk factors for Candida albicans vs. non-Candida albicans infections and persistent vs. nonpersistent Candida infections were analysed using classic statistical methods. Then, we applied three machine learning models (random forest, logistic regression, and support vector machine) to identify prognostic indicators related to mortality. Prediction performance of different models was assessed by precision, recall, F1 score, accuracy, and AUC.Entities:
Mesh:
Substances:
Year: 2022 PMID: 35692595 PMCID: PMC9185171 DOI: 10.1155/2022/7896218
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.246
Figure 1Flow chart of machine learning.
Figure 2Tumour types of 258 patients.
Figure 3Candida species in 258 patients.
Figure 4The distribution of blood bacterial infection in 99 patients.
Risk factors for Candida albicans and non-Candida albicans infections.
| Candida albicans % ( | Non-Candida albicans % ( | Statistic |
| |
|---|---|---|---|---|
| Male | 23 (69.70%) | 148 (65.78) | 0.445 | 0.657 |
| Age (years)a | 61.00 (56.00, 69.00) | 63.00 (54.00, 70.00) | -0.191 | 0.848 |
| Length of stay (days)a | 30.00 (23.00, 46.00) | 39.00 (28.00, 62.00) | 2.253 | 0.024 |
| Length of stay in ICUa | 0.00 (0.00, 4.00) | 1.00 (0.00, 8.00) | 1.658 | 0.097 |
| Solid tumour | 33 (100%) | 213 (94.67%) | 1.359 | 0.174 |
| Diabetes | 7 (21.21%) | 25 (11.11%) | 1.644 | 0.100 |
| Pancreatitisb | 1 (3.03%) | 1 (0.44%) | — | 0.240 |
| Total parenteral nutrition | 25 (75.76%) | 209 (92.89%) | 3.164 | 0.002 |
| Renal failure | 1 (3.03%) | 11 (4.89%) | — | 1 |
| Recent surgery (within 2 weeks) | 20 (60.61%) | 166 (73.78%) | 1.575 | 0.115 |
| Use immunosuppressantsb | 6 (18.18%) | 26 (11.56%) | — | 0.267 |
| ICU | 17 (51.52%) | 83 (36.89%) | 1.611 | 0.107 |
| Hypoproteinemia | 27 (81.82%) | 161 (71.56%) | 1.238 | 0.216 |
| Invasive mechanical ventilation | 14 (42.42%) | 70 (31.11%) | 1.295 | 0.195 |
| Urinary catheter | 31 (93.94%) | 194 (86.22%) | 1.240 | 0.215 |
| Gastric tube | 14 (42.42%) | 144 (64.00%) | 2.376 | 0.018 |
| Central venous catheter | 20 (60.61%) | 140 (62.22%) | 0.179 | 0.858 |
| Drainage catheter | 19 (57.58%) | 189 (84.00%) | 3.586 | <0.001 |
| Endotoxic shockb | 5 (15.15%) | 22 (9.78%) | — | 0.360 |
| Multiple hospitalisations within 2 years (>2 times) | 22 (66.67%) | 140 (62.22%) | 0.493 | 0.622 |
| Persistent fungal infection | 12 (36.36%) | 73 (32.44%) | 0.447 | 0.655 |
| Serum albumin levela (g/l) | 27.80 (24.60, 30.80) | 25.90 (22.60, 29.00) | -1.659 | 0.097 |
| Serum creatinine levela ( | 57.00 (45.00, 72.00) | 67.00 (40.00, 88.00) | 0.990 | 0.322 |
| Leukocyte counta (10^9/l) | 6.54 (4.56, 9.52) | 12.26 (7.46, 14.19) | 4.829 | <0.001 |
| Total bilirubin levela ( | 14.00 (8.80, 26.70) | 17.30 (10.60, 56.10) | 0.999 | 0.318 |
| Neutrophil counta (10^9/l) | 5.30 (3.73, 7.95) | 9.22 (5.13, 11.68) | 3.591 | <0.001 |
| Lymphocyte counta (10^9/l) | 0.64 (0.44, 0.90) | 0.85 (0.53, 1.13) | 2.017 | 0.044 |
| CRPa (mg/l) | 85.70 (58.55, 124.25) | 95.40 (76.10, 182.00) | 1.322 | 0.186 |
| PCTa (ng/ml) | 0.46 (0.24, 1.09) | 0.96 (0.24, 3.46) | 1.174 | 0.240 |
| CD4 | 305.00 (150.00, 463.5) | 234.00 (159.00, 317.00) | 0.570 | 0.569 |
| CD8 | 152.00 (98.00, 227.50) | 110.00 (74.00, 289.00) | 0.297 | 0.767 |
| CD3 | 472.00 (252.50, 700.50) | 398.00 (241.00, 634.00) | 0.388 | 0.698 |
| CD4/CD8 | 1.37 (0.91, 2.49) | 1.93 (1.08, 3.13) | -0.661 | 0.509 |
Note: a is described by median and quartile, and the statistic was the Z value; other items were described as numbers (n − %), and the statistic was the χ2 value, b statistic was the Fisher χ2 value.
Risk factors in patients with persistent and nonpersistent Candida infections.
| Persistent Candida infection (%) ( | Nonpersistent Candida infection (%) ( | Statistic |
| |
|---|---|---|---|---|
| Male | 59 (69.41%) | 44 (59.46%) | 1.310 | 0.190 |
| Age (years)a | 63.00 (56.00, 70.00) | 61.00 (54.00, 66.75) | 1.575 | 0.115 |
| Length of stay (days)a | 39.00 (30.00, 62.00) | 32.00 (25.00, 44.75) | 3.048 | 0.002 |
| Length of stay in ICUa | 3.00 (0.00, 12.00) | 0.00 (0.00, 3.00) | 2.761 | 0.006 |
| Diabetes | 16 (18.82%) | 8 (10.81%) | — | 0.187 |
| Total parenteral nutrition | 78 (91.76%) | 68 (91.89%) | 0.029 | 0.977 |
| Renal failure | 7 (8.24%) | 3 (4.05%) | — | 0.340 |
| Recent surgery (within 2 weeks) | 52 (61.18%) | 56 (75.68%) | 2.102 | 0.036 |
| Use immunosuppressants within the past 30 daysb | 13 (15.29%) | 11 (14.86%) | — | 1 |
| Stay in ICU during hospitalisation | 46 (54.12%) | 27 (36.49%) | 2.225 | 0.026 |
| Hypoproteinemia | 67 (78.82%) | 51 (68.92%) | 1.424 | 0.154 |
| Invasive mechanical ventilation | 41 (48.24%) | 23 (31.08%) | 2.200 | 0.028 |
| Urinary catheter | 74 (87.06%) | 62 (83.78%) | 0.586 | 0.558 |
| Gastric tube | 54 (63.53%) | 44 (59.46%) | 0.526 | 0.599 |
| Central venous catheter | 60 (70.59%) | 45 (60.81%) | 1.299 | 0.194 |
| Drainage catheter | 70 (82.35%) | 61 (82.43%) | 0.013 | 0.990 |
| Septic shock | 15 (17.65%) | 5 (6.76%) | — | 0.054 |
| Multiple hospitalisations within 2 years (>2 times) | 58 (68.24%) | 51 (68.92%) | 0.093 | 0.926 |
| Serum albumin levela (g/l) | 27.40 (24.50, 29.30) | 28.40 (25.10, 31.63) | -1.500 | 0.133 |
| Serum creatinine levela ( | 58.00 (42.00, 82.00) | 57.50 (47.25, 68.75) | -0.040 | 0.968 |
| Leukocyte counta (10^9/l) | 7.86 (5.35, 11.12) | 5.77 (4.13, 9.02) | 3.234 | 0.001 |
| Total bilirubin levela ( | 15.50 (11.00, 33.30) | 14.20 (7.40, 24.73) | 1.877 | 0.061 |
| Neutrophil counta (10^9/l) | 6.34 (4.00, 9.11) | 4.61 (3.07, 7.39) | 2.624 | 0.009 |
| Lymphocyte counta (10^9/l) | 0.79 (0.59, 1.06) | 0.56 (0.39, 0.75) | 3.605 | <0.001 |
| CRPa (mg/ml) | 96.55 (66.88, 122.75) | 84.90 (61.60, 121.50) | 0.574 | 0.566 |
| PCTa (ng/ml) | 0.53 (0.26, 1.10) | 0.45 (0.24, 1.45) | 0.459 | 0.646 |
Note: a is described by median and quartile, and the statistic was the Z value; other items were described as numbers (n − %), and the statistic was the χ2 value. b statistic was the Fisher χ2 value.
Performance evaluation for the prediction models.
| Model | Precision | Recall |
| Accuracy | AUC |
|---|---|---|---|---|---|
| Random Forest | 0.69 | 0.75 | 0.72 | 0.89 | 0.91 |
| Logistic regression | 0.43 | 0.83 | 0.57 | 0.81 | 0.86 |
| Support vector machine | 0.24 | 0.83 | 0.38 | 0.57 | 0.67 |
Figure 5Receiver operating characteristic curve of different machine learning models.
Feature importance rank.
| Feature | Index |
|---|---|
| Endotoxic shock | 0.101668 |
| Recent surgery (within 2 weeks) | 0.067329 |
| Total parenteral nutrition | 0.058851 |
| Drainage catheter | 0.056403 |
| Length of stay in ICUa | 0.050194 |
| Stay in ICU during hospitalisation | 0.045371 |
| Fungal antigen | 0.042123 |
| Serum creatinine level | 0.037023 |
| Leukocyte count | 0.030745 |
| Total bilirubin level | 0.028947 |