| Literature DB >> 34772961 |
Shuo Feng1, Joel A Dubin2,3.
Abstract
APACHE IVa provides typically useful and accurate predictions on in-hospital mortality and length of stay for patients in critical care. However, there are factors which may preclude APACHE IVa from reaching its ceiling of predictive accuracy. Our primary aim was to determine which variables available within the first 24 h of a patient's ICU stay may be indicative of the APACHE IVa scoring system making occasional but potentially illuminating errors in predicting in-hospital mortality. We utilized the publicly available multi-institutional ICU database, eICU, available since 2018, to identify a large observational cohort for our investigation. APACHE IVa scores are provided by eICU for each patient's ICU stay. We used Lasso logistic regression in an aim to build parsimonious final models, using cross-validation to select the penalization parameter, separately for each of our two responses, i.e., errors, of interest, which are APACHE falsely predicting in-hospital death (Type I error), and APACHE falsely predicting in-hospital survival (Type II error). We then assessed the performance of the models with a random holdout validation sample. While the extremeness of the APACHE prediction led to dependable predictions for preventing either type of error, distinct variables were identified as being strongly associated with the two different types of errors occurring. These included a primary set of predictors consisting of mean SpO2 and worst lactate for predicting Type I errors, and worst albumin and mean heart rate for Type II. In addition, a secondary set of predictors including changes recorded in care limitations for the patient's treatment plan, worst pH, whether cardiac arrest occurred at admission, and whether vasopressor was provided for predicting Type I error; age, whether the patient was ventilated in day 1, mean respiratory rate, worst lactate, worst blood urea nitrogen test, and mean aperiodic vitals for Type II. The two models also differed in their performance metrics in their holdout validation samples, in large part due to the lower prevalence of Type II errors compared to Type I. The eICU database was a good resource for evaluating our objective, and important recommendations are provided, particularly identifying key variables that could lead to APACHE prediction errors when APACHE scores are sufficiently low to predict in-hospital survival.Entities:
Mesh:
Year: 2021 PMID: 34772961 PMCID: PMC8589984 DOI: 10.1038/s41598-021-01290-7
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 2(a) Hospital-level observed mortality versus APACHE IVa-based predicted mortality. (b) Observed mortality versus APACHE IVa-based predicted mortality in different-sized hospitals.
Figure 1Flow chart of cohort selection from eICU database.
Performance metrics for Type I/II errors using cutoff 0.33.
| Missing handling | Error | Pop. ErrorRate | AUROC | AUPRC | Precision | Recall | F-score | Accuracy |
|---|---|---|---|---|---|---|---|---|
| Multiple imputation | Type I | 51.56 | 0.7030 | 0.6802 | 0.6248 | 0.7754 | 0.6920 | 0.6426 |
| Type II | 5.66 | 0.8195 | 0.2185 | 0.2436 | 0.2705 | 0.2564 | 0.9109 | |
| Fill with median | Type I | 51.56 | 0.6996 | 0.6759 | 0.6246 | 0.7655 | 0.6879 | 0.6403 |
| Type II | 5.66 | 0.8172 | 0.2045 | 0.1655 | 0.6583 | 0.2645 | 0.7922 | |
| Fill with group mean | Type I | 51.56 | 0.6934 | 0.6732 | 0.6242 | 0.7699 | 0.6895 | 0.6409 |
| Type II | 5.66 | 0.8197 | 0.2082 | 0.1655 | 0.6583 | 0.2645 | 0.7922 |
Pop. ErrorRate is the true observed APACHE error rate in the entire population.
AUROC is the area under the receiver operating characteristic curve.
AUPRC is the area under the precision–recall curve.
Accuracy , where true positive, true negative, FP = false positive, and false negative.
Figure 3(a) Type I error rate against different ARC values. (b) Type II error rate against different ARC values.
Features selected for Type I error when using cutoff 0.33.
| Missing handling | Features—Type I error | Coefficients | Coefficients | SE | Z-ratio | 95% CI |
|---|---|---|---|---|---|---|
| Multiple imputation | ARC | − 0.2282 | − 0.5467 | 0.029 | 19.2 | (− 0.603, − 0.491) |
| Mean SpO2 | 0.0434 | 0.6442 | 0.047 | 13.8 | (0.552, 0.736) | |
| Worst (max) lactate | − 0.1583 | − 0.4383 | 0.033 | 13.2 | (− 0.503, − 0.373) | |
| Fill with median | ARC | − 0.3022 | − 0.6033 | 0.028 | 21.3 | (− 0.659, − 0.548) |
| Mean SpO2 | 0.0689 | 0.6167 | 0.045 | 13.6 | (0.528, 0.705) | |
| Worst (Max) Lactate | − 0.1211 | − 0.4097 | 0.033 | 12.4 | (− 0.475, − 0.345) | |
| Fill with group mean | ARC | − 0.2573 | − 0.5808 | 0.029 | 20.3 | (− 0.637, − 0.525) |
| Mean SpO2 | 0.0290 | 0.6215 | 0.045 | 13.7 | (0.533, 0.710) | |
| Worst (Max) Lactate | -0.2034 | -0.3949 | 0.033 | 12.1 | (− 0.459, − 0.331) |
Coefficients calculated from fitting the Lasso feature selection model.
Coefficients, SE, Z-ratios, and the 95% CIs calculated by ordinary logistic regression using the selected features from Lasso; features sorted by Z-ratios, where Z-ratio .
Features selected for Type II error when using cutoff 0.33.
| Missing handling | Features—type II error | Coefficients | Coefficients | SE | Z-ratio | 95% CI |
|---|---|---|---|---|---|---|
| Multiple imputation | ARC | − 0.7521 | − 1.1789 | 0.009 | 133.6 | (− 1.196, − 1.162) |
| Worst (Min) albumin | − 0.1349 | − 0.4775 | 0.008 | 61.1 | (− 0.493, − 0.462) | |
| Mean heart rate | 0.0223 | 0.3907 | 0.007 | 52.6 | (0.376, 0.405) | |
| Fill with median | ARC | − 0.7378 | − 1.2424 | 0.009 | 143.5 | (− 1.259, − 1.225) |
| Worst (Min) albumin | − 0.0444 | − 0.4332 | 0.008 | 56.5 | (− 0.448, − 0.418) | |
| Fill with group mean | ARC | − 0.6891 | − 1.1704 | 0.009 | 135.5 | (− 1.187, − 1.153) |
| Worst (Min) albumin | − 0.0667 | − 0.4388 | 0.008 | 57.0 | (− 0.454, − 0.424) |
Coefficients calculated from fitting the Lasso feature selection model.
Coefficients, SE, Z-ratios, and the 95% CIs calculated by ordinary logistic regression using the selected features from Lasso; Features sorted by Z-ratios, where Z-ratio .
Performance metrics for Type I/II errors using cutoff 0.33, with ARC removed.
| Missing handling | Error | Pop. ErrorRate | AUROC | AUPRC | Precision | Recall | F-score | Accuracy |
|---|---|---|---|---|---|---|---|---|
| Multiple imputation | Type I | 51.56 | 0.6873 | 0.6530 | 0.6064 | 0.8637 | 0.7126 | 0.6392 |
| Type II | 5.66 | 0.7888 | 0.2123 | 0.1400 | 0.7097 | 0.2339 | 0.7360 | |
| Fill with median | Type I | 51.56 | 0.6945 | 0.6642 | 0.6041 | 0.8637 | 0.7110 | 0.6364 |
| Type II | 5.66 | 0.7935 | 0.2120 | 0.1408 | 0.7360 | 0.2364 | 0.7300 | |
| Fill with group mean | Type I | 51.56 | 0.6973 | 0.6647 | 0.6065 | 0.8506 | 0.7081 | 0.6369 |
| Type II | 5.66 | 0.7929 | 0.2060 | 0.1339 | 0.7360 | 0.2266 | 0.7149 |
Pop. ErrorRate is the true observed APACHE error rate in the entire population.
AUROC is the area Under the receiver operating characteristic curve.
AUPRC is the area Under the precision–recall curve.
.
Accuracy , where true positive, true negative, FP = false positive, and FN = false negative.
Features selected for Type I error when using cutoff 0.33, with ARC removed.
| Missing handling | Features—Type I Error | Coefficients | Coefficients | SE | Z-ratio | 95% CI |
|---|---|---|---|---|---|---|
| Multiple imputation | Worst (Max) Lactate | − 0.3040 | − 0.5063 | 0.036 | 13.9 | (− 0.577, − 0.435) |
| Mean SpO2 | 0.1452 | 0.5418 | 0.045 | 12.2 | (0.454, 0.629) | |
| Care limitation | − 0.0367 | − 0.3095 | 0.031 | 10.1 | (− 0.370, − 0.249) | |
| Worst (Min) pH | 0.0210 | 0.1641 | 0.031 | 5.2 | (0.102, 0.226) | |
| Fill with median | Mean SpO2 | 0.1399 | 0.5563 | 0.044 | 12.5 | (0.469, 0.643) |
| Worst (Max) Lactate | − 0.2325 | − 0.4027 | 0.035 | 11.5 | (− 0.471, − 0.334) | |
| Care limitation | − 0.0382 | − 0.3315 | 0.031 | 10.5 | (− 0.393, − 0.270) | |
| Cardiac arrest at admission | − 0.0187 | − 0.2533 | 0.027 | 9.4 | (− 0.306, − 0.201) | |
| Vasopressor | − 0.0174 | − 0.1914 | 0.027 | 7.1 | (− 0.244, − 0.139) | |
| Worst (Min) pH | 0.0195 | 0.1003 | 0.031 | 3.2 | (0.04, 0.161) | |
| Fill with group mean | Mean SpO2 | 0.1325 | 0.5513 | 0.045 | 12.4 | (0.464, 0.639) |
| Worst (Max) Lactate | − 0.2565 | − 0.4195 | 0.035 | 12.0 | (− 0.488, − 0.351) | |
| Care limitation | − 0.0369 | − 0.3324 | 0.032 | 10.5 | (− 0.394, − 0.271) | |
| Cardiac arrest at admission | − 0.0089 | − 0.2415 | 0.027 | 9.0 | (− 0.294, − 0.189) | |
| Vasopressor | − 0.0287 | − 0.1989 | 0.027 | 7.3 | (− 0.252, − 0.146) | |
| Worst (Min) pH | 0.0231 | 0.1007 | 0.031 | 3.2 | (0.039, 0.162) |
Coefficients calculated from fitting the Lasso feature selection model.
Coefficients, SE, Z-ratios, and the 95% CIs calculated by ordinary logistic regression using the selected features from Lasso; Features sorted by Z-ratios, where Z-ratio .
Care Limitation is how often the care limitation(s) recorded in the care planning was required to change for the patient.
Whether vasopressor was provided to the patient within the first 24 h.
Features selected for Type II error when using cutoff 0.33, with ARC removed.
| Missing handling | Features—Type II error | Coefficients | Coefficients | SE | Z-ratio | 95% CI |
|---|---|---|---|---|---|---|
| Multiple imputation | Age | 0.1210 | 0.6270 | 0.008 | 76.4 | (0.611, 0.643) |
| Ventilated in day 1 | 0.0345 | 0.4702 | 0.007 | 66.5 | (0.456, 0.484) | |
| Worst (Min) albumin | − 0.2785 | − 0.4665 | 0.008 | 57.8 | (− 0.482, − 0.451) | |
| Mean respiratory rate | 0.0712 | 0.3867 | 0.008 | 50.1 | (0.372, 0.402) | |
| Worst (Max) lactate | 0.0455 | 0.4366 | 0.010 | 45.2 | (0.418, 0.456) | |
| Mean heart rate | 0.0307 | 0.3481 | 0.008 | 43.2 | (0.332, 0.364) | |
| Worst (Max) BUN | 0.0602 | 0.2983 | 0.008 | 37.8 | (0.283, 0.314) | |
| Mean aperiodic vital | − 0.0143 | − 0.2443 | 0.008 | 32.1 | (− 0.259, -0.229) | |
| Fill with median | Age | 0.1346 | 0.6137 | 0.008 | 77.5 | (0.598, 0.629) |
| Ventilated in Day 1 | 0.0342 | 0.4700 | 0.007 | 67.6 | (0.456, 0.484) | |
| Worst (Min) albumin | − 0.1891 | − 0.4120 | 0.008 | 51.8 | (− 0.428, − 0.396) | |
| Mean heart rate | 0.0428 | 0.3760 | 0.008 | 48.2 | (0.361, 0.391) | |
| Mean respiratory rate | 0.0423 | 0.3452 | 0.008 | 46.0 | (0.330, 0.360) | |
| Mean aperiodic vital | − 0.0545 | − 0.2912 | 0.007 | 39.3 | (− 0.306, − 0.277) | |
| Worst (Max) BUN | 0.0138 | 0.2465 | 0.008 | 32.1 | (0.231, 0.262) | |
| Whether Lactate test provided | − 0.0472 | − 0.1944 | 0.007 | 26.5 | (− 0.209, − 0.18) | |
| Fill with group mean | Age | 0.1344 | 0.5748 | 0.008 | 73.0 | (0.559, 0.590) |
| Ventilated in Day 1 | 0.0368 | 0.4591 | 0.007 | 66.5 | (0.446, 0.473) | |
| Worst (Min) albumin | − 0.2685 | − 0.4688 | 0.008 | 59.2 | (− 0.484, − 0.453) | |
| Mean heart rate | 0.0440 | 0.3882 | 0.008 | 50.1 | (0.373, 0.403) | |
| Mean respiratory rate | 0.0594 | 0.3667 | 0.008 | 48.8 | (0.352, 0.381) | |
| Mean aperiodic vital | − 0.0546 | − 0.3007 | 0.007 | 40.5 | (− 0.315, − 0.286) | |
| Worst (Max) BUN | 0.0318 | 0.2819 | 0.008 | 36.6 | (0.267, 0.297) |
Coefficients calculated from fitting the Lasso feature selection model.
Coefficients, SE, Z-ratios, and the 95% CIs calculated by ordinary logistic regression using the selected features from Lasso; Features sorted by Z-ratios, where Z-ratio .
BUN: Blood Urea Nitrogen test.
Whether a lactate test was provided within the first 24 h.