| Literature DB >> 35441692 |
Boran Hao1,2, Yang Hu1,2, Shahabeddin Sotudian1,3, Zahra Zad1,3, William G Adams4, Sabrina A Assoumou5, Heather Hsu4, Rebecca G Mishuris5, Ioannis C Paschalidis1,2,3,6.
Abstract
OBJECTIVE: To develop predictive models of coronavirus disease 2019 (COVID-19) outcomes, elucidate the influence of socioeconomic factors, and assess algorithmic racial fairness using a racially diverse patient population with high social needs.Entities:
Keywords: AI; COVID-19; predictive modeling; racial bias; social determinants of health
Mesh:
Year: 2022 PMID: 35441692 PMCID: PMC9129120 DOI: 10.1093/jamia/ocac062
Source DB: PubMed Journal: J Am Med Inform Assoc ISSN: 1067-5027 Impact factor: 7.942
Hospitalization prediction models
| Algorithm | AUC | F1-weighted |
|---|---|---|
|
| ||
|
| 86.5% (0.9%) | 87.0% (0.5%) |
|
| 86.3% (0.9%) | 86.8% (0.8%) |
|
| 93.1% (0.6%) | 90.1% (0.8%) |
|
| 92.4% (0.5%) | 90.0% (0.7%) |
|
| ||
|
| ||
|
| 86.1% (1.0%) | 87.3% (0.7%) |
|
| 86.0% (1.0%) | 87.1% (0.6%) |
|
| 92.5% (0.5%) | 90.2% (0.7%) |
|
| 92.2% (0.5%) | 89.9% (0.7%) |
|
| ||
|
| ||
|
| 83.5% (1.7%) | 85.3% (0.7%) |
|
| 83.4% (1.8%) | 84.7% (1.0%) |
|
| 92.0% (0.6%) | 89.6% (0.6%) |
|
| 91.3% (0.6%) | 89.4% (0.5%) |
Note: The values inside the parentheses denote the standard deviation of the corresponding metric. SVM-L1 and LR-L1 refer to the ℓ1-norm regularized SVM and LR models. We report the composition of an ℓ2-norm regularized LR model, including the coefficient of each variable (Coef), the correlation of the variable with the outcome (Y-corr), the mean of the variable (Y1-mean) in the hospitalized, and the mean of the variable (Y0-mean) in the non-hospitalized. For each variable, we also report the corresponding p-value, the odds ratio (OR), and its 95% confidence interval (CI).
SpO2: oxygen saturation; BP: blood pressure; BMI: body mass index; PMH: past medical history; CKD: chronic kidney disease; COPD: chronic obstructive pulmonary disease; CHF: congestive heart failure; SDOH: social determinants of health; Total non-COVID percentage: (Total number of non-COVID patients at the hospital/Total number of beds)×100; Total COVID percentage: (Total number of COVID patients at the hospital/Total number of beds)×100.
ICU prediction models
| 0-drop | 12 h-drop | 24 h-drop | ||||
|---|---|---|---|---|---|---|
| AUC | F1-weighted | AUC | F1-weighted | AUC | F1-weighted | |
|
| XGBoost | XGBoost | XGBoost | |||
| 94.8% (1.2%) | 89.2% (1.8%) | 86.6% (1.2%) | 82.4% (0.9%) | 78.3% (2.0%) | 76.0% (2.3%) | |
|
| XGBoost (112 features) | XGBoost (104 features) | XGBoost (95 features) | |||
| 94.3% (1.4%) | 88.0% (2.3%) | 86.3% (1.8%) | 82.2% (1.2%) | 79.9% (3.2%) | 76.8% (3.4%) | |
|
| 92.7% (1.5%) | 86.7% (1.3%) | 86.5% (1.6%) | 81.6% (2.1%) | 81.1% (2.5%) | 79.4% (0.8%) |
|
| 89.0% (2.2%) | 86.1% (2.0%) | 73.0% (2.0%) | 73.7% (1.0%) | 67.4% (1.7%) | 70.8% (1.7%) |
|
| 94.8% (0.8%) | 89.1% (1.7%) | 86.2% (1.0%) | 83.2% (1.0%) | 77.1% (2.5%) | 74.0% (1.4%) |
|
| 84.3% (2.0%) | 83.5% (1.8%) | 48.6% (2.5%) | 72.2% (1.4%) | 46.5% (2.3%) | 70.4% (1.6%) |
|
| 71.7% (3.0%) | 79.3% (1.7%) | 54.3% (2.1%) | 69.1% (1.8%) | 52.1% (2.4%) | 68.1% (2.0%) |
|
| 90.9% (2.0%) | 85.2% (1.8%) | 84.5% (2.0%) | 81.3% (1.2%) | 76.8% (3.9%) | 73.6% (1.5%) |
Note: For each full model, we only report results from the algorithm with the highest AUC out of LR, SVM, XGBoost, and RF. We present the LR coefficients of each variable (Coef), the correlation of the variable with the outcome (Y-corr), the p-value, the mean of the variable (Y1-mean) in the ICU patients, and the mean of the variable (Y0-mean) in the non-ICU patients.
LDH: lactate dehydrogenase; BUN: blood urea nitrogen; NRBC: nucleated red blood cell; CKD: chronic kidney disease; PMH: past medical history; CAD: coronary artery disease; DVT: deep vein thrombosis; HLD: hypersensitivity lung disease; Total COVID percentage: (Total number of COVID patients at the hospital/Total number of beds)×100.
Mechanical ventilation prediction models
| 0-drop | 12 h-drop | 24 h-drop | ||||
|---|---|---|---|---|---|---|
| AUC | F1-weighted | AUC | F1-weighted | AUC | F1-weighted | |
|
| XGBoost | RF | RF | |||
| 93.6% (0.9%) | 91.6% (0.8%) | 90.6% (1.2%) | 87.8% (1.0%) | 86.3% (1.1%) | 85.0% (2.1%) | |
|
| XGBoost (90 features) | XGBoost (104 features) | XGBoost (107 features) | |||
| 93.8% (1.2%) | 91.8% (1.5%) | 90.3% (1.4%) | 88.8% (0.7%) | 86.1% (1.1%) | 85.9% (1.4%) | |
|
| 91.2% (2.0%) | 90.7% (0.9%) | 90.3% (1.4%) | 87.6% (1.0%) | 84.9% (1.5%) | 84.7% (0.5%) |
|
| 82.3% (0.9%) | 85.0% (1.7%) | 63.4% (3.1%) | 79.9% (0.6%) | 56.7% (1.5%) | 79.9% (0.7%) |
|
| 90.0% (1.0%) | 88.7% (0.9%) | 88.1% (1.1%) | 88.3% (0.3%) | 84.0% (1.1%) | 80.3% (0.8%) |
|
| 66.0% (6.5%) | 84.2% (1.3%) | 65.7% (2.8%) | 80.0% (0.0%) | 67.6% (2.8%) | 80.0% (0.0%) |
|
| 63.1% (5.1%) | 80.0% (0.4%) | 52.3% (2.0%) | 80.0% (0.0%) | 52.3% (2.0%) | 80.0% (0.0%) |
|
| 90.0% (2.3%) | 88.4% (1.6%) | 85.9% (2.8%) | 86.3% (1.2%) | 80.0% (2.5%) | 79.9% (0.2%) |
Note: For each full model, we only report results from the algorithm with the highest AUC out of LR, SVM, XGBoost, and RF. We present the LR coefficients of each variable (Coef), the correlation of the variable with the outcome (Y-corr), the p-value, the mean of the variable (Y1-mean) in the intubated patients, and the mean of the variable (Y0-mean) in the nonintubated patients.
CRP: C-reactive protein; Total Elective Surgery percentage: (Total number of Elective Surgeries/Total number of beds)×100.
Mortality prediction models with features extracted within 3 days after admission
| Adm-based |
| |||
|---|---|---|---|---|
| AUC | F1-weighted | AUC | F1-weighted | |
|
| XGBoost | XGBoost | ||
| 91.4% (0.8%) | 91.9% (0.7%) | 96.2% (0.7%) | 94.6% (0.6%) | |
|
| XGBoost | XGBoost | ||
| 91.1% (2.0%) | 91.8% (1.3%) | 94.7% (1.2%) | 94.3% (1.0) | |
|
| 92.0% (2.9%) | 92.4% (1.3%) | 94.0% (0.6%) | 92.7% (0.9%) |
|
| 88.9% (4.1%) | 91.2% (0.7%) | 89.5% (2.2%) | 91.5% (0.6%) |
|
| 89.3% (2.4%) | 90.8% (0.7%) | 92.3% (1.1%) | 92.6% (1.4%) |
|
| 66.8% (5.0%) | 85.8% (0.9%) | 72.4% (2.9%) | 87.2% (1.1%) |
|
| 68.2% (3.0%) | 86.6% (0.9%) | 73.3% (5.1%) | 87.1% (0.6%) |
|
| 84.3% (4.4%) | 89.0% (0.9%) | 87.1% (2.0%) | 91.0% (1.5%) |
Note: For each full model, we only report results from the algorithm with the highest AUC out of LR, SVM, XGBoost, and RF. We present the LR coefficients of each variable (Coef), the correlation of the variable with the outcome (Y-corr), the p-value, the mean of the variable (Y1-mean) in the deceased, and the mean of the variable (Y0-mean) in the nondeceased.
PMH: past medical history; CHF: congestive heart failure; CAD: coronary artery disease; CRP: C-reactive protein; LDH: lactate dehydrogenase.