| Literature DB >> 33856514 |
Zhichao Feng1,2, Hui Shen3, Kai Gao3, Jianpo Su3, Shanhu Yao1,2, Qin Liu1, Zhimin Yan1, Junhong Duan1, Dali Yi1, Huafei Zhao1, Huiling Li1, Qizhi Yu4, Wenming Zhou5, Xiaowen Mao6, Xin Ouyang7, Ji Mei8, Qiuhua Zeng9, Lindy Williams10, Xiaoqian Ma1,2, Pengfei Rong1,2, Dewen Hu11, Wei Wang12,13.
Abstract
OBJECTIVES: To develop and validate a machine learning model for the prediction of adverse outcomes in hospitalized patients with COVID-19.Entities:
Keywords: Artificial intelligence; COVID-19; Prognosis; Tomography, X-ray computed
Year: 2021 PMID: 33856514 PMCID: PMC8046645 DOI: 10.1007/s00330-021-07957-z
Source DB: PubMed Journal: Eur Radiol ISSN: 0938-7994 Impact factor: 5.315
Fig. 1Study workflow. (I) Non-severe COVID-19 patients who underwent chest CT scan on admission were included. (II) Lung and lesion segmentation were performed using DL-based framework and texture clustering was used to distinguish between GGO and CON. CT quantitative measurements including lesion%, GGO%, and CON% were calculated. (III) The optimal machine learning classifier and feature subset were selected and used for prediction model construction. (IV) The performance of the machine learning model was determined and validated in an external cohort. CON, consolidation; COVID-19, coronavirus disease 2019; CT, computed tomography; DL, deep learning; GGO, ground-glass opacification; LR, logistic regression; RF, random forest; SVM, support vector machine; XGBoost, extreme gradient boosting
Clinical characteristics of patients in the primary and validation cohorts
| Variables | Primary ( | Validation ( | |
|---|---|---|---|
| Age (years) | 46 (36–58) | 46 (31–53) | 0.201 |
| Male gender | 210 (49.5%) | 51 (52.0%) | 0.654 |
| Comorbidities | |||
| Any | 107 (25.2%) | 21 (21.4%) | 0.430 |
| Hypertension | 59 (13.9%) | 13 (13.3%) | 0.866 |
| Diabetes | 35 (8.3%) | 9 (9.2%) | 0.765 |
| Cardiovascular or cerebrovascular disease | 19 (4.5%) | 6 (6.1%) | 0.493 |
| COPD | 13 (3.1%) | 3 (3.1%) | 0.998 |
| Clinical outcomes | |||
| Severe or critical illnesses | 37 (8.7%) | 8 (8.2%) | 0.858 |
| Requiring mechanical ventilation | 8 (1.9%) | 3 (3.1%) | 0.466 |
| ICU admission | 14 (3.3%) | 4 (4.1%) | 0.703 |
| Death | 1 (0.2%) | 1 (1.0%) | 0.341 |
COPD, chronic obstructive pulmonary disease; ICU, intensive care unit; IQR, interquartile range
Data are presented as median (IQR) or n (percentage)
Fig. 2DL-based lung and lesion segmentation and CT quantitative measurements. a The original CT images, lung segmentation, and lesion segmentation of 3 example cases. b The contours of 3 radiologists and lesion DL-based segmentation (left) and the uncertain region (right). c ROC curve of the pixel-level performance of DL-based segmentation to identify the lesion. d Unsupervised multi-scale texture feature clustering to distinguish between GGO and CON based on grey-level attenuation and LBP features. e t-SNE plot showing the pixel-level GGO or CON distribution. CON, consolidation; CT, computed tomography; DL, deep learning; GGO, ground-glass opacification; LBP, local binary pattern; ROC, receiver operating characteristic; t-SNE, t-distributed stochastic neighbour embedding
Clinical characteristics and CT quantitative measurements among patients according to whether to develop composite endpoint in the primary cohort
| Variables | Yes ( | No ( | |
|---|---|---|---|
| Age (years) | 58 (51–67) | 45 (35–56) | < 0.001 |
| Male gender | 20 (54.1%) | 190 (49.1%) | 0.564 |
| Smoking history | 7 (18.9%) | 33 (8.5%) | 0.068 |
| Comorbidities | |||
| Any | 25 (67.6%) | 82 (21.2%) | < 0.001 |
| Hypertension | 11 (29.7%) | 48 (12.4%) | 0.004 |
| Diabetes | 8 (21.6%) | 27 (7.0%) | 0.006 |
| Cardiovascular or cerebrovascular diseases | 7 (18.9%) | 12 (3.1%) | 0.001 |
| COPD | 7 (18.9%) | 6 (1.6%) | < 0.001 |
| Symptoms and signs | |||
| Fever | 28 (75.7%) | 220 (56.8%) | 0.026 |
| Cough | 24 (64.9%) | 199 (51.4%) | 0.118 |
| Fatigue or myalgia | 8 (21.6%) | 84 (21.7%) | 0.991 |
| Dyspnea | 4 (10.8%) | 17 (4.4%) | 0.100 |
| Temperature (°C) | 37.3 (36.8–38.0) | 36.9 (36.5–37.3) | 0.001 |
| Heart rate (/min) | 90 (80–105) | 86 (78–96) | 0.092 |
| Respiratory rate (/min) | 21 (20–22) | 20 (19–20) | 0.053 |
| Laboratory findings | |||
| Hemoglobin (g/L) | 126.5 (119.3–136.0) | 131.0 (120.0–143.0) | 0.300 |
| Platelet count (×109/L) | 148.0 (119.5–208.0) | 174.0 (139.0–228.0) | 0.067 |
| White blood cell count (×109/L) | 4.5 (3.6–6.0) | 4.6 (3.6–5.7) | 0.812 |
| Neutrophil count (×109/L) | 3.0 (2.4–4.5) | 2.9 (2.1–3.7) | 0.090 |
| Lymphocyte count (×109/L) | 0.9 (0.7–1.3) | 1.2 (0.9–1.6) | < 0.001 |
| Monocyte count (×109/L) | 0.4 (0.2–0.5) | 0.4 (0.3–0.5) | 0.618 |
| Total bilirubin (μmol/L) | 10.5 (7.1–14.6) | 11.9 (8.8–17.3) | 0.031 |
| ALT (U/L) | 23.0 (16.6–31.2) | 19.7 (14.5–28.4) | 0.124 |
| AST (U/L) | 33.2 (25.8–44.6) | 23.0 (18.3–28.3) | < 0.001 |
| Albumin (g/L) | 36.8 (34.2–39.8) | 39.3 (36.5–42.6) | 0.001 |
| BUN (mg/dL) | 4.7 (3.8–5.8) | 3.9 (3.1–4.8) | 0.002 |
| Creatinine (μmol/L) | 66.1 (53.8–86.0) | 56.4 (44.8–70.0) | 0.002 |
| Glucose (mmol/L) | 7.2 (5.8–9.2) | 5.7 (3.6–4.3) | < 0.001 |
| K+ (mmol/L) | 3.7 (3.5–4.0) | 4.0 (3.6–4.3) | 0.051 |
| Na+ (mmol/L) | 135.3 (133.0–137.6) | 137.5 (135.5–139.9) | < 0.001 |
| INR | 1.22 (0.99–1.33) | 1.10 (0.90–1.19) | 0.043 |
| D-dimer ≥ 0.5 mg/L | 16 (43.2%) | 47 (12.1%) | < 0.001 |
| Procalcitonin ≥ 0.05 ng/mL | 21 (56.8%) | 124 (32.0%) | 0.002 |
| Hs-cTnI ≥ 28 pg/mL | 5 (13.5%) | 11 (2.8%) | 0.008 |
| Creatine kinase (U/L) | 94.0 (40.0–213.5) | 72.0 (49.1–109.0) | 0.139 |
| LDH (U/L) | 265.0 (184.6–342.8) | 174.0 (141.3–214.1) | < 0.001 |
| CRP (mg/L) | 40.9 (22.9–61.0) | 10.4 (2.4–24.5) | < 0.001 |
| PaO2 (mmHg) | 71.1 (54.6–106.7) | 90.9 (76.0–115.8) | 0.009 |
| Radiological findings | |||
| Number of segments involved | 16 (12–18) | 9 (5–13) | < 0.001 |
| CT severity score | 12 (7–17) | 6 (3–9) | < 0.001 |
| CT quantitative measurements | |||
| Lesion% | 9.5 (3.5–26.6) | 3.1 (0.6–7.5) | < 0.001 |
| GGO% | 8.2 (3.3–18.9) | 2.8 (0.6–6.7) | < 0.001 |
| CON% | 1.3 (0.2–2.9) | 0.3 (0.0–0.7) | < 0.001 |
ALT, alanine aminotransferase; AST, aspartate aminotransferase; BUN, blood urea nitrogen; CON, consolidation; COPD, chronic obstructive pulmonary disease; CRP, C-reactive protein; CT, computed tomography; GGO, ground-glass opacification; Hs-cTnI, hypersensitive cardiac troponin I; INR, international normalized ratio; K+, potassium; LDH, lactic dehydrogenase; Na+, sodium; PaO, partial pressure of oxygen
Fig. 3Optimal machine learning classifier and feature subset selection. a The heatmap illustrating the correlations between features in the candidate feature set. b The performance of five machine learning classifiers, including LR, SVM-Linear, SVM-RBF, RF, and XGBoost, based on the candidate feature set in the primary cohort (left) and validation cohort (right). c The feature importance rank in the XGBoost classifier using fivefold cross-validation in the primary cohort. d The relationship between the feature subset size and model performance. The optimal size (red dot) was determined with the highest average AUC and a minimal number of features. The optimal feature subset contained the top 4 features, i.e. LDH, presence of comorbidity, lesion%, and hs-cTnI. AST, aspartate aminotransferase; AUC, area under the receiver operating characteristic curve; BUN, blood urea nitrogen; CRP, C-reactive protein; GGO, ground-glass opacification; hs-cTnI, hypersensitive cardiac troponin I; LDH, lactic dehydrogenase; LR, logistic regression; PaO2, partial pressure of oxygen; RF, random forest; SVM-Linear, support vector machine with a linear kernel; SVM-RBF, support vector machine with a radial basis function; XGBoost, extreme gradient boosting
Performance of each classifier based on the candidate feature set in the primary and validation cohorts
| Classifier | AUC | Sensitivity | Specificity | Accuracy |
|---|---|---|---|---|
| Primary cohort | ||||
| LR | 0.916 (0.885–0.938) | 67.6% (25/37) | 90.4% (350/387) | 0.884 (0.851–0.911) |
| SVM-Linear | 0.803 (0.760–0.838) | 51.4% (19/37) | 86.0% (333/387) | 0.830 (0.790–0.864) |
| SVM-RBF | 0.821 (0.780–0.856) | 75.7% (28/37) | 84.0% (325/387) | 0.833 (0.793–0.866) |
| RF | 0.924 (0.894–0.947) | 59.5% (22/37) | 93.0% (360/387) | 0.901 (0.867–0.927) |
| XGBoost | 0.964 (0.941–0.979) | 75.7% (28/37) | 96.4% (373/387) | 0.946 (0.919–0.965) |
| Validation cohort | ||||
| XGBoost | 0.974 (0.910–0.996) | 100% (8/8) | 85.6% (77/90) | 0.867 (0.780–0.925) |
AUC, area under the receiver operating characteristic curve; LR, logistic regression; RF, random forest; SVM-Linear, support vector machine with a linear kernel; SVM-RBF, support vector machine with a radial basis function; XGBoost, extreme gradient boosting
Fig. 4Performance of the XGBoost classifiers based on the top four features or only three clinical features. a ROC curves of the XGBoost classifiers in the primary cohort (left) and validation cohort (right). b Comparison of decision curves of the XGBoost classifiers in the whole cohort. AUC, area under the receiver operating characteristic curve; ROC, receiver operating characteristic; XGBoost, extreme gradient boosting
Performance of the XGBoost classifiers in the primary and validation cohorts
| Cohort | AUC | Sensitivity | Specificity | Accuracy |
|---|---|---|---|---|
| Primary cohort | ||||
| Top four features | 0.959 (0.936–0.976) | 89.2% (33/37) | 91.5% (354/387) | 0.913 (0.882–0.936) |
| Three clinical features | 0.913 (0.882–0.938) | 75.7% (28/37) | 90.7% (351/387) | 0.894 (0.861–0.920) |
| Validation cohort | ||||
| Top four features | 0.953 (0.891–0.986) | 100% (8/8) | 87.8% (79/90) | 0.888 (0.810–0.936) |
| Three clinical features | 0.881 (0.800–0.938) | 75.0% (6/8) | 87.8% (79/90) | 0.867 (0.786–0.921) |
AUC, area under the receiver operating characteristic curve; XGBoost, extreme gradient boosting