| Literature DB >> 32997743 |
Chuanyu Hu1, Zhenqiu Liu2,3, Yanfeng Jiang2,3, Oumin Shi4, Xin Zhang5,6, Kelin Xu7, Chen Suo5,6, Qin Wang1, Yujing Song1, Kangkang Yu8, Xianhua Mao2,3, Xuefu Wu5,6, Mingshan Wu5,6, Tingting Shi5, Wei Jiang2,3, Lina Mu9, Damien C Tully10, Lei Xu11, Li Jin2,3, Shusheng Li1, Xuejin Tao1, Tiejun Zhang5,6, Xingdong Chen2,3.
Abstract
BACKGROUND: Coronavirus disease 2019 (COVID-19), caused by severe acute respiratory syndrome coronavirus 2 infection, has been spreading globally. We aimed to develop a clinical model to predict the outcome of patients with severe COVID-19 infection early.Entities:
Keywords: COVID-19; death; fatality rate; machine learning; predictive model
Year: 2021 PMID: 32997743 PMCID: PMC7543461 DOI: 10.1093/ije/dyaa171
Source DB: PubMed Journal: Int J Epidemiol ISSN: 0300-5771 Impact factor: 7.196
Figure 1The study flow chart. bagFDA, bagged flexible discriminant analysis; PLS, partial least squares; AUROC, area under the receiver operating characteristic curve
Figure 2The top 20 important variables selected by five machine learning models (A-E) and the model performance based on the selected variables (F). NT-proBNP, N-terminal pro-brain natriuretic peptide; hsCRP, high-sensitivity C-reactive protein; CRE, creatinine; ALT, alanine aminotransferase; IL8, interleukin 8; LYM, lymphocyte count; IL6, interleukin 6; WC, white cell count; eGFR, estimated glomerular filtration rate; ALP, alkaline phosphatase; RC, red cell count; TCH, total cholesterol; FIB, fibrinogen; PLT, platelet count; TB, total bilirubin; ALB, albumin; PT, prothrombin time; IL10, interleukin 10; IL2R, interleukin-2 receptor; FER, ferritin; LDH, lactate dehydrogenase; hscTnI, high-sensitivity cardiac troponin I; Mono, monocyte count; UA, uric acid; ESR, erythrocyte sedimentation rate; No.Com., number of basic conditions
The baseline characteristics and laboratory findings at admission included in the predictive models
| Survivors ( | Non-survivors ( |
| |
|---|---|---|---|
| Baseline characteristics | |||
| Sex | |||
| Male | 57 (49.57) | 50 (73.53) | 0.002 |
| Female | 58 (50.43) | 18 (26.47) | |
| Age | 60.54 ± 13.19 | 68.44 ± 9.94 | <0.001 |
| Diabetes | |||
| Yes | 21 (18.26) | 14 (20.59) | 0.848 |
| No | 94 (81.74) | 54 (79.41) | |
| Hypertension | |||
| Yes | 43 (37.39) | 30 (44.12) | 0.458 |
| No | 72 (62.61) | 38 (55.88) | |
| Lung disease | |||
| Yes | 5 (4.35) | 10 (14.71) | 0.030 |
| No | 109 (94.78) | 58 (85.29) | |
| No. of underlying conditions | |||
| <1 | 75 (65.22) | 41 (60.29) | 0.861 |
| ≥1 | 40 (34.78) | 27 (39.71) | |
| Signs and symptoms at disease onset | |||
| Diarrhoea | |||
| Yes | 38 (33.04) | 29 (42.65) | 0.253 |
| No | 77 (66.96) | 39 (57.35) | |
| Stomach ache | |||
| Yes | 16 (13.91) | 12 (17.65) | 0.642 |
| No | 99 (86.09) | 56 (82.35) | |
| Vomiting | |||
| Yes | 20 (17.39) | 12 (17.65) | 1.000 |
| No | 95 (82.61) | 56 (82.35) | |
| Anorexia | |||
| Yes | 22 (19.13) | 32 (47.06) | 0.019 |
| No | 82 (71.30) | 36 (52.94) | |
| Fever | |||
| Yes | 101 (87.83) | 60 (88.24) | 1.000 |
| No | 14 (12.17) | 8 (11.76) | |
| Dyspnoea | |||
| Yes | 36 (31.30) | 49 (72.06) | <0.001 |
| No | 79 (68.70) | 19 (27.94) | |
| Bilateral infection (chest CT image) | |||
| Yes | 78 (67.83) | 53 (77.94) | 0.142 |
| No | 0 (0.00) | 1 (1.47) | |
| First SARS-Cov2 nuclei acid test | |||
| Positive | 72 (62.61) | 42 (61.76) | 0.659 |
| Negative | 20 (17.39) | 15 (22.06) | |
| Cough | |||
| No cough | 23 (20.00) | 11 (16.18) | <0.001 |
| Dry cough | 68 (59.13) | 22 (32.35) | |
| Productive cough | 24 (20.87) | 35 (51.47) | |
| Vital signs on admission | |||
| Count of white cells, ×109/L | 5.43 (4.30, 7.00) | 8.01 (5.61, 11.36) | <0.001 |
| Count of lymphocytes, ×109/L | 1.13 (0.77, 1.45) | 0.56 (0.42, 0.75) | <0.001 |
| Count of monocytes, ×109/L | 0.42 (0.33, 0.57) | 0.35 (0.22, 0.57) | 0.025 |
| Count of red cells, ×1012/L | 4.10 (3.69, 4.48) | 4.26 (3.84, 4.64) | 0.084 |
| Count of platelets, ×109/L | 206 (160, 267) | 165.5 (122.5, 226.0) | 0.001 |
| Glucose, mmol/L | 5.73 (5.17, 7.00) | 7.44 (6.56, 9.43) | <0.001 |
| Erythrocyte sedimentation rate, mm/h | 28 (12, 53) | 47.0 (25.5, 66.5) | 0.003 |
| Alanine aminotransferase, U/L | 23 (14, 43) | 30.0 (19.5, 41.0) | 0.074 |
| Albumin, g/L | 35.20 (31.95, 39.35) | 31.4 (28.15, 34.10) | <0.001 |
| Total bilirubin, µmol/L | 9.30 (7.25, 12.40) | 13.05 (9.70, 19.00) | <0.001 |
| Alkaline phosphatase, U/L | 66 (53, 84) | 81.5 (57.5, 107.5) | 0.001 |
| γ-Glutamyl transpeptidase, U/L | 27.00 (18.00, 72.50) | 45.5 (26.0, 82.0) | 0.005 |
| Total cholesterol, mmol/L | 3.77 (3.22, 4.51) | 3.41 (2.97, 4.00) | 0.029 |
| Lactate dehydrogenase, U/L | 264.5 (211.0, 340.0) | 480.5 (418.0, 600.0) | <0.001 |
| Urea, mmol/L | 4.3 (3.5, 5.6) | 7.95 (5.75, 10.00) | <0.001 |
| Creatinine, µmol/L | 67 (58, 84) | 85.5 (69.0, 100.0) | <0.001 |
| Uric acid, µmol/L | 251 (196, 333) | 239.5 (181.0, 336.5) | 0.645 |
| Estimated glomerular filtration rate, ml/min/1.73 | 93.35 (80.30, 102.80) | 77.85 (59.70, 90.85) | <0.001 |
| High sensitivity C-reactive protein, mg/L | 14.1 (2.8, 59.5) | 87.3 (62.0, 155.6) | <0.001 |
| Potassium, mmol/L | 4.17 (3.91, 4.55) | 4.42 (4.00, 5.06) | 0.011 |
| Sodium, mmol/L | 140.30 (137.25, 141.90) | 138.55 (136.05, 142.45) | 0.255 |
| Chlorine, mmol/L | 101.80 (99.15, 103.55) | 100.05 (97.70, 102.90) | 0.055 |
| Corrected calcium, mmol/L | 2.37 (2.28, 2.42) | 2.39 (2.36, 2.48) | 0.002 |
| Bicarbonate, mmol/L | 23.90 (22.35, 25.45) | 21.5 (19.2, 24.5) | <0.001 |
| Ferritin, µg/L | 578.15 (319.70, 1125.20) | 1508.3 (1054.0, 2456.7) | <0.001 |
| Interleukin 2 receptor, U/ml | 574.5 (379.5, 928.0) | 1137.5 (890.0, 1833.0) | <0.001 |
| Interleukin 6, pg/ml | 7.49 (2.14, 27.57) | 68.00 (18.66, 137.35) | <0.001 |
| Interleukin 8, pg/ml | 11.85 (6.95, 23.55) | 26.6 (16.0, 70.7) | <0.001 |
| Interleukin 10, pg/ml | 5.00 (5.00, 7.85) | 10.5 (6.4, 16.8) | <0.001 |
| Tumour necrosis factor α, pg/ml | 8.7 (6.8, 11.4) | 11.1 (8.0, 15.9) | 0.004 |
| Fibrinogen, g/L | 4.50 ± 1.44 | 5.38 ± 2.04 | 0.005 |
| Prothrombin time | 13.8 (13.2, 14.5) | 15.6 (14.0, 17.2) | <0.001 |
| d-dimer, µg/ml FEU | 0.94 (0.40, 1.44) | 2.70 (1.21, 21.00) | <0.001 |
| High-sensitivity cardiac troponin I, pg/ml | 4.5 (2.0, 8.8) | 24.40 (9.35, 103.70) | <0.001 |
| N-terminal pro-brain natriuretic peptide | 98 (5, 1877) | 798 (176, 70000) | <0.001 |
| Interval, | 11 (9, 21) | 10.5 (7.0, 15.0) | 0.038 |
Categorical variables are shown in frequency (%). Continuous variables following normal or approximately normal distribution are shown in mean ± standard deviation. Continuous variables not following normal distribution are shown in median (interquartile range).
FEU, fibrinogen equivalent units.
The interval between date of first symptom and date of admission.
Figure 3The area under the receiver operating characteristic curve (AUROC) of the logistic regression model based on selected variables in the derivation set (A) and the external validation set (B)
Figure 4The relationship between probability of death and the normalized probability (A) and the cost curves of the predictive models using different cutoffs (B)
Figure 5The formula to calculate the risk scores (A) and their distributions among survivors and non-survivors (B) and the corresponding probability of death (C)