| Literature DB >> 33584997 |
Shijie Qin1,2, Weiwei Li1, Xuejia Shi2, Yanjun Wu3,4, Canbiao Wang2, Jiawei Shen1, Rongrong Pang1,5, Bangshun He1,6, Jun Zhao1, Qinghua Qiao7,4, Tao Luo1, Yanju Guo1,2, Yang Yang1, Ying Han1, Qiuyue Wu1, Jian Wu1, Wei Dai1, Libo Zhang1,5, Liming Chen2, Chunyan Xue1, Ping Jin2, Zhenhua Gan8,4, Fei Ma2, Xinyi Xia1,5.
Abstract
Critical patients and intensive care unit (ICU) patients are the main population of COVID-19 deaths. Therefore, establishing a reliable method is necessary for COVID-19 patients to distinguish patients who may have critical symptoms from other patients. In this retrospective study, we firstly evaluated the effects of 54 laboratory indicators on critical illness and death in 3044 COVID-19 patients from the Huoshenshan hospital in Wuhan, China. Secondly, we identify the eight most important prognostic indicators (neutrophil percentage, procalcitonin, neutrophil absolute value, C-reactive protein, albumin, interleukin-6, lymphocyte absolute value and myoglobin) by using the random forest algorithm, and find that dynamic changes of the eight prognostic indicators present significantly distinct within differently clinical severities. Thirdly, our study reveals that a model containing age and these eight prognostic indicators can accurately predict which patients may develop serious illness or death. Fourthly, our results demonstrate that different genders have different critical illness rates compared with different ages, in particular the mortality is more likely to be attributed to some key genes (e.g. ACE2, TMPRSS2 and FURIN) by combining the analysis of public lung single cells and bulk transcriptome data. Taken together, we urge that the prognostic model and first-hand clinical trial data generated in this study have important clinical practical significance for predicting and exploring the disease progression of COVID-19 patients.Entities:
Keywords: COVID-19; China; Clinical characteristics; SARS-CoV-2; prognostic factors
Year: 2021 PMID: 33584997 PMCID: PMC7870437 DOI: 10.1016/j.csbj.2021.01.042
Source DB: PubMed Journal: Comput Struct Biotechnol J ISSN: 2001-0370 Impact factor: 7.271
Fig. 1The research flow and overall distribution of 3044 COVID-19 patients. A: The roadmap of research. B: Age distribution of COVID-19 patients. C: Treatment outcome chart of COVID-19 patients. D: Ratio chart of the highest historical classification (mild, severe, critical) of COVID-19 patients. E: Scale diagram of COVID-19 patients staying in the ICU. F: Sex ratio chart of COVID-19 patients.
Baseline characteristics of 3044 COVID-19 patients.
| Total. NO. (Highest classification) | Total (n = 3044) | Mild (n = 1467) | Severe (n = 1418) | Critical (n = 159) | |
|---|---|---|---|---|---|
| Age, median [IQR] | 60.0 (49.0–68.0) | 56.0 (45.0–65.0) | 63.0 (53.0–71.0) | 68.0 (61.5–76.5) | |
| Sex | |||||
| Female | 1498 (49.21%) | 719 (49.01%) | 722 (50.91%) | 57 (35.85%) | 0.002 |
| Male | 1546 (50.78%) | 748 (50.99%) | 696 (49.08%) | 102 (64.15%) | |
| Stay in ICU | 127 (4.17%) | 1 (0.07%) | 23 (1.62%) | 103 (64.78%) | |
| State of Death | 66 (2.17%) | 0 (0.00%) | 4 (0.35%) | 61 (38.36%) | |
| Hypertension | 935 (30.72%) | 366 (24.95%) | 492 (34.7%) | 77 (48.43%) | |
| Diabetes | 435 (14.29%) | 169 (11.52%) | 224 (15.8%) | 42 (26.42%) | |
| Coronary atherosclerosis | 165 (5.42%) | 50 (3.41%) | 98 (6.91%) | 17 (10.69%) | |
| Tumor | 61 (2.00%) | 15 (1.02%) | 38 (2.68%) | 8 (5.03%) | |
| Chronic obstructive pulmonary disease | 30 (0.99%) | 5 (0.34%) | 21 (1.48%) | 4 (2.52%) | 0.001 |
| Hyperlipidemia | 23 (0.76%) | 11 (0.75%) | 12 (0.85%) | 0 (0%) | 0.505 |
| Abnormal liver function | 59 (1.94%) | 27 (1.84%) | 27 (1.9%) | 5 (3.14%) | 0.522 |
| Gastritis | 36 (1.18%) | 17 (1.16%) | 17 (1.20%) | 2 (1.26%) | 0.991 |
| Cirrhosis | 16 (0.53%) | 4 (0.27%) | 11 (0.71%) | 1 (0.63%) | 0.245 |
| Hepatitis | 55 (1.81%) | 30 (2.04%) | 24 (1.69%) | 1 (0.63%) | 0.403 |
| Nephritis | 7 (0.23%) | 3 (0.2%) | 3 (0.21%) | 1 (0.63%) | 0.558 |
| Benign prostatic hyperplasia | 32 (1.05%) | 6 (0.41%) | 22 (1.55%) | 4 (2.52%) | 0.002 |
| Prostatitis | 4 (0.13%) | 1 (0.07%) | 3 (0.21%) | 0 (0%) | 0.509 |
| Asthma | 8 (0.26%) | 3 (0.2%) | 4 (0.28%) | 1 (0.63%) | 0.599 |
| Respiratory failure | 52 (1.71%) | 0 (0%) | 16 (1.13%) | 46 (28.93%) | |
| Acute respiratory distress syndrome | 24 (0.79%) | 0 (0%) | 2 (0.14%) | 22 (13.84%) | |
| Abnormal kidney function | 26 (0.85%) | 6 (0.41%) | 17 (1.20%) | 3 (1.89%) | 0.024 |
| Heart failure | 14 (0.46%) | 2 (0.13%) | 6 (0.42%) | 6 (3.77%) | |
| Venous thrombosis | 10 (0.33%) | 3 (0.20%) | 5 (0.35%) | 2 (1.25%) | 0.079 |
| Thrombocytopenia | 20 (0.66%) | 5 (0.34%) | 11 (0.78%) | 4 (2.51%) | 0.010 |
| 13.0 (8–19) | 12.0 (8–17) | 14.0 (8–22) | 19.0 (11–32) | ||
| 1386 (45.53%) | 751 (51.19%) | 604 (42.60%) | 31 (19.50%) | ||
| Hypertension | 1.483 | 0.394 | 1.081 | 2.029 | 0.014 |
| Diabetes | 1.557 | 0.443 | 1.076 | 2.216 | 0.016 |
| Coronary atherosclerosis | 1.174 | 0.161 | 0.671 | 1.947 | 0.553 |
| Cancer | 2.315 | 0.839 | 1.068 | 4.570 | 0.022 |
| Chronic obstructive pulmonary disease | 0.822 | −0.196 | 0.233 | 2.241 | 0.728 |
| Hyperlipidemia | 0 | −13.76 | NA | NA | 0.978 |
| Abnormal liver function | 1.573 | 0.453 | 0.530 | 3.758 | 0.355 |
| Abnormal renal function | 1.313 | 0.272 | 0.303 | 3.950 | 0.667 |
| Gastritis | 0.778 | −0.252 | 0.124 | 2.648 | 0.735 |
| Cirrhosis | 1.918 | 0.651 | 0.288 | 7.507 | 0.411 |
| Hepatitis | 0.651 | −0.430 | 0.105 | 2.173 | 0.559 |
| Nephritis | 2.426 | 0.886 | 0.123 | 15.895 | 0.430 |
| Benign prostatic hyperplasia | 0.919 | −0.084 | 0.264 | 2.461 | 0.880 |
| Prostatitis | 0 | −11.778 | NA | NA | 0.978 |
| Asthma | 2.605 | 0.957 | 0.132 | 16.834 | 0.393 |
Note: IQR: The 25% and 75% quantiles. ICU: intensive care unit. OR: odds ratio. log2OR: log2 (odds ratio). 95% CI lower: The lower of 95% confidence interval. 95% CI upper: The upper of 95% confidence interval.
The impact of different laboratory test indicators on clinical death and critical illness outcomes.
| Laboratory testing index | OR | log2OR | 95% CI lower | 95% CI upper | Total | |
|---|---|---|---|---|---|---|
| Neutrophil percentage | 1.136 | 0.127 | 1.118 | 1.155 | 2976 | |
| Neutrophil absolute value | 1.466 | 0.382 | 1.387 | 1.553 | 2976 | |
| Basophil percentage | 0.014 | −4.260 | 0.006 | 0.035 | 2976 | |
| Absolute value of basophil | 0.000 | −48.688 | 0.000 | 0.000 | 2976 | |
| Eosinophil percentage | 0.539 | −0.618 | 0.462 | 0.622 | 2976 | |
| Eosinophil absolute value | 0.003 | −5.689 | 0.000 | 0.023 | 2976 | |
| Monocyte percentage | 0.657 | −0.420 | 0.610 | 0.705 | 2976 | |
| Monocyte absolute value | 0.937 | −0.065 | 0.415 | 2.003 | 0.872 | 2976 |
| Lymphocyte percentage | 0.854 | −0.157 | 0.836 | 0.873 | 2976 | |
| Lymphocyte absolute value | 0.150 | −1.895 | 0.103 | 0.217 | 2976 | |
| Blood leukocytes | 1.349 | 0.299 | 1.282 | 1.422 | 2976 | |
| Red blood cells | 0.803 | −0.212 | 0.600 | 1.079 | 0.1424 | 2976 |
| Potassium | 1.113 | 0.107 | 0.833 | 1.474 | 0.4623 | 2851 |
| sodium | 0.966 | −0.034 | 0.925 | 1.009 | 0.126 | 2851 |
| chlorine | 0.903 | −0.102 | 0.867 | 0.942 | 2851 | |
| calcium | 0.002 | −6.116 | 0.001 | 0.008 | 2850 | |
| phosphorus | 0.193 | −1.644 | 0.091 | 0.406 | 2390 | |
| Serum magnesium | 18.143 | 2.898 | 3.165 | 105.618 | 0.0012 | 2389 |
| Alanine aminotransferase | 1.004 | 0.004 | 1.001 | 1.008 | 0.0184 | 2898 |
| Aspartate aminotransferase | 1.007 | 0.007 | 1.003 | 1.012 | 0.0072 | 2907 |
| Total protein | 0.918 | −0.085 | 0.894 | 0.942 | 2901 | |
| albumin | 0.800 | −0.223 | 0.769 | 0.832 | 2901 | |
| Total bilirubin | 1.038 | 0.037 | 1.020 | 1.057 | 2900 | |
| Direct bilirubin | 1.080 | 0.077 | 1.046 | 1.124 | 2900 | |
| Total bile acid | 0.971 | −0.029 | 0.939 | 0.998 | 0.064 | 2899 |
| Indirect bilirubin | 1.050 | 0.049 | 1.010 | 1.094 | 0.017 | 2378 |
| globulin | 1.014 | 0.014 | 0.976 | 1.051 | 0.475 | 2380 |
| Alkaline phosphatase | 1.008 | 0.008 | 1.005 | 1.011 | 2899 | |
| γ-glutamyl transpeptidase | 1.005 | 0.005 | 1.003 | 1.007 | 2899 | |
| Cystatin C | 1.837 | 0.608 | 1.431 | 2.390 | 2894 | |
| PH | 1.002 | 0.002 | 0.750 | 1.332 | 0.99 | 2313 |
| Urine red blood cells | 1.001 | 0.001 | 1.001 | 1.002 | 0.004 | 2377 |
| Urine leukocyte | 1.000 | 0.000 | 1.000 | 1.001 | 0.184 | 2378 |
| Urea nitrogen | 1.250 | 0.223 | 1.187 | 1.319 | 2903 | |
| Creatinine | 1.003 | 0.003 | 1.001 | 1.005 | 0.0023 | 2903 |
| Uric acid | 0.996 | −0.004 | 0.994 | 0.998 | 2898 | |
| Total carbon dioxide | 0.962 | −0.039 | 0.912 | 1.015 | 0.1551 | 2897 |
| Creatine kinase | 1.003 | 0.003 | 1.001 | 1.004 | 2847 | |
| Lactate dehydrogenase | 1.010 | 0.010 | 1.009 | 1.012 | 2850 | |
| alpha-hydroxybutyrate dehydrogenase | 1.012 | 0.012 | 1.010 | 1.013 | 2850 | |
| Creatine kinase isoenzyme | 1.006 | 0.006 | 1.000 | 1.017 | 0.0821 | 2846 |
| Myoglobin | 1.008 | 0.008 | 1.005 | 1.012 | 1270 | |
| Hypersensitive troponin I | 1.525 | 0.422 | 1.168 | 2.353 | 0.012 | 1276 |
| B-type natriuretic peptide | 1.001 | 0.001 | 1.001 | 1.002 | 1638 | |
| Fibrinogen | 1.064 | 0.062 | 0.895 | 1.222 | 0.385 | 2528 |
| Activated partial thromboplastin time | 1.046 | 0.045 | 1.021 | 1.078 | 0.001 | 2529 |
| Prothrombin time | 1.328 | 0.284 | 1.224 | 1.445 | 2529 | |
| Thrombin time | 1.186 | 0.170 | 1.100 | 1.291 | 2529 | |
| International standardized ratio | 27.539 | 3.316 | 10.362 | 75.970 | 2529 | |
| DD dimer | 1.265 | 0.235 | 1.200 | 1.339 | 2510 | |
| C-reactive protein | 1.023 | 0.023 | 1.020 | 1.027 | 2926 | |
| Hypersensitive C-reactive protein | 1.227 | 0.204 | 1.179 | 1.278 | 2923 | |
| Interleukin-6 | 1.012 | 0.012 | 1.008 | 1.017 | 1472 | |
| Procalcitonin | 1.881 | 0.632 | 1.368 | 2.899 | 0.002 | 2018 |
| Glucose | 1.252 | 0.225 | 1.184 | 1.326 | 2900 |
Note: OR: odds ratio. log2OR: log2 (odds ratio). 95% CI lower: The lower of 95% confidence interval.
95% CI upper: The upper of 95% confidence interval.
Independent prognostic factors for laboratory inspection indicators of various categories.
| Immune cell percentage | OR | log2OR | 95% CI lower | 95% CI upper | |
|---|---|---|---|---|---|
| (Intercept) | 0.000 | (10.812) | 0.000 | 0.000 | |
| Red blood cells | 0.783 | (0.245) | 0.598 | 1.023 | 0.073 |
| Blood leukocytes | 1.063 | 0.061 | 1.010 | 1.124 | 0.026 |
| Neutrophil percentage | 1.130 | 0.122 | 1.110 | 1.151 | |
| (Intercept) | 0.121 | (2.111) | 0.068 | 0.215 | |
| Lymphocyte absolute value | 0.215 | (1.535) | 0.147 | 0.310 | |
| Monocyte absolute value | 0.424 | (0.859) | 0.165 | 1.022 | 0.066 |
| Neutrophil absolute value | 1.415 | 0.347 | 1.330 | 1.510 | |
| (Intercept) | 271.269 | 5.603 | 0.448 | 188182.081 | 0.090 |
| Serum magnesium | 72.921 | 4.289 | 11.323 | 482.399 | |
| phosphorus | 0.170 | (1.772) | 0.086 | 0.331 | |
| chlorine | 0.840 | (0.174) | 0.791 | 0.892 | |
| calcium | 0.001 | (6.881) | 0.000 | 0.004 | |
| Potassium | 1.747 | 0.558 | 1.285 | 2.366 | |
| sodium | 1.154 | 0.143 | 1.079 | 1.234 | |
| (Intercept) | 0.014 | (4.263) | 0.009 | 0.022 | |
| Cystatin C | 3.782 | 1.330 | 2.683 | 5.485 | |
| Urine red blood cells | 1.001 | 0.001 | 1.000 | 1.002 | 0.006 |
| (Intercept) | 0.097 | (2.334) | 0.059 | 0.158 | |
| Urea nitrogen | 1.526 | 0.423 | 1.431 | 1.632 | |
| Creatinine | 0.995 | (0.006) | 0.991 | 0.998 | 0.001 |
| Uric acid | 0.991 | (0.009) | 0.989 | 0.993 | |
| (Intercept) | 33.286 | 3.505 | 6.222 | 182.502 | |
| Alanine aminotransferase | 0.990 | (0.010) | 0.982 | 0.997 | |
| Aspartate aminotransferase | 1.016 | 0.016 | 1.006 | 1.026 | 0.001 |
| albumin | 0.797 | (0.227) | 0.764 | 0.829 | |
| Total bilirubin | 0.948 | (0.053) | 0.882 | 1.017 | 0.142 |
| Direct bilirubin | 1.288 | 0.253 | 1.122 | 1.485 | |
| Total bile acid | 0.904 | (0.100) | 0.875 | 0.930 | |
| globulin | 1.042 | 0.041 | 1.001 | 1.083 | 0.041 |
| Alkaline phosphatase | 1.007 | 0.007 | 1.003 | 1.012 | 0.001 |
| (Intercept) | 0.009 | (4.709) | 0.005 | 0.016 | |
| Creatine kinase | 0.995 | (0.005) | 0.993 | 0.998 | 0.001 |
| Lactate dehydrogenase | 1.013 | 0.013 | 1.010 | 1.015 | |
| Creatine kinase isoenzyme | 0.970 | (0.031) | 0.937 | 0.999 | 0.059 |
| Myoglobin | 1.012 | 0.012 | 1.008 | 1.017 | |
| Hypersensitive troponin I | 0.614 | (0.488) | 0.269 | 0.998 | 0.260 |
| (Intercept) | 0.001 | (7.562) | 0.000 | 0.002 | |
| Prothrombin time | 1.187 | 0.171 | 1.105 | 1.289 | |
| Fibrinogen | 1.277 | 0.244 | 1.090 | 1.585 | 0.014 |
| Thrombin time | 1.099 | 0.094 | 1.033 | 1.195 | 0.017 |
| DD dimer | 1.255 | 0.227 | 1.189 | 1.330 | |
| (Intercept) | 0.058 | (2.841) | 0.046 | 0.073 | |
| Interleukin-6 | 1.014 | 0.014 | 1.010 | 1.019 | |
| (Intercept) | 0.017 | (4.103) | 0.011 | 0.023 | |
| C-reactive protein | 1.018 | 0.018 | 1.014 | 1.022 | |
| Hypersensitive C-reactive protein | 1.136 | 0.127 | 1.088 | 1.188 | |
| (Intercept) | 0.076 | (2.575) | 0.064 | 0.090 | |
| Procalcitonin | 2.363 | 0.860 | 1.562 | 3.857 | |
| (Intercept) | 0.019 | (3.966) | 0.014 | 0.026 | |
| Glucose | 1.227 | 0.205 | 1.175 | 1.282 |
Note: OR: odds ratio. log2OR: log2(odds ratio). 95% CI lower: The lower of 95% confidence interval. 95% CI upper: The upper of 95% confidence interval.
Fig. 2Screening the most important prognostic indicators through random forest machine learning algorithms. A: Principal component analysis chart of age, gender and 29 laboratory test indexes. B: The 5 times 10-fold cross-validation curve shows the relationship between the model error and the number of variables used for fitting. C: The ranking of the importance of 31 prognostic indicators calculated based on the random forest algorithm. MeanDecreaseGini represents the influence of each variable on the heterogeneity of the observations on each node of the classification tree. The larger the value, the greater the importance of the variable. D: The overall dynamic changes of the eight most important prognostic indicators at different time points before the end event. Red and blue lines represent the fit curve. 1 represents the composite endpoint event group in which patients developed into critical illness or death or entered the ICU while 0 represents the group without composite endpoint events where patients have milder symptoms and better treatment effects. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)
Fig. 3A joint prognostic model that can be used for clinical decision-making. A ~ H: The cumulative event rate of patients with abnormal and non-abnormal groups of 8 important prognostic indicators. The normal reference range is as follows: neutrophil percentage: 40%-75%, procalcitonin: 0–0.05 ng/ml, neutrophil absolute value: 1.8–6.3*10-9 /L, C reactive protein: 0–4 mg/L, albumin: 40–55 g/L, interleukin 6: 0–5.90 pg/mL, lymphocyte absolute value: 1.1–3.2*10-9 /L, myoglobin: 0–80 ng/ml. I: ROC curve of the joint model in the training set. J: Validate the ROC curve of the joint model in the concentration. K: The nomogram shows the model prediction in all 611 samples detecting 8 indicators at the same time. The line segment corresponding to each variable is marked with a scale, which represents the range of possible values of the variable, and the length of the line segment reflects the contribution of this factor to the ending event. Point in the Fig represents the individual score corresponding to each variable under different values. Total Point represents the total score of the individual scores after the values of all variables are added up. The risk probability represents the patient's probability that a composite endpoint event will occur.
Fig. 4Analysis of lung single cell transcriptomes of different ages and genders. A: The heat map shows the marker genes corresponding to different cell types in the lung. B: The UMAP cluster map shows the clustering of different cells in the lungs. C: Expression and distribution of ACE2 in different cell clusters in the lung. D: TMPRSS2 expression and distribution in different cell clusters in the lung. E: Expression and distribution of FURIN in different cell clusters in the lung. F: Differences in expression and proportion of ACE2, TMPRSS2 and FURIN in old and young people. G: Differences in expression and proportion of ACE2, TMPRSS2 and FURIN in men and women.