Literature DB >> 32616597

Development of a clinical decision support system for severity risk prediction and triage of COVID-19 patients at hospital admission: an international multicentre study.

Guangyao Wu^1,2,3, Pei Yang^4,2, Yuanliang Xie⁴, Henry C Woodruff^5,6, Xiangang Rao⁷, Julien Guiot⁸, Anne-Noelle Frix⁸, Renaud Louis⁸, Michel Moutschen⁹, Jiawei Li¹⁰, Jing Li¹¹, Chenggong Yan^5,12, Dan Du⁴, Shengchao Zhao⁴, Yi Ding⁴, Bin Liu⁴, Wenwu Sun¹³, Fabrizio Albarello¹⁴, Alessandra D'Abramo¹⁴, Vincenzo Schininà¹⁴, Emanuele Nicastri¹⁴, Mariaelena Occhipinti¹⁵, Giovanni Barisione¹⁶, Emanuela Barisione¹⁷, Iva Halilaj⁵, Pierre Lovinfosse¹⁸, Xiang Wang^4,3, Jianlin Wu¹⁹, Philippe Lambin^5,6.

Abstract

BACKGROUND: The outbreak of coronavirus disease 2019 (COVID-19) has globally strained medical resources and caused significant mortality.
OBJECTIVE: To develop and validate a machine-learning model based on clinical features for severity risk assessment and triage for COVID-19 patients at hospital admission.
METHOD: 725 patients were used to train and validate the model. This included a retrospective cohort from Wuhan, China of 299 hospitalised COVID-19 patients from 23 December 2019 to 13 February 2020, and five cohorts with 426 patients from eight centres in China, Italy and Belgium from 20 February 2020 to 21 March 2020. The main outcome was the onset of severe or critical illness during hospitalisation. Model performances were quantified using the area under the receiver operating characteristic curve (AUC) and metrics derived from the confusion matrix.
RESULTS: In the retrospective cohort, the median age was 50 years and 137 (45.8%) were male. In the five test cohorts, the median age was 62 years and 236 (55.4%) were male. The model was prospectively validated on five cohorts yielding AUCs ranging from 0.84 to 0.93, with accuracies ranging from 74.4% to 87.5%, sensitivities ranging from 75.0% to 96.9%, and specificities ranging from 55.0% to 88.0%, most of which performed better than the pneumonia severity index. The cut-off values of the low-, medium- and high-risk probabilities were 0.21 and 0.80. The online calculators can be found at www.covid19risk.ai.
CONCLUSION: The machine-learning model, nomogram and online calculator might be useful to access the onset of severe and critical illness among COVID-19 patients and triage at hospital admission.

Entities: Chemical Disease Gene Species

Mesh：

Year: 2020 PMID： 32616597 PMCID： PMC7331655 DOI： 10.1183/13993003.01104-2020

Source DB: PubMed Journal: Eur Respir J ISSN： 0903-1936 Impact factor: 16.671

Introduction

In December 2019, a novel coronavirus, severe acute respiratory syndrome-coronavirus-2 (SARS-CoV-2; earlier named as 2019-nCoV), emerged in Wuhan, China [1]. The disease caused by SARS-CoV-2 was named coronavirus disease 2019 (COVID-19). As of 15 May 2020, more than 4 490 000 COVID-19 patients have been reported globally, and over 300 000 patients have died [2]. The outbreak of COVID-19 has developed into a pandemic [3]. Among COVID-19 patients, around 80% present with mild illness whose symptoms usually disappear within 2 weeks [4]. However, around 20% of the patients may proceed and necessitate hospitalisation and increased medical support. The mortality rate for severe patients is around 13.4% [4]. Therefore, risk assessment of patients preferably in a quantitative, non-subjective way is extremely important for patient management and medical resource allocation. General quarantine and symptomatic treatment at home or mobile hospitals can be used for most non-severe patients, while a higher level of care and fast track to the intensive care unit (ICU) is needed for severe patients. Previous studies have summarised the clinical and radiological characteristics of severe COVID-19 patients, while the prognostic value of different variables is still unclear [5, 6]. Several scoring systems that are in common clinical use (e.g. Sequential Organ Failure Assessment score; Confusion, Urea, Respiratory rate, Blood pressure, Age 65; Acute Physiology and Chronic Health Evaluation, etc.) could be applied to the triage problem, albeit each with their own problems and limitations, such as the need for laboratory variables that are hard to obtain at hospital admission [7]. The pneumonia severity index (PSI) stands out as it is used to assess the probability of severity and mortality among adult patients with community-acquired pneumonia and to help hospitalisation management [8]. A better solution could possibly be found using machine learning, a branch of artificial intelligence that learns from past data in order to build a prognostic model [9]. In recent years, machine learning has been developed as a useful tool to analyse large amounts of data from medical records or images [10]. Previous modelling studies focused on forecasting the potential international spread of COVID-19 [11]. Therefore, our objective is to develop and validate a prognostic machine-learning model based on clinical, laboratory and radiological variables of COVID-19 patients at hospital admission for severity risk assessment during hospitalisation, and compare the performance with that of PSI as a representative clinical assessment method. Our ambition is to develop a multifactorial decision support system with different datasets to facilitate risk prediction and triage (home or mobile hospital quarantine, hospitalisation or ICU) of the patient at hospital admission.

Methods

Patients

The institutional review board approved this study (2020-71), which followed the Standards for Reporting of Diagnostic Accuracy Studies statement [12], and the requirement for written informed consent was waived. 299 adult confirmed COVID-19 patients from the central hospital of Wuhan, China were included consecutively and retrospectively between 23 December 2019 and 13 February 2020. The inclusion criteria were: 1) patients with confirmed COVID-19 disease; and 2) patients presenting at hospital for admission. The exclusion criteria were: 1) patients who already had a severe illness at hospital admission; 2) time interval>2 days between admission and examinations; and 3) no data available or delayed results as described below. The patients included from this centre were divided into two datasets according to the entrance time of hospitalisation, 80% for training (239 patients from 23 December 2019 to 28 January 2020) and 20% for internal validation (60 patients from 29 January to 13 February 2020). The five test datasets were collected between 20 February 2020 and 31 March 2020 from other eight centres (supplementary material) in China, Italy and Belgium under the same inclusion and exclusion criteria (figure 1).

FIGURE 1

Flowchart of the patient selection process. COVID-19: coronavirus disease 2019; RT-PCR: reverse transcriptase-PCR.

Flowchart of the patient selection process. COVID-19: coronavirus disease 2019; RT-PCR: reverse transcriptase-PCR. Patients were labelled as having a “severe disease” if at least one of the following criteria were met during hospitalisation [6, 13]: 1) respiratory failure requiring mechanical ventilation; 2) shock; 3) ICU admission; 4) organ failure; or 5) death. Patients were labelled as having a “non-severe disease” if none of the above mentioned criteria were met during the whole hospitalisation process until deemed recovered and discharged from the hospital.

Data collection

Clinical, laboratory and radiological characteristics, and outcome data were obtained from the case record form shared by the International Severe Acute Respiratory and Emerging Infection Consortium from the electronic medical records [14]. A confirmed case with COVID-19 was defined as a positive result of high-throughput sequencing or real-time reverse-transcriptase PCR assay for nasal and pharyngeal swab specimens. After consultation with respiratory specialists and review of the recent COVID-19 literature, a set of clinical, laboratory and radiological characteristics was identified and data collected from the electronic medical system. The clinical characteristics included basic information (five variables), comorbidities (11 variables) and symptoms (13 variables). All clinical characteristics were obtained when the patients were admitted to hospital for the first time. 42 laboratory results were recorded, including complete blood count, white blood cell differential count, d-dimer, C-reactive protein (CRP), cardiac enzymes, procalcitonin, liver function test, kidney function test, B-type natriuretic peptide and electrolyte test. The arterial blood gas was not taken into account due to missing data for most early-stage patients. The metric conversion of laboratory results was performed using an online conversion table [15]. A detailed list of variables can be found in tables 1 and 2.

TABLE 1

Clinical characteristics and radiological findings of patients confirmed with coronavirus disease 2019

	Non-severe group	Severe group	p-value^#
Subjects n	228	71
Age years	43.0 (33.0–61.0)	62.0 (52.5–71.5)	<0.001
Males	95 (41.7)	42 (59.2)	0.014
Smoking history	17 (7.5)	14 (19.7)	0.006
Hospital staff	86 (37.7)	4 (5.6)	<0.001
Time of onset of illness days	4.0 (2.0–7.0)	4.0 (2.0–7.0)	0.963
Comorbidities
Hypertension	44 (19.3)	31 (43.7)	<0.001
Diabetes	19 (8.3)	18 (25.4)	<0.001
Hyperlipidaemia	11 (4.8)	5 (7.0)	0.545
Cardiopathy disease	2 (0.88)	8 (11.3)	<0.001
COPD	8 (3.5)	13 (18.3)	<0.001
Cerebrovascular disease	6 (2.6)	16 (22.5)	<0.001
Kidney disease	5 (2.2)	11 (15.5)	<0.001
Fatty liver	28 (12.3)	12 (16.9)	0.322
Hepatitis B virus carrier	2 (0.88)	5 (7.0)	0.009
Cancer history	12 (5.3)	4 (5.6)	1
Surgical history	30 (13.2)	15 (21.1)	0.127
Symptoms
Fever	172 (75.4)	47 (66.2)	0.128
Body temperature C	37.8 (37.3–38.4)	37.5 (36.8–38.0)	0.027
Cough	153 (67.1)	46 (64.8)	0.774
Sputum	59 (25.9)	25 (35.2)	0.133
Weakness	102 (44.7)	31 (43.7)	0.892
Diarrhoea	25 (11.0)	8 (11.3)	1
Vomiting	16 (7.0)	9 (12.7)	0.144
Chest tightness	50 (21.9)	30 (42.3)	0.001
Dyspnoea	11 (4.8)	4 (5.6)	0.760
Muscular soreness	61 (26.8)	17 (23.9)	0.757
Chill	38 (16.7)	11 (15.5)	1
Conjunctival congestion	1 (0.44)	1 (1.4)	0.419
Headache or dizziness	34 (14.9)	12 (16.9)	0.708
Radiological findings
Main findings			0.928
Normal	4 (1.8)	1 (1.4)
Ground-glass opacity only	132 (57.9)	39 (54.9)
Consolidation only	22 (9.6)	6 (8.5)
Mixed	70 (30.7)	25 (35.2)
Vascular enlargement	66 (28.9)	36 (50.7)	<0.001
Air bronchogram	49 (21.5)	30 (42.3)	0.001
Lesion range score	4.5 (2.0–7.0)	6.0 (4.0–10.5)	0.001

Data are presented median (interquartile range) or n (%), unless otherwise stated. #: obtained using Fisher's exact test or Mann–Whitney U-test.

TABLE 2

Laboratory results of patients with coronavirus disease 2019 at hospital admission

Laboratory results	Non-severe group	Severe group	p-value^#
Subjects n	228	71
Complete blood cell count
White blood count ×10⁹ cells·L⁻¹	4.5 (3.3–5.8)	5.5 (4.0–7.6)	<0.001
Red blood cell count ×10¹² cells·L⁻¹	4.4 (4.1–4.7)	4.3 (4.0–4.7)	0.217
Haemoglobin g·L⁻¹	130.5 (121.0–142.0)	132.0 (117.0–142.5)	0.968
Platelets ×10⁹ cells·L⁻¹	174.0 (140.8–214.5)	149.0 (116.5–183.5)	<0.001
Haematocrit %	39.5 (36.6–42.7)	38.9 (36.0–42.6)	0.472
Mean corpuscular volume fL	90.5 (87.5–93.4)	90.5 (87.8–94.7)	0.526
Mean corpuscular haemoglobin pg	30.0 (28.7–30.9)	30.0 (29.1–31.5)	0.266
Mean corpuscular haemoglobin concentration g·dL⁻¹	330.0 (323.0–336.0)	329.0 (323.0–337.0)	0.684
Red blood cell distribution width standard deviation fL	39.1 (36.4–41.1)	40.0 (15.3–41.7)	0.193
Red blood cell distribution width coefficient of variation %	12.7 (12.1–14.9)	13.0 (12.5–38.3)	0.015
Platelet distribution width %	12.9 (10.7–16.3)	12.4 (10.8–15.8)	0.371
Platelet large cell ratio %	24.0 (19.7–30.1)	26.8 (21.1–32.3)	0.028
Mean platelet volume fL	9.8 (9.2–10.6)	10.2 (9.4–10.9)	0.016
Thrombocytocrit %	0.17 (0.14–0.21)	0.15 (0.13–0.18)	0.002
White cell differential count
Neutrophil %	64.5 (56.6–74.7)	76.3 (67.1–85.7)	<0.001
Lymphocyte %	26.1 (17.8–26.8)	15.3 (7.6–22.8)	<0.001
Monocyte %	7.7 (5.7–9.8)	6.9 (4.5–8.9)	0.061
Eosinophil %	0.10 (0.00–0.60)	0.00 (0.00–0.31)	0.007
Basophil %	0.20 (0.10–0.30)	0.20 (0.10–0.30)	0.558
Neutrophil count ×10⁹ cells·L⁻¹	2.9 (1.9–3.9)	4.2 (2.7–6.1)	<0.001
Lymphocyte count ×10⁹ cells·L⁻¹	1.1 (0.78–1.5)	0.77 (0.49–1.14)	<0.001
Monocyte count ×10⁹ cells·L⁻¹	0.33 (0.24–0.44)	0.37 (0.23–0.52)	0.215
Eosinophil count ×10⁹ cells·L⁻¹	0.01 (0.00–0.03)	0.00 (0.00–0.01)	0.087
Basophil count ×10⁹ cells·L⁻¹	0.01 (0.01–0.01)	0.01 (0.01–0.02)	0.364
D-dimer mg·L⁻¹	0.45 (0.24–0.90)	0.73 (0.46–1.83)	<0.001
C-reactive protein mg·dL⁻¹	1.4 (0.46–3.1)	3.9 (2.4–7.2)	<0.001
Cardiac enzymes
Aspartate amino transferase U·L⁻¹	22.1 (17.3–31.1)	33.6 (22.2–42.8)	<0.001
Alpha-hydroxybutyric dehydrogenase U·L⁻¹	143.0 (113.0–174.1)	189.0 (157.5–268.5)	<0.001
Lactate dehydrogenase U·L⁻¹	183.5 (142.8–231.2)	252.0 (199.7–331.8)	<0.001
Creatine kinase U·L⁻¹	78.5 (45.0–132.4)	108.4 (59.8–248.0)	0.001
Liver function
Alanine aminotransferase U·L⁻¹	21.6 (13.4–33.4)	25.8 (15.7–38.8)	0.202
Aspartate transaminase U·L⁻¹	22.3 (17.1–31.2)	31.1 (21.0–40.1)	<0.001
Gamma-glutamyl transpeptidase U·L⁻¹	22.3(13.7–42.8)	36.8 (22.7–51.5)	<0.001
Kidney function
Urea mmol·L⁻¹	4.1 (3.2–5.1)	6.3 (4.6–8.2)	<0.001
Creatinine μmol·L⁻¹	64.7 (52.8–75.1)	77.3 (63.8–90.8)	<0.001
Procalcitonin ng·mL⁻¹	0.05 (0.04–0.09)	0.11 (0.06–0.31)	<0.001
B-type natriuretic peptide pg·mL⁻¹	59.4 (22.4–106.7)	156.0 (56.4–486.7)	<0.001
Electrolyte
Potassium mmol·L⁻¹	4.1 (4.0–4.3)	4.1 (3.8–4.3)	0.353
Sodium mmol·L⁻¹	141.1 (140.0–142.2)	139.9 (137.8–141.5)	<0.001
Chloride mmol·L⁻¹	103.9 (102.5–105.6)	102.7 (100.9–105.1)	0.006
Calcium mmol·L⁻¹	2.3 (2.2–2.4)	2.2 (2.1–2.3)	<0.001
Phosphate mmol·L⁻¹	1.0 (0.9–1.1)	0.95 (0.80–1.1)	0.005

Data are presented as median (interquartile range), unless otherwise stated. #: obtained using Mann–Whitney U-test.

Clinical characteristics and radiological findings of patients confirmed with coronavirus disease 2019 Data are presented median (interquartile range) or n (%), unless otherwise stated. #: obtained using Fisher's exact test or Mann–Whitney U-test. Laboratory results of patients with coronavirus disease 2019 at hospital admission Data are presented as median (interquartile range), unless otherwise stated. #: obtained using Mann–Whitney U-test. The semantic computed tomography (CT) characteristics (including ground-glass opacity, consolidation, vascular enlargement, air bronchogram and lesion range score) were independently evaluated on all datasets by two radiologists (P. Yang, a radiologist with 5 years’ experience in chest CT images, and Y. Xie, a radiologist with 20 years’ experience in chest CT images), who were blinded to clinical and laboratory results. Any disagreement was resolved by a consensus read. Lesion range was identified as areas of ground-glass opacity or consolidation and was graded with a six-point scale according to the lesion volume proportion in each single lobe: 0=no lung parenchyma involved; 1=up to 5% of lung parenchyma involved; 2=5–25%; 3=26–50%; 4=51–75%; and 5=76–100% of lung parenchyma involved. The lesion volume proportion was automatically calculated by the Pneumonia Assisted Diagnosis System (version 1.17.0; Shukun Technology), and the final score was the total score from five lobes (figure 2). Detailed CT acquisition and reconstruction parameters are presented in the supplementary material.

FIGURE 2

Chest computed tomography images of two patients with coronavirus disease 2019 pneumonia. a) A 48-year-old man, the focal ground-glass opacities in the bilateral lung lobes (yellow arrow) were automatically segmented (orange areas) and the lesion volume in each lobe was calculated: right superior lobe: 0.2%; right middle lobe: 0.3%; right inferior lobe: 0.1%; left superior lobe: 0.9%; and left inferior lobe: 9.4%. The lesion range score was 6 (1+1+1+1+2). b) A 70-year-old man, the peripheral ground-glass opacities in the bilateral lung lobes (yellow arrow) were automatically segmented (orange areas) and the lesion volume in each lobe was calculated: right superior lobe: 32.1%; right middle lobe: 16.4%; right inferior lobe: 62.7%; left superior lobe: 12.8%; and left inferior lobe: 7.3%. The lesion range score was 13 (3+2+4+2+2).

Feature selection and modelling

All feature selection and model training were performed in the training dataset alone to prevent information leakage. An overview of the functions used is given in table S1. In order to reduce feature dimensionality, features showing high pairwise Spearman correlation (r=>0.8) and the highest mean correlation with all remaining features were removed, followed by application of the Boruta algorithm to select important features [16]. The Boruta algorithm combines feature rank based on the random forest classification algorithm and selection frequency based on multiple iterations of the feature selection procedure. Recursive feature elimination based on bagged tree models with a cross-validation technique (10 folds, 10 times) was performed to select the best performing combination of features. In order to balance the positive and negative sample size, an adaptive synthetic sampling approach for imbalanced learning (ADASYN) was used during feature selection and modelling [17]. The feature selection process was used for clinical, laboratory and CT semantic models alone, and in combination. Logistic regression models based on selected features were trained and the validation dataset was used to internally validate the prognostic performance of the models. Four models were trained: model 1 contained only baseline clinical features without symptoms; model 2 used all selected clinical features; model 3 used selected semantic CT features, age and sex; and model 4 employed all selected clinical, laboratory and CT features. The prognostic performances of the best model were compared with other models on the training dataset, due to a bigger sample size. The performance of the best model and PSI scoring were gauged on the datasets via the receiver operator characteristic (ROC) and confusion matrix. In order to gauge the level of overfitting, the outcomes were randomised on the best model and the entire process repeated, from feature selection to model building and evaluation. The patients from the training datasets were divided into low, medium and high risk according to the first quartile (25th percentile) and the third quartile (75th percentile) of probabilities from the best performing model. Nomograms and online calculators were used to provide the interpretability of the best trained models. The test datasets were used to gauge the prognostic performance and the validity for the best model.

Statistical analysis

Baseline data were summarised as median, and categorical variables as frequency (%). Differences between the severe group and the non-severe group were tested using the Mann–Whitney test for continuous data and Fisher's exact test for categorical data. Feature correlations were measured using the Spearman correlation coefficient. We determined the area under the ROC curve (AUC) with its 95% CI and tested AUC difference between models 1–3 and model 4 using the method of DeLong et al. [18], measures of prognostic performance including the AUC and metrics derived from the confusion matrix accuracy, sensitivity, specificity, positive prediction value (PPV) and negative prediction value (NPV). A calibration-plot based on the Hosmer–Lemeshow test was used to estimate the goodness-of-fit and consistency of the model on the test datasets. All p-values were two-sided and p<0.05 was regarded as significant. All statistical analyses, modelling and plotting were performed in R (version 3.5.3), and the detailed package characteristics are listed in table S1.

Results

Demographic and clinical characteristics

Out of 299 hospitalised COVID-19 patients in the retrospective cohort, the median (interquartile range) age was 50 (35.5–63.0) years and age range was 20–94 years. 137 (45.8%) were male. All the clinical characteristics and CT findings are summarised in table 1, and more details of laboratory findings can be seen in table 2. Out of 426 hospitalised COVID-19 patients in five cohorts as test datasets, the median (interquartile range) age was 62 (50.0–72.0) years and age range was 19–94 years. 236 (55.4%) were male.

Feature selection

Among the clinical features, age, hospital employment, body temperature and the time of onset to admission were selected. Lymphocyte (proportion), neutrophil, (proportion), CRP, lactate dehydrogenase (LDH), creatine kinase, urea and calcium were selected from the laboratory feature set. Only the lesion range score was selected from CT semantic features. When putting these three category features together to select features, age, lymphocyte (proportion), CRP, LDH, creatine kinase, urea and calcium were finally included in the combination model.

Models performance in the training and validation dataset

Model performance was as follows. Model 1 was based on age and hospital employment and showed an AUC of 0.74 (95% CI 0.69–0.79) on the training dataset and 0.83 (95% CI 0.72–0.94) on the validation dataset. Model 2 with the clinical features of age, hospital employment, body temperature and the time of onset yielded an AUC of 0.78 (95% CI 0.73–0.83) on the training dataset and 0.74 (95% CI, 0.59–0.89) on the validation dataset. Model 3 was based on age and lesion range score on CT and had an AUC of 0.75 (95% CI 0.70–0.80) on the training dataset and 0.83 (95% CI, 0.72–0.94) on the validation dataset. When pooling these three categories of features, the combination model (model 4) selected seven features (age, lymphocyte (proportion), CRP, LDH, creatine kinase, urea and calcium), which achieved the highest AUC of 0.86 (95% CI 0.82–0.90) on the training dataset and a 0.90 (95% CI 0.82–0.98) on the validation dataset. The AUC value of model 4 was significantly higher than model 1 (p=0.001), model 2 (p=0.033) and model 3 (p=0.003) on the training dataset. The cut-off values from reclassification of low-, medium- and high-risk probabilities were 0.21 and 0.80.

External validation

Model 4 was validated on the five test datasets, which showed AUCs ranging from 0.84 to 0.93 with accuracies ranging from 74.4% to 87.5%, sensitivities ranging from 75.0% to 96.9%, specificities ranging from 55.0% to 88.0%, PPVs ranging from 66.7% to 84.1%, and NPVs ranging from 73.9% to 95.7% (table 3). The ROC, confusion matrix and calibration plots are shown in figure 3. The results of randomising the outcomes and re-running the analysis yielded an AUC of 0.50 (95% CI 0.44–0.55) for model 4.

TABLE 3

The prognostic performance of the combination model (model 4) on five test datasets

Dataset	Tool	AUC (95% CI)	Accuracy (95% CI) %	Sensitivity %	Specificity %	PPV %	NPV %
Test 1	Model	0.88 (0.75–1.0)	80.6 (64.0–91.8)	75.0	85.0	80.0	81.0
	PSI	0.87 (0.74–1.0)	77.8 (60.9–89.9)	56.3	95.0	90.0	73.1
Test 2	Model	0.88 (0.80–0.95)	78.9 (69.0–86.8)	75.5	82.9	84.1	73.9
	PSI	0.86 (0.79–0.94)	65.6 (54.8–75.3)	38.8	97.6	95.0	57.1
Test 3	Model	0.93 (0.83–1.0)	87.5 (71.0–96.5)	85.7	88.0	66.7	95.7
	PSI	0.89 (0.77–1.0)	75.0 (56.6–88.5)	0.00	96.0	0.00	77.4
Test 4	Model	0.84 (0.76–0.93)	74.4 (64.2–83.1)	90.0	55.0	71.4	81.5
	PSI	0.78 (0.68–0.87)	67.8 (57.1–77.3)	62.2	73.3	70.0	66.0
Test 5	Model	0.89 (0.85–0.94)	79.2 (72.5–84.9)	96.9	57.5	73.6	93.9
	PSI	0.71 (0.63–0.78)	62.9 (55.4–70.0)	42.9	87.5	80.8	55.6
Mean (Test 1–5)	Model	0.88	80.1	84.6	73.7	75.2	85.2
	PSI	0.82	69.8	40.0	89.9	67.2	65.8

PSI: pneumonia severity index; AUC: area under the receiver operating characteristic curve; PPV: positive prediction value; NPV: negative prediction value.

FIGURE 3

The a) receiver operator characteristic curve, b) confusion matrix and c) calibration curve for the test datasets. AUC: area under the curve.

The prognostic performance of the combination model (model 4) on five test datasets PSI: pneumonia severity index; AUC: area under the receiver operating characteristic curve; PPV: positive prediction value; NPV: negative prediction value. The a) receiver operator characteristic curve, b) confusion matrix and c) calibration curve for the test datasets. AUC: area under the curve.

Clinical use

Based on the selected features from the best models, a nomogram was established to quantitatively assess the severity risk of illness (figure 4). The developed online calculators can be found at www.covid19risk.ai. Compared to PSI scoring (cut-off value: 90), model 4 (cut-off value: 0.53) showed higher AUCs, accuracies, sensitivities and NPVs on the five test datasets (table 3). There was significant difference for the proportion of severe patients among low-, medium- and high-risk groups in the five test datasets (figure 5).

FIGURE 4

Severe-illness risk nomogram and triage tool for clinicians.

FIGURE 5

Histogram plot of the proportion of severe patients in low-, medium- and high-risk groups of the test datasets. ***: p<0.001; #: p=0.006.

Severe-illness risk nomogram and triage tool for clinicians. Histogram plot of the proportion of severe patients in low-, medium- and high-risk groups of the test datasets. ***: p<0.001; #: p=0.006.

Discussion

This international multicentre study analysed individually and in combination, clinical, laboratory and radiological characteristics for COVID-19 patients at hospital admission to retrospectively develop and prospectively validate a prognostic model and tool to assess the severity of the illness, and its progression, and to compare these with PSI scoring. We found that COVID-19 patients who developed a severe illness were often of an advanced age, had multiple comorbidities, presented with chest tightness, and had abnormal laboratory results and broader lesion range on lung CT on admission. Using simpler linear regression models yielded better prognostic performance than PSI scoring in the test datasets. We believe these models could be useful for risk assessment and triage. Previous studies have reported that age and underlying comorbidities (such as hypertension, diabetes and cardiovascular diseases) may be risk factors for the COVID-19 patients requiring ICU admission [19, 20]. In this study, we found that the elderly COVID-19 patients who were male, non-hospital staff, suffering from hypertension, diabetes, cardiopathy disease, COPD, cerebrovascular disease, renal disease, hepatitis B virus infection, lower body temperature and chest tightness were more vulnerable to develop a severe illness in the early stages of the disease. Among these features, age, hospital staff, body temperature and the time of onset to admission had certain prognostic abilities. Age was the most important feature, which may interact with other features, which was why only age was selected into our combination model (model 4) from these features. Zhou et al. [21] have confirmed that SARS-CoV-2 uses the same cell entry receptor (angiotensin-converting enzyme-2) as SARS-CoV. However, whether COVID-19 patients with hypertension and diabetes have higher severe illness risk, which is due to treatment with angiotensin-converting enzyme-2, increasing drugs is still unknown [22]. Hospital staff had a lower risk of progression, possibly due lower age, higher levels of education, and more medical knowledge once infected, although the unbalanced nature of this type of data has to be taken into account. Furthermore, early studies have shown that COVID-19 patients with severe illness had more laboratory abnormalities such as CRP, d-dimer, lymphocytes, neutrophils and LDH, than those patients with non-severe illness, which were associated with the prognosis [19, 20, 23]. In our study, we also found that the severe group had numerous laboratory abnormalities in complete blood cell count, white cell differential count, d-dimer, CRP, liver function, renal function, procalcitonin, B-type natriuretic peptides and electrolytes. Among these abnormalities, lymphocyte proportion, neutrophil proportion, CRP, LDH, creatine kinase, urea and calcium were significant prognostic factors, which suggest that COVID-19 may cause damage to multiple organ systems when developing into a severe illness. However, current pathological findings of COVID-19 suggest that there is no evidence that SARS-CoV-2 can directly impair the other organs such as liver, kidney and heart [24]. Current reports have shown that thin-slice chest CT is a powerful tool in clinical diagnosis due to the high sensitivity and the ability to monitor the development of the disease [25, 26]. In addition, a previous study reported that ground-glass opacity and consolidation were the most common CT findings for COVID-19 patients with pneumonia, while being nonspecific [27]. Clinical observations showed that there were significantly more consolidation lesions in ICU patients on admission, while more ground-glass opacity lesions were observed in non-ICU patients [28]. In our study, we found that vascular enlargement, air bronchogram and lesion range score differed significantly between non-severe and severe groups. Among these features, only the lesion range score had prognostic power, but not enough to be selected for the combination model. This indicates that while these early stage CT semantic features could have diagnostic value, they have limited ability to prognose the onset of severe illness in COVID-19 patients. The Chinese National Health Committee added some warning indicators for severe or critical cases in the updated diagnosis and treatment plan for COVID-19 patients (version 7) [29], which includes progressive reduction of peripheral blood lymphocytes, a progressive increase of interleukin-6, CRP and lactate, and rapid progression of lung CT findings in a short period. In this study, we used age, lymphocyte fraction, CRP, LDH, creatine kinase, urea and calcium scores from clinical, laboratory, and radiological examinations recorded at hospital admission to train a model for the prediction of the onset of severe illness. Our model combining these features from multiple sources showed a favourable performance when validated in the five external datasets from China, Italy and Belgium. In addition, the model is able to stratify COVID-19 patients into low-, medium- and high-risk groups for developing severe illness. We propose that this model, with its higher prediction performance and simplicity than PSI score, could be used for a preliminary screening and triage tool at hospital admission for the potential to develop severe illness. Furthermore, the model could be used for the selection and/or stratification of patients in clinical trials in order to homogenise the patient population. Follow-up laboratory tests are needed to assess the severity risk with a higher accuracy. As one of the coronavirus family's infecting humans, SARS-CoV-2 has similar aetiologic, clinical, radiological and pathological features to those of severe acute respiratory syndrome coronavirus and Middle East Respiratory Syndrome coronavirus [23, 30, 31]. Therefore, we believe that developing a reliable early warning model based on present clinical, radiological and pathological data is necessary for current outbreaks and possible future outbreaks of coronaviruses. Our study has several limitations. First, selection bias is unavoidable as is the limited and unbalanced sample size. Secondly, patients from different races and ethnicities may have diverse clinical and laboratory results, and the self-medication of patients before admission may affect the clinical and laboratory results. Thirdly, the threshold to go to the hospital and hospitalisation management can vary from country to country, and we are also aware that RNA viruses can mutate rapidly and could have an impact of the performance of the models. We therefore propose that those models should be continuously updated to achieve a better performance for example using privacy preserving distributed learning approaches [32, 33]. Fourthly, the CT features used for this study are semantic features from the first CT scan, and radiomics or deep learning approaches may improve its prognostic performance, and follow-up CT scans may yield more information. Finally, due to the large number of predictors included in the analysis, and the complexity of feature selection and modelling, overfitting is always possible. We have mitigated this with the use of external validation cohorts, and by re-running the analysis on randomised outcomes to arrive at a “chance” (AUC=0.5) result.

Conclusions

Elderly COVID-19 patients and non-hospital staff seem more vulnerable to develop a severe illness after hospitalisation as per defining criteria, which can cause a wide range of laboratory and CT anomalies. Furthermore, our model based on lactate dehydrogenase, C-reactive protein, calcium, age, lymphocyte proportion, urea and creatine kinase might be a more useful preliminary screening and triage tool than pneumonia severity index for risk assessment of COVID-19 patients at hospital admission. Please note: supplementary material is not edited by the Editorial Office, and is uploaded as it has been supplied by the author. Supplementary material ERJ-01104-2020.SUPPLEMENT This one-page PDF can be shared freely online. Shareable PDF ERJ-01104-2020.Shareable

24 in total

1. Machine Learning in Medicine.

Authors: Mike Fralick; Errol Colak; Muhammad Mamdani
Journal: N Engl J Med Date: 2019-06-27 Impact factor: 91.245

2. Clinical Characteristics of 138 Hospitalized Patients With 2019 Novel Coronavirus-Infected Pneumonia in Wuhan, China.

Authors: Dawei Wang; Bo Hu; Chang Hu; Fangfang Zhu; Xing Liu; Jing Zhang; Binbin Wang; Hui Xiang; Zhenshun Cheng; Yong Xiong; Yan Zhao; Yirong Li; Xinghuan Wang; Zhiyong Peng
Journal: JAMA Date: 2020-03-17 Impact factor: 56.272

3. Epidemiologic Features and Clinical Course of Patients Infected With SARS-CoV-2 in Singapore.

Authors: Barnaby Edward Young; Sean Wei Xiang Ong; Shirin Kalimuddin; Jenny G Low; Seow Yen Tan; Jiashen Loh; Oon-Tek Ng; Kalisvar Marimuthu; Li Wei Ang; Tze Minn Mak; Sok Kiang Lau; Danielle E Anderson; Kian Sing Chan; Thean Yen Tan; Tong Yong Ng; Lin Cui; Zubaidah Said; Lalitha Kurupatham; Mark I-Cheng Chen; Monica Chan; Shawn Vasoo; Lin-Fa Wang; Boon Huan Tan; Raymond Tzer Pin Lin; Vernon Jian Ming Lee; Yee-Sin Leo; David Chien Lye
Journal: JAMA Date: 2020-04-21 Impact factor: 56.272

4. A prediction rule to identify low-risk patients with community-acquired pneumonia.

Authors: M J Fine; T E Auble; D M Yealy; B H Hanusa; L A Weissfeld; D E Singer; C M Coley; T J Marrie; W N Kapoor
Journal: N Engl J Med Date: 1997-01-23 Impact factor: 91.245

5. Short term outcome and risk factors for adverse clinical outcomes in adults with severe acute respiratory syndrome (SARS).

Authors: J W M Chan; C K Ng; Y H Chan; T Y W Mok; S Lee; S Y Y Chu; W L Law; M P Lee; P C K Li
Journal: Thorax Date: 2003-08 Impact factor: 9.139

6. Are patients with hypertension and diabetes mellitus at increased risk for COVID-19 infection?

Authors: Lei Fang; George Karakiulakis; Michael Roth
Journal: Lancet Respir Med Date: 2020-03-11 Impact factor: 30.700

7. A pneumonia outbreak associated with a new coronavirus of probable bat origin.

Authors: Peng Zhou; Xing-Lou Yang; Xian-Guang Wang; Ben Hu; Lei Zhang; Wei Zhang; Hao-Rui Si; Yan Zhu; Bei Li; Chao-Lin Huang; Hui-Dong Chen; Jing Chen; Yun Luo; Hua Guo; Ren-Di Jiang; Mei-Qin Liu; Ying Chen; Xu-Rui Shen; Xi Wang; Xiao-Shuang Zheng; Kai Zhao; Quan-Jiao Chen; Fei Deng; Lin-Lin Liu; Bing Yan; Fa-Xian Zhan; Yan-Yi Wang; Geng-Fu Xiao; Zheng-Li Shi
Journal: Nature Date: 2020-02-03 Impact factor: 69.504

8. Diagnosis and Treatment of Adults with Community-acquired Pneumonia. An Official Clinical Practice Guideline of the American Thoracic Society and Infectious Diseases Society of America.

Authors: Joshua P Metlay; Grant W Waterer; Ann C Long; Antonio Anzueto; Jan Brozek; Kristina Crothers; Laura A Cooley; Nathan C Dean; Michael J Fine; Scott A Flanders; Marie R Griffin; Mark L Metersky; Daniel M Musher; Marcos I Restrepo; Cynthia G Whitney
Journal: Am J Respir Crit Care Med Date: 2019-10-01 Impact factor: 21.405

9. Nowcasting and forecasting the potential domestic and international spread of the 2019-nCoV outbreak originating in Wuhan, China: a modelling study.

Authors: Joseph T Wu; Kathy Leung; Gabriel M Leung
Journal: Lancet Date: 2020-01-31 Impact factor: 79.321

10. Clinical Characteristics of Coronavirus Disease 2019 in China.

Authors: Wei-Jie Guan; Zheng-Yi Ni; Yu Hu; Wen-Hua Liang; Chun-Quan Ou; Jian-Xing He; Lei Liu; Hong Shan; Chun-Liang Lei; David S C Hui; Bin Du; Lan-Juan Li; Guang Zeng; Kwok-Yung Yuen; Ru-Chong Chen; Chun-Li Tang; Tao Wang; Ping-Yan Chen; Jie Xiang; Shi-Yue Li; Jin-Lin Wang; Zi-Jing Liang; Yi-Xiang Peng; Li Wei; Yong Liu; Ya-Hua Hu; Peng Peng; Jian-Ming Wang; Ji-Yang Liu; Zhong Chen; Gang Li; Zhi-Jian Zheng; Shao-Qin Qiu; Jie Luo; Chang-Jiang Ye; Shao-Yong Zhu; Nan-Shan Zhong
Journal: N Engl J Med Date: 2020-02-28 Impact factor: 91.245

56 in total

1. Assessment of Data Quality Variability across Two EHR Systems through a Case Study of Post-Surgical Complications.

Authors: Sunyang Fu; Andrew Wen; Gavin M Schaeferle; Patrick M Wilson; Gabriel Demuth; Xiaoyang Ruan; Sijia Liu; Curtis Storlie; Hongfang Liu
Journal: AMIA Annu Symp Proc Date: 2022-05-23

2. Artificial Intelligence and Precision Medicine: A Perspective.

Authors: Jacek Lorkowski; Oliwia Kolaszyńska; Mieczysław Pokorski
Journal: Adv Exp Med Biol Date: 2022 Impact factor: 2.622

3. Fast prototyping of a local fuzzy search system for decision support and retraining of hospital staff during pandemic.

Authors: Evgeny A Bakin; Oksana V Stanevich; Daria M Danilenko; Dmitry A Lioznov; Alexander N Kulikov
Journal: Health Inf Sci Syst Date: 2021-05-11

4. Comparing machine learning algorithms for predicting ICU admission and mortality in COVID-19.

Authors: Ashish Verma; Ankit B Patel; Sonu Subudhi; C Corey Hardin; Melin J Khandekar; Hang Lee; Dustin McEvoy; Triantafyllos Stylianopoulos; Lance L Munn; Sayon Dutta; Rakesh K Jain
Journal: NPJ Digit Med Date: 2021-05-21

5. The value of computed tomography in assessing the risk of death in COVID-19 patients presenting to the emergency room.

Authors: Giulia Besutti; Marta Ottone; Tommaso Fasano; Pierpaolo Pattacini; Valentina Iotti; Lucia Spaggiari; Riccardo Bonacini; Andrea Nitrosi; Efrem Bonelli; Simone Canovi; Rossana Colla; Alessandro Zerbini; Marco Massari; Ivana Lattuada; Anna Maria Ferrari; Paolo Giorgi Rossi
Journal: Eur Radiol Date: 2021-05-12 Impact factor: 5.315

6. Assessing the likelihood of contracting COVID-19 disease based on a predictive tree model: A retrospective cohort study.

Authors: Francesc X Marin-Gomez; Mireia Fàbregas-Escurriola; Francesc López Seguí; Eduardo Hermosilla Pérez; Mència Benítez Camps; Jacobo Mendioroz Peña; Anna Ruiz Comellas; Josep Vidal-Alaball
Journal: PLoS One Date: 2021-03-03 Impact factor: 3.240

7. Early Prediction of COVID-19 Ventilation Requirement and Mortality from Routinely Collected Baseline Chest Radiographs, Laboratory, and Clinical Data with Machine Learning.

Authors: Abdulrhman Fahad Aljouie; Ahmed Almazroa; Yahya Bokhari; Mohammed Alawad; Ebrahim Mahmoud; Eman Alawad; Ali Alsehawi; Mamoon Rashid; Lamya Alomair; Shahad Almozaai; Bedoor Albesher; Hassan Alomaish; Rayyan Daghistani; Naif Khalaf Alharbi; Manal Alaamery; Mohammad Bosaeed; Hesham Alshaalan
Journal: J Multidiscip Healthc Date: 2021-07-30

Review 8. A State-of-the-Art Survey on Artificial Intelligence to Fight COVID-19.

Authors: Md Mohaimenul Islam; Tahmina Nasrin Poly; Belal Alsinglawi; Ming Chin Lin; Min-Huei Hsu; Yu-Chuan Jack Li
Journal: J Clin Med Date: 2021-05-02 Impact factor: 4.241

Review 9. Predicting clinical outcomes among hospitalized COVID-19 patients using both local and published models.

Authors: William Galanter; Jorge Mario Rodríguez-Fernández; Kevin Chow; Samuel Harford; Karl M Kochendorfer; Maryam Pishgar; Julian Theis; John Zulueta; Houshang Darabi
Journal: BMC Med Inform Decis Mak Date: 2021-07-24 Impact factor: 2.796

10. Long-term clinical follow-up of patients suffering from moderate-to-severe COVID-19 infection: a monocentric prospective observational cohort study.

Authors: Gilles Darcis; Antoine Bouquegneau; Nathalie Maes; Marie Thys; Monique Henket; Florence Labye; Anne-Françoise Rousseau; Perrine Canivet; Colin Desir; Doriane Calmes; Raphael Schils; Sophie De Worm; Philippe Léonard; Paul Meunier; Michel Moutschen; Renaud Louis; Julien Guiot
Journal: Int J Infect Dis Date: 2021-07-14 Impact factor: 3.623