| Literature DB >> 35280897 |
Syunsuke Yamanaka1, Koji Morikawa2, Hiroyuki Azuma3, Maki Yamanaka4, Yoshimitsu Shimada5, Toru Wada6, Hideyuki Matano7, Naoki Yamada1, Osamu Yamamura8, Hiroyuki Hayashi1.
Abstract
Background: Early prediction of oxygen therapy in patients with coronavirus disease 2019 (COVID-19) is vital for triage. Several machine-learning prognostic models for COVID-19 are currently available. However, external validation of these models has rarely been performed. Therefore, most reported predictive performance is optimistic and has a high risk of bias. This study aimed to develop and validate a model that predicts oxygen therapy needs in the early stages of COVID-19 using a sizable multicenter dataset.Entities:
Keywords: COVID-19; PROBAST; TRIPOD; machine learning; medical triage; multicenter; prognostic model
Year: 2022 PMID: 35280897 PMCID: PMC8904892 DOI: 10.3389/fmed.2022.846525
Source DB: PubMed Journal: Front Med (Lausanne) ISSN: 2296-858X
Characteristics of the patient on the admission of the patients with COVID-19.
|
|
|
|
|---|---|---|
| Age, median (IQR), year | 54 (34–70) | 42 (27–53) |
| Female | 192 (48) | 326 (45) |
| Smoking history | 116 (32) | – |
| Drinking alcohol | 123 (52) | – |
| Height, median (IQR), cm | 164 (157–171) | 165 (158–173) |
| Body weight, median (IQR), kg | 61 (52–72) | 62 (53–73) |
| Body mass index, median (IQR), kg/m2 | 23 (20–25) | 23 (20–25) |
|
| ||
| Any comorbidity | 193 (49) | 170 (24) |
| Cardiovascular all | 32 (9) | – |
| Myocardial infarction | 16 (4) | – |
| Congestive heart failure | 5 (1) | – |
| Peripheral vascular disease | 3 (1) | – |
| Cerebrovascular disease | 16 (4) | – |
| Chronic Obstructive Pulmonary Disease (COPD) | 5 (1) | – |
| Bronchial asthma | 16 (4) | – |
| Chronic lung disease (excluding COPD) | 4 (1) | – |
| Chronic kidney disease (CKD) | 9 (3) | – |
| Hypertension | 117 (30) | 90 (12) |
| Hyperlipidemia | 48 (13) | – |
| Diabetes mellites | 52 (14) | – |
| Malignancy | 15 (4) | – |
|
| ||
| Any symptoms | 298 (75) | – |
| Fever (37.0 to 38.0°C) | 190 (53) | – |
| Fever (38.0°C or more) | 47 (13) | – |
| Malaise or fatigue | 110 (31) | – |
| Sore throat | 92 (26) | – |
| Headache | 60 (17) | – |
| Rhinorrhea | 71 (20) | – |
| Arthralgia | 37 (10) | – |
| Diarrhea | 9 (3) | – |
| Loss of smell | 45 (13) | – |
| Dyspnea | 15 (4) | – |
| Muscle ache | 37 (31) | – |
| Loss of taste | 39 (11) | – |
| Disturbance of consciousness | 2 (1) | – |
| Conjunctival hyperemia | 1 (0) | – |
| Period from onset of symptom to PCR positive (IQR) (days) | 3 (1–6) | 3 (1–6) |
Data are shown as no (%) otherwise is specified.
Cardiovascular diseases include congestive heart failure, unstable angina pectoris, atrial fibrillation, and hypertension. Respiratory diseases include bronchial asthma, chronic obstructive pulmonary disease (COPD), tuberculosis, pleuritis, and pneumonia. Malignancy includes colonic cancer, spinal tumor, brain tumor, prostate cancer, oral cancer, and malignant lymphoma.
CKD, chronic kidney disease; GERD, gastroesophageal reflux disease; PCR, polymerase chain reaction.
Smoking history includes patients who are currently smoking or smoking in the past.
Drinking alcohol includes patients who drink daily or occasionally.
Blood pressure, percutaneous oxygen saturation, laboratory findings, X-ray on admission, and outcome of the patients with COVID-19.
|
|
|
|
|---|---|---|
|
| ||
| Systolic blood pressure (mmHg) | 126 (115–139) | – |
| Diastolic blood pressure (mmHg) | 83 (75–91) | – |
| Saturation of percutaneous oxygen (%) | 97 (96–98) | 98 (97–98) |
|
| ||
| White blood cells (×103/μL) | 46 (37–56) | – |
| Lymphocytes (%) | 25 (20–34) | – |
| Platelets (×104/μL) | 10 (16–25) | – |
|
| ||
| Prothrombin time (seconds) | 11 (10–12) | – |
| Activated partial thromboplastin time (seconds) | 32 (30–35) | – |
| Fibrinogen (mg/dl) | 329 (247–428) | 343 (290–404) |
| D-dimer (μg/ml) | 0.8 (0.5–1.1) | 0.7 (0.51–0.9) |
|
| ||
| Na (mEq/l) | 140 (138–141) | 141 (140–151) |
| K (mEq/l) | 4 (3.8–4.2) | – |
| Albumin (g/dl) | 4.2 (3.9–4.5) | – |
| Blood urea nitrogen (mg/dl) | 12.6 (10.0–15.3) | – |
| Creatinine (mg/dl) | 0.78 (0.66–0.95) | – |
| Lactate dehydrogenase (U/l) | 194 (168–235) | |
| Aspartate aminotransferase (U/l) | 24 (20–33) | 23 (19–32) |
| Alanine aminotransferase (U/I) | 21 (14–35) | – |
|
| ||
| C-reactive protein (mg/dl) | 0.45 (0.12–1.59) | 0.44 (0.14–1.39) |
| X-ray (pneumonia) | ||
| Non | 170 (43) | 382 (52) |
| Unilateral | 42 (11) | 78 (11) |
| Bilateral | 106 (33) | 244 (35) |
|
| ||
| A-DROP (≥1) | 99 (25) | – |
| Oxygen needs | 102 (26) | 106 (15) |
A-DROP, Age-dehydration-respiration-orientation-blood-pressure criteria.
The ability of eight machine-learning models and A-DROP as the risk stratification tool.
|
|
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|---|
|
|
|
| ||||||
| A-DROP criteria ≥1 (Reference) | 0.69 (0.62–0.75) | – | 0.54 (0.48–0.59) | 0.83 (0.79–0.87) | 0.54 (0.48–0.59) | 0.83 (0.79–0.87) | 3.08 (2.24–4.25) | 0.56 (0.41–0.77) |
| Penalized logistic regression | 0.85 (0.75–0.95) | 0.01 | 0.79 (0.70–0.88) | 0.73 (0.64–0.82) | 0.56 (0.45–0.66) | 0.89 (0.82–0.95) | 2.96 (1.83–4.77) | 0.28 (0.18–0.46) |
| Random forest | 0.88 (0.78–0.97) | <0.01 | 0.79 (0.70–0.73) | 0.96 (0.93–1.00) | 0.90 (0.84–0.96) | 0.92 (0.85–0.97) | 22.17 (5.60–7.38) | 0.22 (0.05–0.86) |
| SVM | 0.83 (0.73–0.94) | 0.03 | 0.75 (0.65–0.84) | 0.80 (0.71–0.89) | 0.62 (0.51–0.75) | 0.88 (0.81–0.95) | 3.82 (2.14–6.81) | 0.31 (0.17–0.55) |
| KNN | 0.78 (0.66–0.91) | 0.23 | 0.25 (0.16–0.35) | 0.98 (0.95–1.00) | 0.86 (0.78–0.93) | 0.75 (0.66–0.85) | 14.01 (1.78–110) | 0.76 (0.10–6.01) |
| XG boost | 0.89 (0.79–0.96) | <0.01 | 0.83 (0.75–0.91) | 0.90 (0.84–0.96) | 0.71 (0.61–0.81) | 0.95 (0.90–0.99) | 8.61 (3.92–18.9) | 0.18 (0.08–0.41) |
| MLP | 0.86 (0.76–0.96) | <0.01 | 0.54 (0.44–0.65) | 0.95 (0.89–0.99) | 0.81 (0.72–0.90) | 0.83 (0.74–0.91) | 10.11 (3.17–32.8) | 0.48 (0.15–1.55) |
| XG boost with eight features (Internal validation) | 0.92 (0.86–0.98) | <0.001 | 0.94 (0.89–0.99) | 0.69 (0.59–0.79) | 0.47 (0.36–0.59) | 0.98 (0.94–1.00) | 3.08 (2.08–4.56) | 0.08 (0.05–0.12) |
| XG boost with eight features (External validation) | 0.88 (0.81–0.95) | <0.001 | 0.64 (0.55–0.71) | 0.93 (0.88–0.97) | 0.61 (0.53–0.68) | 0.93 (0.89–0.97) | 8.77 (4.34–17.7) | 0.39 (0.19–0.79) |
A-DROP criteria, age-dehydration-respiration-orientation-blood-pressure criteria.
CI, confidence interval; PPV, positive predictive value; NPV, negative predictive value; PLR, positive likelihood ratio; NLR, negative likelihood ratio; KNN, K-point nearest neighbor; MLPerceptron, multilayer perceptron; XG boost, extreme gradient boosting.
Figure 1C-statistic of XG Boost model with eight variables in external validation.
Feature importance of XGboost with eight variables.
|
|
|
|---|---|
| SpO2 at the first visit | 0.26135576 |
| Age | 0.26132179 |
| Lactate dehydrogenase | 0.16435097 |
| Aminotransferase | 0.15464792 |
| Any comorbidity | 0.124125525 |
| C-reactive protein | 0.06478236 |
| Hypertension | 0.062515706 |
| Pneumonia | <0.01 |
SpO.
Figure 2XG boost model calibration plot in external validation.