| Literature DB >> 33261629 |
Thomas Langer1,2, Martina Favarato3,4, Riccardo Giudici4, Gabriele Bassi4, Roberta Garberi3, Fabiana Villa3, Hedwige Gay3,4, Anna Zeduri3, Sara Bragagnolo3, Alberto Molteni5, Andrea Beretta6, Matteo Corradin7, Mauro Moreno7, Chiara Vismara8, Carlo Federico Perno8, Massimo Buscema9,10, Enzo Grossi11,12, Roberto Fumagalli3,4.
Abstract
BACKGROUND: Reverse Transcription-Polymerase Chain Reaction (RT-PCR) for Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-COV-2) diagnosis currently requires quite a long time span. A quicker and more efficient diagnostic tool in emergency departments could improve management during this global crisis. Our main goal was assessing the accuracy of artificial intelligence in predicting the results of RT-PCR for SARS-COV-2, using basic information at hand in all emergency departments.Entities:
Keywords: Artificial intelligence; Critical care; Emergency service, hospital; Pandemics; Severe acute respiratory syndrome coronavirus 2; Supervised machine learning
Year: 2020 PMID: 33261629 PMCID: PMC7705856 DOI: 10.1186/s13049-020-00808-8
Source DB: PubMed Journal: Scand J Trauma Resusc Emerg Med ISSN: 1757-7241 Impact factor: 2.953
Variables under study (n = 74)
| Age (years) | |
| Sex (M; F) | |
| Asthma | |
| Atrial fibrillation | |
| Autoimmune/inflammatory disease | |
| Chronic Interstitial lung disease | |
| Chronic kidney disease | |
| Chronic liver disease | |
| Chronic obstructive pulmonary disease (COPD) | |
| Diabetes mellitus | |
| Hematologic malignancy | |
| Human immunodeficiency virus | |
| Hypertension | |
| Ischemic cardiomyopathy | |
| Smoking (active) | |
| Solid organ tumor | |
| 5-alpha-reductase inhibitors | |
| Angiotensin converting enzyme inhibitor | |
| Anti-arrhythmic drugs | |
| Antibiotics before Emergency Department | |
| Anticoagulant therapy | |
| Anti-epileptic therapy | |
| Anti-platelets therapy | |
| Beta blocker | |
| Calcium channel antagonist | |
| Diuretics | |
| Drugs for psychiatric disorders | |
| Hydroxymethylglutaryl-CoA reductase inhibitors | |
| Immunosuppressant drugs | |
| Sartans | |
| Arthralgia | |
| Asthenia | |
| Chest pain | |
| Cough | |
| Dyspnea | |
| Fever | |
| Gastrointestinal symptoms | |
| Headache | |
| Sore throat | |
| Syncope | |
| Glasgow coma scale, score | |
| Body temperature, Celsius | |
| Systolic blood pressure, mmHg | |
| Diastolic blood pressure, mmHg | |
| Heart rate, beats per min | |
| Sinus rhythm – no. (%) | |
| Normal ST segment – no. (%) | |
| Respiratory rate, breaths per min | |
| Oxygen supplementation – no. (%) | |
| Non-Invasive Ventilation – no. (%) | |
| Chest X-rays opacity – no. (%) | |
| Chest X-rays pleural effusion – no. (%) | |
| Total white blood cell count, 103/μL | |
| Neutrophils, % | |
| Lymphocytes, % | |
| Monocytes, % | |
| Eosinophils, % | |
| Basophils, % | |
| Total red blood cells count, 106/μL | |
| Haematocrit, % | |
| Haemoglobin, g/dL | |
| Mean corpuscular haemoglobin concentration, g/dL | |
| Mean corpuscular haemoglobin, pg | |
| Mean red blood cell volume, μm3 | |
| Red blood cells distribution width, | |
| Platelets, 103/μL | |
| Glycaemia, mg/dL | |
| Creatinine, mg/dL | |
| Urea, mg/dL | |
| Sodium, mEq/L | |
| Total bilirubin, mg/dL | |
| C-reactive protein, mg/dL | |
| Negative (yes; no) | |
| Positive (yes; no) |
Medical and medication history and reported sign and symptoms were collected by physicians in the Emergency Room. Clinical findings refer to data gathered in the Emergency Room at hospital admission
Nicknames of Machine Learning Systems in WEKA software Package
| Machine Learning software | Nick name |
|---|---|
| Logistic [ | Logistic |
| Pruned decision tree [ | J 48 |
| Multi Layer Perceptron [ | MLP |
| NaivBayes [ | NaivBayes |
| RandomForest [ | RandomForest |
| RotationForest [ | RotationForest |
| Sequential Minimal Optimization [ | SMO |
| K Nearest Neighborhood [ | KNN47 |
| Additive Logistic Regression [ | Logit Boost48 |
Main Features of Semeion Machine Learning Systems employed in the study
| ANNs Architecture | Hidden layers | Learning Rate | Epochs | Initialization | Output Function | Nickname |
|---|---|---|---|---|---|---|
| Conic Net [ | 5 Layers= 4x12x12x12 | 0.01 | 1000 | Auto-encoders | Soft Max | D_FF_Conic (4x12x12x12) |
5 Layers= 6x12x12x12 | 0.01 | 1000 | Auto-encoders | Soft Max | D_FF_Conic (6x12x12x12) | |
| Sine Net [ | 1 Layer = 48 | 0.1 | 2000 | Random | Soft Max | D_FF_Sn [ |
| Back Propagation [ | 0 Layer = L | 0.1 | 1000 | Random | Soft Max | D_FF_Bp (0) |
| 1 Layer = 24 | 0.1 | 1000 | Random | Soft Max | D_FF_Bp [ | |
5 Layers= 16x16x16x16 | 0.01 | 2000 | Auto-encoders | Soft Max | D_FF_Bp (16x16x16x16) | |
| Bi-Modal Net | 1 Layer = 48 | 0.1 | 1000 | Random | Soft Max | D_FF_Bm [ |
| Gauss Net [ | 1 Layer = 64 | 0.01 | 1000 | Random | Soft Max | D_FF_GNet [ |
Characteristics of the study population
| Demographic characteristics | Overall | Positive | Negative | |
|---|---|---|---|---|
| Age, years | 65 [46–78] | 65 [49–77] | 66 [38–82] | 0.94 |
| Male – no. (%) | 127 (63.8) | 78 (62.9) | 49 (65.3) | 0.85 |
| Asthma | 13 (6.6) | 9 (7.3) | 4 (5.4) | 0.83 |
| Atrial fibrillation | 19 (9.6) | 11 (8.9) | 8 (10.7) | 0.87 |
| Autoimmune/inflammatory disease | 6 (3.0) | 3 (2.4) | 3 (4) | 0.84 |
| Chronic Interstitial lung disease | 1 (0.5) | 1 (0.8) | 0 (0.0) | 0.80 |
| Chronic kidney disease | 11 (5.5) | 8 (6.5) | 3 (4.0) | 0.68 |
| Chronic liver disease | 7 (3.5) | 3 (2.4) | 4 (5.3) | 0.49 |
| Chronic obstructive pulmonary disease | 15 (7.5) | 5 (4.0) | 10 (13.3) | 0.03 |
| Diabetes mellitus | 29 (14.6) | 16 (12.9) | 13 (17.3) | 0.51 |
| Hematologic malignancy | 6 (3.0) | 4 (3.2) | 2 (2.7) | 0.84 |
| Human immunodeficiency virus | 1 (0.5) | 0 (0.0) | 1 (1.3) | 0.80 |
| Hypertension | 85 (42.7) | 55 (44.4) | 30 (40) | 0.65 |
| Ischemic cardiomyopathy | 19 (9.6) | 10 (8.1) | 9 (12.0) | 0.51 |
| Smoking (active) | 12 (6.0) | 4 (3.2) | 8 (10.7) | 0.07 |
| Solid organ tumour | 16 (8.0) | 7 (5.7) | 9 (12.0) | 0.18 |
| 5-alpha-reductase inhibitors | 12 (6.0) | 9 (7.3) | 3 (4.0) | 0.53 |
| Angiotensin converting enzyme inhibitors | 33 (16.6) | 21 (16.9) | 12 (16.0) | 0.98 |
| Anti-arrhythmic drugs | 11 (5.5) | 6 (4.8) | 5 (6.7) | 0.82 |
| Antibiotics before Emergency Department | 47 (23.6) | 36 (29.0) | 11 (14.7) | 0.03 |
| Anticoagulant drugs | 25 (12.6) | 15 (12.1) | 10 (13.3) | 0.97 |
| Anti-epileptic drugs | 10 (5.0) | 1 (0.8) | 9 (12.0) | 0.002 |
| Antiplatelets drugs | 33 (16.6) | 20 (16.1) | 13 (17.3) | 0.98 |
| Beta blockers | 40 (20.1) | 24 (19.4) | 16 (21.3) | 0.88 |
| Calcium channel antagonists | 27 (13.6) | 18 (14.5) | 9 (12.0) | 0.77 |
| Diuretics | 35 (17.6) | 21 (16.9) | 14 (18.7) | 0.91 |
| Drugs for psychiatric disorders | 7 (3.5) | 1 (0.8) | 6 (8.0) | 0.02 |
| Hydroxymethylglutaryl-CoA (HMG-CoA) reductase inhibitors | 28 (14.1) | 18 (14.5) | 10 (13.3) | 0.98 |
| Immunosuppressant drugs | 12 (6.0) | 8 (6.5) | 4 (5.3) | 0.99 |
| Sartans | 24 (12.1) | 19 (15.3) | 5 (6.7) | 0.11 |
| Arthralgia | 7 (3.5) | 6 (4.8) | 1 (1.3) | 0.37 |
| Asthenia | 18 (9.1) | 9 (7.3) | 9 (12.0) | 0.38 |
| Chest pain | 7 (3.5) | 7 (5.7) | 0 (0.0) | 0.09 |
| Cough | 130 (65.3) | 91 (73.4) | 39 (52) | 0.004 |
| Dyspnea | 75 (37.7) | 43 (34.7) | 32 (42.7) | 0.33 |
| Fever | 174 (87.4) | 119 (96.0) | 55 (73.3) | < 0.001 |
| Gastrointestinal symptoms | 27 (13.6) | 18 (14.5) | 9 (12.0) | 0.77 |
| Headache | 12 (6.0) | 8 (6.5) | 4 (5.3) | 0.99 |
| Sore throat | 8 (4.0) | 3 (2.4) | 5 (6.7) | 0.27 |
| Syncope | 5 (2.5) | 2 (1.6) | 3 (4.0) | 0.57 |
| Glasgow coma scale, score | 15 [15–15] | 15 [15–15] | 15 [15–15] | > 0.99 |
| Body temperature, Celsius | 37.6 ± 0.9 | 37.8 ± 0.8 | 37.2 ± 1 | < 0.001 |
| Systolic blood pressure, mmHg | 131 ± 22 | 133 ± 19 | 128 ± 25 | 0.09 |
| Diastolic blood pressure, mmHg | 75 [65–80] | 75 [70–80] | 70 [65–80] | 0.09 |
| Heart rate, beats per min | 90 [80–105] | 90 [83–105] | 90 [80–105] | 0.87 |
| Sinus rhythm – no. (%) | 185 (93) | 117 (94.4) | 68 (90.7) | 0.48 |
| Normal ST segment – no. (%) | 198 (99.5) | 123 (99.2) | 75 (100.0) | 0.80 |
| Respiratory rate, breaths per min | 18 [16–22] | 18 [16–22] | 18 [16–24] | 0.80 |
| Oxygen supplementation – no. (%) | 51 (25.6) | 27 (21.8) | 24 (32.0) | 0.15 |
| Non-Invasive Ventilation – no. (%) | 8 (4.0) | 4 (3.2) | 4 (5.3) | 0.72 |
| Chest X-rays opacity – no. (%) | 158 (79.4) | 104 (83.9) | 54 (72.0) | 0.07 |
| Chest X-rays pleural effusion – no. (%) | 20 (10.1) | 7 (5.7) | 13 (17.3) | 0.01 |
| Total white blood cell count, 103/μL | 6.64 [4.65–9.65] | 5.44 [4.21–7.23] | 9.28 [6.87–13.64] | < 0.001 |
| Neutrophils, % | 72.5 [64.4–81.7] | 70.4 [62.5–79.9] | 76.7 [68.4–85.3] | 0.001 |
| Lymphocytes, % | 18.0 [10.5–25.1] | 20.6 [12.9–27.7] | 14.5 [7.4–19.8] | < 0.001 |
| Monocytes, % | 7.9 [5.6–10.2] | 8.0 [6–10.9] | 7.6 [5.4–9.7] | 0.18 |
| Eosinophils, % | 0 [0–0.3] | 0 [0–0.2] | 0.2 [0–1.2] | < 0.001 |
| Basophils, % | 0.3 [0.2–0.4] | 0.2 [0.2–0.4] | 0.3 [0.2–0.5] | 0.03 |
| Total red blood cells count, 106/μL | 4.80 [4.29–5.25] | 4.88 [4.42–5.28] | 4.52 [4.01–5.14] | 0.01 |
| Haematocrit, % | 41.6 ± 5.3 | 42.1 ± 4.5 | 40.8 ± 6.3 | 0.10 |
| Haemoglobin, g/dL | 13.7 ± 1.9 | 14.0 ± 1.6 | 13.3 ± 2.2 | 0.01 |
| Mean corpuscular haemoglobin concentration, g/dL | 33.1 [32–33.9] | 33.2 [32.5–34.1] | 32.8 [31.6–33.8] | 0.01 |
| Mean corpuscular haemoglobin, pg | 29.2 [28.1–30.5] | 29.2 [27.9–30.5] | 29.3 [28.6–30.2] | 0.56 |
| Mean red blood cell volume, μm3 | 88.3 [85.4–91.4] | 87.7 [84.5–90.4] | 89.7 [87–92.6] | 0.002 |
| Red blood cells distribution width, | 13.2 [12.4–14.4] | 13.1 [12.3–13.9] | 13.6 [12.5–15.5] | 0.01 |
| Platelets, 103/μL | 193 [162–244] | 183 [145–234] | 221 [175–283] | < 0.001 |
| Glycaemia, mg/dL | 120 [104–142] | 120 [104–138] | 117 [101–158] | 0.80 |
| Creatinine, mg/dL | 1.0 [0.8–1.3] | 0.98 [0.82–1.21] | 1.09 [0.83–1.45] | 0.19 |
| Urea, mg/dL | 35 [23–54.5] | 34 [22–49] | 41.5 [26–64.5] | 0.07 |
| Sodium, mEq/L | 138 [135–140] | 138 [134–140] | 138 [136–141] | 0.37 |
| Total bilirubin, mg/dL | 0.5 [0.4–0.7] | 0.5 [0.3–0.6] | 0.5 [0.4–0.7] | 0.05 |
| C-reactive protein, mg/dL | 4.1 [1.2–9] | 4.0 [1.3–8.9] | 4.5 [0.9–9.3] | 0.90 |
Variables are ordered by categories. Continuous variables are expressed as median [interquartile range] or mean ± Standard Deviation, as appropriate. Medical and medication history, reported signs and symptoms were collected from physicians of the Emergency Department. Clinical findings (Vital signs, ECG – Electrocardiogram, Chest X-ray and Laboratory findings) refer to the first measurements performed in the Emergency department
Study variables positively and negatively correlated with SARS-COV-2 positivity
| *Fever | 0.33 | 0.06 | |
| *Body temperature (degree Celsius) | 0.27 | *Chronic kidney disease | 0.05 |
| Lymphocytes percentage | 0.26 | 0.05 | |
| Cough | 0.22 | Hypertension | 0.04 |
| *Total red blood cells count (106/μL) | 0.19 | Asthma | 0.04 |
| *Haemoglobin (g/dL) | 0.17 | *Age | 0.04 |
| Glasgow coma scale (GCS) | 0.17 | Gastrointestinal symptoms | 0.04 |
| *Antibiotics before Emergency Department | 0.16 | 0.04 | |
| *Chest pain | 0.15 | Mean corpuscular haemoglobin concentration (%) | 0.03 |
| *Chest X-rays opacity | 0.14 | Respiratory rate (breaths/min) | 0.03 |
| *Sartans | 0.13 | *Female | 0.02 |
| *Systolic blood pressure (mmHg) | 0.12 | 0.02 | |
| Haematocrit (%) | 0.12 | 0.02 | |
| Arthralgia | 0.09 | reductase inhibitors | 0.02 |
| *Diastolic blood pressure (mmHg) | 0.07 | Hematologic malignancy | 0.02 |
| P wave present and normal | 0.07 | 0.01 | |
| *5-alpha-reductase inhibitors | 0.07 | Heart rate (beats per minute) | 0.00 |
| Chronic Interstitial lung disease | 0.06 | ||
| *Total white blood cells count (103/μL) | −0.46 | *Dyspnoea | −0.07 |
| *Eosinophils percentages | −0.34 | −0.06 | |
| Platelets (103/μL) | − 0.33 | − 0.06 | |
| *Anti-epileptic therapy | −0.25 | Diabetes mellitus | −0.06 |
| *Calcium (mg/dL) | −0.22 | Creatinine (mg/dL) | −0.06 |
| Mean red blood cell volume (μm3) | −0.21 | −0.06 | |
| Neutrophils percentages | −0.19 | −0.05 | |
| *Drugs for psychiatric disorders | −0.19 | Autoimmune/inflammatory disease | −0.04 |
| *Chest X-rays pleural effusion | −0.19 | Red blood cells distribution width (%) | −0.04 |
| *Chronic obstructive pulmonary disease | −0.17 | Anti-arrhythmic drugs | −0.04 |
| *Smoking (active) | −0.15 | Syncope | −0.04 |
| *Urea (mg/dL) | −0.14 | Glycaemia (mg/dL) | −0.03 |
| *Total bilirubin (mg/dL) | −0.14 | −0.03 | |
| Oxygen supplementation | −0.11 | Atrial fibrillation | −0.03 |
| *Solid organ tumour | −0.11 | C-reactive protein (mg/dL) | −0.03 |
| *Sore throat | −0.10 | −0.02 | |
| Human Immunodeficiency Virus | −0.09 | Beta blocker | −0.02 |
| Asthenia | −0.08 | Anticoagulant therapy | −0.02 |
| Chronic liver disease | −0.08 | *Anti-platelets therapy | −0.02 |
| *Sodium (mEq/L) | −0.07 | ||
Variables were divided according to their positive or negative correlation to the target variable (positivity of RT-PCR for SARS-COV-2) and are listed in decreasing order of positive or negative correlation coefficient, respectively. Asterisks indicate variables included in the TWIST model
Predictive results with variables selection using Semeion (a) and WEKA (b) Machine learning systems
| Machine learning system | Sensitivity | Specificity | Overall accuracy | Balanced accuracy | Variance | PPV | AUROC |
|---|---|---|---|---|---|---|---|
| 94.1 | 88.7 | 91.4 | 92.2 | 0.5 | 93.5 | 0.90 | |
| 92.5 | 90.2 | 91.3 | 91.7 | 0.0 | 94.1 | 0.91 | |
| 89.2 | 93.0 | 91.1 | 90.6 | 1.0 | 95.6 | 0.93 | |
| 93.2 | 88.7 | 91.0 | 91.7 | 1.0 | 93.4 | 0.92 | |
| 90.7 | 90.2 | 90.5 | 90.6 | 1.0 | 94.0 | 0.90 | |
| 91.7 | 88.9 | 90.3 | 90.6 | 1.0 | 93.2 | 0.92 | |
| 91.6 | 88.9 | 90.2 | 90.6 | 1.0 | 93.2 | 0.91 | |
| 91.6 | 88.7 | 90.2 | 90.6 | 0.0 | 93.3 | 0.92 | |
| 89.1 | 90.3 | 89.7 | 89.6 | 2.1 | 93.8 | 0.91 | |
| 81.0 | 84.8 | 82.9 | 82.3 | 3.1 | 89.8 | 0.90 | |
| 86.6 | 65.3 | 75.9 | 78.7 | 1.6 | 80.6 | 0.86 | |
| 85.8 | 64.5 | 75.1 | 78.1 | 4.2 | 81.0 | 0.83 | |
| 88.3 | 60.8 | 74.6 | 78.1 | 0.0 | 79.1 | 0.85 | |
| 80.7 | 67.9 | 74.3 | 76.0 | 1.0 | 80.8 | 0.63 | |
| 81.0 | 61.2 | 71.1 | 73.4 | 1.6 | 77.6 | 0.81 | |
| 77.4 | 60.7 | 69.0 | 71. 4 | 0.5 | 77.0 | 0.57 | |
| 96.8 | 27.9 | 62.4 | 70.3 | 9.9 | 61.5 | 0.65 | |
| 75.9 | 36.5 | 56.2 | 60.9 | 2.6 | 66.6 | 0.56 |
Employed machine learning systems are listed in decreasing order of overall accuracy. The results are the average of two testing experiments with training-testing A-B and B-A sequence. A hundred cases were presented in subset A and ninety-nine cases in subset B. Overall accuracy Arithmetic average of sensitivity and specificity, Balanced accuracy Weighted average of sensitivity and specificity, PPV Positive Predictive Value, AUROC Area Under the Receiver Operator Curve. Sensitivity, Specificity, Overall accuracy, Balanced accuracy, Variance and PPV are all expressed as percentage.
Predictive results with 5-K fold protocol, using Semeion (a) and WEKA (b) machine learning systems
| Machine learning system | Sensitivity | Specificity | Overall accuracy | Balanced accuracy | Variance | PPV | AUROC |
|---|---|---|---|---|---|---|---|
| 89.2 | 86.2 | 87.7 | 88.0 | 2.6 | 92.1 | 0.86 | |
| 87.5 | 84.6 | 86.0 | 86.5 | 3.4 | 91.0 | 0.84 | |
| 88.3 | 83.2 | 85.8 | 86.4 | 4.3 | 90.4 | 0.85 | |
| 88.3 | 77.6 | 83.0 | 84.3 | 6.0 | 87.1 | 0.81 | |
| 90.0 | 58.0 | 74.0 | 78.1 | 3.2 | 78.9 | 0.83 | |
| 82.5 | 62.5 | 72.5 | 75.1 | 5.6 | 79.7 | 0.74 |
Employed machine learning systems are listed in decreasing order of overall accuracy. The results are the average of five testing experiments. Overall accuracy Arithmetic average of sensitivity and specificity, Balanced accuracy Weighted average of sensitivity and specificity, PPV Positive Predictive Value, AUROC Area Under the Receiver Operator Curve. Sensitivity, Specificity, Overall accuracy, Balanced accuracy, Variance and PPV are all expressed as percentage.