| Literature DB >> 35699441 |
Paula Cremades-Martínez1, Lucy A Parker2,3, Elisa Chilet-Rosell2,3, Blanca Lumbreras2,3.
Abstract
We aimed to review strategies for identifying SARS-CoV-2 infection before the availability of molecular test results, and to assess the reporting quality of the studies identified through the application of the STARD guideline. We screened 3,821 articles published until 30 April 2021, of which 23 met the inclusion criteria: including at least two diagnostic variables, being designed for use in clinical practice or in a public health context and providing diagnostic accuracy rates. Data extraction and application of STARD criteria were performed independently by two researchers and discrepancies were discussed with a third author. Most of the studies (16, 69.6%) included symptomatic patients with suspected infection, six studies (26.1%) included patients already diagnosed and one study (4.3%) included individuals with close contact to a COVID-positive patient. The main variables considered in the studies, which included symptomatic patients, were imaging and demographic characteristics, symptoms, and lymphocyte count. The values for area under the receiver operating characteristic curve (AUC)ranged from 53-97.4. Seven studies (30.4%) validated the diagnostic model in an independent sample. The average number of STARD criteria fulfilled was 17.6 (maximum, 27 and minimum, 5). High diagnostic accuracy values are shown when more than one diagnostic variable is considered, mainly imaging and demographic characteristics, symptoms, and lymphocyte count. This could offer the potential to identify individuals with SARS-CoV-2 infection with high accuracy when molecular testing is not available. However, external validation for developed models and evaluations in populations as similar as possible to those in which they will be applied is urgently needed. IMPORTANCE According to this review, the inclusion of more than one diagnostic test in the diagnostic process for COVID-19 infection shows high diagnostic accuracy values. Imaging characteristics, patients' symptoms, demographic characteristics, and lymphocyte count were the variables most frequently included in the diagnostic models. However, developed models should be externally validated before reaching conclusions on their utility in practice. In addition, it is important to bear in mind that the test should be evaluated in populations as similar as possible to those in which it will be applied in practice.Entities:
Keywords: COVID-19; SARS-CoV-2; STARD; diagnosis
Mesh:
Year: 2022 PMID: 35699441 PMCID: PMC9430610 DOI: 10.1128/spectrum.00300-22
Source DB: PubMed Journal: Microbiol Spectr ISSN: 2165-0497
FIG 1Flow chart of the inclusion of articles in the systematic review, according to Preferred Reporting Items for Systematic Reviews and MetaAnalyses (PRISMA) recommendations.
Description of the main characteristics of the studies included in the systematic review (study characteristics, setting, and population)
| Reference | Yr | Country | Study design | Aim | Setting | Sample size | Gender | Age | Inclusion criteria | Exclusion criteria |
|---|---|---|---|---|---|---|---|---|---|---|
| Symptomatic patients with suspected COVID 19 infection | ||||||||||
| Guo X ( | 2020 | China | Retrospective cohort study | To develop an integrated multi-feature predictive model based on random forest to differentiate COVID-19 from seasonal flu and pneumonia caused by other common respiratory viruses. | Two hospitals | 105 patients: (a) Discovery cohort, 50 patients and (b) Validation cohort, 55 patients. | Discovery cohort: males, 60%; validation cohort: males, 60% | Age range Discovery cohort: 25-31 yrs (23.1-62.5); Validation cohort: 31-39 yrs (25.0-52.3) | Suspected COVID-19 patients who presented to hospital. | NA |
| Langer T ( | 2020 | Italy | Retrospective cohort study | To develop a machine learning model to predict the results of RT-qPCR for SARS-CoV-2 using only basic clinical, radiological and routine laboratory data at hand in all emergency departments (training and testing and k-fold cross-validation protocol) | Emergency Department at Niguarda Ca′ Granda, Milan, Italy. | 199 | Males 63.8% | Median : 65 [46–78] | Symptoms of presentation compatible with COVID-19 (fever, sore throat, cough, dyspnoea, chest pain, headache, syncope, asthenia, arthralgia, diarrhoea, nausea and vomit) | Age < 12 yrs and absence of evaluation of the leukocyte formula (defined as percent- ages of the five types of leukocytes: neutrophils, lymphocytes, eosinophils, basophils and monocytes) in the emergency department. |
| Hermans JJR ( | 2020 | The Netherlands | Prospective cohort study | To construct a predictive machine learning model based on chest CT and additional data to improve the diagnostic accuracy of chest CT (10-fold cross validation). | Franciscus Gasthuis & Vlietland hospital in Rotterdam and Schiedam, the Netherlands, which has a level 2 trauma center with 48,000 visits annually at the Emergency Department. | 319 patients | Males RT-qPCR+: 55.6%; -RT-qPCR-: 44.6% | Median (IQR) RT-qPCR+: 59 (50-68); RT-qPCR-: 62 (44-75) | Consecutive patients who visited the emergency department between March 27 and April 20, 2020, (a) age ≥ 18 yrs; (b) suspected infection with COVID-19 in combination with at least one of the following: (i) new respiratory symptoms persisting for ≤ 2 wks and present during the last 24 h, (ii) saturation ≤ 94% and/or respiration rate ≥ 20/min and/or abdominal complaints, and (iii) a high clinical suspicion even in the absence of symptoms; and (c) RT-qPCR and chest CT performed within 24 h after each other. | (a) Previously confirmed COVID-19 infection; (b) instability defined as a peripheral oxygen saturation < 92% despite 5 l of oxygen and/or a systolic blood pressure < 90 mmHg; (c) principal presentation due to high energetic trauma, thrombolysis, or acute coronary syndrome; (d) pregnancy; and (e) non-interpretable first RT-qPCR result. |
| Yang HS ( | 2020 | USA | Cross-sectional study | To develop a machine learning model integrating age, gender, race and routine laboratory blood tests, which are readily available with a short turn-around time. | New York Presbyterian Hospital/Weill Cornell Medicine (NYPH/WCM) and New York Presbyterian Hospital/Lower Manhattan Hospital (NYPH/LMH) during March 11 to April 29. | 1898 patients: Training set: 1,402 positive and 1,954 negative patients; Validation set: 496 positive and 968 negative patients. | NYPH/WCM: 46.4% and NYPH/LMH: 43.31% | NYPH/WCM: Mean, 56.44 (SD, 19.46) and NYPH/LMH: 56.20 (20.81) | Patients that received a COVID-19 RT-qPCR test in the hospital. | Patients < 18 yrs old, patients who had indeterminate RT-qPCR results, and patients who did not have laboratory results within two days prior to the completion of RT-qPCR testing |
| AlJame M ( | 2021 | Brazil and Italy | Cross–sectional study | To propose a machine learning prediction model to accurately diagnose COVID-19 from clinical and/or routine laboratory data. | Albert Einstein Israelita Hospital located in São Paulo, Brazil and San Raffaele Hospital, Milan, Italy | 5923 | NA | NA | Patients admitted to hospitals from 28 March to 3 April 2020 in Brazil, from the end of February 2020 to mid-March 2020 in Italy | NA |
| Tschoellitsch T ( | 2021 | Austria | Retrospective cohort study | To evaluate whether machine learning could exclude SARS-CoV-2 RT-qPCR infection using routinely available laboratory values. (internal validation using five-fold cross validation) | Kepler University Hospital in Linz, Austria | 1,357 | Males: 49.9% | Mean 56.3 yrs (sd 26.6) | Patients with a SARS test performed from 1 March until 30 April | |
| Du R ( | 2021 | China | Retrospective cohort study | To apply machine learning for the task of COVID-19 detection using basic laboratory markers and explore the adjunctive role of chest radiographs | 24 hospitals of Hong Kong | Primary cohort: 5230 patients; validation cohort: 605 patients | Males: 56% | NA | Patients with clinical suspicion of COVID-19 infection presenting to the accident and emergency department from the start of the COVID-19 outbreak and had a RT-PCR testing for SARS-CoV-2. The inclusion criteria were: (i) had frontal chest radiographs on the date of the RT-qPCR test, (ii) had laboratory testing done, specifically hematological blood count with or without differential counts, C-reactive protein and lactate dehydrogenase on the date of the RT-qPCR test. | Patients younger than 16 yrs old |
| Kurstjens S ( | 2020 | The Netherland | Retrospective cohort study | To develop an algorithm to rapidly evaluate an individual’s risk of SARS- CoV-2 infection at the emergency department. | Emergency department of three (discovery cohort) and four (validation cohort) different hospitals. | 967 patients: (a) discovery cohort: 375 patients, ad (b) validation cohort: 592 patients. | Discovery cohort: COVID-19 negative men 43%; COVID-19 positive men 64.1%. Validation cohort: COVID-19 negative men 53.3%; COVID-19 positive men 63.8%. | Mean (SD). Discovery cohort: COVID-19 negative 62 +-16; COVID-19 positive 70 +-12 Validation cohort: COVID-19 negative 63 +-17; COVID-19 positive 69 +-12 | Patients presenting at the emergency department with respiratory symptoms, or suspected COVID-19 infection because of gastro-intestinal complaints (1–2% of this cohort), and subsequent SARS-CoV-2 RT-qPCR | Patients from other departments and patients without any respiratory symptoms or suspicion of COVID-19. For the validation population, patients with missing values or hemolytic samples were excluded. |
| Tordjman M ( | 2020 | France | Retrospective cohort study | To determine a pre-test probability score for SARS-CoV-2 infection. | Four different university hospitals between 10 March and 30 April 2020. | 605: Discovery cohort: 200 patients (100 patients and 100 controls). | Discovery cohort: males: patients 65% and controls 45%. Validation cohort: 55% males | Discovery cohort: patients mean 65 (SD, 24) and controls mean 60 (SD, 31). Validation cohort: mean 65 | Discovery cohort: outpatients with both RT-qPCR and CT-scan results evaluated for a suspicion of SARS-CoV-2 infection between 10 March and 8 April 2020. Validation cohort: consecutive outpatients suspected of SARS-CoV-2 infection with both RT-qPCR and CT-scan results available. | Discovery cohort: (i) diagnosis still under investigation; (ii) lack of blood test including complete white blood cell count and serum electrolytes; (iii) absence of reported clinical characteristics. |
| Gupta-Wright A ( | 2021 | UK | Retrospective cohort study | To develop and internally validate (bootstrap) a diagnostic risk score to predict risk of COVID-19 (including RT-PCR-negative COVID-19) among medical admissions. | Two hospitals within an acute NHS Trust in London, UK. | 4,008 patients | Males: 58.3% | Median, 65 (IQR 57–76) | All patients admitted to medical wards between 2 March and 3 May 2020 with a RT-qPCR test (the decision to test was based on a clinical suspicion). | NA |
| Vieceli T ( | 2020 | Brazil | Cross-sectional study | To develop a useful predictive tool for COVID-19 diagnosis based on clinical, laboratory and image data prior to RT-PCR test confirmation. (Internal validation with bootstrapping) | Hospital de Clínicas de Porto Alegre | 100 | Males 43% | Median 58 (IQR 40–69.5) | The first consecutive patients aged 18 or older admitted to hospital due to suspected COVID-19 from 17 March to 10 April 2020. | Patients discharged within 24 h of admission |
| Pardo Lledias J ( | 2020 | Spain | Cross-sectional study | To define a score with different clinical probabilities and analyzing the usefulness of repetition of nasopharyngeal smears based on these. | Two COVID internal medicine wards at the University Hospital Marqués de Valdecilla, Santander, Spain (tertiary hospital that covers a population of about 350,000 inhabitants), from March to April 2020. | 145 | Males 59.9% | Mean, 66.9 (SD,17.8) | Patients admitted for suspected SARS-CoV-2 infection | NA |
| Trubiano JA ( | 2020 | Australia | Cross-sectional study | To report the clinical and epidemiological predictors of COVID-19 from a uniquely derived prospective database and present a point-of-care ready COVID-19 clinical decision tool (internal validation with booststrap). | A COVID-19 rapid assessment screening clinic at Austin Health on 11 March 2020 | 2935 | Males 36.5% | Median, 39 (IQR: 29, 53) | Patients assessed for COVID-19 at a screening clinic due to symptoms (98.3%), contact with known COVID-19 positive patient (17.3%) or overseas travel (24.6%) | NA |
| Ahmed S ( | 2021 | Pakistan | Cross-sectional study | To validate the performance of Corona-Score in a cohort of Pakistani patients pursuing care for suspected infection. | Section of Chemical Pathology in collaboration with the Section of Molecular Pathology, Department of Pathology and Laboratory Medicine, Aga Khan University (AKU), Karachi, | 60 patients: 30 RT-qPCR negative and 30 RT-PCR positive. | Males: 41.7% | RT-qPCR positive: mean, 33.1 yrs (SD, 6.5); RT-PqCR negative: mean 60.5 (SD, 16.5) | Suspected COVID-19 cases with respiratory symptoms were recouped from electronical medical records. | NA |
| Elimian KO ( | 2021 | Nigeria States and the Federal Capital Territory. | Retrospective cohort study | To develop and validate the predictive capacity of clinical signs and symptoms about testing positive for COVID-19 | Surveillance, Outbreak Response Management and Analysis System (SORMAS) database from 27 February to 27 August 2020. | 181,544 patients: derivation set: 90,722; validation set: 90,693. | Males: children, 62.2%; adults, 64%, and elderly, 63.7% | Children (≤17 yrs): 37%; Adults: 61.8%, Elderly: 1.8%. | Patients who were tested for SARS-CoV-2 (according with NCDC COVID-19 standard case definition for suspected cases) with at least one symptom positively recorded. Persons concerned about COVID-19 infection on presentation to testing centers were also included. | No PCR result available; no data on age, sex |
| McRae AD ( | 2021 | Canada | Prospective cohort study | To develop and validate a clinical risk score that can accurately quantify the probability of SARS-CoV- 2 infection in patients presenting to an emergency department without the need for laboratory testing. | 32 emergency departments in eight Canadian provinces.(Canadian COVID-19 Emergency Department Rapid Response Network (CCEDRRN) registry) | 27,665 | Males: 49.5% | Derivation cohort: median 57 yrs (IQR 38–73); validation cohort: median 56 yrs (IQR 37–73) | Consecutive patients aged 18 yrs and older who had a biological sample collected for nucleic acid amplification test (NAAT) on their index emergency department visit or, if admitted, within 24 h of emergency department arrival for known risk factors for SARS-CoV-2 infection, work as a healthcare provider, institutional living, close personal or household contacts with SARS-CoV-2 infection, and symptoms including cough, anosmia or dysgeusia, fever, myalgias and vital signs. | Patients who had a positive SARS-CoV- 2 NAAT within 14 days prior to their emergency department visit, patients with cardiac arrest prior to emergency department arrival and those with missing outcome data. |
| Confirmed COVID 19 infection | ||||||||||
| Gatti M ( | 2020 | Italy | Retrospective cohort study | To describe CXR findings and clinical and laboratory characteristics associated with positive and negative CXR. | Emergency department of two Northern Italy hospitals | 260 | Males 61% | 62.8 ± 15.8 yrs | Consecutive patients who were admitted to the emergency department of two Northern Italy hospitals between 1 March and 31 March 2020 with COVID-19 confirmed by RT-qPCR, and who underwent CXR within 24 h of the swab execution. | NA |
| Góreke V ( | 2021 | Israel | Cross-sectional study | To obtain a new feature group based on laboratory findings using the developed algorithm and to detect COVID-19 with a new hybrid classifier based on deep learning that uses the new feature group as input parameters. | Albert Einstein Israelita Hospital | 600 patients: 520 diagnosed as COVID-19 and 80 healthy individuals | NA | NA | NA | NA |
| Arpaci I ( | 2021 | China | Cross-sectional study | To predict the COVID-19 positive or negative cases based on 14 clinical features using machine learning classification algorithms. (Internal validation using 10-fold-cross validation) | Taizhou hospital of Zhejiang Province in China | 114 | Male: 59.6% | Mean, 39.6 (SD 18.8) (range 1–80) | NA | NA |
| Marateb HR ( | 2021 | Iran | Open-cohort study | To design and implement a reliable COVID-19 diagnosis method to provide the risk of infection using demographics, symptoms and signs, blood markers, and family history of diseases to have excellent agreement with the results obtained by the RT-PqCR and CT-scan. (Internal validation using 10-fold-cross validation) | Khorshid Covid Cohort is a hospital-based surveillance study to investigate COVID-19 and non-COVID pneumonia patients since February 2020. | Dataset 1: 634 patients with COVID-19 and 118 with non-COVID-19 pneumonia. | Dataset 1: Males: 47.5% (non-COVID-19 pneumonia) and 61.4% (COVID-19) Dataset 2: 45.9%. Dataset 3 not provided. | Dataset 1: Mean,61.7 (SD, 18.3) (non-COVID-19 pneumonia) and 57.0 (15.4) (patients with COVID-19) Dataset 2: Over 60; 89 (14.0%) Dataset 3: not provided | NA | NA |
| Yousif AY ( | 2022 | Iraq | Cross-sectional study | To find a diagnostic method for the COVID-19 virus in Iraq through machine learning algorithms based on blood tests features of Iraqi patients trying to enhance the classification accuracy by selecting the appropriate one for the early prediction of COVID-19. | Samples collected from many private laboratories in Iraq/Baghdad | 300 (213 not infected, 87 infected individuals). | NA | NA | NA | NA |
| Plante TB ( | 2020 | USA and Israel | Case-control study | To describe the development of a machine learning model for ruling out COVID-19 using only routinely collected laboratory tests. Furthermore, to assess the AUROC curve of the model with both COVID-19 PCR test results (cases) and prepandemic patients (controls). | Three data sets of patients in emergency department : (a) The Premier Healthcare Database (155 hospitals); (b) Cedars-Sinai Medical Center an 886-bed academic medical center in Los Angeles, CA; and (3) the Beth Israel Deaconess Medical Center a 673-bed academic medical center in Boston. | 192,778 patients: training set (12,183) and validation set (172,754); sensitivity analysis (7,842) | Median age decile 50 (IQR 30-60) | Males: 40.5% | Adults aged ≥20 yrs in an emergency department at an included center during one of the prepandemic or pandemic time frames. | Missing a laboratory result included in the model on the day of presentation to the ED or if any of their laboratory results were reported with inappropriate units or incorrect specimen type. |
| Other | ||||||||||
| Banerjee A ( | 2020 | Brasil | Cross-sectional study | To use machine learning, an artificial neural network, random forest, and a simple statistical test to identify SARS-CoV-2-positive patients from full blood counts without knowledge of symptoms or history of the individuals. | Hospital Israelita Albert Einstein, at São Paulo, Brazil. | 598 | NA | NA | Patients who had samples collected to perform the SARS-CoV-2 RT-qPCR and additional laboratory tests during a visit to the hospital. | Patients in semi-intensive unit and ICU |
AUC, area under the receiver operating characteristic curve; CT, computerized tomography; CXR, chest X-rays; ED, emergency department; ICU, intensive care unit; IQR, interquartile range; NA, not available; SD, Standard deviation.
Description of the variables included in the study, the results obtained and the main conclusions of the study
| Reference | Variables/tests included | Reference standard | Results | Authors' conclusion |
|---|---|---|---|---|
| Symptomatic patients with suspected COVID 19 infection | ||||
| Guo X ( | CT scans; pharyngeal swab samples for first RT-qPCR analysis; blood samples (WBC, LYC, and LYP); age, sex, epidemiological record and clinical symptoms (fever, sore throat, cough, fatigue). | First RT-qPCR | An integrated multi-feature model (RT-qPCR, CT features, and LYP) established with random forest algorithm showed the diagnostic accuracy of 92.0% (95% CI: 73.9 - 99.1) in the training set, and 96. 6% (95% CI: 79.6 - 99.9) in the internal validation cohort. The model also performed well in the external validation cohort with an AUC of 93 (95% CI: 79 - 100). | The developed multivariate model based on machine learning techniques could be an efficient tool for COVID-19 screening in nonendemic regions with a high rate of influenza and CAP in the post-COVID-19 era. |
| Langer T ( | Age, gender, presence and type of comorbidities, reported symptoms. Vital signs upon admission to the ED (first measurement), presence and type of ventilatory support, routinely performed blood tests, major electrocardiographic characteristics (presence of sinus rhythm and ST abnormalities) and CXR (presence of any type of parenchymal involvement, presence of pleural effusion) | RT-qPCR | The best Machine Learning System reached an accuracy of 91.4% with 94.1% sensitivity and 88.7% specificity. | Basic clinical data might be sufficient for properly trained algorithms to predict with good accuracy the positivity and negativity to SARS-CoV-2. |
| Hermans JJR ( | CXR (classified according to the CO-RADS classification), ferritin, leucocyte count, CK, presence of diarrhea and no. of days since onset of disease. | RT-qPCR | The prediction model with CO-RADS, ferritin, leucocyte count, CK, days of complaints, and diarrhea as predictors had a sensitivity, specificity, PPV, and NPV of 89.3%, 93.4%, 90.8%, and 92.3%, respectively. AUC = 93.4% | Combining a predictive machine learning model could further improve the accuracy of diagnostic chest CT for COVID-19. Further candidate predictors should be analyzed to improve our model. However, RT-PCR should remain the primary standard of testing as up to 9% of RT-PCR positive patients are not diagnosed by chest CT or our machine learning model. |
| Yang HS ( | Patient demographic features (age, sex, race) with 27 routine laboratory tests | RT-qPCR | The model achieved an area under the AUC of 85.4 (95% CI: 82.9-87.8). | This model employing routine laboratory test results offers opportunities for early and rapid identification of high-risk SARS-CoV-2 infected patients before their RT- PCR results are available. |
| AlJame M ( | Clinical tests: AST, ALT, WBC, platelets, CRP, ALP, LDH, monocytes, gender, age. | RT-qPCR | Experimental results show that the proposed DF model has an accuracy of 99.5%, sensitivity of 95.28%, and specificity of 99.96%. | These performance metrics are comparable to other well-established machine learning techniques, and hence, DF model can serve as a fast-screening tool for COVID-19 patients at places where testing is scarce. |
| Tschoellitsch T ( | Standard laboratory values: blood count, electrolytes, C-reactive protein, creatinine, blood urea nitrogen, liver enzymes, bilirubin, cholinesterase, and prothrombin time. | RT-qPCR | The machine learning model could predict SARS-CoV-2 test results with an accuracy of 86% and an area under the ROC of 0.74. | Machine learning methods can reliably predict a negative SARS-CoV-2 RT-PCR test result using standard blood tests |
| Du R ( | Sex, age, laboratory data: hemoglobin, hematocrit, white blood cells, lymphocytes, monocyte, neutrophil, platelet, CRP, LDH, and chest radiographs. | RT-qPCR | For predicting SARS-CoV-2 infection, the ML model achieved high AUCs and specificity but low sensitivity in all three validation sets (AUC: 89.9–95.8%; sensitivity: 55.5–77.8%; specificity: 91.5–98.3%). When used in adjunction with radiologist interpretations of chest radiographs, the sensitivity was over 90% while keeping moderate specificity. | The study showed that machine learning model based on readily available laboratory markers could achieve high accuracy in predicting SARS-CoV-2 infection. |
| Kurstjens S ( | Laboratory measurements (CRP, ALC, ANC, LDH and ferritin), age, sex and CXR/CT | RT-qPCR | The corona-score model yielded an AUC of 91% in the validation population. | The corona-score provides the means for medical professionals to rapidly evaluate SARS-CoV-2 infection status of patients presenting at the ED with respiratory symptoms. |
| Tordjman M ( | Demographic characteristics, comorbidities (hypertension, respiratory diseases [asthma, COPD], immunodeficiency, renal insufficiency), clinical symptoms (cough, fever, headache, diarrhea, anosmia, ageusia, oxygen desaturation), and biological tests (WBC, serum electrolytes and RP) | RT-qPCR and/or CT-scan showing signs of COVID-19 pneumonia | In the multivariate analysis, lymphocyte (<1.3 G/L), eosinophil (<0.06 G/L), basophil (<0.04 G/L) and neutrophil counts (<5 G/L) were associated with high probability of SARS-CoV-2 infection, but no clinical variable was statistically significant. The score had a good performance in the validation cohort (AUC = 91.8 (CI: [89.1–94.6]; STD = 0.014) with a Positive Predictive Value of high-probability score of 93% (95%CI: [89–96]). Low-probability score excluded SARS-CoV-2 infection with a Negative Predictive Value of 98% (95%CI: [93–99]). | The PARIS score has a good performance to categorize the pre-test probability of SARS- CoV-2 infection based on complete white blood cell count. It could help clinicians adapt testing and for rapid triage of patients before test results. |
| Gupta-Wright A ( | Demographic characteristics (age, sex, ethnicity), CXR, clinical symptoms associated with COVID-19 (cough, fever or shortness of breath), vital signs (NEWS 2 and laboratory bloods (CRP and arterial/venous blood gas) | Patients with a positive SARS-CoV-2 RT-qPCR within 7 days before or after the date of admission and had a discharge diagnosis of COVID-19. | The following variables: age, sex, ethnicity, cough, fever or shortness of breath, NEW 2, CRP, and CXR appearance had moderate discrimination (AUC 83%, 95% CI 82 – 85%), good calibration and was internally validated. | Diagnostic risk scores could potentially help triage patients requiring admission but need external validation. |
| Vieceli T ( | Clinical characteristics, PSI/PORT score, comorbidities, laboratory findings and radiographic findings. | RT-qPCR | Variables associated with COVID-19 diagnosis in multivariate analysis were leukocyte count ≤7.7 × 103 mm–3, LDH >273U/L, and CXR abnormality. After bootstrapping, the corrected AUC for this model was 82.7 (95% CI 75–90). | This predictive score that can be easily applied in clinical practice, but it is yet to be validated in larger populations. |
| Pardo Lledias J ( | Epidemiological contact, clinical presentation as pneumonia, absence of pneumonia in the last yr, onset of symptoms > 7 days, two or more of the following symptoms -dyspnea, cough or fever- and serum lactate dehydrogenase levels >350 U/L (p < 0.05). | RT-qPCR | A score based on the independent variables yielded an AUC-ROC of 89 (CI95%, 83.1–94.6; p < 0.001). The accuracy of the first nasopharyngeal swabs was 54.9%. Repeating nasopharyngeal swabs two or three times allows to detect an additional 16% of positive cases. The overall accuracy of successive RT-PCR tests in patients with low pre-test probability was <5%. | The pre-test probability score based on epidemiological and clinical data showed a high accuracy for diagnosis of SARS-CoV-2. Repeating nasopharyngeal swabs avoids sampling errors, but only in medium of high probability pre-test clinical scenarios. |
| Trubiano JA ( | Clinical data from the data collection tool (baseline demographics, clinical symptoms, clinical observations) | RT-qPCR | The 7 features associated with a positive COVID- 19 test on multivariable analysis were: COVID-19 patient exposure or international travel, myalgia/malaise, anosmia or ageusia, temperature, coryza/sore throat, hypoxia–oxygen saturation < 97%, 65 yrs or older. Internal validation showed an AUC of 83.6. | The clinical decision rule, COVID-MATCH65 has a high sensitivity and negative predictive value for SARS-CoV-2 and can be employed in the pandemic, adjusted for disease prevalence, to aid COVID-19 risk-assessment and vital testing resource allocation. Authors encourage readers to urgently employ and validate COVID-MATCH65 in their own datasets. |
| Ahmed S ( | Biochemical data (serum LDH, CRP and ferritin), hematological data (absolute neutrophil and lymphocyte count) and imaging data (presence of infiltrates on chest X-ray). | RT-qPCR | The AUC of Corona-Score in our population of patients was 0.59 (95% CI: 0.45–0.74). Using the cut-off values of 4 originally identified by Kurstjens et al. for their study population, the model displayed 43.3% sensitivity and 70% specificity with an overall accuracy of 56.67%. | Corona Score is an easy-to-use algorithm for identification of COVID-19 patients with respiratory symptoms and needs to be further validated on a bigger sample size. Large multi-center studies across the country are in dire need of time to evaluate the score in overly. |
| Elimian KO ( | Clinical signs and symptoms: chills/sweat; cough; breathing difficulty; rapid breathing; runny nose; abdominal pain/diarrhea; gastrointestinal symptoms; chest pain; fatigue; headache; musculoskeletal pain; sore throat; loss of taste; loss of smell; fever. | RT-qPCR | Best individual symptom predictor of COVID-19 positivity was loss of smell in children (AUROC 0.56, 95%; CI 0.55 to 0.56), either fever or cough in adults (AUROC 0.57, 95%; CI 0.56 to 0.58) and difficulty in breathing in the elderly (AUROC 0.53, 95%; CI 0.48 to 0.58) patients. | The predictive capacity of various symptom scores for COVID-19 positivity was poor overall. However, the findings could serve as an advocacy tool for more investments in resources for capacity strengthening of molecular testing for COVID-19 in Nigeria. |
| McRae AD ( | Demographics: age, sex, arrival from (home + other, single room + no fixed address + shelter, institutional living, inter-hospital transfer), infection risk, emergency department variables, COVID symptoms, 7-day avg incident COVID-19 cases | NAAT | The score had a c-statistic of 0.838 with excellent calibration. | The score can identify patients at sufficiently high risk of SARS-CoV-2 infection to warrant isolation and empirical therapy prior to test confirmation while also identifying patients at sufficiently low risk of infection that they may not need testing |
| Confirmed COVID 19 infection | ||||
| Gatti M ( | Comorbidities (presence of cardiac disease, diabetes, obesity, hypertension, smoke history, ACEi/Sartan or FANS therapy), Clinical data (fever, cough, rhinitis, dyspnea, pharyngodynia, myalgias, asthenia, conjunctivitis, headache, nausea, vomit and diarrhea), laboratory data (WBC, CRP, LDH, hepatic enzymes, CK, blood’s pH, PaCO2) and CXR. | RT-qPCR | The ROC curve procedure determined that CXR+ was associated with LDH > 500 UI/L (AUC = 87.8%), CRP > 30 mg/L (AUC = 83.0%) and interval between the onset of symptoms and the execution of CXR > 4 days (AUC = 75.0%). The presence of two out of three of the above-mentioned predictors resulted in CXR+ in 92.5% of cases, whereas their absence in 7.4%. | CXR has a low sensitivity. LDH, CRP and interval between the onset of symptoms and the execution of CXR are major predictors for a positive CXR. |
| Góreke V ( | Laboratory tests routinely collected, age, race, sex, and disease severity subgroups. | RT-qPCR | A development and external validation study of a machine learning model for COVID-19 status using laboratory tests routinely collected in adult ED patients found high discrimination across age, race, sex, and disease severity subgroups. The AUROC for the training and external validation data set was 91% (95% CI 90%–92%). | A machine learning model developed with multicenter clinical data integrating commonly collected ED laboratory data demonstrated high rule-out accuracy for COVID-19 status and might inform selective use of PCR-based testing. |
| Arpaci I ( | Laboratory data: hematocrit, hemoglobin, platelets, red blood cells, lymphocytes, leukocytes, basophils, eosinophils, monocytes, serum glucose, neutrophils, urea, CRP, creatinine, potassium, sodium, ASL, ASP. | RT-qPCR | Classification performance indicators were obtained as accuracy of 94.95%, F1-score of 94.98%, precision of 94.98%, recall of 94.98% and AUC of 100%. | Proposed method shows superior performance and can provide more convenience and precision to experts for diagnosis of COVID-19 disease. |
| Marateb HR ( | Laboratory data: white blood cell count, neutrophil, lymphocyte, monocytes, eosinophil, basophils, neutrophil-lymphocyte, lymphocyte/monocyte, hemoglobin, hematocrit, mean red blood cell volume, platelet, thrombocytocrit and procalcitonin. | RT-qPCR | The CR meta-classifier is the most accurate classifier for predicting the positive and negative COVID-19 cases with an accuracy of 84.21%. | The results could help in the early diagnosis of COVID-19, specifically when the RT-PCR kits are not sufficient for testing the infection and assist countries, specifically the developing ones that suffer from the shortage of RT-PCR tests and specialized laboratories. |
| Yousif AY ( | Demographics: age, gender, occupation; laboratory data: white blood cells, CRP, LDH, PLT, lymphocytes, hemoglobin sodio; symptoms and signs: shortness of breath, PCO2, cough details, decreased appetite, headache, body temperature; other: contact with conformed COVID-19 patients, other comorbidities, sore throat, myalgia, chronic respiratory disease, symptom duration, COPD, weight loss, chills, diarrhea, SatO2 | RT-qPCR and chest CT | Sensitivity of 96% (CI, 95%: 94–98), specificity of 95% (90–99), positive predictive value (PPV) of 99% (98–100)], negative predictive value (NPV) of 82% (76–89), diagnostic odds ratio (DOR) of 496 (198–1,245), area under the ROC 0.96 (0.94–0.97), Matthews Correlation Coefficient of 0.87 (0.85–0.88), accuracy of 96% (94–98), and Cohen’s Kappa of 0.86 (0.81–0.91). The AUC on the datasets 2 and 3 was 0.97 (0.96–0.98) and 0.92 (0.91–0.94), respectively. The most important feature was white blood cell count, shortness of breath, and C-reactive protein for datasets 1, 2, and 3, respectively. | The proposed algorithms, thus, a promising COVID-19 diagnosis method, which could be an amendment to simple blood tests and screening of symptoms. However, the RT-PCR and chest CT-scan, performed as the gold standard, are not 100% accurate. |
| Plante TB ( | Laboratory data: oxygen content, ferritin, CRP, WBC, LYM, GRA, RBC. | NA | The results show that the best classification accuracy obtained was 0.87, associated with an F1-Score of 0.91. | Machine learning algorithms can be used in conjunction with blood tests in countries with insufficient resources to combat this pandemic. |
| Other | ||||
| Banerjee A ( | Standard full blood count: hematocrit, hemoglobin, platelets, MPV, RBC, lymphocytes, MCHC, leukocytes, basophils, neutrophils, MCH, eosinophils, MCV, monocytes and RBCDW. | RT-qPCR | Full blood counts random forest, shallow learning and a flexible model predict SARS- CoV-2 patients with high accuracy between populations on regular wards (AUC = 94–95%) and those not admitted to hospital or in the community (AUC = 80–86%). A simple linear combination of 4 blood counts can be used to have an AUC of 85% for patients within the community. | This new methodology has potential to greatly improve initial screening for patients where based diagnostic tools are limited. Further validation will be required to determine if the model can distinguish fully from other pathogens. |
ACEi/Sartan or FANS therapy; ALC, absolute leukocyte count; ANC, absolute neutrophil count; AUC, area under the receiver operating characteristic curve; CK, creatin kinase; COPD, chronic obstructive pulmonary disease; CRP, C-reactive protein; CT, computerized tomography; CXR, chest X-rays; ED, emergency department; LDH, lactate dehydrogenase; LYC, lymphocyte count; LYP, blood lymphocyte percentage; MCHC, mean corpuscular hemoglobin concentration; MCV, mean corpuscular volume; MPV, mean platelet volume; NAAT, nucleic acid amplification test; NEWS 2, National Early Warning Score 2; PaCO2, arterial partial pressure of carbon dioxide; PSI/PORT score, Pneumonia Severity Index for community-acquired pneumonia; RBC, red blood cells; RBCDW, red blood cell distribution width; WBC, white blood cell count.
Variables finally included in the model to diagnose infection with symptomatic patients with suspected COVID 19 infection
| References | AUC (IC 95%) | RT-qPCR | Demographic (age, sex) | Clinical characteristics | Image | Whole blood tests | Biochemical measurements | |||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Comorbidities | Symptoms | Exposure or international travel | ECG | Days since onset | CT | CXR | RBC | LYP | LYC | WBC | Laboratory routine tests | CRP | LDH | Ferritin | CK | Arterial/venous blood gas | Cholinesterase | Prothrombin time | ||||
| Guo X ( | 93 (79–100) | X | X | X | ||||||||||||||||||
| Langer T ( | 90 | X | X | X | X | X | X | X | ||||||||||||||
| Hermans JJR ( | 91.4 (87.9–94.9) | X | X | X | X | X | X | |||||||||||||||
| Yang HS ( | 85.4 (82.9–87.8) | X | X | X | X | X | X | |||||||||||||||
| Aljame M ( | Accuracy 99.5% | X | X | X | X | X | X | |||||||||||||||
| Tschoellitsch T ( | 0.74 | X | X | X | X | X | ||||||||||||||||
| Du R ( | 89.9–95.8 | X | X | X | X | X | X | X | X | |||||||||||||
| Kurstjens S ( | 91 | X | X | X | X | X | X | X | X | |||||||||||||
| Tordjman M ( | 91.8 (89.1–94.6) | X | X | |||||||||||||||||||
| Gupta-Wright A ( | 83 (82–85) | X | X | X | X | X | ||||||||||||||||
| Vieceli T ( | 82.7 (75–90) | X | X | X | ||||||||||||||||||
| Pardo Lledias J ( | 89 (83.1–94.6) | X | X | X | ||||||||||||||||||
| Trubiano J ( | 83.6 | X | X | X | ||||||||||||||||||
| Ahmed S ( | 59 (0.45–0.74) | X | X | X | X | X | X | |||||||||||||||
| Elimian KO ( | Loss of smell in children: 0.56 (0.55– 0.56), either fever or cough in adults: 0.57 (0.56–0.58), and difficulty breathing in the elderly: 0.53 (0.48–0.58). | X | ||||||||||||||||||||
| Mc Rae AD ( | Sensitivity: 97.4 (96.4–98.3), Specificity: 95.9 (95.5–96) | X | X | X | X | X | X | X | X | X | X | X | X | X | X | X | X | X | X | |||
CK, Creatin Kinase; CRP, C-reactive protein; CT, computerized tomography; CXR, chest X-rays; ECG, electrocardiogram; LDH, lactate dehydrogenase; LYP, blood lymphocyte percentage; LYC, lymphocyte count; RBC, red blood cell; WBC, white blood cell count.
Variables finally included in the model to diagnose infection with SARS-COV-2 in patients diagnosed with COVID-19
| References | AUC | Demographic (age, sex, race) | Occupation | Contact with COVID-19 patients | Clinical characteristics | Image | Whole blood tests | Biochemical measurements | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Comorbidities | Symptoms | Days since onset | CT | CXR | RBC | LYP | LYC | WBC | Laboratory routine tests | CRP | LDH | Arterial/venous blood gas | Procalcitonin | |||||
| Gatti M ( | 87.8 | X | X | X | X | |||||||||||||
| Plante TB ( | 91 (90–92) | X | X | X | ||||||||||||||
| Goreke V ( | 100 | X | X | X | X | X | X | |||||||||||
| Arpaci I ( | Accuracy: 84.2 | X | X | X | X | X | ||||||||||||
| Marateb HR ( | 87–96 | X | X | X | X | X | X | X | X | |||||||||
| Yousif AY ( | Accuracy: 87 | X | X | X | X | X | X | |||||||||||
CRP, C-reactive protein; CT, computerized tomography; CXR, chest X-rays; LDH, lactate dehydrogenase; LYP, blood lymphocyte percentage; LYC, lymphocyte count; RBC, red blood cell; WBC, white blood cell count.
Description of the compliance with items included in the checklist of the Standards for Reporting Diagnostic Accuracy Studies guidelines (STARD)
| Item from STARD Checklist | Compliance | |||
|---|---|---|---|---|
| N | (%) | |||
| Title or abstract | ||||
| 1 | Identification as a study of diagnostic accuracy using at least one measure of accuracy (such as sensitivity, specificity, predictive values, or AUC) | 12 | 35.3 | |
| Abstract | ||||
| 2 | Structured summary of study design, methods, results, and conclusions (for specific guidance, see STARD for Abstracts) | 16 | 47.1 | |
| Introduction | ||||
| 3 | Scientific and clinical background, including the intended use and clinical role of the index test | 21 | 61.8 | |
| 4 | Study objectives and hypotheses | 21 | 61.8 | |
| Methods | ||||
| Study design | 5 | Whether data collection was planned before the index test and reference standard were performed (prospective study) or after (retrospective study) | 18 | 52.9 |
| Participants | 6 | Eligibility criteria | 13 | 38.2 |
| 7 | On what basis potentially eligible participants were identified (such as symptoms, results from previous tests, inclusion in registry) | 17 | 50 | |
| 8 | Where and when potentially eligible participants were identified (setting, location, and dates) | 19 | 55.9 | |
| 9 | Whether participants formed a consecutive, random or convenience series | 12 | 35.3 | |
| Test methods | 10a | Index test, in sufficient detail to allow replication | 17 | 50 |
| 10b | Reference standard, in sufficient detail to allow replication | 14 | 41.2 | |
| 11 | Rationale for choosing the reference standard (if alternatives exist) | 10 | 29.4 | |
| 12a | Definition of and rationale for test positivity cut-offs or result categories of the index test, distinguishing prespecified from exploratory | 14 | 41.2 | |
| 12b | Definition of and rationale for test positivity cutoffs or result categories of the reference standard, distinguishing prespecified from exploratory | 9 | 26.5 | |
| 13a | Whether clinical information and reference standard results were available to the performers/readers of the index test | 1 | 2.9 | |
| 13b | Whether clinical information and index test results were available to the assessors of the reference standard | 6 | 17.6 | |
| Analysis | 14 | Methods for estimating or comparing measures of diagnostic accuracy | 20 | 58.8 |
| 15 | How indeterminate index test or reference standard results were handled | 1 | 2.9 | |
| 16 | How missing data on the index test and reference standard were handled | 5 | 14.7 | |
| 17 | Any analyses of variability in diagnostic accuracy, distinguishing pre-specified from exploratory | 13 | 38.2 | |
| 18 | Intended sample size and how it was determined | 2 | 5.9 | |
| Participants | 19 | Flow of participants using a diagram | 8 | 23.5 |
| 20 | Baseline demographic and clinical characteristics of participants | 18 | 52.9 | |
| 21a | Distribution of severity of disease in those with the target condition | 10 | 29.4 | |
| 21b | Distribution of alternative diagnoses in those without the target condition | 5 | 14.7 | |
| 22 | Time interval and any clinical interventions between index test and reference standard | 4 | 11.8 | |
| Test results | 23 | Cross tabulation of the index test results (or their distribution) by the results of the reference standard | 19 | 55.9 |
| 24 | Estimates of diagnostic accuracy and their precision (such as 95% confidence intervals) | 20 | 58.8 | |
| 25 | Any adverse events from performing the index test or the reference standard | 0 | 0 | |
| Discussion | ||||
| 26 | Study limitations, including sources of potential bias, statistical uncertainty, and generalizability | 21 | 61.8 | |
| 27 | Implications for practice, including the intended use and clinical role of the index test | 19 | 55.9 | |
| Other Information | ||||
| 28 | Registration no. and name of registry | 2 | 5.9 | |
| 29 | Where the full study protocol can be accessed | 2 | 5.9 | |
| 30 | Sources of funding and other support; role of funders | 15 | 44.1 | |