| Literature DB >> 32265220 |
Laure Wynants1,2, Ben Van Calster2,3, Gary S Collins4,5, Richard D Riley6, Georg Heinze7, Ewoud Schuit8,9, Marc M J Bonten8,10, Darren L Dahly11,12, Johanna A A Damen8,9, Thomas P A Debray8,9, Valentijn M T de Jong8,9, Maarten De Vos2,13, Paul Dhiman4,5, Maria C Haller7,14, Michael O Harhay15,16, Liesbet Henckaerts17,18, Pauline Heus8,9, Michael Kammer7,19, Nina Kreuzberger20, Anna Lohmann21, Kim Luijken21, Jie Ma5, Glen P Martin22, David J McLernon23, Constanza L Andaur Navarro8,9, Johannes B Reitsma8,9, Jamie C Sergeant24,25, Chunhu Shi26, Nicole Skoetz19, Luc J M Smits1, Kym I E Snell6, Matthew Sperrin27, René Spijker8,9,28, Ewout W Steyerberg3, Toshihiko Takada8, Ioanna Tzoulaki29,30, Sander M J van Kuijk31, Bas van Bussel1,32, Iwan C C van der Horst32, Florien S van Royen8, Jan Y Verbakel33,34, Christine Wallisch7,35,36, Jack Wilkinson22, Robert Wolff37, Lotty Hooft8,9, Karel G M Moons8,9, Maarten van Smeden8.
Abstract
OBJECTIVE: To review and appraise the validity and usefulness of published and preprint reports of prediction models for diagnosing coronavirus disease 2019 (covid-19) in patients with suspected infection, for prognosis of patients with covid-19, and for detecting people in the general population at increased risk of covid-19 infection or being admitted to hospital with the disease.Entities:
Mesh:
Year: 2020 PMID: 32265220 PMCID: PMC7222643 DOI: 10.1136/bmj.m1328
Source DB: PubMed Journal: BMJ ISSN: 0959-8138
Fig 1PRISMA (preferred reporting items for systematic reviews and meta-analyses) flowchart of study inclusions and exclusions. CT=computed tomography
Overview of prediction models for diagnosis and prognosis of covid-19
| Study; setting; and outcome | Predictors in final model | Sample size: total No of participants for model development set (No with outcome) | Predictive performance on validation | Overall risk of bias | ||
|---|---|---|---|---|---|---|
| Type of validation* | Sample size: total No of participants for model validation (No with outcome) | Performance* (C index, sensitivity (%), specificity (%), PPV/NPV (%), calibration slope, other (95% CI, if reported)) | ||||
|
| ||||||
| Decaprio et al | Age, sex, number of previous hospital admissions, 11 diagnostic features, interactions between age and diagnostic features | 1.5 million (unknown) | Training test split | 369 865 (unknown) | C index 0.73 | High |
| Decaprio et al | Age and ≥500 features related to diagnosis history | 1.5 million (unknown) | Training test split | 369 865 (unknown) | C index 0.81 | High |
| Decaprio et al | ≥500 undisclosed features, including age, diagnostic history, social determinants of health, Charlson comorbidity index | 1.5 million (unknown) | Training test split | 369 865 (unknown) | C index 0.81 | High |
|
| ||||||
| Original review | ||||||
| Feng et al | Age, temperature, heart rate, diastolic blood pressure, systolic blood pressure, basophil count, platelet count, mean corpuscular haemoglobin content, eosinophil count, monocyte count, fever, shiver, shortness of breath, headache, fatigue, sore throat, fever classification, interleukin 6 | 132 (26) | Temporal validation | 32 (unclear) | C index 0.94 | High |
| Lopez-Rincon et al | Specific sequences of base pairs | 553 (66) | 10-fold cross validation | Not applicable | C index 0.98, sensitivity100, specificity 99 | High |
| Meng et al | Age, activated partial thromboplastin time, red blood cell distribution width SD, uric acid, triglyceride, serum potassium, albumin/globulin, 3-hydroxybutyrate, serum calcium | 620 (302) | External validation | 145 (80) | C index 0.87‡ | High |
| Song et al | Fever, history of close contact, signs of pneumonia on CT, neutrophil to lymphocyte ratio, highest body temperature, sex, age, meaningful respiratory syndromes | 304 (73) | Training test split | 95 (18) | C index 0.97 (0.93 to 1.00) | High |
| Yu et al | Direct bilirubin; alanine transaminase | 105 (8) | Apparent performance only | Not applicable | F1 score 1.00 | High |
| Update 1 | ||||||
| Martin et al | Unknown | Not applicable | External validation only (simulation) | Not applicable | Sensitivity 97, specificity 96 | High |
| Sun et al | Age, sex, temperature, heart rate, systolic blood pressure, diastolic blood pressure, sore throat | 292 (49) | Leave-one-out cross validation | Not applicable | C index 0.65 (0.57 to 0.73) | High |
| Sun et al | Sex, temperature, heart rate, respiration rate, diastolic blood pressure, sore throat, sputum production, shortness of breath, gastrointestinal symptoms, lymphocytes, neutrophils, eosinophils, creatinine | 292 (49) | Leave-one-out cross validation | Not applicable | C index 0.88 (0.83 to 0.93) | High |
| Sun et al | Sex, temperature, heart rate, respiration rate, diastolic blood pressure, sputum production, gastrointestinal symptoms, chest radiograph or CT scan suggestive of pneumonia, neutrophils, eosinophils, creatinine | 292 (49) | Leave-one-out cross validation | Not applicable | C index 0.88 (0.83 to 0.93) | High |
| Sun et al | Sex, covid-19 case contact, travel to Wuhan, travel to China, temperature, heart rate, respiration rate, diastolic blood pressure, sore throat, sputum production, gastrointestinal symptoms, chest radiograph or CT scan suggestive of pneumonia, neutrophils, eosinophils, creatinine, sodium | 292 (49) | Leave-one-out cross validation | Not applicable | C index 0.91 (0.86 to 0.96) | High |
| Wang et | Epidemiological history, wedge shaped or fan shaped lesion parallel to or near the pleura, bilateral lower lobes, ground glass opacities, crazy paving pattern, white blood cell count | 178 (69) | External validation | 116 (68) | C index 0.85, calibration slope 0.56 | High |
| Wu et al | Lactate dehydrogenase, calcium, creatinine, total protein, total bilirubin, basophil, platelet distribution width, kalium, magnesium, creatinine kinase isoenzyme, glucose | 108 (12) | Training test split | 107 (61) | C index 0.99, sensitivity 100, specificity 94 | High |
| Zhou et al | Age, sex, onset-admission time, high blood pressure, diabetes, CHD, COPD, white blood cell counts, lymphocyte, neutrophils, alanine transaminase, aspartate aminotransferase, serum albumin, serum creatinine, blood urea nitrogen, CRP | 250 (79) | Training test split | 127 (38) | C index 0.88 (0.94 to 0.92), sensitivity 89, specificity 74 | High |
|
| ||||||
| Original review | ||||||
| Barstugan et al | Not applicable | 53 (not applicable) | Cross validation | Not applicable | Sensitivity 93, specificity 100 | High |
| Chen et al | Not applicable | 106 (51) | Training test split | 27 (11) | Sensitivity 100, specificity 82 | High |
| Gozes et al | Not applicable | 50 (unknown) | External validation with Chinese cases and US controls | Unclear | C index 0.996 (0.989 to 1.000) | High |
| Jin et al | Not applicable | 416 (196) | Training test split | 1255 (183) | C index 0.98, sensitivity 94, specificity 95 | High |
| Jin et al | Not applicable | 1136 (723) | Training test split | 282 (154) | C index: 0.99, sensitivity 97, specificity 92 | High |
| Li et al | Not applicable | 2969 (400) | Training test split | 353 (68) | C index 0.96 (0.94 to 0.99), sensitivity 90 (83 to 94), specificity 96 (93 to 98) | High |
| Shan et al | Not applicable | 249 (not applicable) | Training test split | 300 (not applicable) | Dice similarity coefficient 91.6%** | High |
| Shi et al | 5 categories of location features from imaging: volume, number, histogram, surface, radiomics | 2685 (1658) | Fivefold cross validation | Not applicable | C index 0.94 | High |
| Wang et al | Not applicable | 259 (79) | Internal, other images from same people | Not applicable | C index 0.81 (0.71 to 0.84), sensitivity 83, specificity 67 | High |
| Xu et al | Not applicable | 509 (110) | Training test split | 90 (30) | Sensitivity 87, PPV 81 | High |
| Song et al | Not applicable | 123 (61) | Training test split | 51 (27) | C index 0.99 | High |
| Song et al | Not applicable | 131 (61) | Training test split | 57 (27) | C index 0.96 | High |
| Zheng et al | Not applicable | Unknown | Temporal validation | Unknown | C index 0.96 | High |
| Update 1 | ||||||
| Abbas et al | Not applicable | 137 (unknown) | Training test split | 59 (unknown) | C index 0.94, sensitivity 98, specificity 92 | High |
| Apostolopoulos et al | Not applicable | 1427 (224) | 10-fold cross validation | Not applicable | Sensitivity 99, specificity 97 | High |
| Bukhari et al | Not applicable | 223 (unknown) | Training test split | 61 (17) | Sensitivity 98, PPV 91 | High |
| Chaganti et al | Not applicable | 631 (not applicable) | Training test split | 100 (not applicable) | Correlation§§ 0.98 | High |
| Chaganti et al | Not applicable | 631 (not applicable) | Training test split | 100 (not applicable) | Correlation§§ 0.98 | High |
| Chaganti et al | Not applicable | 631 (not applicable) | Training test split | 100 (not applicable) | Correlation§§ 0.97 | High |
| Chaganti et al | Not applicable | 631 (not applicable) | Training test split | 100 (not applicable) | Correlation§§ 0.97 | High |
| Chowdhury et al | Not applicable | Unknown | Fivefold cross validation | Not applicable | C index 0.99 | High |
| Chowdhury et al | Not applicable | Unknown | Fivefold cross validation | Not applicable | C index 0.98 | High |
| Chowdhury et al | Not applicable | Unknown | Fivefold cross validation | Not applicable | C index 0.998 | High |
| Chowdhury et al | Not applicable | Unknown | Fivefold cross validation | Not applicable | C index 0.99 | High |
| Fu et al | Not applicable | 610 (100) | External validation | 309 (50) | C index 0.99, sensitivity 97, specificity 99 | High |
| Gozes et al | Not applicable | 50 (unknown) | External validation | 199 (109) | C index 0.95 (0.91 to 0.99) | High |
| Imran et al | Not applicable | 357 (48) | Twofold cross validation | Not applicable | Sensitivity 90, specificity 81 | High |
| Li et al | Severity score based on CT scans | Not applicable | External validation of existing score | 78 (not applicable) | C index 0.92 (0.84 to 0.99) | High |
| Li et al | Not applicable | 360 (120) | Training test split | 135 (45) | C index 0.97 | High |
| Hassanien et al | Not applicable | Unknown | Training test split | Unknown | Sensitivity 95, specificity 100 | High |
| Tang et al | Not applicable | 176 (55) | Threefold cross validation | Not applicable | C index 0.91, sensitivity 93, specificity 75 | High |
| Wang et al | Not applicable | 709 (560) | External validation in other centres | 508 (223) | C index (average) 0.87 | High |
| Zhang et al | Not applicable | 1078 (70) | Twofold cross validation | Not applicable | C index 0.95, sensitivity 96, specificity 71 | High |
| Zhou et al | Not applicable | 191 (35) | External validation in other centres | 107 (57) | C index 0.92, sensitivity 83, specificity 86 | High |
|
| ||||||
| Original review | ||||||
| Bai et al | Combination of demographics, signs and symptoms, laboratory results and features derived from CT images | 133 (54) | Unclear | Not applicable | C index 0.95 (0.94 to 0.97) | High |
| Caramelo et al | Age, sex, presence of any comorbidity (hypertension, diabetes, cardiovascular disease, chronic respiratory disease, cancer)†† | Unknown | Not reported | Not applicable | Not reported | High |
| Gong et al | Age, serum LDH, CRP, variation of red blood cell distribution width, blood urea nitrogen, albumin, direct bilirubin | 189 (28) | External validation (two centres) | 165 (40) and 18 (4) | Centre 1: C index 0.85 (0.79 to 0.92), sensitivity 78, specificity 78; centre 2: sensitivity 75, specificity 100 | High |
| Lu et al | Age, CRP | 577 (44) | Not reported | Not applicable | Not reported | High |
| Qi et al | 6 features derived from CT images‡‡ (logistic regression model) | 26 (20) | 5 fold cross validation | Not applicable | C index 0.92 | High |
| Qi et al | 6 features derived from CT images‡‡ (random forest) | 26 (20) | 5 fold cross validation | Not applicable | C index 0.96 | High |
| Shi et al | Age (dichotomised), sex, hypertension | 478 (49) | Validation in less severe cases | 66 (15) | Not reported | High |
| Xie et al | Age, LDH, lymphocyte count, SPO2 | 299 (155) | External validation (other Chinese centre) | 130 (69) | C index 0.98 (0.96 to 1.00), calibration slope 2.5 (1.7 to 3.7) | High |
| Yan et al | LDH, lymphocyte count, high sensitivity CRP | 375 (174) | Temporal validation, selecting only severe cases | 29 (17) | Sensitivity 92, PPV 95 | High |
| Yuan et al | Clinical scorings of CT images (zone, left/right, location, attenuation, distribution of affected parenchyma) | Not applicable | External validation of existing model | 27 (10) | C index 0.90 (0.87 to 0.93) | High |
| Update 1 | ||||||
| Huang et al | Underlying diseases, fast respiratory rate >24/min, elevated CRP level (>10 mg/dL), elevated LDH level (>250 U/L) | 125 (32) | Apparent performance only | Not applicable | C index 0.99 (0.97 to 1.00), sensitivity 0.91, specificity 0.96 | High |
| Pourhomayoun et al | Unknown | Unknown | 10-fold cross validation | Not applicable | C index 0.96, sensitivity 90, specificity 0.97 | High |
| Sarkar et al | Age, days from symptom onset to hospitalisation, from Wuhan, sex, visit to Wuhan | 80 (37) | Apparent performance only | Not applicable | C index 0.97 | High |
| Wang et al | Age and CT features | 301 (not applicable) | Not reported | Not applicable | Not reported | High |
| Zeng et al | CT features | 338 (76) | Cross validation (number of folds unclear) | Not applicable | C index 0.88 | High |
| Zeng et al | CT features and laboratory markers | 338 (76) | Cross validation (number of folds unclear) | Not applicable | C index 0.88 | High |
CHD=coronary heart disease; COPD=chronic obstructive pulmonary disease; covid-19=coronavirus disease 2019; CRP=C reactive protein; CT=computed tomography; LDH=lactate dehydrogenase; NPV=negative predictive value; PPV=positive predictive value; PROBAST=prediction model risk of bias assessment tool; SPO2=oxygen saturation.
Performance is given for the strongest form of validation reported. This is indicated in the column “type of validation.” When a training test split was used, performance on the test set is reported. Apparent performance is the performance observed in the development data.
Proxy events used: pneumonia (except from tuberculosis), influenza, acute bronchitis, or other specified upper respiratory tract infections (no patients with covid-19 pneumonia in data).
Calibration plot presented, but unclear which data were used.
The development set contains scans from Chinese patients, the testing set contains scans from Chinese cases and controls, and US controls.
Data contain mixed cases and controls. Chinese data and controls from US and Switzerland.
Describes similarity between segmentation of the CT scan by a medical doctor and automated segmentation.
Outcome and predictor data were simulated.
Wavelet-HLH_gldm_SmallDependenceLowGrayLevelEmphasis, wavelet-LHH_glcm_Correlation, wavelet-LHL_glszm_GrayLevelVariance, wavelet-LLH_glszm_SizeZoneNonUniformityNormalized, wavelet-LLH_glszm_SmallAreaEmphasis, wavelet-LLH_glcm_Correlation. §§Pearson correlation between the predicted and ground truth scores for patients with lung abnormalities.
Risk of bias assessment (using PROBAST) based on four domains across 51 studies that created prediction models for coronavirus disease 2019
| Authors | Risk of bias | |||
|---|---|---|---|---|
| Participants | Predictors | Outcome | Analysis | |
|
| ||||
| DeCaprio et al | High | Low | High | High |
|
| ||||
| Original review | ||||
| Feng et al | Low | Unclear | High | High |
| Lopez-Rincon et al | Unclear | Low | Low | High |
| Meng et al | High | Low | High | High |
| Song et al | High | Unclear | Low | High |
| Yu et al | Unclear | Unclear | Unclear | High |
| Update 1 | ||||
| Martin et al | High | High | High | High |
| Sun et al | Low | Low | Unclear | High |
| Wang et al | Low | Unclear | Unclear | High |
| Wu et al | High | Unclear | Low | High |
| Zhou et al | Unclear | Low | High | High |
|
| ||||
| Original review | ||||
| Barstugan et al | Unclear | Unclear | Unclear | High |
| Chen et al | High | Unclear | Low | High* |
| Gozes et al | Unclear | Unclear | High | High |
| Jin et al | High | Unclear | Unclear | High† |
| Jin et al | High | Unclear | High | High* |
| Li et al | Low | Unclear | Low | High |
| Shan et al | Unclear | Unclear | High | High† |
| Shi et al | High | Unclear | Low | High |
| Wang et al | High | Unclear | Low | High |
| Xu et al | High | Unclear | High | High |
| Song et al | Unclear | Unclear | Low | High |
| Zheng et al | Unclear | Unclear | High | High |
| Update 1 | ||||
| Abbas et al | High | Unclear | Unclear | High |
| Apostolopoulos et al | High | Unclear | High | High |
| Bukhari et al | Unclear | Unclear | Unclear | High |
| Chaganti et al | High | Unclear | Low | Unclear |
| Chowdhury et al | High | Unclear | Unclear | High |
| Fu et al | High | Unclear | Unclear | High |
| Gozes et al | High | Unclear | Unclear | High |
| Imran et al | High | Unclear | Unclear | High* |
| Li et al | Low | Low | Unclear | High |
| Li et al | High | Unclear | High | High* |
| Hassanien et al | Unclear | Unclear | Unclear | High* |
| Tang et al | Unclear | Unclear | High | High |
| Wang et al | Low | Unclear | Unclear | High |
| Zhang et al | High | Unclear | High | High |
| Zhou et al | High | Unclear | High | High* |
|
| ||||
| Original review | ||||
| Bai et al | Low | Unclear | Unclear | High |
| Caramelo et al | High | High | High | High |
| Gong et al | Low | Unclear | Unclear | High |
| Lu et al | Low | Low | Low | High |
| Qi et al | Unclear | Low | Low | High |
| Shi et al | High | High | High | High |
| Xie et al | Low | Low | Low | High |
| Yan et al | Low | High | Low | High |
| Yuan et al | Low | High | Low | High |
| Update 1 | ||||
| Huang et al | Unclear | Unclear | Unclear | High |
| Pourhomayoun et al | Low | Low | Unclear | High |
| Sarkar et al | High | High | High | High |
| Wang et al | Low | Low | Low | High |
| Zeng et al | Low | Low | Low | High |
PROBAST=prediction model risk of bias assessment tool.
Risk of bias high owing to calibration not being evaluated. If this criterion is not taken into account, analysis risk of bias would have been unclear.
Risk of bias high owing to calibration not being evaluated. If this criterion is not taken into account, analysis risk of bias would have been low.