| Literature DB >> 35864468 |
Jordan H Chamberlin1, Gilberto Aquino1, Sophia Nance1, Andrew Wortham1, Nathan Leaphart1, Namrata Paladugu1, Sean Brady1, Henry Baird1, Matthew Fiegel1, Logan Fitzpatrick1, Madison Kocher1, Florin Ghesu2, Awais Mansoor2, Philipp Hoelzer2, Mathis Zimmermann2, W Ennis James3, D Jameson Dennis3, Brian A Houston4, Ismail M Kabakus1, Dhiraj Baruah1, U Joseph Schoepf1, Jeremy R Burt5,6.
Abstract
BACKGROUND: Airspace disease as seen on chest X-rays is an important point in triage for patients initially presenting to the emergency department with suspected COVID-19 infection. The purpose of this study is to evaluate a previously trained interpretable deep learning algorithm for the diagnosis and prognosis of COVID-19 pneumonia from chest X-rays obtained in the ED.Entities:
Keywords: COVID-19; Critical care; Deep learning; Pulmonology; Radiology
Mesh:
Year: 2022 PMID: 35864468 PMCID: PMC9301895 DOI: 10.1186/s12879-022-07617-7
Source DB: PubMed Journal: BMC Infect Dis ISSN: 1471-2334 Impact factor: 3.667
Fig. 1Flow-diagram describing inclusion of patients for COVID-19 training and test datasets. 23,805 X-rays were queried and ultimately 2488 met criteria of a documented COVID-19 test within 14 days of an eligible PA or AP CXR. 2000 were used in the training cohort with 488 retained as internal holdout for validation. Missing data from 32 patients, defined as images that failed the AI segmentation due to poor imaging quality, were excluded
Fig. 2Visual representation of neural network annotations and outputs. A AP portable CXR with left lower lobe airspace opacities scored a 4/10 by the dCNN. EKG leads overlie the chest bilaterally. B Upright portable AP view CXR with bilateral airspace opacities scored an 8/10 by the dCNN. Dual chamber pacemaker with atrial and ventricular leads overlies the left chest. C dCNNs architecture used for classification and detection of airspace opacities. A ResNet backbone for the image anatomy feeds forward into a voxel feature pyramid which is then forwarded to a convolutional network-based detector for classification of the airspace opacity. A detailed description of the architecture can be found in the materials and methods under Deep Convolutional Neural Network Algorithm
Demographics and clinical variables of test cohort patients stratified by SARS-CoV-2 RT-PCR results
| Variables | RT-PCR Positive (N = 236) | RT-PCR Negative (N = 220) | ||
|---|---|---|---|---|
| Mean | SD | Mean | SD | |
| Age (years) | 55.3 | 17 | 49.2 | 16.3 |
| BMI kg/m2 | 31.6 | 8.5 | 27.7 | 7.4 |
| CXR–PCR Interval (days) | 3.4 | 3.8 | 3.1 | 14.4 |
SARS-CoV-2: Severe acute respiratory syndrome coronavirus 2; RT-PCR: Reverse transcription polymerase chain reaction; SD: Standard deviation; BMI: Body mass index; CXR: Chest X-ray; COPD: Chronic obstructive pulmonary disease; HTN: Hypertension; HIV: Human immunodeficiency virus
Fig. 3Prediction of Positive SARS-CoV-2 PCR by extent of AI-determined airspace disease. A Logistic probability plot of positive SARS-CoV-2 PCR as a function of AI-determined airspace extent. Median airspace extent (40%) had just under 50% probability of a concurrent positive PCR. McFadden R2 = 0.412. B ROC curve for prediction of SARS-CoV-2 PCR positivity in comparison to radiologist impression of airspace extent. Radiologist (AUC = 0.936, 95% CI 0.918–0.960) and AI (AUC = 0.890, 95% CI 0.861–0.920) annotations were both highly accurate
Fig. 4Comparison of differences between AI and Radiologist measurement of airspace opacity extent. A Airspace opacity extent percentage as a function of observer. Adjusted R2 = 0.656; Spearman ρ = 0.797. Overall agreement is considered excellent for positive cases (single fixed raters ICC = 0.810, 95% CI 0.765–0.840). Agreement for all cases is considered excellent (single fixed raters ICC = 0.820, 95% CI 0.790–0.840). B Bland–Altman plot for difference of methods. Mean difference -22.4%; SE 21.1%. C Confusion matrix for discrete scores compared between expert and AI. Weighted macro F1 score for categorical agreement is 0.157
Diagnostic performance of empirically derived threshold models for SARS-CoV-2 RT-PCR Positivity
| Accuracy | Sensitivity | Specificity | PPV | NPV | |
|---|---|---|---|---|---|
| Metric | |||||
| 0.792 (0.735–0.842) | 0.850 (0.791–0.891) | 0.850 (0.799–0.894) | 0.792 (0.740–0.843) | ||
| > 10% | 0.776 (0.738–0.815) | 0.646 (0.578–0.709) | 0.731 (0.680–0.782) | ||
| > 80% | 0.774 (0.736–0.813) | 0.593 (0.528–0.656) | 0.689 (0.638–0.741) | ||
| Most accurate model (AI airspace opacity severity ≥ 40%) | |||||
| False Positive Rate | 0.155 (0.107–0.202) | LR+ | 5.13 (3.74–7.03) | RR | 4.06 (3.15–5.23) |
| False Negative Rate | 0.208 (0.156–0.259) | LR− | 0.246 (0.190–0.317) | OR | 20.9 (12.9–33.7) |
Bolded values indicate highest values for each category
SARS-CoV-2: severe acute respiratory syndrome coronavirus 2; RT-PCR: reverse transcription polymerase chain reaction; PPV: positive predictive value; NPV: negative predictive value; LR: likelihood ratio; RR: relative risk; OR: odds ratio
Association of AI-ASOS with clinical outcomes amongst patients stratified by SARS-CoV-2 RT-PCR
| Outcome | N | SARS-CoV-2 (+) | N | SARS-CoV-2 (−) | P | ||
|---|---|---|---|---|---|---|---|
| Mean ASOS | SD | Mean ASOS | SD | ||||
| Hospitalization | 175 | 5.4 | 4.0 | 124 | 2.7 | 3.3 | < 0.001 |
| ICU admit | 120 | 8.3 | 2.5 | 10 | 3.0 | 3.4 | < 0.001 |
| Intubation | 88 | 7.7 | 3.3 | 17 | 3.6 | 3.7 | < 0.001 |
| ARDS | 115 | 8.8 | 1.9 | 1 | 3.0 | 3.4 | < 0.001 |
| Mortality | 53 | 8.6 | 2.2 | 2 | 3.9 | 3.8 | < 0.001 |
| Pulmonary mortality | 47 | 8.9 | 1.9 | 0 | –- | –- | – |
SARS-CoV-2: Severe acute respiratory syndrome coronavirus 2; RT-PCR: Reverse transcription polymerase chain reaction; ASOS: Airspace Opacity Severity Score; SD: Standard deviation; ICU: Intensive care unit; ARDS: acute respiratory distress syndrome
Fig. 5Prediction of outcomes by use of AI-determined airspace opacity extent (AI-ASOS) using simple logistic regression. A Prediction of outcomes in all patients. AI-ASOS is best at predicting ICU admission (AUC = 0.870, 95% CI 0.834–0.904) and pulmonary mortality (AUC = 0.845, 95% CI 0.802–0.888). B Prediction of outcomes statistics amongst all patients using a multivariate empirically derived model of additional clinical risk factors. Use of AI-ASOS, age, and BMI had a high accuracy for prediction of mortality statistics and ICU admission (AUC = 0.906, 0.896, and 0.880, respectively)
Logistic regression model parameters and predictive intervals for AI severity scores alone and with age + BMI
| McFadden R2 | OR Score (95% CI) | AUC (95% CI) | |
|---|---|---|---|
| Hospitalization | 0.082 | 1.20 (1.14–1.27) | 0.687 (0.639–0.735) |
| ICU admission | 0.336 | 1.57 (1.45–1.72) | 0.869 (0.834–0.934) |
| Intubation | 0.186 | 1.36 (1.26–1.46) | 0.791 (0.742–0.840) |
| Mortality | 0.226 | 1.51 (1.34–1.73) | 0.829 (0.782–0.876) |
| Pulmonary mortality | 0.244 | 1.59 (1.39–1.90) | 0.845 (0.802–0.888) |
| Hospitalization | 0.153 | 1.22 (1.14–1.31) | 0.758 (0.710–0.806) |
| ICU admission | 0.359 | 1.59 (1.45–1.75) | 0.880 (0.845–0.915) |
| Intubation | 0.202 | 1.36 (1.26–1.48) | 0.806 (0.759–0.853) |
| Mortality | 0.369 | 1.55 (1.35–1.84) | 0.906 (0.873–0.939) |
| Pulmonary mortality | 0.331 | 1.55 (1.33–1.85) | 0.896 (0.860–0.932) |
AI: Artificial Intelligence; BMI: Body Mass Index (kg/m2); OR: Odds ratio; AUC: Area under curve; CI: Confidence Interval; ICU: Intensive care unit
Fig. 6Probabilities of outcomes as a function of AI-determined airspace opacity extent (AI-ASOS). A Probability of ICU admission. 50% airspace opacity extent (AI-ASOS = 5) confers a ~ 20% chance of ICU admission. B Probability of pulmonary death. Risk of pulmonary death begins increasing at roughly 50% airspace opacity extent (AI-ASOS = 5)