| Literature DB >> 32729810 |
Nikolas Lessmann1, Clara I Sánchez1, Ludo Beenen1, Luuk H Boulogne1, Monique Brink1, Erdi Calli1, Jean-Paul Charbonnier1, Ton Dofferhoff1, Wouter M van Everdingen1, Paul K Gerke1, Bram Geurts1, Hester A Gietema1, Miriam Groeneveld1, Louis van Harten1, Nils Hendrix1, Ward Hendrix1, Henkjan J Huisman1, Ivana Išgum1, Colin Jacobs1, Ruben Kluge1, Michel Kok1, Jasenko Krdzalic1, Bianca Lassen-Schmidt1, Kicky van Leeuwen1, James Meakin1, Mike Overkamp1, Tjalco van Rees Vellinga1, Eva M van Rikxoort1, Riccardo Samperna1, Cornelia Schaefer-Prokop1, Steven Schalekamp1, Ernst Th Scholten1, Cheryl Sital1, Lauran Stöger1, Jonas Teuwen1, Kiran Vaidhya Venkadesh1, Coen de Vente1, Marieke Vermaat1, Weiyi Xie1, Bram de Wilde1, Mathias Prokop1, Bram van Ginneken1.
Abstract
Background The COVID-19 pandemic has spread across the globe with alarming speed, morbidity and mortality. Immediate triage of suspected patients with chest infections caused by COVID-19 using chest CT may be of assistance when results from definitive viral testing are delayed. Purpose To develop and validate an artificial intelligence (AI) system to score the likelihood and extent of pulmonary COVID-19 on chest CT scans using the CO-RADS and CT severity scoring systems. Materials and Methods CORADS-AI consists of three deep learning algorithms that automatically segment the five pulmonary lobes, assign a CO-RADS score for the suspicion of COVID-19 and assign a CT severity score for the degree of parenchymal involvement per lobe. This study retrospectively included patients who received an unenhanced chest CT scan due to clinical suspicion of COVID-19 at two medical centers. The system was trained, validated, and tested with data from one of the centers. Data from the second center served as an external test set. Diagnostic performance and agreement with scores assigned by eight independent observers were measured using receiver operating characteristic (ROC) analysis, linearly-weighted kappa and classification accuracy. Results 105 patients (62 ± 16 years, 61 men) and 262 patients (64 ± 16 years, 154 men) were evaluated in the internal and the external test set, respectively. The system discriminated between COVID-19 positive and negative patients with areas under the ROC curve of 0.95 (95% CI: 0.91-0.98) and 0.88 (95% CI: 0.84-0.93). Agreement with the eight human observers was moderate to substantial with a mean linearly-weighted kappa of 0.60 ± 0.01 for CO-RADS scores and 0.54 ± 0.01 for CT severity scores. Conclusion CORADS-AI correctly identified COVID-19 positive patients with high diagnostic performance from chest CT exams, assigned standardized CO-RADS and CT severity scores in good agreement with eight independent observers and generalized well to external data.Entities:
Year: 2020 PMID: 32729810 PMCID: PMC7393955 DOI: 10.1148/radiol.2020202439
Source DB: PubMed Journal: Radiology ISSN: 0033-8419 Impact factor: 11.105
Figure 1.Flowchart for patient inclusion in the training and test sets. Note that n refers to the number of patients. The number of CT images is higher in the training set since several patients received multiple chest CT scans within the inclusion period. However, in the test sets, only the earliest available scan of each patient was used.
Characteristics of the Training and Test Cohorts
CO-RADS and CT Severity Scores according to the Radiological Reports or according to the Radiologist Who Reviewed Scans Initially Scored CO-RADS 6 (proven COVID-19)
Figure 2.ROC curves for automatically predicted CO-RADS 5 probability vs. COVID-19 diagnosis. The receiver operating characteristic (ROC) curve is based on the probability that the algorithm assigned to CO-RADS 5. The shaded area around the ROC curve reflects the 95% confidence interval. A, The performance of the eight observers is shown as individual points on the graph for the internal test set, and, B, the diagnostic performance of the scores from the radiological reports is shown for the external test set. Different colors indicate different cut-offs, where patients were considered predicted COVID-19 positive if the observer assigned a CO-RADS score of 5 (orange), 4 or 5 (green), 3 to 5 (magenta), or 2 to 5 (yellow). COVID-19 diagnosis meant either a positive RT-PCR test or very high clinical suspicion of COVID-19 despite at least one negative RT-PCR test.
Sensitivity of the Observers and the AI Algorithm for Identification of COVID-19 at the Specificity Levels Corresponding to Various Operating Points of the Observers
Agreement of the Observers and the AI System with the Median Score Assigned by the Remaining Seven Observers in the Internal Test Set
Figure 3.CT severity score predictions vs. median of observer scores. Shown as box plots are the distribution of the percentage of affected lung parenchyma per lobe according to the automatic lesion (affected volume) and lobe segmentations (total volume) for the internal test set. The notch in each box plot illustrates the 95% confidence interval around the median. The CT severity score cut-offs are marked on the y-axis.