| Literature DB >> 33230503 |
Francisco Dorr1, Hernán Chaves1,2, María Mercedes Serra1,2, Andrés Ramirez1, Martín Elías Costa1, Joaquín Seia1, Claudia Cejas2, Marcelo Castro3, Eduardo Eyheremendy4, Diego Fernández Slezak1,5,6, Mauricio F Farez1,7.
Abstract
PURPOSE: To investigate the diagnostic performance of an Artificial Intelligence (AI) system for detection of COVID-19 in chest radiographs (CXR), and compare results to those of physicians working alone, or with AI support.Entities:
Keywords: AI, artificial intelligence; AUPR, area under the precision-recall; AUROC, area under the receiver operating characteristic; Artificial intelligence; COVID-19; CT, computed tomography; CXR, chest radiographs; Chest; DL, deep learning; Diagnostic performance; RT-PCR, real-time reverse transcriptase–polymerase chain reaction; Radiography
Year: 2020 PMID: 33230503 PMCID: PMC7674009 DOI: 10.1016/j.ibmed.2020.100014
Source DB: PubMed Journal: Intell Based Med ISSN: 2666-5212
Fig. 1Convolutional Neural Network Diagram. This chart summarizes the strategy used in the study. Using a convolutional neural network, pre-trained with a dataset of over 200,000 CXRs and 5 output classes; all layers but the last block of layers were frozen and transferred onto a new network with new labels (COVID-19 pneumonia, Other pneumonias, Normal/Other findings). Final fully-connected layers were then retrained over the transferred ones.
Fig. 2Performance of the Artificial Intelligence (AI) System in COVID-19 Prediction. Receiver operating characteristic (ROC) curve and area under the curve (AUC) of the AI system on the validation set for each of the 5 folds, with a mean area under the receiver operating characteristic (AUROC) curve of 0.96 ± 0.02, n = 302).
Performance of the AI system in the training dataset using the average of 5-fold cross-validation.
| Diagnosis | Sensitivity | Specificity | AUROC |
|---|---|---|---|
| Covid-19 pneumonia (n = 102) | 94% | 81% | 0.96 |
| Non-Covid-19 pneumonia (n = 100) | 55% | 95% | 0.87 |
| Other (n = 100) | 84% | 91% | 0.93 |
Fig. 3Activation Maps of the Artificial Intelligence (AI) System. a) Example of a single activation map on a CXR image from the COVID-19 group. b) Mean activation map of Non-COVID-19 pneumonia category. c) Mean activation map of COVID-19 pneumonia category. d) Delta activation map between COVID-19 and Non-COVID-19 pneumonia categories calculated by , 0) for each pixel (i,j), depicting lower and peripheral areas as more relevant for the differentiation.
Performance of the AI system in the test dataset.
| Diagnosis | Sensitivity | Specificity | AUROC | F1 score | Brier score | MAE |
|---|---|---|---|---|---|---|
| Covid-19 pneumonia (n = 20) | 80% | 80% | 0.84 | 0.73 | 0.16 | 0.28 |
| Non-Covid-19 pneumonia (n = 20) | 60% | 90% | 0.88 | 0.67 | 0.14 | 0.26 |
| Other (n = 20) | 65% | 83% | 0.86 | 0.65 | 0.15 | 0.26 |
AI: artificial intelligence, AUROC: area under the receiver operating characteristics, MAE: mean absolute error.
Fig. 4Performance of the Artificial Intelligence (AI) System on the Train and Test Sets, Compared to the Performance of Physicians in COVID-19 Prediction. Receiver operating characteristic (ROC) curve and area under the curve (AUC) of the AI system on the train and test sets. Physician performance with and without AI support is compared.