| Literature DB >> 32796848 |
Stephanie A Harmon1,2, Thomas H Sanford3, Sheng Xu4, Evrim B Turkbey5, Holger Roth6, Ziyue Xu6, Dong Yang6, Andriy Myronenko6, Victoria Anderson4, Amel Amalou4, Maxime Blain4, Michael Kassin4, Dilara Long4, Nicole Varble4,7, Stephanie M Walker1, Ulas Bagci8, Anna Maria Ierardi9, Elvira Stellato9, Guido Giovanni Plensich9, Giuseppe Franceschelli10, Cristiano Girlando11, Giovanni Irmici11, Dominic Labella3, Dima Hammoud5, Ashkan Malayeri5, Elizabeth Jones5, Ronald M Summers5, Peter L Choyke1, Daguang Xu6, Mona Flores6, Kaku Tamura12, Hirofumi Obinata12, Hitoshi Mori12, Francesca Patella10, Maurizio Cariati10, Gianpaolo Carrafiello9,13, Peng An14, Bradford J Wood15, Baris Turkbey16.
Abstract
Chest CT is emerging as a valuable diagnostic tool for clinical management of COVID-19 associated lung disease. Artificial intelligence (AI) has the potential to aid in rapid evaluation of CT scans for differentiation of COVID-19 findings from other clinical entities. Here we show that a series of deep learning algorithms, trained in a diverse multinational cohort of 1280 patients to localize parietal pleura/lung parenchyma followed by classification of COVID-19 pneumonia, can achieve up to 90.8% accuracy, with 84% sensitivity and 93% specificity, as evaluated in an independent test set (not included in training and validation) of 1337 patients. Normal controls included chest CTs from oncology, emergency, and pneumonia-related indications. The false positive rate in 140 patients with laboratory confirmed other (non COVID-19) pneumonias was 10%. AI-based algorithms can readily identify CT scans with COVID-19 associated pneumonia, as well as distinguish non-COVID related pneumonias with high specificity in diverse patient populations.Entities:
Mesh:
Year: 2020 PMID: 32796848 PMCID: PMC7429815 DOI: 10.1038/s41467-020-17971-2
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Patient cohorts utilized in model development and testing. Demographic values are reported as absolute numbers for patient sex and as median (range) for patient age.
| Disease cohort | Center | Demographics | Training | Validation | Testing |
|---|---|---|---|---|---|
| COVID-19 | Hubei, China | 363 Male, 353 female Median 49 (18a–92) | 369 Scans | 122 Scans | 207 Scans |
| Milan, Italy | 220 Male, 90 female Median 60 (18–96) | 57 Scans | 24 Scans | 54 Scans | |
| Tokyo, Japan | 91 Male, 60 female Median 60 (4–87) | 100 Scans | 31 Scans | 49 Scans | |
| Milan, Italy | 10 Male, 5 female Median 55 (31–85) | – | – | 15 Scans | |
| Syracuse, NY, USA | bSee footnote | – | – | 1 Scan | |
| Any clinical indication | Syracuse, NY, USA | 437 Male, 534 female Median 65 (19–100) | 356 Scans | 93 Scans | 500 Scans |
| Cancer diagnosis and/or staging | LIDC[ | N/A | 149 Scans | 50 Scans | 271 Scans |
| NIH, USA | 100 Male Median 69 (30–89) | – | – | 100 Scans | |
| Pneumonia | Syracuse, NY, USA | 73 Male, 42 female Median 66 (13–101) | – | – | 140 Scans |
| NIH, USA | 28 Male, 8 female Median 21 (4–71) | 28 Scans | 8 Scans | – | |
| Total |
aAge was not readily available for all Hubei, China patients.
bDemographics for COVID-19 diagnosis from SUNY is included in all-comer/any clinical indication grouping.
Fig. 13D classification workflow.
All CT images under lung segmentation for localization to chest cavity region. Following cropping to lung region, two methods were considered for differentiation of COVID-19 from other clinical entities. a Full 3D Model resampled the cropped lung region of CT to a fixed size (192 × 192 × 64 voxels) for input to algorithm. b Hybrid CT resampled the cropped lung region of CT to fixed resolution (1mm × 1mm × 5mm) and sampled multiple 3D regions (192 × 192 × 32) for input to algorithm. At training, 6 regions/patient were used. At inference 15 regions/patient were used and results were averaged to produce final probability of COVID-19.
Performance of 3D and hybrid 3D classification models for two experimental conditions.
| Design | Model | Validation accuracy | Test summary stats | |||||
|---|---|---|---|---|---|---|---|---|
| ACC | SENS | SPEC | PPV | NPV | AUC | |||
| Original training schema | 3D | 0.917 | 0.908 | 0.840 | 0.930 | 0.794 | 0.948 | 0.949 |
| Hybrid 3D | 0.924 | 0.889 | 0.853 | 0.901 | 0.735 | 0.950 | 0.947 | |
| Independent testing population | 3D | 0.939 | 0.896 | 0.845 | 0.916 | 0.793 | 0.939 | 0.941 |
| Hybrid 3D | 0.905 | 0.895 | 0.751 | 0.951 | 0.853 | 0.909 | 0.938 | |
Original training design included 1337 patients in testing cohort (of which, n = 326 patients with COVID-19 positivity). Independent testing population design included 1397 patients in testing cohort (entire patient cohort from Tokyo, Japan excluded from training/validation), with a total of n = 386 patients with COVID-19 positivity.
ACC accuracy, SENS sensitivity, SPEC specificity, PPV positive predictive value, NPV negative predictive value, AUC area under the curve.
Fig. 2Model performance.
Receiver operating characteristic (ROC) curve for 3D and hybrid 3D classification models. Both experimental conditions are shown, with highlighted area to zoom in at upper left area of the curve. Solid lines represent original training design, dotted lines indicate independent testing population design.
Fig. 3Grad-CAM* resultant saliency maps for five representative COVID-19 patients from testing set.
All images are of correctly predicted positive by 3D model. Within the heatmap, areas of red indicate activation of the algorithm related with COVID-19 prediction. a, b Images and (f, g) associated maps from Hubei, China cohort. c Image and (h) associated map from Tokyo, Japan cohort. d Image and (i) associated map from an advanced case in Milan, Italy Center #1. Note activation in non-consolidating areas for prediction of COVID-19, indicating specific features independent of pneumonia-related consolidation are learned. e Image and (j) associated map of an advanced case in Milan, Italy Center #2. Note: case (e) represents an unseen testing center from training/validation centers. *footnote: Grad-CAM images are produced from preprocessed input data, including cropping to lung region and resizing to fixed dimension, which may result in visible changes to anatomic aspect ratio.