| Literature DB >> 34698988 |
Francesca Lizzi1,2, Abramo Agosti3, Francesca Brero4,5, Raffaella Fiamma Cabini4,3, Maria Evelina Fantacci6,7, Silvia Figini4,8, Alessandro Lascialfari4,5, Francesco Laruina9,6, Piernicola Oliva10,11, Stefano Piffer12,13, Ian Postuma4, Lisa Rinaldi4,5, Cinzia Talamonti12,13, Alessandra Retico6.
Abstract
PURPOSE: This study aims at exploiting artificial intelligence (AI) for the identification, segmentation and quantification of COVID-19 pulmonary lesions. The limited data availability and the annotation quality are relevant factors in training AI-methods. We investigated the effects of using multiple datasets, heterogeneously populated and annotated according to different criteria.Entities:
Keywords: COVID-19; Chest Computed Tomography; Ground-glass opacities; Machine Learning; Segmentation; U-net
Mesh:
Year: 2021 PMID: 34698988 PMCID: PMC8547130 DOI: 10.1007/s11548-021-02501-2
Source DB: PubMed Journal: Int J Comput Assist Radiol Surg ISSN: 1861-6410 Impact factor: 2.924
A summary of the datasets used in this study. The CT Severity Score (CT-SS) information is not available for all datasets, but it can be computed for data which has both lung masks and ground-glass opacification (GGO) masks
| Dataset name | Lung | GGO | CT-SS | N. of |
|---|---|---|---|---|
| mask | mask | cases | ||
| Plethora [ | Yes | No | No | 402 |
| Lung CT Segmentation Challenge [ | Yes | No | No | 60 |
| COVID-19 Challenge [ | No | Yes | No | 199 |
| MosMed [ | No | No | No | 1110 |
| MosMed (annotated subsample) | No | Yes | Inferable | 50 |
| MosMed (in-house annotated subsample) | Yes | No | No | 91 |
| COVID-19-CT-Seg [ | Yes | Yes | Inferable | 10 |
Fig. 1A summary of the whole analysis pipeline: the input CT scans are used to train U-net, which is devoted to lung segmentation; its output is refined by a morphology-based method. A bounding box containing the segmented lungs is made and applied to all CT scans for training U-net, which is devoted to COVID-19 lesion segmentation. Finally, the output of U-net is the definitive COVID-19 lesion mask, whereas the definitive lung mask is obtained as the union between the outputs of U-net and U-net. The ratio between the COVID-19 lesion mask and the lung mask provides the CT-SS for each patient
Fig. 2U-net scheme: the neural network is made of 6 levels of depth. In the compression path (left), the input is processed through convolutions, activation layers (ReLu) and instance normalization layers, while in the decompression one (right), in addition to those already mentioned, 3D Transpose Convolution (de-convolution) layers are also introduced
Number of CT scans assigned to the train, validation (val) and test sets used during the training and performance assessment of the U-net and the U-net networks
| Train | Val | Test | |
|---|---|---|---|
| Plethora | 319 | 40 | 40 |
| MosMed (91 CT-0) | 55 | 18 | 18 |
| LCTSC | 36 | 12 | 12 |
| COVID-19-CT-Seg | – | – | 10 |
Performances achieved by U-net in lung segmentation on different test sets, evaluated in terms of the vDSC at three successive stages of the segmentation procedure
| Test set | Masks of U-net size | Masks before refinement | Masks after refinement |
|---|---|---|---|
| vDSC | vDSC | vDSC | |
| Plethora | 0.96 ± 0.02 | 0.95 ± 0.02 | 0.95 ± 0.04 |
| MosMed | 0.97 ± 0.02 | 0.97 ± 0.02 | 0.97 ± 0.02 |
| LCTSC | 0.96 ± 0.03 | 0.95 ± 0.03 | 0.96 ± 0.01 |
| COVID-19-CT-Seg | 0.96 ± 0.01 | 0.95 ± 0.01 | 0.95 ± 0.01 |
Performances achieved by U-net in COVID-19 lesion segmentation, evaluated in terms of the vDSC
| U-net | Trained on | Test set | U-net size | Original CT size |
|---|---|---|---|---|
| (vDSC) | (vDSC) | |||
| U-net | COVID-19 challenge | COVID-19 challenge | 0.51 ± 0.24 | 0.51 ± 0.25 |
| COVID-19 Challenge | MosMed | 0.39 ± 0.19 | 0.40 ± 0.19 | |
| MosMed | MosMed | 0.54 ± 0.22 | 0.55 ± 0.22 | |
| MosMed | COVID-19 challenge | 0.25 ± 0.23 | 0.25 ± 0.23 | |
| COVID-19 challenge | COVID-19 challenge | 0.49 ± 0.21 | 0.50 ± 0.21 | |
| + MosMed | + MosMed | |||
| U-net | COVID-19 challenge | COVID-19 challenge | 0.64 ± 0.23 | 0.65 ± 0.23 |
| + MosMed | + MosMed |
The composition of the train and test sets is reported in Table 2
Performances of the LungQuant system on the independent COVID-19-CT-Seg test dataset. The vDSC and sDSC computed between the lung and lesion reference masks and those predicted by the LunQuant system are reported
| Metrics | Lung segmentation | |||
|---|---|---|---|---|
| vDSC | sDSC (1 mm) | sDSC (5 mm) | sDSC (10 mm) | |
| 0.96 ± 0.01 | 0.66 ± 0.09 | 0.95 ± 0.02 | 0.98 ± 0.01 | |
| 0.95 ± 0.01 | 0.65 ± 0.09 | 0.95 ± 0.02 | 0.98 ± 0.01 | |
| Infection Segmentation | ||||
| 0.62 ± 0.09 | 0.29 ± 0.06 | 0.75 ± 0.11 | 0.90 ± 0.09 | |
| 0.66 ± 0.13 | 0.36 ± 0.13 | 0.76 ± 0.18 | 0.87 ± 0.13 | |
Fig. 3On the rows: three axial slices of the first CT scan on the COVID-19-CT-Seg test dataset (coronacases001.nii) are shown. On the columns: original images (left); overlays between the predicted and the reference lung (centre) and COVID-19 lesion (right) masks. The reference masks are in green, while the predicted ones, obtained by the LungQuant system integrating U-net,are in blue
Fig. 4Estimated percentages P of affected lung volume versus the ground truth percentages, as obtained by the LungQuant system integrating U-net (left) and U-net (right). The grey areas in the plot backgrounds guide the eye to recognize the CT-SS values assigned to each value of P (from left to right: CT-SS 1, CT-SS 2, CT-SS 3)
Classification performances of the whole system in predicting CT Severity Score on MosMed and COVID-19-CT-Seg datasets. The number of misclassified cases is reported
| U-net | Dataset | Accuracy | Misclassified | Misclassified |
|---|---|---|---|---|
| by 1 class | by 2 classes | |||
| U-net | COVID-19-CT-Seg | 6/10 | 4/10 | 0 |
| U-net | COVID-19-CT-Seg | 9/10 | 1/10 | 0 |