| Literature DB >> 36257092 |
Mohammad Arafat Hussain1, Zahra Mirikharaji2, Mohammad Momeny3, Mahmoud Marhamati4, Ali Asghar Neshat5, Rafeef Garbi6, Ghassan Hamarneh7.
Abstract
Supervised deep learning has become a standard approach to solving medical image segmentation tasks. However, serious difficulties in attaining pixel-level annotations for sufficiently large volumetric datasets in real-life applications have highlighted the critical need for alternative approaches, such as semi-supervised learning, where model training can leverage small expert-annotated datasets to enable learning from much larger datasets without laborious annotation. Most of the semi-supervised approaches combine expert annotations and machine-generated annotations with equal weights within deep model training, despite the latter annotations being relatively unreliable and likely to affect model optimization negatively. To overcome this, we propose an active learning approach that uses an example re-weighting strategy, where machine-annotated samples are weighted (i) based on the similarity of their gradient directions of descent to those of expert-annotated data, and (ii) based on the gradient magnitude of the last layer of the deep model. Specifically, we present an active learning strategy with a query function that enables the selection of reliable and more informative samples from machine-annotated batch data generated by a noisy teacher. When validated on clinical COVID-19 CT benchmark data, our method improved the performance of pneumonia infection segmentation compared to the state of the art.Entities:
Keywords: Active learning; COVID-19; Deep learning; Noisy teacher; Pneumonia; Segmentation; Semi-supervised learning
Year: 2022 PMID: 36257092 PMCID: PMC9540707 DOI: 10.1016/j.compmedimag.2022.102127
Source DB: PubMed Journal: Comput Med Imaging Graph ISSN: 0895-6111 Impact factor: 7.422
Fig. 1Schematic diagram of the proposed example re-weighted active learning from noisy teacher approach for 3D image segmentation.
Our 3D UNet architecture. Acronyms—BN: batch normalization, Conv3D: 3D convolution, Conv3D-Res: Conv3D is used for residual connection, PReLU: parametric rectified linear unit, and I: identity connection.
| Block | Conv3D | Stride | Activation | BN | Repeat | Input | Output | # trainable |
|---|---|---|---|---|---|---|---|---|
| type | Kernel | function | size | size | parameters | |||
| Conv3D | 3 | 2 | PReLU | Yes | 2 | 96 | 48 | 897 |
| Conv3D | 3 | 1 | PReLU | Yes | 1 | 48 | 48 | 6,929 |
| Conv3D-Res | 3 | 2 | – | – | 1 | 48 | 48 | 0 |
| Conv3D | 3 | 2 | PReLU | Yes | 2 | 48 | 24 | 27,713 |
| Conv3D | 3 | 1 | PReLU | Yes | 1 | 24 | 24 | 27,681 |
| Conv3D-Res | 3 | 2 | – | – | 1 | 24 | 24 | 0 |
| Conv3D | 3 | 2 | PReLU | Yes | 2 | 24 | 12 | 110,720 |
| Conv3D | 3 | 1 | PReLU | Yes | 1 | 12 | 12 | 110,657 |
| Conv3D-Res | 3 | 2 | – | – | 1 | 12 | 12 | 0 |
| Conv3D | 3 | 2 | PReLU | Yes | 2 | 12 | 6 | 442,625 |
| Conv3D | 3 | 1 | PReLU | Yes | 1 | 6 | 6 | 442,497 |
| Conv3D-Res | 3 | 2 | – | – | 1 | 6 | 6 | 0 |
| Conv3D | 3 | 1 | PReLU | Yes | 2 | 6 | 6 | 918,017 |
| Conv3D | 3 | 1 | PReLU | Yes | 1 | 6 | 6 | 1,769,729 |
| Conv3D-Res | 1 | 1 | – | – | 1 | 6 | 6 | 0 |
| Conv3D | 3 | 2 | PReLU | Yes | 1 | 6 | 12 | 663,617 |
| Conv3D | 3 | 1 | PReLU | Yes | 1 | 12 | 12 | 110,657 |
| Conv3D-Res | I | – | – | – | 1 | 12 | 12 | 0 |
| Conv3D | 3 | 2 | PReLU | Yes | 1 | 12 | 24 | 110,621 |
| Conv3D | 3 | 1 | PReLU | Yes | 1 | 24 | 24 | 27,681 |
| Conv3D-Res | I | – | – | – | 1 | 24 | 24 | 0 |
| Conv3D | 3 | 2 | PReLU | Yes | 1 | 24 | 48 | 27,665 |
| Conv3D | 3 | 1 | PReLU | Yes | 1 | 48 | 48 | 6,929 |
| Conv3D-Res | I | – | – | – | 1 | 48 | 48 | 0 |
| Conv3D | 3 | 2 | PReLU | Yes | 1 | 48 | 96 | 1,731 |
| Conv3D | 3 | 1 | – | – | 1 | 96 | 96 | 110 |
| Conv3D-Res | I | – | – | – | 1 | 96 | 96 | 0 |
| Total | ||||||||
Represents the skip connection between the encoder and decoder sides of the network.
5-fold cross-validation performance in terms of Dice scores and Hausdorff distances using our methods implemented for segmenting COVID-19 pneumonia infection in Challenge data. The upward arrow () indicates that ‘higher is better, and the downward arrow () indicates that ‘lower is better’. Values indicated by the colors and indicate the best performance in terms of the Dice score and the Hausdorff distance, respectively. (Acronyms—RGS: sample re-weighting based on gradient similarity only, RGSAL: sample re-weighting based on gradient similarity followed by active learning, RGS&MAL: sample re-weighting based on both gradient similarity and last layer gradient magnitude followed by active learning, Met: metrics, DS: Dice score, HD: Hausdorff distance.)
4-fold cross-validation performance in terms of Dice scores and Hausdorff distances by our implemented methods for segmenting pneumonia infection in the Benchmark data. The upward arrow () indicates that higher is better, and the downward arrow () indicates that lower is better. The values indicated by the colors and indicate the best performance in terms of dice score and Hausdorff distance, respectively. (Acronyms—RGS: sample re-weighting based on gradient similarity only, RGSAL: sample re-weighting based on gradient similarity followed by active learning, RGS&MAL: sample re-weighting based on both gradient similarity and last layer gradient magnitude followed by active learning, Met: metrics, DS: Dice score, HD: Hausdorff distance.)
Fig. 2Qualitative performance comparison by our implemented methods in pneumonia infection segmentation on the Challenge data. The first row shows the axial CT slices of seven COVID-19-infected patients. The second row shows the expert-generated infection mask overlaid on the corresponding CT slices. The third to eighth rows show infection segmentation masks generated by different approaches. Blue arrows indicate false positives and yellow arrows indicate false negatives.
Fig. 3Qualitative performance comparison by our implemented methods in pneumonia infection segmentation on the Benchmark data. The first row shows the axial CT slices of five COVID-19-infected patients. The second row shows the expert-generated infection mask overlaid on the corresponding CT slices. The third to sixth rows show infection segmentation masks generated by different approaches.
Dice scores achieved by contrasting methods in the segmentation of pneumonia infections in Benchmark data. (Acronyms—RGS: sample re-weighting based on gradient similarity only, RGSAL: sample re-weighting based on gradient similarity followed by active learning, RGS&MAL: sample re-weighting based on both gradient similarity and last layer gradient magnitude followed by active learning.)
| Method type | Methods | Mean Dice |
|---|---|---|
| Fully supervised | ||
| Our 3D UNet | ||
| Semi-supervised | ||
| 0.5970 | ||
| Proposed RGS | ||
| Active Learning from Noisy Teacher | Proposed RGS | |
| Proposed RGS&M | ||