| Literature DB >> 31830973 |
Vitoantonio Bevilacqua1, Antonio Brunetti2, Giacomo Donato Cascarano2, Andrea Guerriero2, Francesco Pesce3, Marco Moschetta3, Loreto Gesualdo3.
Abstract
BACKGROUND: The automatic segmentation of kidneys in medical images is not a trivial task when the subjects undergoing the medical examination are affected by Autosomal Dominant Polycystic Kidney Disease (ADPKD). Several works dealing with the segmentation of Computed Tomography images from pathological subjects were proposed, showing high invasiveness of the examination or requiring interaction by the user for performing the segmentation of the images. In this work, we propose a fully-automated approach for the segmentation of Magnetic Resonance images, both reducing the invasiveness of the acquisition device and not requiring any interaction by the users for the segmentation of the images.Entities:
Keywords: ADPKD; Convolutional neural network; Deep learning; Magnetic resonance; R-CNN; Semantic segmentation
Mesh:
Year: 2019 PMID: 31830973 PMCID: PMC6907104 DOI: 10.1186/s12911-019-0988-4
Source DB: PubMed Journal: BMC Med Inform Decis Mak ISSN: 1472-6947 Impact factor: 2.796
Fig. 1Workflow for the semantic segmentation starting from the full image
Fig. 2Workflow for the semantic segmentation of ROIs automatically detected with R-CNN
Fig. 3Example of an input image segmented manually; left: the representation of a DICOM image in greyscale; right: the mask obtained after the manual contouring of the selected slice
Fig. 4Encoder–Decoder architecture for SegNet [32]
Configurations designed and tested for the semantic segmentation of the full image
| Network ID | Number of layers per encoder | Number of convolutional filters per layer | Learner |
|---|---|---|---|
| [2 2 3 3 3] | [64 128 256 512 512] | ADAM | |
| [3 2 3 3 3] | [64 128 256 512 512] | ADAM | |
| [3 2 3 3 3] | [96 128 256 512 512] | ADAM |
Each layer is a sequence of a convolutional layer, a batch normalization layer and a ReLu layer
Configurations designed and tested for the CNN in the ROI detector
| Network ID | Number of layers per encoder | Number of convolutional filters per layer | Learner |
|---|---|---|---|
| [3 3] | [32 32] | SGDM | |
| [1 1] | [16 32] | SGDM | |
| [3 3] | [64 32] | SGDM |
Each layer is a sequence of a convolutional layer, and a ReLu layer
Fig. 5Precision – Recall plot and log Average Miss rate for R-CNN-1
Fig. 6Precision – Recall plot and log Average Miss rate for R-CNN-2
Fig. 7Precision – Recall plot and log Average Miss rate for R-CNN-3
Fig. 8Results from R-CNN classifier. Input image is on the left; the image on the right contains squares on the detected ROIs, each one is associated with a score
Performance indices for the classifiers working on MR images
| Network ID | Mean accuracy | Weighted IoU | Mean BF score |
|---|---|---|---|
| 0.88076 | 0.75288 | 0.41117 | |
| 0.88359 | 0.76294 | 0.38205 | |
| 0.79824 | 0.52781 | 0.38643 |
Normalized Confusion Matrix for VGG-16, S-CNN-1 and S-CNN-2 segmenting the MR images for the test set
| VGG-16 | S-CNN-1 | S-CNN-2 | |||||
|---|---|---|---|---|---|---|---|
| True condition | True condition | True condition | |||||
| Predicted | |||||||
| Condition | |||||||
Fig. 9Result of the semantic segmentation considering an image sample. Top left: the MR slice represented in greyscale; top right: the segmentation result; bottom left: the ground-truth mask; bottom right: superimposition of the segmentation result to the ground-truth mask
Performance indices for the classifiers working on the ROIs
| Network ID | Mean accuracy | Weighted IoU | Mean BF score |
|---|---|---|---|
| VGG-16 | 0.86016 | 0.75426 | 0.34828 |
| S-CNN-1 | 0.8726 | 0.8540 | 0.4332 |
| S-CNN-2 | 0.8550 | 0.82931 | 0.41515 |
Normalized Confusion Matrix for VGG-16, S-CNN-1 and S-CNN-2 segmenting the ROIs detected by the R-CNN-1 from the MR images of the test set
| VGG-16 | S-CNN-1 | S-CNN-2 | |||||
|---|---|---|---|---|---|---|---|
| True condition | True condition | True condition | |||||
| Predicted | |||||||
| Condition | |||||||
Fig. 10Example result for ROI detection and semantic segmentation. Top left: the MR slice represented in greyscale; top right: the R-CNN detection result; middle left: one of the detected ROIs; middle right the segmentation result; bottom left: the ground-truth mask for the considered ROI; bottom right: superimposition of the classification result to the ground-truth mask