| Literature DB >> 35310182 |
Priyanka Malhotra1, Sheifali Gupta1, Deepika Koundal2, Atef Zaguia3, Wegayehu Enbeyle4.
Abstract
Image segmentation is a branch of digital image processing which has numerous applications in the field of analysis of images, augmented reality, machine vision, and many more. The field of medical image analysis is growing and the segmentation of the organs, diseases, or abnormalities in medical images has become demanding. The segmentation of medical images helps in checking the growth of disease like tumour, controlling the dosage of medicine, and dosage of exposure to radiations. Medical image segmentation is really a challenging task due to the various artefacts present in the images. Recently, deep neural models have shown application in various image segmentation tasks. This significant growth is due to the achievements and high performance of the deep learning strategies. This work presents a review of the literature in the field of medical image segmentation employing deep convolutional neural networks. The paper examines the various widely used medical image datasets, the different metrics used for evaluating the segmentation tasks, and performances of different CNN based networks. In comparison to the existing review and survey papers, the present work also discusses the various challenges in the field of segmentation of medical images and different state-of-the-art solutions available in the literature.Entities:
Mesh:
Year: 2022 PMID: 35310182 PMCID: PMC8930223 DOI: 10.1155/2022/9580991
Source DB: PubMed Journal: J Healthc Eng ISSN: 2040-2295 Impact factor: 2.682
Description of few review papers in medical image segmentation.
| Ref. | Year | Models discussed | Performance metrics | Dataset | Challenges | Remarks | |
|---|---|---|---|---|---|---|---|
| [ | 2017 | CNN | No coverage | No coverage | Challenges with CNN covered | Image classification, object detection, segmentation, and registration mechanisms discussed | |
| [ | 2017 | Stacked autoencoder, deep belief network, and deep Boltzmann machine | No coverage | No coverage | No coverage | — | |
| [ | 2018 | CNN, R-CNN | Image classification metrics discussed but segmentation metrics not covered | Medical image modalities covered | No coverage | All areas of medical image analysis discussed | |
| [ | 2019 | CNN. FCN, U-Net, VNet, CRN, and RNN | No coverage | Covered | Challenges and possible solutions discussed | — | |
| [ | 2020 | Supervised, weakly supervised models (RNN, U-Net) | No coverage | Covered | Challenges and possible solutions discussed | ---- | |
| [ | 2021 | CNN, FCN, DeepLab, SegNet, U-Net, and VNet | Covered | Covered | Challenges discussed but the solutions not discussed | — | |
| Ours | CNN,FCN,R-CNN, fast R-CNN, faster R-CNN, mask R-CNN, U-Net, VNet, and DeepLab | Covered | Covered | Challenges and possible state-of-the-art solutions discussed | Paper provides extended coverage to the different deep neural networks for image segmentation | ||
Structure of the paper.
| S. no. | Main section | Subsection |
|---|---|---|
| 1 | Introduction | Introduction and motivation literature review major contributions |
| 2 | Deep neural network structures | Artificial neural network convolutional neural network encoder-decoder models regional convolutional network deepLab model comparison, limitations, and advantages/ |
| 3 | Application of deep neural network to medical image segmentation | Deep learning-based system literature review on DNN based image segmentation models for different organs summary on deep learning-based medical image segmentation methods ( |
| 4 | Medical image segmentation datasets | Types and format of dataset different types of modalities summary of medical image segmentation datasets ( |
| 5 | Evaluation metrics | Importance of metrics popular image segmentation algorithm performance metrics |
| 6 | Major challenges and state-of-the-art solutions | Dataset challenges with DL model possible solution to the problems related to dataset and DL model |
| 7 | Future direction | Motivation for further study and future research |
| 8 | Conclusion | Concluding remarks |
Figure 1Artificial neural network (ANN) model.
Figure 2Different types of deep neural network architectures for image segmentation.
Figure 3Convolutional neural network architecture.
Figure 4R-CNN architecture.
Figure 5DeepLab architecture.
Comparison between different image segmentation algorithms.
| Deep learning algorithm | Algorithm description | Advantages | Limitations |
|---|---|---|---|
| CNN | It consists of three main neural layers, which are convolutional layers, pooling layers, and fully connected layers | (a) It is simple | (a) It cannot manage different input sizes |
| FCN | All fully connected layers of CNN are replaced with the fully convolutional layers | The model outputs a spatial segmentation map instead of classification scores | It is hard to train a FCN model to get good performance |
| U-Net | It combines the location information obtained from the downsampling path and the contextual information obtained from upsampling path to predict segmentation map | It can perform efficient segmentation of images using limited number of labelled training images | (a) Input image size is limited to 572 × 572. |
| VNet | It performs convolutions on each stage using volumetric kernels of size 5 × 5 × 5 | It can be applied to 3D data for segmentation | |
| R-CNN | It uses selective search algorithm to extract 2000 regions from the image called region proposals | (a) It predicts the presence of an object within the region proposals | (a) Huge amount of time is needed to train network to classify 2000 region proposals per image |
| Fast R-CNN | It uses selective search algorithm which takes the whole image and region proposals as input in its CNN architecture in one forward propagation | It improves mean average precision (mAP) as compared to R-CNN | There is high computation time due to selective search region proposal generation algorithm |
| Faster R-CNN | It uses region proposal network | It generates the bounding boxes of different shapes and sizes | There is lower computation time |
| Mask R-CNN | It gives three outputs for each object in the image: its class, bounding box coordinates, and object mask | a) It is simple and flexible approach | There is high training time |
| DeepLabv1 | a) It uses atrous convolution to extract the features from an image | a) There is high speed due to atrous convolution | Use of CRFs makes algorithm slow |
| DeepLabv2 | It uses | Atrous spatial pyramid pooling (ASPP) robustly segments objects at multiple scales | There are challenges in capturing fine object boundaries |
| DeepLabv3 | It uses atrous separable convolution to capture sharper object boundaries | It can segment sharper targets | It still needs more refinement for object boundaries |
| DeepLabv3+ | It extends DeepLabv3 by adding a decoder module to refine the segmentation results along the object boundaries | There is better segmentation performance as compared to deepLabv3 | It is a large model with number of parameters to train. So, while training on higher resolution images and batch sizes, it needs large GPU memory. |
Figure 6Basic layout of typical deep learning-based system.
Summary on deep learning-based medical image segmentation methods.
| Organ | Segmented area | Model utilized | Dataset | Modality | Remarks |
|---|---|---|---|---|---|
| Cardiac | Cardiac, left, and right ventricular cavities and myocardium [ | 2D/3d CNN | ADC2017 | Cardiac MR images | — |
| Heart [ | RFCN | MICCAI2 2009 challenge dataset | Cardiac MR images | RFCN reduces computational time, simplifies segmentation, and enables real time applications | |
| Heart [ | U-Net | — | DT-CMR images | U-Net automated the DT-CMR postprocessing, supporting real time results | |
| Brain | Brain tissues [ | 2D CNN | — | Multimodal MR images | Model performance increases by employing multiple modalities |
| Brain tumor [ | SDResU-Net | — | MR images | U-Net has generalization capability | |
| Brain [ | Voxel-wise residual network | MRBrainS | MRI | — | |
| Brain [ | DNN | ISBI 2012 EM | TEM | — | |
| Pixel-wise brain segmentation [ | MD-LSTM | MRBrainS13 | Brain MR images | It can parallelize for 3D data | |
| Brain tumor core [ | FCN, U-Net | MR images | Bounding box technique used | ||
| Brain tumor [ | DeepLab | CT images | DeepLab with conditional random fields produces high accuracy | ||
| Lungs | Pulmonary nodules [ | 3D FCN | LIDC dataset | Chest CT images | Increased speed of screening |
| Lung segmentation [ | JU-Net based CNN | JSRT | CXR | ____ | |
| Pneumothorax segmentation [ | FC-DenseNet with SCSE module | PACS | Chest X-ray images | Spatial weighted cross-entropy loss function improves precision at boundaries | |
| Pneumothorax segmentation [ | Mask R-CNN | SIIM-ACR | Chest X-ray images | Bounding box regression helps in improving classification | |
| Pneumothorax segmentation [ | U-Net and PSPNet | Routine chest CT dataset | Chest CT images | ||
| Liver | Liver and tumor segmentation [ | Cascaded FCN | DIRCAD dataset | CT and MRI images | Separate set of filters applied at each stage improves segmentation |
| Liver segmentation [ | HED-mask R-CNN | CHAOS challenge | CT and MR images | High segmentation accuracy obtained | |
| Liver segmentation [ | FCN | MICCAI SLiver07 dataset | CT images | — | |
| Reproductive system | Prostate [ | VNet | — | 3D MRI | — |
| Digestive system | Pancreas [ | Recurrent NN (LSTM) | NIH-CT-82, ufl-mri-79 | Abdominal CT and MRI images | RNN performs better than HNN and UNET |
| Breast | Breast masses [ | DBN + CRF/SSVM | DDSM-BCRP, INbreast databases | Mammograms | CRF model is faster than SSVM |
| Eyes | Retinal blood vessels [ | U-Net with modifications | DRIVE/STARE | Retinal images | Modification allows precise and faster segmentation of blood vessels |
| Retinal blood vessels [ | U-Net, LadderNet | DRIVE/STARE/CHASE | Retinal images | — |
ADC: Alzheimer Disease Center. MICCAI: Medical Image Computing and Computer Assisted Intervention. MRBrainS: MR brain segmentation. ISBI: IEEE International Symposium on Biomedical Imaging. LIDC: Lung Image Database Consortium. JSRT: Japanese society of radiological technology. PACS: Picture Archiving and Communication System. SIIM-ACR: Society for Imaging Informatics in Medicine-American College of Radiology. DIRCAD: 3D image reconstruction for comparison of algorithm database. CHAOS: combined (CT-MR) healthy abdominal organ segmentation challenge. DDSM: digital database for screening mammography. DRIVE: digital retinal images for vessel extraction. STARE: Structural Analysis of Retinal Dataset. CHASE: Combined Healthy Abdominal Organ Segmentation Challenge.
Figure 7(a) MR image of brain. (b) CT scan of brain [30].
Summary of medical image segmentation datasets.
| Organ examined | Imaging modality | Dataset name | Dataset size | Dimensions | Image format | Segmented area | Used in reference |
|---|---|---|---|---|---|---|---|
| Brain | MRI | BraTS1 2018 | 285 | 3D (240 × 240 × 155) | NIFTI | Gliomas tumor | [ |
| Knee | MRI | SK110 | 60 | 3D (0.39 × 0.39 × 1.0) | NIFTI | Bones and cartilage | [ |
| OA1ZIB | 507 | 3D (0.36 × 0.36 × 0.7) | NIFTI | Bones and cartilage | |||
| Eyes | Retinal images | DRIVE | 40 | 2D (768 × 584) | JPEG | Retinal vessels | [ |
| Retinal images retinal images | PALM2 STARE | 1200 20 | -- 700 × 605 | JPEG JPEG | Lesions in pathological myopia blood vessels | [ | |
| Abdominal area | CT | CHAOS3 | 40 | 512 × 512 | DICOM | Liver and vessels | [ |
| MRI | CHAOS | 120 | 2D (256 × 256) | DICOM | |||
| Chest | Chest X-ray | SIIM-ACR4 | — | 2D (1024 × 1024) | DICOM | Pneumothorax | [ |
| Chest X-ray CT | SCR5 SegTHOR | 247 60 | 2D (2048 × 2048) ----- | JPEG----- | Lungs, heart, and clavicles segmentation of heart, aorta, trachea, and esophagus | [ | |
| Kidney | CT | KiTS6 19 | 300 | ----- | NIFTI | Kidney tumor | [ |
| Liver | WSI CT | PAIP ---- | 50 201 | 3D 3D | — | Liver cancer tumor | [ |
| Cardiac | MRI | 30 | 3D | Left atrium | [ | ||
| Lung | CT CT | Luna7 16 DSB8 | 888 1397 | 2D 2D | MetaImage | Nodules nucleus segmentation | [ |
ACR: Society for Imaging Informatics in Medicine-American College of Radiology. BraTS: Brain Tumor Segmentation. CHAOS: Combined Healthy Abdominal Organ Segmentation Challenge. DSB: Data Science Bowl. KiTS: kidney tumor segmentation challenge. Luna: Lung Nodule Analysis. PALM: Pathologic Myopia Challenge. SCR: Segmentation in Chest Radiographs.