| Literature DB >> 35751030 |
Zeeshan Shaukat1,2, Qurat Ul Ain Farooq3, Shanshan Tu4, Chuangbai Xiao5, Saqib Ali4.
Abstract
Glioma is the most aggressive and dangerous primary brain tumor with a survival time of less than 14 months. Segmentation of tumors is a necessary task in the image processing of the gliomas and is important for its timely diagnosis and starting a treatment. Using 3D U-net architecture to perform semantic segmentation on brain tumor dataset is at the core of deep learning. In this paper, we present a unique cloud-based 3D U-Net method to perform brain tumor segmentation using BRATS dataset. The system was effectively trained by using Adam optimization solver by utilizing multiple hyper parameters. We got an average dice score of 95% which makes our method the first cloud-based method to achieve maximum accuracy. The dice score is calculated by using Sørensen-Dice similarity coefficient. We also performed an extensive literature review of the brain tumor segmentation methods implemented in the last five years to get a state-of-the-art picture of well-known methodologies with a higher dice score. In comparison to the already implemented architectures, our method ranks on top in terms of accuracy in using a cloud-based 3D U-Net framework for glioma segmentation.Entities:
Keywords: 3D U-Net; Brain tumor; Cloud computing; Deep learning; Semantic segmentation
Mesh:
Year: 2022 PMID: 35751030 PMCID: PMC9229514 DOI: 10.1186/s12859-022-04794-9
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.307
Fig. 1Segmentation showing different regions of tumor in a multimodality MRI image
Fig. 2Published articles in Google Scholar, PubMed, Scopus and Web of Science in last 5 years
Number of publications related to tumor Segmentation in last 5 years
| Year | Google scholar | PubMed | Scopus | Web of science |
|---|---|---|---|---|
| 2021 | 3020 | 1495 | 319 | 200 |
| 2020 | 2350 | 1100 | 268 | 170 |
| 2019 | 1710 | 742 | 185 | 140 |
| 2018 | 1140 | 408 | 89 | 74 |
| 2017 | 709 | 224 | 26 | 26 |
Fig. 3Semantic Segmentaion Workflow for Tumor MRIs
BraTS brain tumors dataset specifications
| Target | Gliomas segmentation necrotic/active tumor and oedema | |
|---|---|---|
| Modality | Multimodal multisite MRI data | FLAIR |
| T1w | ||
| T1gd | ||
| T2w | ||
| Size | 750 4D volumes | 484 training |
| 266 testing | ||
| Dimensions | Height | 240 |
| Width | 240 | |
| Depth | 155 | |
| Dimen | Different scan modalities | |
Fig. 4Dataset Ground Truth versus Pixel Labels
Fig. 5Preview of four different labeled training volumes from the dataset
Experimental specifications used to perform semantic segmentation on cloud
| Cloud server | RAM | 56 GiB |
| Storage | 340 GiB | |
| GPU | 1 × K80 | |
| Instance | Azure NC6 | |
| vCPU(s) | 6 | |
| Cost | $1.321/hour | |
| NVIDIA Tesla K80 accelerator | Memory | 24 GB GDDR5 |
| Bandwidth | 480 GB/s | |
| CUDA cores | 4992 | |
| Single-precision | 8.73 teraflops | |
| Double-precision | 2.91 teraflops |
Fig. 6Random patch extraction datastore
Random patch extraction datastore specifications
| Patch size | 64 × 64 × 64 voxels |
| Patch per image | 16 |
| Mini-batch size | 8 |
Fig. 73-D U-Net Layers Diagram
Fig. 83D U-Net Deep Network Diagram used to train the system
Network analysis result of 3D U-net layers with reference to Figs. 7 and 8
| Sr | Name | Type | Activations | Learnable | Total learnable |
|---|---|---|---|---|---|
| 1 | Input 64 × 64 × 64 × 4 images | 3-D Image Input | 64 × 64 × 64 × 4 | – | 0 |
| 2 | en1_conv1 32 3 × 3 × 3 × 4 convolution with stride [1 1 1] and padding ‘same’ | Convolution | 64 × 64 × 64 × 32 | Weights 3 × 3 × 3 × 4 × 32 Bias 1 × 1 × 1 × 32 | 3488 |
| 3 | en1_bn1 Batch normalization with 32 channels | Batch Normalization | 64 × 64 × 64 × 32 | Offset 1 × 1 × 1 × 32 Scale 1 × 1 × 1 × 32 | 64 |
| 4 | en1_relu1 ReLU | ReLU | 64 × 64 × 64 × 32 | – | 0 |
| 5 | en1_conv2 64 3 × 3 × 3 × 32 convolution with stride [1 1 1] and padding ‘same’ | Convolution | 64 × 64 × 64 × 64 | Weights 3 × 3 × 3 × 32 × 64 Bias 1 × 1 × 1 × 64 | 55,360 |
| 6 | en1_relu2 ReLU | ReLU | 64 × 64 × 64 × 64 | – | 0 |
| 7 | en1_maxpool 2 × 2 × 2 max pooling with stride [2 2 2] and padding ‘same’ | 3-D Max Pooling | 32 × 32 × 32 × 64 | – | 0 |
| 8 | en2_conv1 64 3 × 3 × 3 × 64 convolution with stride [1 1 1] and padding ‘same’ | Convolution | 32 × 32 × 32 × 64 | Weights 3 × 3 × 3 × 64 × 64 Bias 1 × 1 × 1 × 64 | 110,656 |
| 9 | en2_bn1 Batch normalization with 64 channels | Batch Normalization | 32 × 32 × 32 × 64 | Offset 1 × 1 × 1 × 64 Scale 1 × 1 × 1 × 64 | 128 |
| 10 | en2_relu1 ReLU | ReLU | 32 × 32 × 32 × 64 | – | 0 |
| 11 | en2_conv2 128 3 × 3 × 3 × 64 convolution with stride [1 1 1] and padding ‘same’ | Convolution | 32 × 32 × 32 × 128 | Weights 3 × 3 × 3 × 64 × 128 Bias 1 × 1 × 1 × 128 | 221,312 |
| 12 | en2_relu2 ReLU | ReLU | 32 × 32 × 32 × 128 | – | 0 |
| 13 | en2_maxpool 2 × 2 × 2 max pooling with stride [2 2 2] and padding ‘same’ | 3-D Max Pooling | 16 × 16 × 16 × 128 | – | 0 |
| 14 | en3_conv1 128 3 × 3 × 3 × 128 convolution with stride [1 1 1] and padding ‘same’ | Convolution | 16 × 16 × 16 × 128 | Weights 3 × 3 × 3 × 128 × 128 Bias 1 × 1 × 1 × 128 | 442,496 |
| 15 | en3_bn1 Batch normalization with 128 channels | Batch Normalization | 16 × 16 × 16 × 128 | Offset 1 × 1 × 1 × 128 Scale 1 × 1 × 1 × 128 | 256 |
| 16 | en3_relu1 ReLU | ReLU | 16 × 16 × 16 × 128 | – | 0 |
| 17 | en3_conv2 256 3 × 3 × 3 × 128 convolution with stride [1 1 1] and padding ‘same’ | Convolution | 16 × 16 × 16 × 256 | Weights 3 × 3 × 3 × 128 × 256 Bias 1 × 1 × 1 × 256 | 884,992 |
| 18 | en3_relu2 ReLU | ReLU | 16 × 16 × 16 × 256 | – | 0 |
| 19 | en3_maxpool 2 × 2 × 2 max pooling with stride [2 2 2] and padding ‘same’ | 3-D Max Pooling | 8 × 8 × 8 × 256 | – | 0 |
| 20 | de4_conv1 256 3 × 3 × 3 × 256 convolution with stride [1 1 1] and padding ‘same’ | Convolution | 8 × 8 × 8 × 256 | Weights 3 × 3 × 3 × 256 × 256 Bias 1 × 1 × 1 × 256 | 1,769,728 |
| 21 | de4_relu1 ReLU | ReLU | 8 × 8 × 8 × 256 | – | 0 |
| 22 | de4_conv2 512 3 × 3 × 3 × 256 convolution with stride [1 1 1] and padding ‘same’ | Convolution | 8 × 8 × 8 × 512 | Weights 3 × 3 × 3 × 256 × 512 Bias 1 × 1 × 1 × 512 | 3,539,456 |
| 23 | de4_relu2 ReLU | ReLU | 8 × 8 × 8 × 512 | – | 0 |
| 24 | de4_transconv 512 2 × 2 × 2 × 512 transposed 3D convolutions with stride [2 2 2] and cropping [0 0 0; 0 0 0] | Transposed Convolution 3D | 16 × 16 × 16 × 512 | Weights 2 × 2 × 2 × 512 × 512 Bias 1 × 1 × 1 × 512 | 2,097,664 |
| 25 | concat3 Concatenation of 2 inputs along dimension 4 | Concatenation | 16 × 16 × 16 × 768 | – | 0 |
| 26 | de3_conv1 256 3 × 3 × 3 × 768 convolution with stride [1 1 1] and padding ‘same’ | Convolution | 16 × 16 × 16 × 256 | Weights 3 × 3 × 3 × 758 × 256 Bias 1 × 1 × 1 × 256 | 5,308,672 |
| 27 | de3_relu1 ReLU | ReLU | 16 × 16 × 16 × 256 | – | 0 |
| 28 | de3_conv2 256 3 × 3 × 3 × 256 convolution with stride [1 1 1] and padding ‘same’ | Convolution | 16 × 16 × 16 × 256 | Weights 3 × 3 × 3 × 256 × 256 Bias 1 × 1 × 1 × 256 | 1,769,728 |
| 29 | de3_relu2 ReLU | ReLU | 16 × 16 × 16 × 256 | – | 0 |
| 30 | de3_transconv 256 2 × 2 × 2 × 256 transposed 3D convolutions with stride [2 2 2] and cropping [0 0 0; 0 0 0] | Transposed Convolution 3D | 32 × 32 × 32 × 256 | Weights 2 × 2 × 2 × 256 × 256 Bias 1 × 1 × 1 × 256 | 524,544 |
| 31 | concat2 Concatenation of 2 inputs along dimension 4 | Concatenation | 32 × 32 × 32 × 384 | – | 0 |
| 32 | de2_conv1 128 3 × 3 × 3 × 384 convolution with stride [1 1 1] and padding ‘same’ | Convolution | 32 × 32 × 32 × 128 | Weights 3 × 3 × 3 × 384 × 128 Bias 1 × 1 × 1 × 128 | 1,327,232 |
| 33 | de2_relu1 ReLU | ReLU | 32 × 32 × 32 × 128 | – | 0 |
| 34 | de2_conv2 128 3 × 3 × 3 × 128 convolution with stride [1 1 1] and padding ‘same’ | Convolution | 32 × 32 × 32 × 128 | Weights 3 × 3 × 3 × 128 × 128 Bias 1 × 1 × 1 × 128 | 442,496 |
| 35 | de2_relu2 ReLU | ReLU | 32 × 32 × 32 × 128 | – | 0 |
| 36 | de2_transconv 128 2 × 2 × 2 × 128 transposed 3D convolutions with stride [2 2 2] and cropping [0 0 0; 0 0 0] | Transposed Convolution 3D | 64 × 64 × 64 × 128 | Weights 2 × 2 × 2 × 128 × 128 Bias 1 × 1 × 1 × 128 | 131,200 |
| 37 | concat1 Concatenation of 2 inputs along dimension 4 | Concatenation | 64 × 64 × 64 × 192 | – | 0 |
| 38 | de1_conv1 64 3 × 3 × 3 × 192 convolution with stride [1 1 1] and padding ‘same’ | Convolution | 64 × 64 × 64 × 64 | Weights 3 × 3 × 3 × 192 × 64 Bias 1 × 1 × 1 × 64 | 331,840 |
| 39 | de1_relu1 ReLU | ReLU | 64 × 64 × 64 × 64 | – | 0 |
| 40 | de1_conv2 64 3 × 3 × 3 × 64 convolution with stride [1 1 1] and padding ‘same’ | Convolution | 64 × 64 × 64 × 64 | Weights 3 × 3 × 3 × 64 × 64 Bias 1 × 1 × 1 × 64 | 110,656 |
| 41 | de1_relu2 ReLU | ReLU | 64 × 64 × 64 × 64 | – | 0 |
| 42 | convlast 2 1 × 1 × 1 × 64 convolution with stride [1 1 1] and padding ‘same’ | Convolution | 64 × 64 × 64 × 2 | Weights 1 × 1 × 1 × 64 × 2 Bias 1 × 1 × 1 × 2 | 130 |
| 43 | softmax softmax | Softmax | 64 × 64 × 64 × 2 | – | 0 |
| 44 | Output Dice loss | Classification Output | – | – | 0 |
System parameters to train 3D U-Net network
| Initial learning rate | 5e-4 |
| Maxepochs | 100 |
| Learning rate schedule | Piecewise |
| Learning rate drop period | 5 |
| Learning rate drop factor | 0.95 |
| Validation frequency | 400 |
| Verbose | False |
| Mini-batch size | 8 |
Fig. 93D Orientation at Corresponding angles of a Labeled Volume
List of methods with a high accuracy and dice score
| Dataset | Segmentation method | Dice score | Publication year | References |
|---|---|---|---|---|
BRATS 2013 BRATS 2015 | FCNNs and CRFs | 0.83 0.82 | 2017 | [ |
| BRATS 2013 | CNN | 0.80 | 2017 | [ |
| BRATS 2015 | DeepMedic + CRF | 0.71 | 2017 | [ |
| BRATS 2019 | CNN | 0.84 | 2020 | [ |
| BRATS 2015 | FCN | 0.89 | 2019 | [ |
| BRATS 2013 | CNN | HG 0.943 LG 0.950 | 2019 | [ |
| BRATS 2015 | RDM-Net | 0.73 | 2019 | [ |
| BRATS 2015 | CNN + TVS | 0.857 | 2018 | [ |
| BRATS 2013 | Hybrid CNN | 0.86 | 2019 | [ |
| BRATS 2015 | WRN-PPNet | 0.94 | 2019 | [ |
| BRATS 2019 | Two-stage Unet | 0.84 | 2020 | [ |
| BRATS 2019 | DNN | 0.85 | 2020 | [ |
BRATS 2017 BRATS 2018 Oslo Dataset | 3D U-Net | 0.82 0.84 0.80 | 2020 | [ |
| BRATS 2020 | Ensemble + post-processing | 0.87 | 2021 | [ |
| BRATS 2018 | C-ConvNet | 0.90 | 2021 | [ |
| BRATS 2020 | 3D U-NET | 0.95 | 2022 | This Study |
Summary of evaluation matrics
| Metrics | Expression |
|---|---|
| Dice score (DSC) | |
| Sensitivity (SEN) | |
| Specificity (SPEC) |
Fig. 10Labled Ground Truth Left versus Network Predicted Right
Fig. 11Dice Accuracy Plot of 3D U-NET Trained Network