| Literature DB >> 32512401 |
Naofumi Tomita1, Steven Jiang2, Matthew E Maeder3, Saeed Hassanpour4.
Abstract
In this paper, we demonstrate the feasibility and performance of deep residual neural networks for volumetric segmentation of irreversibly damaged brain tissue lesions on T1-weighted MRI scans for chronic stroke patients. A total of 239 T1-weighted MRI scans of chronic ischemic stroke patients from a public dataset were retrospectively analyzed by 3D deep convolutional segmentation models with residual learning, using a novel zoom-in&out strategy. Dice similarity coefficient (DSC), average symmetric surface distance (ASSD), and Hausdorff distance (HD) of the identified lesions were measured by using manual tracing of lesions as the reference standard. Bootstrapping was employed for all metrics to estimate 95% confidence intervals. The models were assessed on a test set of 31 scans. The average DSC was 0.64 (0.51-0.76) with a median of 0.78. ASSD and HD were 3.6 mm (1.7-6.2 mm) and 20.4 mm (10.0-33.3 mm), respectively. The latest deep learning architecture and techniques were applied with 3D segmentation on MRI scans and demonstrated effectiveness for volumetric segmentation of chronic ischemic stroke lesions.Entities:
Keywords: Deep learning; Ischemic stroke; MRI; Segmentation
Mesh:
Year: 2020 PMID: 32512401 PMCID: PMC7281812 DOI: 10.1016/j.nicl.2020.102276
Source DB: PubMed Journal: Neuroimage Clin ISSN: 2213-1582 Impact factor: 4.881
Fig. 1Overview of our 52-layer segmentation model. (a) The network consists of residual blocks (in green), down-sampling blocks (in blue), and up-sampling blocks (in yellow). (2) Our 3D residual block uses a group-normalization (gn) layer to stabilize optimization for a small mini-batch. (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.)
The details and hyperparameters for the model optimization in our experiments. Additional details about the input volume selection are available in Appendix E3.
| Optimization Stage | Zoom-In Stage | Zoom-Out Stage |
|---|---|---|
| Input volume size (mm3) | 128 × 128 × 128 | 144 × 172 × 168 |
| Training Length | 1200 epochs | 150 epochs |
| Initial learning rate | 1.00E−03 | 1.00E−04 |
| Optimizer | Adam optimizer and cosine annealing with warm restart scheduler ( | |
| GPU | Nvidia Titan Xp | Nvidia Titan RTX |
| Deep learning framework | PyTorch ( | |
Summary of evaluation metrics. A higher rate is better for DSC, mDSC, TPR, and Precision. For distance metrics (HD and ASSD), a smaller number is better. Best scores are marked in bold. The inter-rater scores are calculated based on tracing of five brain MRIs by 11 non-expert individuals trained by an expert neuroradiologist (Liew et al., 2018). The model’s performance, based on the primary stroke locations and the vascular territories, is available in the Supplementary Material, Tables E6 and E7.
| Methods | DSC | mDSC | HD | ASSD | TPR | Precision |
|---|---|---|---|---|---|---|
| 3D-ResU-Net | 0.64 | 0.62 | ||||
| 3D-ResU-Net-E | 0.64 | 0.65 | 21.5 | 3.7 | 0.79 | |
| Trained human tracer | – | 22.6 | – | – |
Summary of different approaches on the ATLAS dataset. LR: learning rate; H: height; W: width; D: depth; SGD: stochastic gradient descent. “–” denotes that the corresponding information is not available.
| Methods | X-Net ( | Multi-path 2.5D-CNN ( | D-UNet ( | CLCI-Net ( | 3D-ResU-Net (ours) |
|---|---|---|---|---|---|
| Training data source | ATLAS | KF & MCW | ATLAS | ATLAS | ATLAS |
| ATLAS split ratio (train, validation, test) (%) | 5-fold cross-validation | (0, 0, 100) | (80, 20, 0) | (55, 18, 27) | (76, 11, 13) |
| Base architecture | 2D U-Net | 2D U-Net with 3D post-processing | 3D U-Net | 2D U-Net | 3D U-Net |
| Regularization layers | Batch normalization | Batch normalization | Batch normalization | Batch normalization | Group normalization |
| Training strategy | Adam optimizer, reduce LR on plateau | SGD optimizer, exponential LR decay | SGD optimizer, constant LR | Adam optimizer, constant LR | Adam optimizer, cosine annealing |
| Loss function | Dice loss & Cross Entropy | Dice loss | Dice loss & Focal loss | Dice loss | Dice loss & Cross Entropy |
| Input size (W × H × D) | 192 × 224 × 1 | 192 × 224 × 192 | 192 × 4 × 192 | 176 × 233 × 1 | 144 × 172 × 168 |
| Reported DSC | 0.49 (–) | 0.54 (–) | 0.54 (0.26–0.81) | 0.58 (–) | 0.64 (0.51–0.76) |
Fig. 2Visualization of reference standard labels (in blue) and lesion predictions by our model (in red). The higher the predicted value is at a voxel, the brighter in red the voxel is. Two groups of samples are shown: large reference labels in (a) and small labels in (b). For each group, the first column is a computed DSC value and the rest are visualized reference standards and predictions, from left-front and right-front views. Three typical samples are shown in a row in each group (best viewed in color). The visualization of segmentation results in axial slices is also available as videos in the Supplementary Material, Video E1. (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.)
Fig. 3DSC scores and the total number of positive lesion voxels (in base-10 log scale) are computed and plotted for each sample in the test set. The R2 value of this distribution is 0.34.
Comparison of evaluation metrics with respect to the subsets of test set samples. Test samples are sorted by the number of positive lesion voxels in increasing order and grouped in four ranges, shown in the first column. DSC, HD, ASSD, TPR, and precision are computed for each group. Best scores are marked in bold.
| Percentile in per-sample lesion size distribution | DSC | HD | ASSD | TPR | Precision |
|---|---|---|---|---|---|
| 0–25% | 0.41 | 23.4 | 4.9 | 0.74 | 0.39 |
| 25–50% | 0.72 | 18.0 | 3.8 | 0.78 | 0.71 |
| 50–75% | 0.62 | 26.1 | 4.1 | 0.82 | 0.60 |
Results of our ablation study examining the effect of our zoom-in&out training strategy. Finetuning with larger extracted volumes is applied on a 3D-ResU-Net-F model to obtain a 3D-ResU-Net model. The last row is the difference in performance between the 3D-ResU-Net and 3D-ResU-Net-F for each metric. Best scores are marked in bold.
| Methods | microDSC | DSC | HD | ASSD | TPR | Precision |
|---|---|---|---|---|---|---|
| 3D-ResU-Net-F | 0.73 | 0.60 | 35.1 | 7.6 | 0.54 | |
| 3D-ResU-Net | 0.81 | |||||
| Δ | +0.06 | +0.04 | −14.7 | −4.0 | −0.02 | +0.08 |