| Literature DB >> 32265680 |
Théo Estienne1,2,3,4, Marvin Lerousseau1,2,3,5, Maria Vakalopoulou1,4,5, Emilie Alvarez Andres1,2,3, Enzo Battistella1,2,3,4, Alexandre Carré1,2,3, Siddhartha Chandra5, Stergios Christodoulidis6, Mihir Sahasrabudhe5, Roger Sun1,2,3,5, Charlotte Robert1,2,3, Hugues Talbot5, Nikos Paragios1, Eric Deutsch1,2,3.
Abstract
Image registration and segmentation are the two most studied problems in medical image analysis. Deep learning algorithms have recently gained a lot of attention due to their success and state-of-the-art results in variety of problems and communities. In this paper, we propose a novel, efficient, and multi-task algorithm that addresses the problems of image registration and brain tumor segmentation jointly. Our method exploits the dependencies between these tasks through a natural coupling of their interdependencies during inference. In particular, the similarity constraints are relaxed within the tumor regions using an efficient and relatively simple formulation. We evaluated the performance of our formulation both quantitatively and qualitatively for registration and segmentation problems on two publicly available datasets (BraTS 2018 and OASIS 3), reporting competitive results with other recent state-of-the-art methods. Moreover, our proposed framework reports significant amelioration (p < 0.005) for the registration performance inside the tumor locations, providing a generic method that does not need any predefined conditions (e.g., absence of abnormalities) about the volumes to be registered. Our implementation is publicly available online at https://github.com/TheoEst/joint_registration_tumor_segmentation.Entities:
Keywords: brain tumor segmentation; convolutional neural networks; deep learning; deformable registration; multi-task networks
Year: 2020 PMID: 32265680 PMCID: PMC7100603 DOI: 10.3389/fncom.2020.00017
Source DB: PubMed Journal: Front Comput Neurosci ISSN: 1662-5188 Impact factor: 2.380
Figure 1A schematic representation of the proposed framework. The framework is composed by two decoders, one which provides tumor segmentation masks for both S and R images, and one the provides the optimal displacement grid G that will accurately map the S to the R image. The merge bloc will combine the forward signal of the source input and the reference input (which are forwarded independently in the encoder).
Layer architectures of the shared encoder, the segmentation decoder and the registration decoder.
| Enc1 | 4D MRI | Conv1,8, ReLU, (Conv3,8, ReLU), AddId, | (144, 208, 144, 8) | |
| Enc2 | Enc1 | Conv2,16, ReLU, (Conv3,16, ReLU)*2, AddId | (72, 104, 72, 16) | |
| Enc3 | Enc2 | Conv2,32, ReLU, (Conv3,32, ReLU)*3, AddId | (36, 52, 36, 32) | |
| Enc4 | Enc3 | Conv2,64, ReLU, (Conv3,64, ReLU)*3, AddId | (18, 26, 18, 64) | |
| Enc5 | Enc4 | Conv2,128, ReLU, (Conv3,128, ReLU)*3, AddId | (9, 13, 9, 128) | |
| Enc5 | Enc4 | DeConv2,64,ReLU, ResConc, (Conv3,64, ReLU)*3, AddId | (18, 26, 18, 64) | |
| Enc3 | DeConv2,32, ReLU, ResConc, (Conv3,32, ReLU)*3, AddId | (36, 52, 36, 32) | ||
| Enc2 | DeConv2,16, ReLU, ResConc, (Conv3,16, ReLU)*2, AddId | (72, 104, 72, 16) | ||
| Enc1 | DeConv2,8, ReLU, ResConc, (Conv3,8, ReLU), AddId | (144, 208, 144, 8) | ||
| Conv1,4, Softmax | (144, 208, 144, 4) | |||
| Merge | For all | |||
| MEnc5 | MEnc4 | DeConv2,64, ReLU, ResConc, (Conv3,64, ReLU)*3, AddId | (18, 26, 18, 64) | |
| MEnc3 | DeConv2,32, ReLU, ResConc, (Conv3,32, ReLU)*3, AddId | (36, 52, 36, 32) | ||
| MEnc2 | DeConv2,16, ReLU, ResConc, (Conv3,16, ReLU)*2, AddId | (72, 104, 72, 16) | ||
| MEnc1 | DeConv2,8, ReLU, ResConc, (Conv3,8, ReLU), AddId | (144, 208, 144, 8) | ||
| Conv1,3, Sigmoid | (144, 208, 144, 3) | |||
The sub-architectures are grouped into blocks, one per table line, whose names are indicated in the first column. Each block processed a forward signal as input identified by the second column. Additionally, both decoders have residual connections from different stages of the encoder, identified by the third column. The blocks are made of a set of successive operations where Conv.
Figure 2Illustration of a slice extracted from two different subjects for both BraTS 2018 and OASIS 3 datasets. The BraTS dataset consists of four modalities (T1, T1 gadolinium [T1 Gd], T2, T2 FLAIR [Flair]), along with voxelwise annotations for the three tumor tissue subclasses depicting the overall extent of tumors. OASIS 3 consists of a single T1 modality, and images are provided with voxelwise annotations for 47 different normal brain structures for patients without brain tumors.
Quantitative results of the different methods on the segmentation task on the BraTS 2018 validation dataset.
| Baseline segmentation | 0.79 ± 0.29 | 7.0 ± 9.6 | 0.73 ± 0.29 | 0.87 ± 0.13 | 0.75 ± 0.24 | 4.7 ± 8.2 | 7.2 ± 9.4 | 9.2 ± 8.9 |
| Proposed | ||||||||
| Concatenation w/o | 0.74 ± 0.29 | 8.3 ± 10.4 | 0.70 ± 0.29 | 0.87 ± 0.11 | 0.65 ± 0.29 | 6.2 ± 9.8 | 7.8 ± 11.1 | 11.3 ± 7.1 |
| Concatenation with | 0.73 ± 0.29 | 7.6 ± 9.9 | 0.68 ± 0.30 | 0.87 ± 0.12 | 0.66 ± 0.28 | 6.3 ± 9.9 | 5.6 ± 4.2 | 10.8 ± 6.6 |
| Subtraction w/o | 0.76 ± 0.27 | 7.8 ± 10.3 | 0.71 ± 0.28 | 0.88 ± 0.10 | 0.70 ± 0.24 | 6.5 ± 10.8 | 7.4 ± 11.0 | 10.0 ± 7.4 |
| Subtraction with | 0.76 ± 0.27 | 7.9 ± 10.1 | 0.71 ± 0.29 | 0.88 ± 0.10 | 0.69 ± 0.25 | 5.8 ± 9.6 | 7.7 ± 11.5 | 11.1 ± 8.3 |
Dice and Hausdorff95 are reported for the three classes Whole Tumor (WT), Enhancing Tumor (ET), and Tumor Core (TC) together with their average values. Results are reported with mean across patients (MRIs) along with the associated standard deviation. We upload our predictions on the official leaderboard of the validation set (66 patients).
Statistical significance of the proposed methods with Milletari et al. (2016) on the BraTS segmentation task.
| Baseline segmentation | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 |
| Proposed | ||||||||
| Concatenation w/o | 0.32 | 0.46 | 0.55 | 1.00 | 0.03 | 0.34 | 0.74 | 0.14 |
| Concatenation with | 0.24 | 0.72 | 0.33 | 1.00 | 0.05 | 0.31 | 0.21 | 0.24 |
| Subtraction w/o | 0.55 | 0.65 | 0.69 | 0.62 | 0.24 | 0.28 | 0.91 | 0.58 |
| Subtraction with | 0.55 | 0.60 | 0.69 | 0.62 | 0.16 | 0.48 | 0.79 | 0.21 |
For each model (line) and each performance measure (column), the displayed value is the p-value, up to two significant figures, of the statistical significance between the model and Milletari et al. (.
Figure 3The segmentation maps produced by the different evaluated methods displayed on post-contrast Gadolinium T1-weighted modalities. We present the provided segmentation maps both on our test dataset and on the BraTS 2018 validation dataset. NCR/NET, necrotic core; ET, GD-enhancing tumor; ED, peritumoral edema.
The mean and standard deviation of the dice coefficient for the 15 different classes of OASIS 3 dataset for the different evaluated methods.
| Rigid | 0.58 ± 0.15 | 0.39 ± 0.11 | 0.46 ± 0.13 | 0.40 ± 0.14 | 0.49 ± 0.05 | 0.44 ± 0.13 | 0.47 ± 0.13 | 0.35 ± 0.17 | 0.27 ± 0.15 | 0.40 ± 0.13 | 0.34 ± 0.13 | 0.39 ± 0.17 | 0.15 ± 0.15 | 0.24 ± 0.18 | 0.36 ± 0.04 | 0.38 ± 0.13 |
| Voxelmorph | 0.69 ± 0.12 | 0.42 ± 0.14 | 0.5 ± 0.11 | 0.33 ± 0.14 | 0.42 ± 0.17 | 0.62 ± 0.14 | 0.38 ± 0.13 | 0.25 ± 0.17 | ||||||||
| Proposed | ||||||||||||||||
| Concatenation | ||||||||||||||||
| Only reg. | 0.65 ± 0.15 | 0.34 ± 0.1 | 0.58 ± 0.11 | 0.48 ± 0.14 | 0.6 ± 0.056 | 0.46 ± 0.12 | 0.47 ± 0.12 | 0.38 ± 0.14 | 0.35 ± 0.15 | 0.54 ± 0.14 | 0.35 ± 0.13 | 0.4 ± 0.16 | 0.21 ± 0.17 | 0.27 ± 0.18 | 0.46 ± 0.051 | 0.44 ± 0.13 |
| w/o | 0.72 ± 0.13 | 0.42 ± 0.1 | 0.61 ± 0.11 | 0.51 ± 0.12 | 0.63 ± 0.056 | 0.47 ± 0.14 | 0.51 ± 0.12 | 0.37 ± 0.16 | 0.46 ± 0.17 | 0.31 ± 0.22 | 0.31 ± 0.19 | 0.48 ± 0.052 | 0.49 ± 0.13 | |||
| With | 0.7 ± 0.15 | 0.44 ± 0.12 | 0.6 ± 0.13 | 0.52 ± 0.14 | 0.66 ± 0.06 | 0.47 ± 0.14 | 0.52 ± 0.13 | 0.38 ± 0.16 | 0.42 ± 0.16 | 0.65 ± 0.14 | 0.4 ± 0.15 | 0.51 ± 0.19 | 0.3 ± 0.22 | 0.28 ± 0.2 | 0.49 ± 0.058 | 0.49 ± 0.14 |
| Subtraction | ||||||||||||||||
| Only reg. | 0.71 ± 0.13 | 0.41 ± 0.1 | 0.61 ± 0.12 | 0.53 ± 0.13 | 0.66 ± 0.058 | 0.47 ± 0.12 | 0.5 ± 0.11 | 0.37 ± 0.15 | 0.43 ± 0.14 | 0.63 ± 0.12 | 0.4 ± 0.13 | 0.47 ± 0.16 | 0.34 ± 0.22 | 0.29 ± 0.19 | 0.49 ± 0.054 | 0.49 ± 0.13 |
| w/o | 0.7 ± 0.13 | 0.41 ± 0.1 | 0.6 ± 0.11 | 0.52 ± 0.12 | 0.65 ± 0.057 | 0.43 ± 0.14 | 0.64 ± 0.13 | 0.41 ± 0.13 | 0.49 ± 0.17 | 0.3 ± 0.22 | 0.29 ± 0.18 | 0.48 ± 0.053 | 0.49 ± 0.13 | |||
| With | 0.4 ± 0.11 | 0.61 ± 0.11 | 0.53 ± 0.12 | 0.64 ± 0.058 | 0.47 ± 0.12 | 0.51 ± 0.11 | 0.38 ± 0.15 | 0.41 ± 0.15 | 0.63 ± 0.13 | 0.43 ± 0.13 | 0.44 ± 0.17 | 0.3 ± 0.22 | 0.48 ± 0.054 | 0.49 ± 0.13 |
The first two rows are baseline methods. The rest of the rows present the results of our proposed method evaluating the different variants and merging operators. The names of the columns represent various brain structures, namely: brain stem (BS), cerebrospinal fluid (CSF), 4th ventricle (4V), amygdala (Am), caudate (Ca), cerebellum cortex (CblmC), cerebellum white matter (CblmWM), cerebral cortex (CeblC), cerebral white matter (CeblWM), hippocampus (Hi), lateral ventricle (LV), pallidum (Pa), putamen (Pu), ventral DC (VDC), and 3rd ventricle (3V). Bold indicates best performance per column.
Figure 4Qualitative evaluation of the registration performance for the different evaluated methods, displayed on T1 modalities. For an easier visualization, we group left and right categories and only display the following nine classes: caudate (Ca), cerebellum cortex (CblmC), cerebellum white matter (CblmWM), cerebral cortex (CeblC), cerebral white matter (CeblWM), lateral ventricle (LV), pallidum (Pa), putamen (Pu), ventral DC (VDC).
The table presents the average distance between (i) the ratio of the area of the deformed tumor mask to the area of the original tumor mask, and (ii) the ratio of area of the reference brain volume to the area of the source brain volume.
| Dalca et al. ( | 2.27 ± 2.68 | 0.67 ± 0.55 | 1.96 ± 3.03 | 0.62 ± 0.51 |
| Proposed | ||||
| Concatenation only reg. | 0.51 ± 0.61 | 0.26 ± 0.19 | 0.71 ± 0.94 | 0.22 ± 0.15 |
| Concatenation w/o | 1.35 ± 1.14 | 0.64 ± 0.41 | 1.80 ± 1.82 | 0.64 ± 0.42 |
| Concatenation with | 0.26 ± 0.20 | 0.26 ± 0.13 | 0.30 ± 0.28 | 0.21 ± 0.12 |
| Subtraction only reg. | 1.34 ± 0.77 | 0.77 ± 0.59 | 2.02 ± 1.65 | 0.68 ± 0.52 |
| Subtraction w/o | 1.74 ± 1.35 | 0.72 ± 0.72 | 2.38 ± 1.74 | 0.74 ± 0.76 |
| Subtraction with |
Lower values are better. The average has been calculated over 200 testing pairs from the BraTS 2018 dataset (NCR/NET, ET and ED). On top of the evaluation per tumor class, we also conduct an evaluation by merging all the tumor classes into just one class (called combined). Bold indicates best performance per column.
Summary of the statistical difference between the Dalca et al. (2018) and the proposed method on the BraTS 2018 dataset for the tumor preservation task.
| Dalca et al. ( | <10−3 | <10−3 | <10−3 | <10−3 |
| Proposed | ||||
| Concatenation only reg. | <10−3 | 0.540 | <10−3 | 0.130 |
| Concatenation w/o | <10−3 | <10−3 | <10−3 | <10−3 |
| Concatenation with | 0.282 | 0.442 | 0.006 | 0.386 |
| Subtraction only reg. | <10−3 | <10−3 | <10−3 | <10−3 |
| Subtraction w/o | <10−3 | <10−3 | <10−3 | <10−3 |
| Subtraction with | 1.000 | 1.000 | 1.000 | 1.000 |
For each model (line) and each performance measure (column), the displayed value is the p-value (up to 3 significant figures) of the statistical significance between the model and subtraction with for the tumor preservation measure on the corresponding tumor class (NCR/NET, ET, ED, and their union in the column Combined). line represents the reference model, cells indicate no statistical significant p-values while cells represents statistical significant p-values.
Figure 5Qualitative evaluation of the tumor deformation of the different evaluated methods, displayed on T1 modalities. Each line is a sample, with source MRI in the first column to be registered on reference MRI in the second column. BraTS ground-truth annotations are plotted onto the source MRI. Seven models are benchmarked, one for each of the remaining columns which display the result of applying the predicted grid onto the source MRI. For each model and each line, the source ground-truth annotation masks of the source MRI were also registered with the predicted deformation grid, and the consequently obtained deformed ground-truth were plotted onto each deformed source MRI to illustrate the impact of all methods regarding the preservation of tumor extent.
Figure 6Comparison of the registration grid of the proposed model using the subtraction operation with and w/o . This figure is obtained by sampling three random pairs of test patients, and computing the predicted registration fields, which are displayed by line for the two models, and in consecutive columns, one for each of the three dimensions, showing the registration field as a warped grid (grayscale) and as a colored map obtained by computing its norm pixelwise (blue-green map). Furthermore, the contour of the Whole Tumor is plotted on top of each image, obtained from the ground truth segmentation.