| Literature DB >> 32393280 |
Mark H F Savenije1,2, Matteo Maspero3,4, Gonda G Sikkes1, Jochem R N van der Voort van Zyp1, Alexis N T J Kotte1, Gijsbert H Bol1, Cornelis A T van den Berg1,2.
Abstract
BACKGROUND: Structure delineation is a necessary, yet time-consuming manual procedure in radiotherapy. Recently, convolutional neural networks have been proposed to speed-up and automatise this procedure, obtaining promising results. With the advent of magnetic resonance imaging (MRI)-guided radiotherapy, MR-based segmentation is becoming increasingly relevant. However, the majority of the studies investigated automatic contouring based on computed tomography (CT).Entities:
Keywords: Artificial intelligence; Contouring; Deep learning; Delineation; MR-only treatment planning; Magnetic resonance imaging; Prostate cancer; Radiotherapy; Segmentation
Mesh:
Year: 2020 PMID: 32393280 PMCID: PMC7216473 DOI: 10.1186/s13014-020-01528-0
Source DB: PubMed Journal: Radiat Oncol ISSN: 1748-717X Impact factor: 3.481
Fig. 1Transverse view of in-phase (IP), water (W) and fat (F) images for a patient (69 yo) diagnosed with T2b cancer. Note the large portion of void space surrounding the patient body. Cropping has been applied as preprocessing to remove such void regions
Image parameters of the sequences used for the OARs contouring. The term FOV refers to the field-of-view, while AP to anterior-posterior and LR to right-left
| 1.2/2.5/3.9 | |
| 10 | |
| 55.2x55.2x30 | |
| 324x324x120 | |
| 528x528x120 | |
| 1x1x2.5 | |
| 1072 | |
| AP | |
| RL | |
| 3D | |
| 2 min 17 s |
∗expressed in terms of anterior-posterior (AP), right-left (RL) and superior-inferior directions
Fig. 2Schematic of the study design representing the timeline and the number of patients included. Also, the length and the number of patients for the preliminary study, the training of the final model and the patients used for testing the clinical implementation are reported
Fig. 3Violin plots representing the mean (white dot), σ (black vertical rectangle), 95% percentile (black vertical line) and the probability distribution for the dice similarity coefficient (DSC, top) and 95% Hausdorff distance (HD95, middle) and surface distance (bottom) for the OARs against clinical contours in among the preliminary study. The statistical significance of the Wilcoxon signed-rank test is reported as well as the mean(±σ) of each metric. The asterisks represent p ≤0.05 (∗), p ≤0.01 (∗∗) and p ≤0.001 (∗∗∗)
Fig. 4Pie chart reporting the percentage of the qualitative scoring performed by the expert RTT for each auto-segmentation method
Comparison of performance between the preliminary study (PS) and after the clinical implementation (Clinic) for DeepMedic in terms of (volumetric) dice similarity coefficient (DSC), 95% Hausdorff distance (HD95) and mean surface distance (MSD)
| DSC | HD95 | MSD | ||||
|---|---|---|---|---|---|---|
| PS | Clinic | PS | Clinic | PS | Clinic | |
| [mm] | [mm] | [mm] | [mm] | |||
| 0.95 ±0.03 | 0.96 ±0.02 | 3.8 ±3.4 | 2.5 ±1.1 | 1.0 ±0.6 | 0.6 ±0.3 | |
| 0.85 ±0.07 | 0.88 ±0.05 | 8.3 ±5.0 | 7.4 ±4.4 | 2.1 ±1.1 | 1.7 ±0.8 | |
| 0.96 ±0.01 | 0.97 ±0.01 | 2.2 ±1.4 | 1.6 ±0.5 | 0.6 ±0.2 | 0.5 ±0.1 | |
| 0.96 ±0.01 | 0.97 ±0.01 | 1.9 ±0.4 | 1.5 ±0.6 | 0.6 ±0.1 | 0.5 ±0.1 | |
Fig. 5Example of in-phase MRI after cropping along with segmentations (OARs) obtained with DeepMedic (contours) versus clinical segmentations (filled contours) in the transverse (left), coronal (centre) and sagittal (right) view for a patient in the test. For this patient, average performance was obtained in terms of DSC: 0.96, 0.86, 0.97 and 0.97 for bladder, rectum, and femurs, respectively. Note that DeepMedic also outputs CTV, but it was not considered for clinical evaluation
Fig. 6Boxplots for each structure of surface Dice similarity coefficient (SDSC) as a function of threshold (τ) for the 53 patients after clinial implementation. The data is plotted for the range of τ from sub-pixel (0.5 mm) to above the voxel size (3 mm). Box plots are shown with an inter-quartile range from 25 to 75% with the horizontal line representing mean value. Upper and lower whisker represent the 2.5 and 97.5 percentiles
Overview of the performance of automatic OARs delineations based on MRI and CT subdivided in convolutional network-based and conventional approaches. The number of patients included in the study (Pts), the imaging modality, a brief description of the method and metrics as dice similarity coefficient (DSC), 95% boundary Hausdorff distance (HD95) and mean surface distance (MSD) were reported for each study. HD95 and MSD are expressed in mm
| DSC | DSC | DSC | DSC | ||||
|---|---|---|---|---|---|---|---|
| HD95 | HD95 | HD95 | HD95 | ||||
| MSD | MSD | MSD | MSD | ||||
| Men2017 [ | 218/60 ∗ | CT | 2D | 0.92 | 0.93 | 0.92 | |
| dilated | |||||||
| VGG-16 | |||||||
| Feng2018 [ | 30/10 ∗ | MRI | Multi-task | 0.952 ±0.007 | 0.88 ±0.03 | ||
| residual | |||||||
| 2D FCN | |||||||
| Kazemifar2018 [ | 51/9/20 ∗ | CT | 2D | 0.95 ±0.04 | |||
| U-net | |||||||
| 1.1 ±0.8 | |||||||
| Balagopal2018 [ | 108/28 | CT | 2D U-net | 0.95 ±0.02 | 0.84 ±0.04 | 0.96 ±0.03 | 0.95 ±0.01 |
| mean | + 3D U-net | 17.0 ±14.6 | 4.9 ±3.9 | ||||
| 4 models | (ResNeXT) | 0.5 ±0.7 | 0.8 ±0.7 | ||||
| Dong2019 [ | 140x5 + | MRI | 3D Cycle-GAN | 0.95 ±0.03 | 0.89 ±0.04 | ||
| + deep attention | 6.81 ±9.25 | 10.84 ±15.59 | |||||
| U-net | 0.92 ±1.03 | ||||||
| Elguindi2019 [ | 40/10/50 | MRI | 0.93 ±0.04 | 0.82 ±0.05 | |||
| DeepLabV3+ | |||||||
| 0.92 ±0.1 | 0.87 ±0.07 | ||||||
| 97/53 ∗ | MRI | 3D | 0.88 ±0.05 | ||||
| multi-scale | 2.5 ±1.1 | 7.4 ±4.4 | |||||
| DeepMedic | 1.7 ±0.8 | ||||||
| 0.98 ±0.03 | 0.92 ±0.05 | ||||||
| LaMacchia2012 [ | 5 | CT | ABAS 2.0 | 0.93 ±0.03 | 0.77 ±0.07 | 0.94 ±0.04 | 0.94 ±0.04 |
| VelocityAI 2.6.2 | 0.72 ±0.15 | 0.75 ±0.04 | 0.92 ±0.02 | 0.92 ±0.03 | |||
| MIM 5.1.1 | 0.93 ±0.02 | 0.87 ±0.05 | 0.94 ±0.02 | 0.94 ±0.01 | |||
| Dowling2015 [ | 39 | MRI | multi-atlas | 0.86 ±0.12 | 0.84 ±0.06 | 0.91 ±0.03 | |
| voting | |||||||
| diffeomorphic reg | 5.1 ±4.6 | 2.4 ±1.0 | 1.5 ±0.5 | ||||
| Delpon2016 [ | 10/10 ∗ | CT | Mirada | 0.76 ±0.12 | 0.73 ±0.07 | 0.89 ±0.05 | 0.91 ±0.03 |
| 15 ±9 | 10 ±3 | 0.2 ±6.4 | 8.1 ±5.6 | ||||
| MIM | 0.80 ±0.14 | 0.75 ±0.07 | 0.89 ±0.08 | 0.92 ±0.02 | |||
| 14.0 ±6.3 | 9.9 ±3.4 | 9.9 ±7.9 | 8.2 ±5.3 | ||||
| ABAS | 0.81 ±0.13 | 0.75 ±0.09 | 0.91 ±0.04 | 0.92 ±0.02 | |||
| 13.6 ±7.9 | 9.9 ±4.4 | 8.6 ±6.9 | 8.5 ±6.1 | ||||
| SPICE | 0.76 ±0.26 | 0.68 ±0.12 | 0.70 ±0.05 | 0.72 ±0.03 | |||
| 9.2 ±11.7 | 13 ±5 | 29.7 ±9.0 | 30 ±6.5 | ||||
| Raystation | 0.59 ±0.15 | 0.49 ±0.12 | 0.91 ±0.03 | 0.92 ±0.02 | |||
| 28.5 ±13.1 | 16.5 ±3.7 | 8.8 ±7.2 | 6.4 ±5.0 | ||||
∗ training/(validation)/test; + indicating x... cross-fold validation; mean surface Hausdorff distance; surface dice similarity coefficient as in [48] with τ=3 or 2 mm, respectively