| Literature DB >> 36104424 |
Alessandro Boaro1,2, Jakub R Kaczmarzyk3,4, Vasileios K Kavouridis5, Maya Harary5,6, Marco Mammi5, Hassan Dawood5,7, Alice Shea8, Elise Y Cho5, Parikshit Juvekar7, Thomas Noh7, Aakanksha Rana5,3, Satrajit Ghosh9, Omar Arnaout5,7.
Abstract
Accurate brain meningioma segmentation and volumetric assessment are critical for serial patient follow-up, surgical planning and monitoring response to treatment. Current gold standard of manual labeling is a time-consuming process, subject to inter-user variability. Fully-automated algorithms for meningioma segmentation have the potential to bring volumetric analysis into clinical and research workflows by increasing accuracy and efficiency, reducing inter-user variability and saving time. Previous research has focused solely on segmentation tasks without assessment of impact and usability of deep learning solutions in clinical practice. Herein, we demonstrate a three-dimensional convolutional neural network (3D-CNN) that performs expert-level, automated meningioma segmentation and volume estimation on MRI scans. A 3D-CNN was initially trained by segmenting entire brain volumes using a dataset of 10,099 healthy brain MRIs. Using transfer learning, the network was then specifically trained on meningioma segmentation using 806 expert-labeled MRIs. The final model achieved a median performance of 88.2% reaching the spectrum of current inter-expert variability (82.6-91.6%). We demonstrate in a simulated clinical scenario that a deep learning approach to meningioma segmentation is feasible, highly accurate and has the potential to improve current clinical practice.Entities:
Mesh:
Year: 2022 PMID: 36104424 PMCID: PMC9474556 DOI: 10.1038/s41598-022-19356-5
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.996
Algorithm performance and inter-expert variability.
| Pairs for comparison | > 2 cc tumors | All tumors | ||
|---|---|---|---|---|
| Mean dice score (%) | Median dice score (%) | Mean dice score (%) | Median dice score (%) | |
| Model/Ground | 87.7 | 89.6 | 84 | 88.2 |
| Model/Expert_1 | 86.9 | 88.7 | 82.7 | 86.4 |
| Model/Expert_2 | 87 | 90.2 | 83.6 | 88 |
| Model/Expert_3 | 85.2 | 89.3 | 80 | 85.1 |
| Ground/Expert_1 | 87.6 | 89.1 | 85.7 | 86.8 |
| Ground/Expert_2 | 91.2 | 92.5 | 89.9 | 91.6 |
| Ground/Expert_3 | 89.5 | 90.5 | 86 | 89.5 |
| Expert_1/Expert_2 | 86.9 | 88.9 | 84.6 | 86 |
| Expert_1/Expert_3 | 84.1 | 86.9 | 79.5 | 82.6 |
| Expert_2/Expert_3 | 90.1 | 92.7 | 87 | 89.7 |
Comparison of tumor segmentation performance (Dice scores) between algorithm output, ground truth and clinical experts (Expert_1(VK), Expert_2(TN), Expert_3(PJ)), expressed as mean and median for all tumors and tumors of volume > 2 cc.
Ground: Ground truth.
Figure 1Example of meningioma segmentation algorithm output. Sagittal and coronal views (a,b) of a brain MRI scan containing two distinct meningiomas, one located in the convexity at the midline, the other located on the anterior skull base. Display of expert label vs computer-generated segmentation respectively of the meningioma of the convexity (c and e) and the meningioma of the skull base (d,f). Display of the mismatch between the expert label and the computer-generated segmentation on the meningioma of the convexity (g) and the meningioma of the skull base (h).
Figure 2Algorithm’s tumor segmentation and volume estimation accuracy. (a) Scatter plot showing the algorithm’s segmentation performance expressed as Dice scores on the test set as a function of the tumor volume. The Dice score correlated with the size of the tumor, quickly reaching a mean of 0.87 and median of 0.89 for tumors > 2 cc. (b–d) Volume estimations by the algorithm, and the 2-D and 3-D traditional estimation techniques, respectively. The algorithm’s predicted volumes constitute almost perfect approximations of the real tumor volumes with a correlation of 0.98, p < 0.001 (b), whereas the 2D and 3D techniques present lower values of respectively 0.88 (c) and 0.96 (d) p < 0.001, with evidence of overestimation.
Manual versus automated tumor segmentation times.
| Mean (s) | SD (s) | Time reduction (%) | ||
|---|---|---|---|---|
| Manual 1 | 142.4 | 93.9 | 98.7 | < 0.001 |
| Manual 2 | 299.8 | 274.9 | 99.4 | < 0.001 |
| Automated | 1.88 | 0.001 | – | Ref |
| Manual 1 | 178.2 | 96.2 | 98.9 | < 0.001 |
| Manual 2 | 391.5 | 296.9 | 99.5 | < 0.001 |
| Automated | 1.88 | 0.001 | – | Ref |
| Manual 1 | 70.9 | 17.3 | 97.3 | < 0.001 |
| Manual 2 | 116.4 | 23.6 | 98.3 | < 0.001 |
| Automated | 1.88 | 0.001 | – | Ref |
Comparison between the average time needed by experts (Manual 1 and Manual 2) and our algorithm (Automated) to produce high-quality tumor segmentations. The algorithm saved 98% of time on average and the difference between the algorithm and each expert in each tumor group reached significance (p < 0.001).
s: seconds, ref: reference.