| Literature DB >> 35590389 |
Muhaddisa Barat Ali1, Irene Yu-Hua Gu2, Alice Lidemar3, Mitchel S Berger4, Georg Widhalm5, Asgeir Store Jakola3,6.
Abstract
BACKGROUND: For brain tumors, identifying the molecular subtypes from magnetic resonance imaging (MRI) is desirable, but remains a challenging task. Recent machine learning and deep learning (DL) approaches may help the classification/prediction of tumor subtypes through MRIs. However, most of these methods require annotated data with ground truth (GT) tumor areas manually drawn by medical experts. The manual annotation is a time consuming process with high demand on medical personnel. As an alternative automatic segmentation is often used. However, it does not guarantee the quality and could lead to improper or failed segmented boundaries due to differences in MRI acquisition parameters across imaging centers, as segmentation is an ill-defined problem. Analogous to visual object tracking and classification, this paper shifts the paradigm by training a classifier using tumor bounding box areas in MR images. The aim of our study is to see whether it is possible to replace GT tumor areas by tumor bounding box areas (e.g. ellipse shaped boxes) for classification without a significant drop in performance.Entities:
Keywords: 1p/19q codeletion; Brain tumor; Deep learning; Ellipse bounding box; IDH genotype
Year: 2022 PMID: 35590389 PMCID: PMC9118766 DOI: 10.1186/s42490-022-00061-3
Source DB: PubMed Journal: BMC Biomed Eng ISSN: 2524-4426
Fig. 1Multi-stream 2D convolutional neural network for glioma-subtype classification from [46]
Fig. 2The pipeline of the method based on proposed strategy. Blue dash box: Tumor areas separated by ellipse bounding box and manually drawn GT boundary. Orange arrow: Training phase. Blue arrow: Testing phase
Fig. 3Illustration of selection of ROIs with tight ellipse bounding box for a FLAIR-MRI from US dataset for all three directional views. The blue line defines the tumor area contour
Fig. 4An example of TCGA dataset from IDH mutation class. Separation of ROIs is shown in both ways (using ellipse bounding box and GT) on FLAIR modality. Left: Axial view. Right: Sagittal view
Summary of Two Datasets (a) Number of 3D scans in each datasets. (b) Description of data for two case studies
| US | 75 | 75 | - | - |
| TCGA | 167 | 167 | 167 | 167 |
| A | 1p/19q cod | 25/450 | 8/144 | 9/162 |
| 1p/19q non-cod | 20/360 | 6/108 | 7/126 | |
| B | IDH-mut | 33/594 | 11/198 | 11/198 |
| IDH-wt | 68/612 | 22/198 | 22/198 | |
*Excluded with augmented slice images
Fig. 5Training (green) and validation (red) curves on ellipse bounding box tumor data as a function of epochs for both case studies. Early stopping strategy was used, where blue dot points to the epoch whose parameters were used for test set. Left: For US dataset, the validation curve converged at epoch = 67. Right: For TCGA dataset, the validation curve converged at epoch = 76, after which the validation losses didn’t improve
Comparison of the average test results for diffuse glioma-subtypes using ellipse bounding box tumor data for 5 runs. The highest values obtained in each run are displayed in bold text. (a) Case-A for US dataset (1p/19q prediction). (b) Case-B for TCGA dataset (IDH genotype)
| Run | Dataset | Accuracy (%) | Precision (%) | Sensitivity(%) | Specificity(%) | F1-Score(%) |
|---|---|---|---|---|---|---|
| (a) Case-A: Prediction Result on Ellipse Bounding Tumor Areas | ||||||
| 1 | 65.97 | 70.00 | 69.14 | 61.90 | 69.57 | |
| 2 | US | 71.53 | 74.10 | 75.93 | 65.87 | 75.00 |
| 3 | (1p/19q Codel/ | 68.06 | 72.73 | 69.14 | 66.67 | 70.90 |
| 4 | Non-Codel) | 71.18 | 74.25 | 76.54 | 65.87 | 75.38 |
| 5 | ||||||
| Average(∣ | 69.86 (2.46) | 72.91(1.55) | 74.20(4.39) | 64.60 (1.92) | 73.51(2.76) | |
| (b) Case-B: Prediction Result on Ellipse Bounding Tumor Areas | ||||||
| 1 | 79.55 | 85.03 | 71.71 | 87.37 | 77.80 | |
| 2 | TCGA | 76.01 | 78.45 | 71.72 | 80.30 | 74.93 |
| 3 | (IDH mut/ | 80.30 | 86.23 | 72.73 | 87.88 | 78.91 |
| 4 | wild-type) | |||||
| 5 | 79.04 | 85.80 | 70.20 | 87.88 | 77.22 | |
| Average(∣ | 79.50(2.12) | 84.84(3.42) | 72.32(1.67) | 86.65(3.28) | 78.06 (2.13) | |
Fig. 6Summary of the evaluation metrics and comparison of prediction on ellipse bounding box data and GT data using multi-stream CNN scheme. Left: Case-A: Comparison on US dataset. Right: Case-B: Comparison on TCGA dataset
Performance difference on average prediction results (over 5 runs) by using GT tumor data and ellipse tumor bounding box data for training, where the standard deviation is included in (·) after each performance value
| Case Study | Tumor Area | Av. Acc.(∣ | Av. Sen.(∣ | Av. Spec.(∣ |
|---|---|---|---|---|
| A | Ellipse | 69.86(2.46) | 74.20(4.39) | 64.60(1.92) |
| GT | 72.78(1.45) | 76.05(1.63) | 68.57(1.78) | |
| Difference | 2.92(1.45) | 1.85(1.78) | 3.97(1.63) | |
| B | Ellipse | 79.50(2.12) | 86.65(3.28) | 72.32(1.67) |
| GT | 82.73(1.82) | 89.70(2.00) | 75.45(3.04) | |
| Difference | 3.23(0.3) | 3.05(1.28) | 3.13(1.37) |
Fig. 7Example images of zoom in brain tumor MRIs. Blue curves are the GT tumor boundaries manually drawn by medical experts, and red curves are the ellipse bounding boxes surrounding the tumors
Averaging tumor dice score calculated between medical experts’ marked GT tumor areas and ellipse tumor bounding box areas
| Case Study | Dataset | Av. Dice score (∣ |
|---|---|---|
| A | US | 0.8046 (0.0652) |
| B | TCGA | 0.8279 (0.0514) |