| Literature DB >> 36105388 |
Irada Pflüger1, Tassilo Wald2, Fabian Isensee2, Marianne Schell1, Hagen Meredig1, Kai Schlamp3, Denise Bernhardt4, Gianluca Brugnara1, Claus Peter Heußel3,5, Juergen Debus6,7,8,9, Wolfgang Wick10,11, Martin Bendszus1, Klaus H Maier-Hein2, Philipp Vollmuth1.
Abstract
Background: Reliable detection and precise volumetric quantification of brain metastases (BM) on MRI are essential for guiding treatment decisions. Here we evaluate the potential of artificial neural networks (ANN) for automated detection and quantification of BM.Entities:
Keywords: artificial intelligence; artificial neural network; brain metastasis; magnetic resonance imaging; neuro-Oncology
Year: 2022 PMID: 36105388 PMCID: PMC9466273 DOI: 10.1093/noajnl/vdac138
Source DB: PubMed Journal: Neurooncol Adv ISSN: 2632-2498
Characteristics of the Patients Included in This Study
| Institutional dataset | External test dataset |
| ||
|---|---|---|---|---|
| Training set | Test set | |||
| Patient [ | 246 | 62 | 30 | – |
| Gender [ | .530 | |||
| Female | 134 (54.5) | 29 (46.7) | 15 (50) | |
| Male | 112 (45.5) | 33 (53.3) | 15 (50) | |
| Mean age [years (± SD)] | 61 (± 11) | 61 (± 12) | 58 (± 12) | .454 |
| No. of metastases (total) | 1682 | 384 | 155 | – |
| Mean no. of metastases per patient (± SD) | 7 (± 15) | 6 (± 11) | 5 (± 8) | .986 |
| Case-wise volumes | ||||
| CE-Lesion | .007 | |||
| Mean CE-lesion volume (± SD) | 8.47 cm3 (± 12.11) | 7.81 cm3 (± 9.77) | 5.31 cm3 (± 12.33) | |
| Median CE-lesion volume (IQR) | 3.91 cm3 (9.4) | 5.31 cm3 (8.63) | 0.63 cm3 (4.38) | |
| NEE-Lesion | 0.014 | |||
| Mean NEE-lesion volume (± SD) | 58.61 cm3 (± 55.54) | 62.1 cm3 (± 52.65) | 36.81cm3 (± 52.69) | |
| Median NEE-lesion volume (IQR) | 42 cm3 (78.69) | 49.22 cm3 (59.82) | 10.10 cm3 (67.25) | |
| Lesion-wise volumes | .141 | |||
| Mean CE-lesion volume (± SD) | 1.23 cm3 (± 4.59) | 1.24 cm3 (± 4.46) | 1.03 cm3 (± 5.17) | |
| Median CE-lesion volume (IQR) | 0.07 cm3 (0.33) | 0.05 cm3 (0.29) | 0.08 cm3 (0.33) | |
| Primary cancer [( | .256 | |||
| Lung | 97 (39.4) | 27 (43.5) | 30 (100) | |
| Breast | 59 (24) | 9 (14.5) | – | |
| Gastrointestinal | 17 (6.9) | 5 (8.1) | – | |
| Cancer of unknown primary origin | 15 (6.1) | 3 (4.8) | – | |
| Kidney | 12 (4.9) | 4 (6.5) | – | |
| Malignant melanoma | 10 (4.1) | 8 (12.9) | – | |
| Soft-tissue sarcoma | 4 (1.6) | 1 (1.6) | – | |
| Multiple primary tumors | 4 (1.6) | – | – | |
| Prostate | 3 (1.2) | – | – | |
| Others | 25 (10.2) | 5 (8.1) | – | |
| MRI sequence [ | – | |||
| T1-w | ||||
| 3D acquisition | 212 (86.2) | 54 (87.1) | 30 (100) | |
| 2D acquisition | 34 (13.8) | 8 (12.9) | – | |
| cT1-w | 246 (100) | 62 (100) | 30 (100) | |
| FLAIR | 246 (100) | 62 (100) | 30 (100) | |
| MR vendors (field strength) [ | – | |||
| Siemens (1.5 T) | – | 1 (1.6) | 30 (100) | |
| Siemens (3.0 T) | 246 (100) | 61 (98.4) | – | |
SD, standard deviation; IQR, inter-quartile range; T, Tesla; CE, contrast-enhancing tumors; NEE, non-enhancing FLAIR signal abnormality/edema.
Group differences were evaluated with chi-square test for categorical and Kruskal–Wallis test or t test (depending on the distribution) for continuous parameters.
Case-wise Segmentation Quality for Contrast-enhancing Tumors (CE) and Non-enhancing FLAIR Signal Abnormality/edema (NEE) and Lesion-wise Segmentation and Detection Quality for CE Lesions for the Institutional Training Set, Institutional Test Set and External Test Set
| Institutional dataset | External dataset | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| Training set | Test set | Test set | |||||||
| Full-model | “Slim”-model |
| Full-model | “Slim”-model |
| Full-model | “Slim”-model |
| |
| Case-wise volumes | |||||||||
| CE-Lesion | |||||||||
| C-DICE (IQR) | 0.90 (0.79–0.93) | 0.87 (0.73–0.92) | < .001 | 0.90 (0.85–0.94) | 0.89 (0.81–0.93) | < .001 | 0.84 (0.76–0.89) | 0.83 (0.70–0.89) | .07 |
| C-Sensitivity (IQR) | 0.91 (0.82–0.95) | 0.89 (0.78–0.94) | < .001 | 0.91 (0.82–0.95) | 0.89 (0.80–0.95) | .003 | 0.91 (0.83–0.96) | 0.89 (0.74–0.93) | < .001 |
| NEE-Lesion | |||||||||
| C-DICE (IQR) | 0.95 (0.88–0.97) | 0.95 (0.88–0.97) | .03 | 0.96 (0.92–0.97) | 0.96 (0.92–0.97) | .2 | 0.85 (0.72–0.91) | 0.86 (0.72–0.91) | .01 |
| C-Sensitivity (IQR) | 0.95 (0.87–0.98) | 0.95 (0.87–0.98) | .57 | 0.95 (0.91–0.98) | 0.95 (0.91–0.98) | .5 | 0.91 (0.83–0.97) | 0.92 (0.83–0.97) | .26 |
| Lesion-wise volumes | |||||||||
| CE-Lesion | |||||||||
| L-DICE (IQR) | 0.72 (0.56–0.90) | 0.72 (0.53–0.88) | < .001 | 0.78 (0.60–0.91) | 0.71 (0.53–0.88) | .0015 | 0.79 (0.67–0.82) | 0.77 (0.59–0.81) | .51 |
| L-Sensitivity (IQR) | 0.77 (0.57–0.92) | 0.76 (0.56–0.91) | .003 | 0.81 (0.63–0.92) | 0.72 (0.54–0.91) | .03 | 0.85 (0.76–0.94) | 0.81 (0.65–0.91) | .02 |
| L-PPV (IQR) | 0.82 (0.65–0.93) | 0.80 (0.64–0.92) | .01 | 0.79 (0.63–0.93) | 0.79 (0.65–0.91) | .35 | 0.76 (0.68–0.88) | 0.79 (0.68–0.90) | .03 |
| F1-Score (IQR) | 0.94 (0.75–1.0) | 0.92 (0.75–1.0) | .02 | 0.93 (0.80–1.0) | 0.96 (0.68–1.0) | .85 | 1.0 (0.89–1.0) | 1.0 (0.81–1.0) | .4 |
| Mean F1-Score (SD) | 0.86 (± 0.19) | 0.83 (± 0.23) | .02 | 0.83 (± 0.24) | 0.83 (± 0.24) | .85 | 0.90 (± 0.19) | 0.89 (± 0.17) | .4 |
CE, contrast-enhancing tumors; NEE, non-enhancing FLAIR signal abnormality/edema; PPV, positive predictive value.
The “Slim”-Model refers to a configuration where the HD-BM is trained only with T1-weighted images after gadolinium contrast agent and FLAIR images and is discussed in detail in the supplement. All values but one are median with respective inter-quartile ranges (IQR) except one mean F1-Score with standard deviation (SD). Comparison of respective datasets on the basis of the necessary input sequences: full-model (T1-weighted images before and after gadolinium contrast agent, FLAIR images and T1-subtraction map) and “Slim”-model (T1-weighted images after gadolinium contrast agent and FLAIR images). Group differences were evaluated with Wilcoxon test.
Figure 1.Segmentation (left column) and detection (right column) agreement between the ground-truth segmentation mask generated by the radiologist and the automatically generated segmentation masks for contrast-enhancing (CE) tumor on a per lesion level (upper row) and a per case level (lower row) within each dataset using violin charts and superimposed box plots. The colors represent each data set.
Figure 2.Example of true positive (A), false negative (B) and false positive (C) findings in the institutional/external dataset. Three example MRI studies with axial T1-weighted postcontrast images. (A) HD-BM (orange) shows accurate detection of BM (white arrow) in the right precentral gyrus comparable to the ground-truth (GT) segmentation (green). (B) Missed BM (green) were mostly small or associated with subtle contrast enhancement as shown here in the right parietal lobe (white arrow). (C) False positive findings (orange) were predominantly associated with vascular changes (white arrow; capillary telangiectasia).
Figure 3.Example of an MRI study with axial FLAIR and T1-weighted post-contrast images of a 71-year-old male patient with malignant melanoma and multiple BM in the institutional dataset (B, arrows and green) and perifocal edema (A, arrowhead). Our HD-BM algorithm detects the perifocal edema accurately (C, yellow) compared to the ground-truth segmentation (D, red).
Figure 4.Volumetric agreement between the ground-truth segmentation mask generated by the radiologist and the automatically generated segmentation masks for contrast-enhancing (CE) tumor (A) and non-enhancing FLAIR signal abnormality/edema (NEE) (B).