| Literature DB >> 35912214 |
Simon A Keek1, Manon Beuque1, Sergey Primakov1, Henry C Woodruff1,2, Avishek Chatterjee1, Janita E van Timmeren3, Martin Vallières4,5, Lizza E L Hendriks6, Johannes Kraft3,7, Nicolaus Andratschke3, Steve E Braunstein8, Olivier Morin8, Philippe Lambin1,2.
Abstract
Introduction: There is a cumulative risk of 20-40% of developing brain metastases (BM) in solid cancers. Stereotactic radiotherapy (SRT) enables the application of high focal doses of radiation to a volume and is often used for BM treatment. However, SRT can cause adverse radiation effects (ARE), such as radiation necrosis, which sometimes cause irreversible damage to the brain. It is therefore of clinical interest to identify patients at a high risk of developing ARE. We hypothesized that models trained with radiomics features, deep learning (DL) features, and patient characteristics or their combination can predict ARE risk in patients with BM before SRT.Entities:
Keywords: MRI; adverse radiation effects; brain metastases (BMs); deep learning - artificial neural network; radiation necrosis (RN); radiomics
Year: 2022 PMID: 35912214 PMCID: PMC9326101 DOI: 10.3389/fonc.2022.920393
Source DB: PubMed Journal: Front Oncol ISSN: 2234-943X Impact factor: 5.738
Figure 1T1-weighted gadolinium-enhanced MRIs of the brain. Delineated in red (A) is a lesion that developed adverse radiation effects after stereotactic radiotherapy and (B) a lesion that did not develop adverse radiation effects after stereotactic radiotherapy.
Figure 2Pre-processing strategies for the “minimalist”, “standardization”, and “harmonization” approaches.
Figure 3Example of pre-processing strategy: deep learning on the “minimalist” approach. The different steps of preprocessing were (A) z-score normalization, (B) shift to positive values only, (C) pixel attenuations with Gaussian smoothing filtering, (D) cropping around the largest bounding box and background set to 0, (E) resizing at 256 × 256, and (F) rescaling the pixel value range to 0–255.
Figure 4General workflow of the model training process: first, the MRI data was pre-processed using 3 pre-processing methods, the most suitable pre-processed set of images was selected according to the radiomics-based model or the DL model performance on the internal test dataset, then the models were ensembled or trained separately, and finally the performance of each model was computed on the external dataset.
Patient characteristics of University of California—San Francisco (UCSF) and University Hospital Zurich (USZ) datasets.
| Patient/tumor characteristic | Total UCSF data | USZ data |
| |
|---|---|---|---|---|
|
|
| |||
| Sex (%) | Male | 571 (41) | 128 (54) | <0.01 |
| Female | 833 (59) | 109 (46) | ||
| Median age ± SD | 59 (13) | 62 (12) | 0.03 | |
| KPS (%) | 80–100 | 1,053 (75) | 198 (83) | <0.01 |
| 40–80 | 351 (25) | 37 (16) | <0.01 | |
| 10–40 | 0 (0) | 2 (1) | – | |
| Primary tumor location (%) | Lung | 530 (38) | 136 (58) | <0.01 |
| Breast | 357 (25) | 27 (11) | <0.01 | |
| Melanoma | 272 (19) | 74 (31) | <0.01 | |
| Kidney | 91 (7) | 0 (0) | – | |
| Gastrointestinal | 57 (4) | 0 (0) | – | |
| Gynecologic | 27 (2) | 0 (0) | – | |
| Sarcoma | 20 (1) | 0 (0) | – | |
| Other | 50 (4) | 0 (0) | – | |
| Histology primary tumor (%) | Adenocarcinoma | 802 (57) | 124 (52) | 0.17 |
| Melanoma | 272 (19) | 74 (31) | <0.01 | |
| Renal cell carcinoma | 88 (6) | 0 (0) | – | |
| Small cell carcinoma | 44 (3) | 0 (0) | – | |
| Squamous cell carcinoma | 40 (3) | 10 (4) | 0.26 | |
| Sarcoma | 18 (1) | 0 (0) | – | |
| Large cell carcinoma | 9 (0.6) | 2 (1) | 0.72 | |
| Bone carcinoma | 8 (0.6) | 0 (0) | – | |
| Adeno squamous carcinoma | 6 (0.4) | 0 (0) | – | |
| Broncho alveolar cell carcinoma | 5 (0.4) | 0 (0) | – | |
| Germ cell carcinoma | 2 (0.1) | 0 (0) | – | |
| Lymphoma | 1 (0.1) | 0 (0) | – | |
| Other/NOS | 109 (8) | 27 (11) | 0.06 | |
| Primary controlled | 974 (70) | 149 (63) | 0.05 | |
| ECM present | 1,097 (78) | 190 (80) | 0.48 | |
| Number of lesions per patient at treatment | Median ± SD | 3 (7) | 2 (3) | <0.01 |
| Symptoms | Headaches | 437 (31) | 31 (13) | <0.01 |
| Hypertension | 407 (29) | 0 (0) | < 0.01 | |
| Seizures | 134 (10) | 16 (7) | 0.17 | |
| Diabetes | 98 (7) | 13 (6) | 0.4 | |
| CTD | 21 (2) | 2 (1) | 0.43 | |
| Number of lesions in total | 7,974 | 646 | – | |
| Number of ARE cases (% of total lesions) | 217 (2.7) | 20 (3.1) | 0.61 | |
| Number of patients with ARE (% of total patients) | 155 (11) | 19 (8) | 0.16 | |
| Prescription dose ± SD (Gy) | 18.5 (1.5) | 20 (5.0) | – | |
The P-value of two-proportion z-test or unpaired two-sample t-test for significant differences between datasets was reported for each characteristic if applicable.
SD, standard deviation; KPS, Karnofsky performance score (80–100, good performance; 50–70, medium performance; and 10–40 bad performance); ECM, extracranial metastasis; BM, brain metastasis; CTD, connective tissue disorder; ARE, adverse radiation effect; Gy, gray.
Figure 5Comparison of predictive performance through receiver operating characteristic curves for (A) radiomics-based machine learning and (B) deep learning models using three different pre-processed image datasets. The shaded areas represent the 95% confidence intervals of the corresponding receiver operating characteristic curves.
Figure 6Receiver operating characteristic curves of the training, testing, and external validation datasets for the different model combinations. The shaded areas represent the 95% confidence intervals of the corresponding receiver operating characteristic curves.
Area under the curve (AUC), balanced accuracy, precision, recall, and F1 metrics with CI on the external validation on patient and lesion levels.
| Per-lesion classification | Per-patient classification | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Approaches | AUC | Balanced accuracy | Precision | Recall | F1 score | Approaches | AUC | Balanced accuracy | Precision | Recall | F1 score |
| Best deep learning model | 0.64 CI (0.50, 0.76) | 0.57 CI (0.48, 0.64) | 0.04 CI (0.02, 0.05) | 0.85 CI (0.67, 1.00) | 0.07 CI (0.04, 0.10) | Best deep learning model | 0.70 CI (0.56, 0.83) | 0.63 CI (0.52, 0.73) | 0.17 CI (0.09, 0.25) | 0.60 CI (0.39, 0.78) | 0.26 CI (0.16, 0.37) |
| Best radiomics model | 0.73 CI (0.63, 0.83) | 0.62 CI (0.51, 0.74) | 0.07 CI (0.03, 0.11) | 0.45 CI (0.23, 0.67) | 0.12 CI (0.05, 0.19) | Best radiomics model | 0.72 CI (0.60, 0.83) | 0.59 CI (0.51, 0.69) | 0.40 CI (0.09, 0.75) | 0.21 CI (0.05, 0.43) | 0.28 CI (0.07, 0.48) |
| Radiomics and DL | 0.71 CI (0.60, 0.82) | 0.67 CI (0.56, 0.76) | 0.05 CI (0.03, 0.08) | 0.80 CI (0.62, 0.96) | 0.10 CI (0.06, 0.14) | Radiomics and DL | 0.71 CI (0.57, 0.83) | 0.66 CI (0.54, 0.77) | 0.14 CI (0.07, 0.22) | 0.63 CI (0.40, 0.84) | 0.23 CI (0.13, 0.34) |
| Radiomics and patient characteristics | 0.70 CI (0.57, 0.80) | 0.62 CI (0.51, 0.74) | 0.06 CI (0.03, 0.10) | 0.50 CI (0.28, 0.73) | 0.11 CI (0.05, 0.17) | Radiomics and patient characteristics | 0.71 CI (0.59, 0.81) | 0.57 CI (0.48, 0.68) | 0.16 CI (0.04, 0.30) | 0.26 CI (0.08, 0.47) | 0.20 CI (0.05, 0.35) |
| Radiomics, DL, and patient characteristics | 0.69 CI (0.56, 0.81) | 0.64 CI (0.53, 0.74) | 0.05 CI (0.03, 0.08) | 0.70 CI (0.48, 0.89) | 0.09 CI (0.05, 0.14) | Radiomics, DL, and patient characteristics | 0.72 CI (0.58, 0.84) | 0.65 CI (0.55, 0.74) | 0.12 CI (0.07, 0.17) | 0.84 CI (0.65, 1.00) | 0.21 CI (0.13, 0.29) |
| Agreed labels | 0.67 CI (0.53, 0.81) | 0.65 CI (0.53, 0.73) | 0.07 CI (0.03, 0.12) | 0.90 CI (0.67, 1.00) | 0.13 CI (0.06, 0.21) | Agreed labels | NA | NA | NA | NA | NA |