Literature DB >> 33963236

Comparison of radiomic feature aggregation methods for patients with multiple tumors.

Enoch Chang1, Marina Z Joel1, Hannah Y Chang2, Justin Du3, Omaditya Khanna4, Antonio Omuro5, Veronica Chiang6, Sanjay Aneja7,8,9.   

Abstract

Radiomic feature analysis has been shown to be effective at analyzing diagnostic images to model cancer outcomes. It has not yet been established how to best combine radiomic features in cancer patients with multifocal tumors. As the number of patients with multifocal metastatic cancer continues to rise, there is a need for improving personalized patient-level prognosis to better inform treatment. We compared six mathematical methods of combining radiomic features of 3,596 tumors in 831 patients with multiple brain metastases and evaluated the performance of these aggregation methods using three survival models: a standard Cox proportional hazards model, a Cox proportional hazards model with LASSO regression, and a random survival forest. Across all three survival models, the weighted average of the largest three metastases had the highest concordance index (95% confidence interval) of 0.627 (0.595-0.661) for the Cox proportional hazards model, 0.628 (0.591-0.666) for the Cox proportional hazards model with LASSO regression, and 0.652 (0.565-0.727) for the random survival forest model. This finding was consistent when evaluating patients with different numbers of brain metastases and different tumor volumes. Radiomic features can be effectively combined to estimate patient-level outcomes in patients with multifocal brain metastases. Future studies are needed to confirm that the volume-weighted average of the largest three tumors is an effective method for combining radiomic features across other imaging modalities and tumor types.

Entities:  

Year:  2021        PMID: 33963236      PMCID: PMC8105371          DOI: 10.1038/s41598-021-89114-6

Source DB:  PubMed          Journal:  Sci Rep        ISSN: 2045-2322            Impact factor:   4.379


Introduction

Recent advances in machine learning and diagnostic imaging have increased enthusiasm surrounding the use of quantitative imaging to model clinical outcomes. One quantitative imaging technique that holds particular promise in modeling cancer outcomes is radiomic feature analysis. Radiomic features are quantitative metrics of size, shape, intensity, and texture that are extracted from medical images using high-throughput computational mining techniques[1,2]. These features represent unique radiographic signatures that have been shown to reveal prognostic insights about underlying gene-expression patterns[3] and treatment response[4] that may not be visible to the human eye. Although radiomic features have shown the ability to risk stratify cancer patients[5,6], radiomic analysis has largely been limited to the evaluation of individual tumor volumes. There is no established methodology regarding the best way to combine radiomic features for patients with multifocal sites of disease to establish a patient-level correlate of clinical outcomes. Within oncology, estimating patient outcomes among patients with multiple sites of metastatic disease is of particular interest as it influences multi-disciplinary treatment paradigms. Prognostication for patients with metastatic disease is an area of increasing need, given as many as 49% of lung cancer patients within the United States are diagnosed with metastatic disease upon initial presentation[7]. More importantly, nearly 75% of patients with metastatic disease have greater than 5 lesions at diagnosis[8]. Even among non-metastatic cancer patients, as many as 17% of patients have multi-focal primary tumors[9]. Brain metastases represent a significant proportion of metastatic cancer patients, and conservatively, at least 50% of patients diagnosed with metastatic brain disease have multiple brain lesions[10]. With improved MRI brain imaging, it is likely that the number of patients found with multiple metastases is even higher[11]. In this study, we attempted to identify optimal techniques to combine radiomic features for patients with multiple brain metastases to model patient-level outcomes. We compared various methods of combining radiomic features of individual tumors in patients with multiple brain metastases and evaluated the performance of different radiomic aggregation methods in estimating survival in patients using several different survival models.

Results

Patient demographics

A total of 831 patients with 3596 brain metastases were included. Patient demographics are presented in Table 1. Median overall survival was 12 months, median follow-up was 36 months, and 257 patients were right-censored. Median age was 63 years, and the most common primary malignancies were NSCLC (41.0%), melanoma (17.3%), breast (13.1%), SCLC (6.6%), renal (5.9%), and GI (5.3%). 543 patients (65.3%) had < 5 metastases, 214 (25.8%) had 5–10 metastases, and 74 (8.9%) had 11 + metastases.
Table 1

Patient demographics.

Age
< 50121 (14.6%)
50–59199 (23.9%)
60–69282 (33.9%)
> 70229 (27.6%)
Primary cancer site
NSCLC341 (41.0%)
Melanoma144 (17.3%)
Breast109 (13.1%)
SCLC55 (6.6%)
Renal49 (5.9%)
GI44 (5.3%)
Other89 (10.7%)
Number of metastases
< 5543 (65.3%)
5–10214 (25.8%)
11 + 74 (8.9%)
Karnofsky performance status*
100216 (29.1%)
90161 (21.7%)
80160 (21.6%)
70102 (13.8%)
6085 (11.5%)
5013 (1.8%)
≤404 (0.5%)
Presence of extracranial metastases
Yes608 (73.2%)
No223 (26.8%)

* If available.

Patient demographics. * If available.

Overall results

Table 2 presents a comparison of various aggregation methods across several survival models.
Table 2

Comparison of aggregation methods.

ModelC-Index (95% CI)
Cox proportional hazards
Unweighted average0.610 (0.570–0.646)
Weighted average0.604 (0.571–0.641)
Weighted average of largest 3 metastases0.627 (0.595–0.661)
Largest + number of metastases0.612 (0.579–0.649)
Largest metastasis0.598 (0.559–0.636)
Smallest metastasis0.595 (0.567–0.631)
Cox proportional hazards with LASSO regression
Unweighted average0.612 (0.585–0.647)
Weighted average0.603 (0.573–0.640)
Weighted average of largest 3 metastases0.628 (0.591–0.666)
Largest + number of metastases0.597 (0.560–0.632)
Largest metastasis0.596 (0.562–0.630)
Smallest metastasis0.597 (0.557–0.630)
Random survival forest
Unweighted average0.649 (0.548–0.709)
Weighted average0.641 (0.567–0.729)
Weighted average of largest 3 metastases0.652 (0.565–0.727)
Largest + Number of metastases0.622 (0.542–0.706)
Largest metastasis0.627 (0.544–0.694)
Smallest metastasis0.621 (0.529–0.709)
Comparison of aggregation methods. For the standard Cox proportional hazards model, the weighted average of the largest three metastases had the highest C-index (95% confidence interval) of 0.627 (0.595–0.661). The weighted average of the largest three metastases also had the highest C-index of all the aggregation methods on the Cox proportional hazards model with LASSO regression with a C-index of 0.628 (0.591–0.666) and on the random survival forest with a C-index of 0.652 (0.565–0.727).

Sub-analysis: number of metastases

Table 3 presents model performance of sub-analyses based on number of brain metastases. For patients with < 5 metastases, the weighted average of their 3 largest metastases had the highest C-index on the standard Cox proportional hazards model with a C-index of 0.640 (0.600–0.686). For patients with 5–10 metastases, the unweighted average of all their metastases had the highest C-index of 0.697 (0.638–0.762). For patients with 11 + metastases, the model including only data from the largest metastasis as well as the number of metastases performed with the highest C-index of 0.909 (0.803–0.993).
Table 3

Sub-Analysis: Number of Metastases.

ModelC-Index (95% CI)
< 5 Metastases (n = 543)
Unweighted average0.621 (0.583–0.661)
Weighted average0.617 (0.570–0.651)
Weighted average of largest 3 metastases0.640 (0.600–0.686)
Largest + number of metastases0.639 (0.593–0.676)
Largest metastasis0.619 (0.580–0.653)
Smallest metastasis0.612 (0.566–0.655)
5–10 Metastases (n = 214)
Unweighted average0.697 (0.638–0.762)
Weighted average0.682 (0.617–0.743)
Weighted average of largest 3 metastases0.688 (0.635–0.749)
Largest + number of metastases0.691 (0.634–0.750)
Largest metastasis0.688 (0.623–0.740)
Smallest metastasis***
11 + metastases (n = 74)
Unweighted average0.876 (0.776–0.964)
Weighted average0.872 (0.771–0.965)
Weighted average of largest 3 metastases0.880 (0.787–0.964)
Largest + number of metastases0.909 (0.803–0.993)
Largest metastasis0.894 (0.765–0.974)
Smallest metastasis***

*** Collinear.

Sub-Analysis: Number of Metastases. *** Collinear.

Sub-analysis: volume of largest metastasis

Table 4 describes model performance of sub-analyses based on volume of the largest metastasis. Across all sub-groups, the weighted average of the largest three metastases had the highest C-indices of 0.701 (0.652–0.748), 0.687 (0.622–0.728), 0.679 (0.623–0.733) for volumes < 0.200 cc, 0.200–0.700 cc, and > 0.700 + cc, respectively.
Table 4

Sub-analysis: volume of largest metastasis.

ModelC-Index (95% CI)
Largest metastasis < 0.200 cc (n = 261)
Unweighted average0.674 (0.630–0.729)
Weighted average0.694 (0.639–0.749)
Weighted average of largest 30.701 (0.652–0.748)
Largest + number of metastases0.698 (0.644–0.767)
Largest0.695 (0.634–0.736)
Smallest***
Largest metastasis 0.200–0.700 cc (n = 281)
Unweighted Average0.663 (0.615–0.719)
Weighted average0.675 (0.622–0.731)
Weighted average of largest 30.687 (0.622–0.728)
Largest + number of metastases0.668 (0.621–0.727)
Largest0.656 (0.611–0.717)
Smallest***
Top 20 features largest metastasis 0.700 + cc (n = 289)
Unweighted average0.671 (0.616–0.717)
Weighted average0.656 (0.602–0.718)
Weighted average of largest 30.679 (0.623–0.733)
Largest + number of metastases0.675 (0.614–0.715)
Largest0.666 (0.615–0.714)
Smallest***

***Collinear.

Sub-analysis: volume of largest metastasis. ***Collinear.

Sensitivity analysis

Model performance was not improved when using the weighted average of the largest 2 and largest 4 metastases across survival models (Supplement 3). Model performance was lower when using PCA rather than mRMR for feature selection (Supplement 4). The Cox proportional hazards models with LASSO regression and random survival forest models using all 851 radiomic features did not have higher c-indices compared to models with prior feature selection – notably, the models using the weighted average of the largest three metastases continued to perform best (Supplement 5).

Discussion

The emerging field of radiomic feature analysis has shown particular promise in cancer research. However, traditional radiomic feature analysis has had limited utility for patients with metastatic or multifocal disease because there is a paucity of established methods which aggregate radiomic features across multiple tumors to establish patient-level outcome estimates. Given the majority of cancer patients with brain metastases have multifocal disease, they represent an optimal patient population to study this question. We compared different methods of aggregating radiomic features and tested their performance across different survival models. Our study suggests that a volume-weighted average of the radiomic features of the largest three brain metastases is the most effective technique to model survival across various methods of survival analysis. Furthermore, this suggests that in patients with multi-focal disease, the largest tumors may be driving prognosis. Practical implications of these findings include more efficient computational and clinical resource utilization. As server costs, data storage requirements, and computation time increase with increasing dataset size and complexity, effective models of prognosis may potentially be developed with just the data from the largest three metastases alone. Furthermore, clinicians may save time by only needing to manually segment the largest three metastases. This study corroborates studies that have found radiomic features to correlate with prognosis among lung[12], breast[13], and prostate cancer patients[6]. To address the issue of combining multi-focal data at a patient-level, prior studies have implemented a weighted average of features of all metastases[14] while others have included all tumors from a specific patient assigned to either a training or validation set to avoid cluster-correlation biases[15]. While tumor-level data may be useful for certain tasks like primary-site prediction, there is a need for aggregation of patient-level data for overall outcomes like survival or recurrence. A notable finding of this study is that across all models, it was important that some measure of multi-focality was incorporated, whether aggregating radiomic features from the largest three metastases or including a clinical measure of multi-focality in the number of metastases. On sub-analysis, across all volume groups of the largest metastasis, the weighted average of the largest three tumors had the highest performance. As the number of metastases increased greater than 5, models incorporating radiomic data from the largest metastasis with the addition of the number of metastases performed better, suggesting that with a higher number of metastases, additional non-imaging data (i.e. clinical data) may improve model performance, as prior studies have demonstrated the promise of clinical data alone in modeling survival[16]. Future studies are planned to incorporate both imaging and clinical data variables. This study has limitations. First, there is an inherent selection bias since all the patients in this study were limited to those with brain metastases so results may vary across other disease sites. Second, these patients were all treated at one institution with the same MRI protocol. Efforts are underway to validate these findings at an independent institution. Third, these patients were ultimately treated with the same treatment modality. Further analysis is needed for patients treated with different interventions. In conclusion, this study suggests that radiomic features can be effectively combined to establish patient-level outcomes in patients with multifocal disease. Future studies are needed to confirm that the volume-weighted average of the largest three tumors is an effective method for combining radiomic features across other imaging modalities and disease sites.

Methods

Patient data

Research was conducted in accordance with the Declaration of Helsinki guidelines and approved by the Yale University Institutional Review Board. Informed consent was obtained from all participants in this study. We analyzed 831 patients with 3,596 brain metastases treated with primary stereotactic radiosurgery (SRS) at our institution between 2000 and 2018. Patients with prior resection or prior radiation treatment were excluded. Metastases < 5 mm were also excluded. The primary outcome of interest was overall survival following SRS treatment defined as time from SRS treatment date to date of death or last follow-up.

Image preprocessing

The image preprocessing workflow is illustrated in Fig. 1. Individual metastases were segmented on T1-weighted contrast enhanced MRI images and approved by both a radiation oncologist and a neurosurgeon. A bounding box with 1 mm of peri-tumoral tissue around the maximum dimension of the tumor contour in the axial plane was created to ensure edge detection of the tumor. Images were resampled to 1 mm pixel spacing, corrected for low frequency intensity non-uniformity present in MRI data with the N4ITK bias field correction algorithm[17], and z-score normalized to reduce inter-scan bias.
Figure 1

Preprocessing Workflow. (a) Input: slices of pre-treatment T1-post contrast brain MRI scans. (b) Identification of the region of interest from manual segmentations. (c) Output: extracted tumors with pixel resampling, N4ITK bias field correction, and z-score normalization.

Preprocessing Workflow. (a) Input: slices of pre-treatment T1-post contrast brain MRI scans. (b) Identification of the region of interest from manual segmentations. (c) Output: extracted tumors with pixel resampling, N4ITK bias field correction, and z-score normalization.

Radiomic feature extraction and aggregation

851 radiomic features[2] (Supplement 1) were extracted from the processed images of each brain metastasis and analyzed as predictors of survival. The following aggregation methods were compared to estimate patient-level risk for patients with multiple metastases: (1) Unweighted Average: Radiomic size and shape features were summed while all other features were averaged for each patient. (2) Weighted Average: Radiomic size and shape features were summed while all other features were averaged for each patient based on a weighted proportion of total volume of all metastases per patient[14]. (3) Weighted Average of Largest Three Metastases: Radiomic size and shape features from the largest three metastases of each patient were summed while all other features from the largest three metastases were averaged based on a weighted proportion of the total volume of the largest three metastases. The largest three metastases were chosen since the Graded Prognostic Assessment, a widely accepted model for prognostication, uses a cut-off of three metastases for prognostic risk stratification[16]. (4) Largest Metastasis + Number of Metastases: The features from the largest metastasis of each patient were selected, and the total number of metastases for each patient was included as an additional variable[16]. (5) Largest Metastasis Alone. (6) Smallest Metastasis Alone as a control with the assumption that the smallest tumor would have decreased prognostic value compared to larger tumors.

Statistical analysis and survival models

The following survival models were tested: a traditional Cox proportional hazards model[18], a Cox proportional hazards model with LASSO (least absolute shrinkage and selection operator) regression[19], and a random survival forest[20]. Dimensionality reduction was performed at the patient level after radiomic feature aggregation. The top 20 ranked features (Supplement 2) were selected via minimum redundancy maximum relevance (mRMR)[21], a method shown in multiple prior publications to effectively reduce dimensionality of radiomic features while maintaining the most relevant features for cancer prognostication[12,22,23]. Hyperparameter tuning for the Cox proportional hazards model with LASSO regression and the random survival forest model was performed with fivefold cross-validation using a grid search. Discriminatory ability of each model was assessed with calculated concordance indices (c-index) using 100 bootstrapped samples of 416 patients (50%). Multiple survival models were assessed to determine if particular aggregation methods were superior for specific survival models.

Sub-group analysis

A sub-group analysis was performed to evaluate the role of the following potential drivers of patient prognosis: the overall number of metastases[16] and the volume of the largest metastasis of each patient[24]. The robustness of the various radiomic aggregation techniques was evaluated by comparing the Cox proportional hazards model across these sub-groups: the number of metastases (< 5 metastases, 5–10 metastases, 11 + metastases) as well as the volume of the largest metastasis (< 0.200 cc, 0.200–0.700 cc, > 0.700 + cc) per patient. A sensitivity analysis was performed to compare the weighted average of the largest 2 and largest 4 metastases across survival models. Additionally, a sensitivity analysis was performed for the Cox proportional hazards models across aggregation methods using Principal Component Analysis (PCA) at the patient level post-aggregation to select a minimum number of features such that 90% of variance was retained. Finally, another analysis was performed where all 851 radiomic features were used for the Cox proportional hazards models with LASSO regression and the random survival forest models. Supplementary Information.
  23 in total

1.  mRMRe: an R package for parallelized mRMR ensemble feature selection.

Authors:  Nicolas De Jay; Simon Papillon-Cavanagh; Catharina Olsen; Nehme El-Hachem; Gianluca Bontempi; Benjamin Haibe-Kains
Journal:  Bioinformatics       Date:  2013-07-03       Impact factor: 6.937

2.  Definitive primary therapy in patients presenting with oligometastatic non-small cell lung cancer.

Authors:  Ravi B Parikh; Angel M Cronin; David E Kozono; Geoffrey R Oxnard; Raymond H Mak; David M Jackman; Peter C Lo; Elizabeth H Baldini; Bruce E Johnson; Aileen B Chen
Journal:  Int J Radiat Oncol Biol Phys       Date:  2014-05-24       Impact factor: 7.038

3.  Increases in the number of brain metastases detected at frame-fixed, thin-slice MRI for gamma knife surgery planning.

Authors:  Aiko Nagai; Yuta Shibamoto; Yoshimasa Mori; Chisa Hashizume; Masahiro Hagiwara; Tatsuya Kobayashi
Journal:  Neuro Oncol       Date:  2010-09-23       Impact factor: 12.300

Review 4.  Radiomics: the process and the challenges.

Authors:  Virendra Kumar; Yuhua Gu; Satrajit Basu; Anders Berglund; Steven A Eschrich; Matthew B Schabath; Kenneth Forster; Hugo J W L Aerts; Andre Dekker; David Fenstermacher; Dmitry B Goldgof; Lawrence O Hall; Philippe Lambin; Yoganand Balagurunathan; Robert A Gatenby; Robert J Gillies
Journal:  Magn Reson Imaging       Date:  2012-08-13       Impact factor: 2.546

5.  Radiomics Signature on Magnetic Resonance Imaging: Association with Disease-Free Survival in Patients with Invasive Breast Cancer.

Authors:  Hyunjin Park; Yaeji Lim; Eun Sook Ko; Hwan-Ho Cho; Jeong Eon Lee; Boo-Kyung Han; Eun Young Ko; Ji Soo Choi; Ko Woon Park
Journal:  Clin Cancer Res       Date:  2018-06-18       Impact factor: 12.531

6.  Computed Tomography-based Radiomics for Risk Stratification in Prostate Cancer.

Authors:  Sarah O S Osman; Ralph T H Leijenaar; Aidan J Cole; Ciara A Lyons; Alan R Hounsell; Kevin M Prise; Joe M O'Sullivan; Philippe Lambin; Conor K McGarry; Suneil Jain
Journal:  Int J Radiat Oncol Biol Phys       Date:  2019-06-26       Impact factor: 7.038

7.  Computational Radiomics System to Decode the Radiographic Phenotype.

Authors:  Joost J M van Griethuysen; Andriy Fedorov; Chintan Parmar; Ahmed Hosny; Nicole Aucoin; Vivek Narayan; Regina G H Beets-Tan; Jean-Christophe Fillion-Robin; Steve Pieper; Hugo J W L Aerts
Journal:  Cancer Res       Date:  2017-11-01       Impact factor: 12.701

8.  Multicenter study demonstrates radiomic features derived from magnetic resonance perfusion images identify pseudoprogression in glioblastoma.

Authors:  Nabil Elshafeey; Aikaterini Kotrotsou; Ahmed Hassan; Nancy Elshafei; Islam Hassan; Sara Ahmed; Srishti Abrol; Anand Agarwal; Kamel El Salek; Samuel Bergamaschi; Jay Acharya; Fanny E Moron; Meng Law; Gregory N Fuller; Jason T Huse; Pascal O Zinn; Rivka R Colen
Journal:  Nat Commun       Date:  2019-07-18       Impact factor: 14.919

9.  Identification of Non-Small Cell Lung Cancer Sensitive to Systemic Cancer Therapies Using Radiomics.

Authors:  Laurent Dercle; Matthew Fronheiser; Lin Lu; Shuyan Du; Wendy Hayes; David K Leung; Amit Roy; Julia Wilkerson; Pingzhen Guo; Antonio T Fojo; Lawrence H Schwartz; Binsheng Zhao
Journal:  Clin Cancer Res       Date:  2020-03-20       Impact factor: 13.801

10.  Decoding tumour phenotype by noninvasive imaging using a quantitative radiomics approach.

Authors:  Hugo J W L Aerts; Emmanuel Rios Velazquez; Ralph T H Leijenaar; Chintan Parmar; Patrick Grossmann; Sara Carvalho; Sara Cavalho; Johan Bussink; René Monshouwer; Benjamin Haibe-Kains; Derek Rietveld; Frank Hoebers; Michelle M Rietbergen; C René Leemans; Andre Dekker; John Quackenbush; Robert J Gillies; Philippe Lambin
Journal:  Nat Commun       Date:  2014-06-03       Impact factor: 14.919

View more
  6 in total

Review 1.  Imaging approaches and radiomics: toward a new era of ultraprecision radioimmunotherapy?

Authors:  Roger Sun; Théophraste Henry; Adrien Laville; Alexandre Carré; Anthony Hamaoui; Sophie Bockel; Ines Chaffai; Antonin Levy; Cyrus Chargari; Charlotte Robert; Eric Deutsch
Journal:  J Immunother Cancer       Date:  2022-07       Impact factor: 12.469

2.  Optimal PET-based radiomic signature construction based on the cross-combination method for predicting the survival of patients with diffuse large B-cell lymphoma.

Authors:  Chong Jiang; Ang Li; Yue Teng; Xiangjun Huang; Chongyang Ding; Jianxin Chen; Jingyan Xu; Zhengyang Zhou
Journal:  Eur J Nucl Med Mol Imaging       Date:  2022-02-11       Impact factor: 10.057

3.  Opportunities for integration of artificial intelligence into stereotactic radiosurgery practice.

Authors:  Rupesh Kotecha; Sanjay Aneja
Journal:  Neuro Oncol       Date:  2021-10-01       Impact factor: 13.029

4.  Radiomics-Based Deep Learning Prediction of Overall Survival in Non-Small-Cell Lung Cancer Using Contrast-Enhanced Computed Tomography.

Authors:  Kuei-Yuan Hou; Jyun-Ru Chen; Yung-Chen Wang; Ming-Huang Chiu; Sen-Ping Lin; Yuan-Heng Mo; Shih-Chieh Peng; Chia-Feng Lu
Journal:  Cancers (Basel)       Date:  2022-08-04       Impact factor: 6.575

5.  Radiomics artificial intelligence modelling for prediction of local control for colorectal liver metastases treated with radiotherapy.

Authors:  Ricky Hu; Ishita Chen; Jacob Peoples; Jean-Paul Salameh; Mithat Gönen; Paul B Romesser; Amber L Simpson; Marsha Reyngold
Journal:  Phys Imaging Radiat Oncol       Date:  2022-09-13

6.  Using Adversarial Images to Assess the Robustness of Deep Learning Models Trained on Diagnostic Images in Oncology.

Authors:  Marina Z Joel; Sachin Umrao; Enoch Chang; Rachel Choi; Daniel X Yang; James S Duncan; Antonio Omuro; Roy Herbst; Harlan M Krumholz; Sanjay Aneja
Journal:  JCO Clin Cancer Inform       Date:  2022-02
  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.