| Literature DB >> 33184313 |
Pierre Fontaine1,2, Oscar Acosta3, Joël Castelli3, Renaud De Crevoisier3, Henning Müller4, Adrien Depeursinge4,5.
Abstract
In standard radiomics studies the features extracted from clinical images are mostly quantified with simple statistics such as the average or variance per Region of Interest (ROI). Such approaches may smooth out any intra-region heterogeneity and thus hide some tumor aggressiveness that may hamper predictions. In this paper we study the importance of feature aggregation within the standard radiomics workflow, which allows to take into account intra-region variations. Feature aggregation methods transform a collection of voxel values from feature response maps (over a ROI) into one or several scalar values that are usable for statistical or machine learning algorithms. This important step has been little investigated within the radiomics workflows, so far. In this paper, we compare several aggregation methods with standard radiomics approaches in order to assess the improvements in prediction capabilities. We evaluate the performance using an aggregation function based on Bags of Visual Words (BoVW), which allows for the preservation of piece-wise homogeneous information within heterogeneous regions and compared with standard methods. The different models are compared on a cohort of 214 head and neck cancer patients coming from 4 medical centers. Radiomics features were extracted from manually delineated tumors in clinical PET-FDG and CT images were analyzed. We compared the performance of standard radiomics models, the volume of the ROI alone and the BoVW model for survival analysis. The average concordance index was estimated with a five fold cross-validation. The performance was significantly better using the BoVW model 0.627 (95% CI: 0.616-0.637) as compared to standard radiomics0.505 (95% CI: 0.499-0.511), mean-var. 0.543 (95% CI: 0.536-0.549), mean0.547 (95% CI: 0.541-0.554), var.0.530 (95% CI: 0.524-0.536) or volume 0.577 (95% CI: 0.571-0.582). We conclude that classical aggregation methods are not optimal in case of heterogeneous tumors. We also showed that the BoVW model is a better alternative to extract consistent features in the presence of lesions composed of heterogeneous tissue.Entities:
Year: 2020 PMID: 33184313 PMCID: PMC7661538 DOI: 10.1038/s41598-020-76310-z
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Influence of the size and localization of the ROI for aggregating the feature maps using the average. Each sub-region and is well separated in the feature space spanned by Simoncelli wavelets and aggregated using the average. The blue region (entire image) involves the averaging of non-stationary sub-regions. As a consequence, this blue region does not represent the true content of the image well, because its representation in the feature space (blue diamond) falls in between the true observations (red circles and green crosses). and represent clusters (called visual words) found using the BoVW approach allowing to reveal and preserve pattern heterogeneity by relying on an aggregation function that is integrative regarding parts in the feature space.
Patient characteristics.
| Cohort | # patient | Mean age, years (SD) | Stage (AJCC) | # events | |
|---|---|---|---|---|---|
| Rennes | 103 | 62 (9) | III | 22 | 63 |
| IV | 81 | ||||
| Besançon | 34 | 63 (8) | III | 13 | 16 |
| IV | 21 | ||||
| Lorient | 16 | NC | III | 5 | 5 |
| IV | 11 | ||||
| Lausanne | 61 | 63 (9) | III | 20 | 7 |
| IV | 41 | ||||
The list of the detailed features used in the study.
| Family | Feature | Quantitative feature |
|---|---|---|
| Filter-based | Laplacian of Gaussian Gabor Sobel | Sigma Sigma Kernel size = 3 × 3 × 3 |
| Grey-level texture matrices | GLRLM Radius Angles Discretization | ShortRunEmphasis LongRunEmphasis GreyLevelNonuniformity RunLengthNonuniformity LowGreyLevelRunEmphasis HighGreyLevelRunEmphasis ShortRunLowGreyLevelEmphasis ShortRunHighGreylevelEmphasis LongRunLowGreyLevelEmphasis LongRunHighGreyLevelEmphasis |
GLCM Radius Angles Discretization | Energy InverseDifferenceMoment Entropy HaralickCorrelation ClusterShade ClusterProminence Inertia Correlation |
Figure 2For each patient , the 42 feature maps are concatenated into a matrix where each coefficient voxel of the ROI is a 42-dimensional vector.
Figure 3Proposed validation strategy using the multi-centric cohort of head and neck cancer.
Figure 4The number of clusters k is chosen based on the Gap value (higher is better) computed on the entire dataset. We chose clusters (i.e. visual words) as a very large number of cluster is required to significantly increases the Gap value beyond .
Figure 5Influence of k on the performance of the survival model measured using the C-index.
Figure 6Average C-indices and 95% CIs for the six proposed models based on various feature aggregation methods. *.
Harrell’s C-indices for the six proposed models.
| Mean (lower bound-upper bound) (95% CI) | |
|---|---|
| Classical radiomics | 0.505 (0.499–0.511) |
| Average–variance | 0.543 (0.536–0.549) |
| Average | 0.547 (0.541–0.554) |
| Variance | 0.530 (0.524–0.536) |
| Volume | 0.577 (0.571–0.582) |
| BoVW | 0.627 (0.616–0.637) |
Figure 7Kaplan–Meier curves using a risk stratification into two groups as defined by the median value of the HS (“Model validation” section).