| Literature DB >> 36045274 |
Abstract
BACKGROUND: Radiomics is a noninvasive method using machine learning to support personalised medicine. Preprocessing filters such as wavelet and Laplacian-of-Gaussian filters are commonly used being thought to increase predictive performance. However, the use of preprocessing filters increases the number of features by up to an order of magnitude and can produce many correlated features. Both substantially increase the dataset complexity, which in turn makes modeling with machine learning techniques more challenging, possibly leading to poorer performance. We investigated the impact of these filters on predictive performance.Entities:
Keywords: Artificial intelligence; Benchmarking; Machine learning; Precision medicine; Radiomics
Mesh:
Year: 2022 PMID: 36045274 PMCID: PMC9433552 DOI: 10.1186/s41747-022-00294-w
Source DB: PubMed Journal: Eur Radiol Exp ISSN: 2509-9280
Datasets used in the experiments
| Dataset | Modality | In-plane resolution (mm) | Slice thickness (m) | |
|---|---|---|---|---|
| CRLM | CT | 76 (1) | 0.7 (0.6–0.9) | 5.0 (1.0–8.0) |
| Desmoid | MRI | 195 (8) | 0.7 (0.2–1.8) | 5.0 (1.0–10.0) |
| GIST | CT | 244 (2) | 0.8 (0.6–1.0) | 3.0 (0.6–6.0) |
| HN | CT | 134 (1) | 1.0 (1.0–1.1) | 3.0 (1.5–3.0) |
| Lipo | MRI | 113 (2) | 0.7 (0.2–1.4) | 5.5 (1.0–9.1) |
| Liver | MRI | 186 (0) | 0.8 (0.6–1.6) | 7.7 (1.0–11.0) |
| Melanoma | CT | 97 (6) | 0.7 (0.5–1.0) | 1.2 (0.6–2.0) |
N denotes the number of samples, with the number of scans removed from the original dataset in parenthesis (excluded); for example, there were 203 scans in the Desmoid dataset, 8 of which have been removed. In-plane resolution and slice thickness are reported as median (range). CT Computed tomography, MRI Magnetic resonance imaging. For the datasets, see references [9, 10]
Overview of the image filters
| Filter | Parameters | |
|---|---|---|
| Original | – | 105 |
| Exponential | – | 91 (196) |
| Gradient | – | 91 (196) |
| Laplacian-of-Gaussian | Sigma 1.0, 2.0, 3.0, 4.0, 5.0 mm | 455 (560) |
| Local binary pattern | Two-dimensional, three-dimensional | 273 (378) |
| Logarithm | – | 91 (196) |
| Square | – | 91 (196) |
| Square root | – | 91 (196) |
| Wavelet | Directions HHH, HHL, HLH, HLL, LHH, LHL, LLH, LLL | 728 (833) |
Original denotes the set of extracted features without any image processing, which also contains of shape features. Parameters denote the chosen extraction parameters for the preprocessing filter, if applicable. HHH, HHL etc. denote the frequency components of the wavelet filters (H = high, L = low). N denotes the number of features that were generated after applying the filter; in parenthesis is the sum of the number of original features and the features after applying the corresponding filter
Fig. 1Overall pipeline used for training. The image data was preprocessed by each filter to generate the different feature sets. A 10-fold nested, stratified cross-validation was applied to train a predictive model
List of hyperparameters tuned during cross-validation
| Classifier | Hyperparameter |
|---|---|
| Logistic regression | C in 2^{-6, -4, -2, 0, 2, 4, 6} |
| Neural networks | Three layers with each 4, 16, or 64 neurons |
| Random forest | Number of estimators 50, 125, or 250 |
Radial basis function Sector vector machine | C in 2^{-6, -4, -2, 0, 2, 4, 6}, gamma was set to auto |
The remaining hyperparameters were left at their default
Areas-under-the-curve at receiver operating characteristics of all the models
| Dataset | AUC-ROC Original | AUC-ROC All | AUC-ROC Tuned | ΔAll—Original | pAll—Original | ΔTuned—Original | pTuned—Original | ΔTuned—All | Best filter | |
|---|---|---|---|---|---|---|---|---|---|---|
| CRLM | 0.6 (0.47–0.73) | 0.7 (0.58–0.82) | -0.1 | 0.237 | 0 | 1 | 0.1 | 0.237 | Original | |
| Desmoid | 0.72 (0.64–0.79) | 0.8 (0.73–0.87) | 0.08 | 0.1 | 0.02 | 0.261 | Wavelet | |||
| GIST | 0.7 (0.63–0.77) | 0.77 (0.71–0.83) | 0.07 | 0.07 | 0 | 1 | All | |||
| HN | 0.85 (0.79–0.92) | 0.81 (0.74–0.89) | -0.04 | 0.216 | 0.01 | 0.651 | 0.05 | 0.086 | Square root | |
| Lipo | 0.74 (0.65–0.83) | 0.82 (0.74–0.9) | 0.08 | 0.08 | 0 | 1 | All | |||
| Liver | 0.73 (0.65–0.8) | 0.69 (0.61–0.77) | -0.04 | 0.393 | 0.02 | 0.573 | 0.06 | 0.188 | Square root | |
| Melanoma | 0.57 (0.45–0.68) | 0.53 (0.41–0.64) | -0.04 | 0.179 | 0.06 | 0.322 | 0.1 | Exponential |
The best-performing model in terms of area-under-the-curve at receiver operating characteristics (AUC-ROC) is denoted by bold face. Δ denotes the difference between the models; for example, a positive Δ denotes an improvement using all features over the original feature set. The reported p values correspond to a DeLong test between the denoted models. Statistically significance (p < 0.05) is also denoted by bold face
Fig. 2Receiver operating characteristic curves for all datasets. Tuned denotes the best-performing preprocessing filter and can differ for each dataset. Note that in the case of CRLM dataset, the best preprocessing filter set was the original set; therefore, these curves coincide
Fig. 3Tradeoffs against sample size. Graphical display of the association of the difference in area-under-the-curve at receiver operating characteristics between the three feature sets and the sample size. Each point corresponds to one of the datasets. a Tradeoff between the original set and all features. b Tradeoff between the original set the best-performing feature set. c Tradeoff of all features and the best-performing feature set
Fig. 4Mean correlation between feature sets. In each cell, the mean pairwise Pearson correlation coefficient (r) between features of the corresponding two feature sets is displayed