| Literature DB >> 32563015 |
Isabella Fornacon-Wood1, Corinne Faivre-Finn2, James P B O'Connor3, Gareth J Price4.
Abstract
Radiomics has become a popular image analysis method in the last few years. Its key hypothesis is that medical images harbor biological, prognostic and predictive information that is not revealed upon visual inspection. In contrast to previous work with a priori defined imaging biomarkers, radiomics instead calculates image features at scale and uses statistical methods to identify those most strongly associated to outcome. This builds on years of research into computer aided diagnosis and pattern recognition. While the potential of radiomics to aid personalized medicine is widely recognized, several technical limitations exist which hinder biomarker translation. Aspects of the radiomic workflow lack repeatability or reproducibility under particular circumstances, which is a key requirement for the translation of imaging biomarkers into clinical practice. One of the most commonly studied uses of radiomics is for personalized medicine applications in Non-Small Cell Lung Cancer (NSCLC). In this review, we summarize reported methodological limitations in CT based radiomic analyses together with suggested solutions. We then evaluate the current NSCLC radiomics literature to assess the risk associated with accepting the published conclusions with respect to these limitations. We review different complementary scoring systems and initiatives that can be used to critically appraise data from radiomics studies. Wider awareness should improve the quality of ongoing and future radiomics studies and advance their potential as clinically relevant biomarkers for personalized medicine in patients with NSCLC.Entities:
Keywords: Imaging biomarkers; Lung cancer; Personalized medicine; Radiomics
Mesh:
Year: 2020 PMID: 32563015 PMCID: PMC7383235 DOI: 10.1016/j.lungcan.2020.05.028
Source DB: PubMed Journal: Lung Cancer ISSN: 0169-5002 Impact factor: 5.705
Fig. 1Visualization of the steps in the radiomics workflow. First, images are acquired and reconstructed. The region of interest is then segmented, from which features will be extracted. Next, pre-processing steps are performed to modify the images before feature extraction. Shape, first order (or histogram) and texture features are then extracted from the region of interest. Finally, data analysis steps attempt to find correlations between features and the specified outcome.
Radiomics studies in NSCLC, categorized into sections based on their investigated endpoint. The Data column specifies the total number of patients involved in the study, in brackets split by training and validation cohorts if applicable and specifying other cancer types of cohorts if applicable. Note: Studies marked with * are validation studies and their RQS score components refer to methodology based on the previous published data. This table has been simplified to clarify presentation – more details for each study are available in Supplementary Table 2.
| Reference | NSCLC stage | Data (training + validation) | Radiomic features in final model | Result |
|---|---|---|---|---|
| Aerts et al. 2014 [ | 1−3b | 647 pCT | Shape, first order and texture | CI = 0.65 |
| Van Timmeren et al. 2017 [ | 1−4 | 252 pCT and CBCT | Shape, first order and texture | CI = 0.69, 0.61, 0.59 (pCT) |
| Grossman et al. 2017 [ | 1−3 | 351 diagnostic CT | Shape, first order and texture | CI = 0.60 |
| Grossman et al. 2017 [ | 1−3 | 351 diagnostic CT | Not specified | CI = 0.61 |
| Yu et al. 2017 [ | 1 | 442 diagnostic CT | First order and texture | CI = 0.64 |
| Chaddad et al. 2017 [ | 1−3b | 315 pCT | Shape and texture | Average AUC = 0.70−0.76 |
| Fave et al. 2017 [ | 3 | 107 4DCT end of exhale, pCT and CBCT | Shape and texture | CI = 0.672 |
| Li et al. 2017 [ | 1−2a | 59 follow up CT | Texture | AUC = 0.81 |
| Li et al. 2017 [ | 1−2a | 92 4DCT | Shape and first order | AUC = 0.728 |
| Tang et al. 2018 [ | 1−3 | 290 staging CT | Shape, first order and texture | CI = 0.72 |
| Bianconi et al. 2018 [ | 1−3 | 203 pCT | Shape and texture | HR = 1.06−1.48 |
| De Jong et al. 2018 [ | 4 | 195 diagnostic CT | Shape, first order and texture | CI = 0.576 |
| Lee et al. 2018 [ | 1−3 | 339 CT | Shape, first order and texture | CI = 0.772 |
| He et al. 2018 [ | 1−3 | 186 CT | Not specified | AUC = 0.9296 |
| Starkov et al. 2018 [ | 1 | 116 pCT | Texture | High risk vs low risk median p-values = 0.04–0.07 |
| Yang et al. 2018 [ | 1−4 | 371 CT | First order and texture | CI = 0.702 |
| Wang et al. 2019 [ | 3 | 70 pre-treatment and 97 post-treatment CT from 118 patients | Texture | CI = 0.743 |
| Shi et al. 2019 [ | 3 | 11 CBCT from 23 patients | First order | HR = 0.21 |
| Van Timmeren et al. 2019 [ | 1−4 | 337 pCT and 2154 CBCTs from 337 patients | First order and texture | CI = 0.59, 0.54, 0.57 |
| Huang et al. 2019 [ | 1−4 | 371 CT | Shape, first order and texture | CI = 0.621, 0.649 |
| Franceschini et al. 2019 [ | 1−2 | 102 4DCT start of inspiration | Shape and texture | AUC = 0.85 |
| Coroller et al. 2015 [ | 2−3 | 182 pCT | First order and texture | CI = 0.6 |
| Mattonen et al. 2016 [ | 1 | 45 follow-up CT | First order and texture | AUC = 0.85 |
| Huynh et al. 2016 [ | 1−2 | 113 CT | First order and texture | Median CI = 0.67 |
| Huynh et al. 2017 [ | 1−2a | 112 CT and AIP CT | Shape, first order and texture | AIP radiomics CI = 0.667 |
| Fave et al. 2017 [ | 3 | 107 4DCT end of exhale, pCT and CBCT | Shape and texture | CI = 0.632, 0.558 (DM, LRR) |
| Li et al. 2017 [ | 1−2a | 59 follow up CT | Texture | AUC = 0.80, 0.80 (RFS, LR-RFS) |
| Li et al. 2017 [ | 1−2a | 92 4DCT | Shape | AUC = 0.747, 0.690 (RFS, LL-RFS) |
| Dou et al. 2018 [ | 2−3 | 200 pCT | Texture | CI = 0.65 |
| Ferreira Junior et al. 2018 [ | 1−4 | 68 CT | Shape and texture | AUC = 0.75, 0.71 |
| Yang et al. 2018 [ | 1−3 | 159CT | Shape, first order and texture | AUC = 0.856 |
| Zhong et al. 2018 [ | 1−2 | 492 CT | First order and texture | AUC = 0.972 |
| Lafata et al. 2019 [ | 1 | 70 CT | Texture | Maximum AUC = 0.72, 0.83, 0.60 (recurrence, LR, non-LR) |
| Akinci D’Antonoli et al. 2019 [ | 1−2b | 124 CT | Shape, first order and texture | AUC 0.731, 0.750 (LR, DM) |
| He et al. 2019 [ | Not specified | 717CT | First order and texture | CI = 0.734 |
| Xu et al. 2019 [ | 3−4 | 132 CT | Texture | AUC = 0.642 |
| Franceschini et al. 2019 [ | 1−2 | 102 4DCT start of inspiration | Shape, first order and texture | AUC = 0.73 |
| Ferreira-Junior et al. 2019 [ | 1−4 | 85 CT | Shape, first order and texture | AUC = 0.92, 0.84 (DM, nodal metastasis) |
| Cong et al. 2019 [ | 1a | 649 venous phase CT | Shape, first order and texture | AUC = 0.851 |
| Coroller et al. 2016 [ | 2−3 | 127 pCT | Shape, first order and texture | Median AUC = 0.65, 0.61 (GRD, pCR) |
| Huang et al. 2016 [ | 1−2 | 282 CT (141 + 141) | First order and texture | HR = 2.09 |
| Song et al. 2016 [ | 1−4 | 152 CT | Texture | HR = 2.35, 2.75 |
| Coroller et al. 2017 [ | 2−3 | 85 pCT | Shape, first order and texture | Median AUC = 0.68, = 0.71 (pCR, GRD) |
| Tunali et al. 2019 [ | 3b-4 | 228 CT | Texture | AUC = 0.804 |
| Franceschini et al. 2019 [ | 1−2 | 102 4DCT start of inspiration | Texture | AUC = 0.88 |
| Moran et al. 2017 [ | 1 | 14 diagnostic CT | First order and texture | AUC = 0.689−0.750 |
| Krafft et al. 2018 [ | Not specified | 192 50 % 4DCT phase | First order and texture | Average AUC = 0.68 |
| Yuan et al. 2018 [ | 1 | 327 CT | First order and texture | AUC = 0.938 |
| Yang et al. 2019 [ | 1−3 | 256 CT | First order and texture | AUC = 0.93 |
Abbreviations: AUC, area under the curve; CBCT, cone-beam CT; CI, concordance index; DFS, disease free survival; DM, distant metastasis; GRD, gross residual disease; H&N, head and neck; HR, hazard ratio; LR, local relapse; LRR, local regional recurrence; LR-RFS, loco-regional recurrence-free survival; OS, overall survival; pCR, pathological complete response; pCT, radiotherapy planning CT scan; PFS, progression free survival; RFS, recurrence free survival.
Radiomics studies in NSCLC with an aspect of biology as the endpoint. The column labeled ‘Data’ specifies the total number of patients involved in the study, in brackets split by training and validation cohorts if applicable and specifying other cancer types of cohorts if applicable. This table has been simplified to clarify presentation – more details for each study are available in Supplementary Table 3.
| Reference | Stage | Endpoint | Data (training + validation) | Radiomic features in final model | Result |
|---|---|---|---|---|---|
| Aerts et al. 2016 [ | Early stage | EGFR | 47 diagnostic CT and follow-up | Shape and texture | AUC = 0.74−0.91 |
| Rios Velazquez et al. 2017 [ | 1−4 | EGFR, KRAS | 705 diagnostic CT | Shape, first order and texture | AUC = 0.69−0.80 |
| Mei et al. 2018 [ | Not specified | EGFR | 296 CT | Texture | AUC = 0.664 |
| Digumarthy et al. 2019 [ | Not specified | EGFR | 93 CT | First order | AUC = 0.713 |
| Jia et al. 2019 [ | 1−4 | EGFR | 504 CT | Shape, first order and texture | AUC = 0.802 |
| Li et al. 2019 [ | 1−4 | EGFR subtypes (19Del and L858R) | 312 CT | Shape, first order and texture | AUC = 0.775−0.793 |
| Tu et al. 2019 [ | 1−4 | EGFR | 404 CT | First order and texture | AUC = 0.775 |
| Yang et al. 2019 [ | Not specified | EGFR | 467 CT | Shape, first order and texture | AUC = 0.789 |
| Wang et al. 2019 [ | 1−2 | EGFR, TP53 | 61 CT | First order and texture | AUC = 0.604, 0.586 |
| Wang et al. 2019 [ | 1−2 | Tumor mutation burden | 61 CT | Texture | AUC = 0.606 |
| Grossman et al. 2017 [ | 1−3 | Various | 351 CT | Shape, first order and texture | AUC = 0.62−0.72 |
| Bak et al. 2018 [ | 1−4 | Various | 57 CT | First order and texture | OR = 0.08−23.94 |
| Patil et al. 2016 [ | Not specified | ADC, LCC, SCC, NOS | 317 pCT | Shape, first order and texture | 88 % accuracy |
| Wu et al. 2016 [ | 1−4 | ADC, SCC | 350 pCT | First order and texture | AUC = 0.72 |
| Ferreira Junior et al. 2018 [ | 1−4 | ADC, SCC | 68 CT | Not specified | AUC = 0.81 |
| Zhu et al. 2018 [ | Not specified | ADC, SCC | 129 CT | First order and texture | AUC = 0.893 |
| Digumarthy et al. 2019 [ | Not specified | ADC, SCC | 93 CT | First order | AUC = 0.744 |
| E et al. 2019 [ | Not specified | ADC, SCC, SCLC | 229 CT | Shape, first order and texture | AUC = 0.657−0.875 |
| Ferreira-Junior et al. 2019 [ | 1−4 | ADC, SCC | 85 CT | Shape, first order, texture | AUC = 0.88 |
| Liu et al. 2019 [ | Not specified | ADC, LCC, SCC, NOS | 349 CT | Not specified | AUC = 0.86 |
| Zhou et al. 2018 [ | 1−4 | Ki-67 | 110 CT | Shape and texture | AUC = 0.61−0.77 |
| Gu et al. 2019 [ | Not specified | Ki-67 | 245 CT | First order and texture | AUC = 0.776 |
| Song et al. 2017 [ | 1−3 | Micropapillary pattern | 339 CT | First order | AUC = 0.751 |
| Chen et al. 2018 [ | Not specified | Degree of differentiation | 487 CT | First order and texture | AUC = 0.782 |
| She et al. 2018 [ | Not specified | Invasive vs non-invasive adenocarcinoma | 402 CT | Shape, first order and texture | AUC = 0.89 |
| Yang et al. 2019 [ | Not specified | Invasive vs non-invasive adenocarcinoma | 192 CT | First order and texture | AUC = 0.77 |
Abbreviations: ADC, adenocarcinoma; AUC, area under the curve; CI, concordance index; EGFR, epidermal growth factor receptor; KRAS, Kirsten rat sarcoma viral oncogene homolog; LCC, large cell carcinoma; NOS, not otherwise specified; OR, odds ratio; SCC, squamous cell carcinoma.
Fig. 2Frequency of CT NSCLC radiomics studies published from 2014 to 2019. Publications are categorized as those investigating radiomics methodological concerns, those evaluating radiomic signatures as prognostic or predictive biomarkers of patient outcome, and those evaluating radiomic signatures as biomarkers of tumor biology.
Potential problems at each step of the radiomics workflow along with possible solutions offered by the literature. Each workflow step with potential problems and solutions identified by the literature is labelled with a letter A-H to reference in-text. Note: Modelling does not have a letter associated with since there is no consensus on the best statistical modelling strategies.
| Problem area | Potential problems | Potential solutions | |
|---|---|---|---|
| Image acquisition | A | Different scanners and acquisition protocols affect feature reproducibility [ | Image phantoms on different scanners to provide baseline [ |
| B | Patient motion affects feature reproducibility [ | Set motion tolerances, reduce ROI boundaries [ | |
| Image acquisition and reconstruction | C | Image resolution parameters (voxel size, slice thickness) affect feature values [ | Control resolution [ |
| Image reconstruction | D | Image reconstruction algorithm and reconstruction parameters (kernel) affects features [ | Pre-processing image correction [ |
| Segmentation | E | Delineation variability [ | Expert ROI definition [ |
| Pre-processing | F | Number of grey levels used to discretize histogram and texture features affects feature values [ | Texture features can be normalized to reduce dependency on the number of grey levels [ |
| Feature extraction | No studies found in the literature search. | ||
| Feature correlation | G | Strong correlations between tumor volume and radiomic features exist [ | Normalization of features to volume [ |
| Test re-test | H | Radiomic features may not be repeatable over multiple measurements [ | Test-retest data acquisition [ |
| Modelling clinical outcome | Different modelling strategies affect model performance [ | Sample sizes above 50 give better predictive performance [ |
Summary of the 4 assessment criteria - TRIPOD score, RQS, number of methodological limitations and testing the added value of radiomics to a clinical model. The added value of radiomics to a clinical model was only tested for the patient outcome studies (N = 50).
| N = 75 | |
|---|---|
| TRIPOD type (n (%)) | |
| 1a – no validation | 10 (13) |
| 1b – internal validation | 27 (36) |
| 2a – dataset randomly split for validation | 18 (24) |
| 2b – dataset non-randomly split for validation | 7 (9) |
| 3 – external validation | 10 (13) |
| 4 – validation only | 3 (4) |
| RQS (median, [IQR]) | 6 [2−12.25] |
| Number of methodological limitations (n (%)) | |
| 0−2 | 0 (0) |
| 3 | 4 (5) |
| 4 | 4 (5) |
| 5 | 15 (20) |
| 6 | 21 (28) |
| 7 | 23 (31) |
| 8 | 8 (11) |
| N = 50 | |
| Added value of radiomics to clinical model tested? (n (%)) | |
| Yes | 32 (64) |
| No | 18 (36) |
Fig. 3The assessment of the literature plotted against each other as boxplots. (A) RQS versus TRIPOD, (B) RQS versus the number of methodological limitations found in this review and (C) TRIPOD versus the number of methodological limitations found in this review.