| Literature DB >> 28336974 |
M Sollini1, L Cozzi2,3, L Antunovic4, A Chiti2,4, M Kirienko2.
Abstract
Imaging with positron emission tomography (PET)/computed tomography (CT) is crucial in the management of cancer because of its value in tumor staging, response assessment, restaging, prognosis and treatment responsiveness prediction. In the last years, interest has grown in texture analysis which provides an "in-vivo" lesion characterization, and predictive information in several malignances including NSCLC; however several drawbacks and limitations affect these studies, especially because of lack of standardization in features calculation, definitions and methodology reporting. The present paper provides a comprehensive review of literature describing the state-of-the-art of FDG-PET/CT texture analysis in NSCLC, suggesting a proposal for harmonization of methodology.Entities:
Mesh:
Year: 2017 PMID: 28336974 PMCID: PMC5428425 DOI: 10.1038/s41598-017-00426-y
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Schematic representation of the process of selection of literature data included in the review.
Figure 2Methodological approaches in image texture analysis (the most frequently evaluated PET features in lung cancer patients are reported as examples).
Figure 3Example of tumor contouring using in (a) a threshold method at 50% of SUVmax and (b) a method based on an absolute SUV cut-off of 2.5. The ROI identified by using the absolute SUV cut-off of 2.5 is greater than that identified by the threshold method, as shown by axial (top), sagittal (right), and coronal (left) images (same slices).
Publications reporting methodological investigations on texture analysis in NSCLC patients.
| Reference | Type of study | Patients, n | Setting, stage | Aspect evaluated | Lesion segmentation method | PET parameter and textural index matrix | Main results |
|---|---|---|---|---|---|---|---|
| Cheng[ | R | 56 | Staging, I–III (only T) | Impact of respiration-averaged CT on PET texture parameters | Adaptive threshold, threshold uptake 45% of the SUVmax * | FOS/IVH = 3 SS = 1 GLCM = 4 GLRLM = 3 NGTDM = 4 | Texture parameters obtained with helical and respiration-averaged PET/CT showed a high degree of agreement (SUV entropy and entropy had the lowest levels of variation) |
| Cui[ | n.r. | 20 | n.r. | Impact of the segmentation method on tumor volume estimation (validation of DM algorithm) | Automatic (DM), fuzzy C-means, threshold uptake 40% of the SUVmax, threshold uptake 50% of the SUVmax, tumor-customized downhill, watershed§ | FOS/IVH = 1 NGTDM = 1 Gr = 1 | DM algorithm was able to segment the tumor (also when adjacent to mediastinum or chest wall) and outperformed the other lung segmentation methods in terms of overlapping measure |
| Cui[ | n.r. | 40 | n.r. | Impact of the segmentation method on tumor volume estimation (validation of topo-poly algorithm) | Threshold uptake 40% of the SUVmax, threshold uptake 50% of the SUVmax, adaptive threshold, fuzzy C-means, tumor-customized downhill, random walks, high-order interactive learning segmentation, PET/CT tumor-background likelihood model, topo-poly§ | NGTDM = 1 | Topo-poly algorithm was able to delineate tumor margins better than other methods |
| Dong[ | R | 50 | Staging, I–IV | Impact of the segmentation method on tumor volume estimation | Absolute SUV cut-off of 2.5, manual (2 observers), threshold at 40% of the SUVmax * | FOS/IVH = 1 SS = 1 GLCM = 1 + visual score | Intratumor heterogeneity significantly correlated with differences in the GTV definition (high heterogeneity corresponded to a larger GTV) |
| Gao[ | n.r. | 132 | Staging, I–III | Impact of computer-based algorithm on diagnosis of mediastinal lymph node metastases (validation of computer-based algorithm) | Manual# | FOS/IVH = 3 GLCM = 5 + visual score | Diagnostic ability of computer-based algorithm and visual experience was similar |
| Hatt[ | n.r. | 25, only 17 analyzed | Staging, Ib–IIIb | Impact of the segmentation method on the tumor volume estimation | Adaptive threshold, fully automatic method (FLAB), manual, threshold at 50% of the maximum* | FOS/IVH = 1 SS = 1 | All delineation methods except the manual one resulted in underevaluation of MTV. Anatomic tumor size and heterogeneity were correlated (larger lesions were more heterogeneous) |
| Hofheinz[ | n.r. | 30 | n.r. | Impact of the segmentation method on tumor volume estimation (validation of voxel-specific threshold algorithm) | Lesion-specific threshold, manual, voxel-specific threshold*° | FOS/IVH = 2 SS = 1 | Voxel-specific threshold method was able to reproduce tumor boundaries accurately, independent of the heterogeneity |
| Leijenaar[ | n.r. | 11 (test-retest cohort) + 23 (inter-observer cohort) | Features’ test–retest reliability and interobserver stability among multiple tumor delineation methods | Manual (by 5 observers), threshold at 50% of the maximum | FOS/IVH = 54 SS = 8 GLCM = 22 GLRLM = 11 GLSZM = 11 | The majority of features had high test–retest (71%) and interobserver (91%) stability in terms of ICC | |
| Leijenaar[ | P | 35 | Staging, I–III | Comparison of different discretization methods for textural features | Manual (SUV discretization using a fixed bin size and a fixed number of bins) | GLCM = 22 GLRLM = 11 GLSZM = 11 | SUV discretization had a crucial effect on textural features |
| Oliver[ | R | 23 | Sensitivity of texture features to tumor motion by comparison of static (3D) and respiratory-gated (4D) PET imaging | Adaptive threshold (background-adapted thresholding method)* | FOS/IVH SS GLCM GLRLM (total 56) | Quantitative analysis using a 3D versus 4D acquisition provided notably different image feature values, mainly due to the impact of respiratory motion | |
| Orlhac[ | P | 24 | Staging, III | Impact of the segmentation method on the tumor volume estimation | Threshold at 40% of the maximum, adaptive threshold*° | FOS/IVH = 8 SS = 1 GLCM = 6 GLRLM = 11 GLSZM = 11 NGLDM = 3 | IVH-based indices strongly depended on the tumor delineation method; 17/31 second- or high-order statistic features were robust with respect to tumor segmentation. Several texture indices included similar information. Some texture indices were highly correlated with MTV |
| Orlhac[ | R | 48 | Staging, I–III | Impact of resampling step on textural features and on the ability of textural features to reflect tissue-specific patterns of metabolic activity | Adaptive threshold (relative resampling approach and absolute resampling approach)*° | FOS/IVH = 1 SS = 1 GLCM = 2 GLRLM = 3 GLSZM = 2 | Textural features computed using an absolute resampling method varied as a function of the tissue type and cancer subtype more than when using the usual relative resampling approach |
| Tixier[ | P | 20 | Staging, I–II | Impact of static and parametric acquisition on PET features | Fully automatic method (FLAB)*° | FOS/IVH = 2 SS = 3 GLCM = 3 GLSZM = 2 | Compared with static SUV images, parametric images did not provide significant complementary information concerning heterogeneity quantification |
| van Velden[ | P | 11 | Staging, IIIb–IV | Repeatability of texture features using different reconstruction settings and delineation methods | Threshold uptake 50% of the 3D SUVpeak on EANM-compliant (reconstruction method 1) and PSF-based (reconstruction method 2) images° | FOS/IVH = 29 FF = 3 SS = 10 GLRLM = 22 GLCM = 44 L = 1 SA = 2 | The majority of features had a high level of repeatability (ICC ≥ 0.90 for 63 features). Features were more sensitive to a change in delineation method (n = 25) than a change in reconstruction method (n = 3) |
| Yan[ | R | 17 | n.r., I–IV | Variability of PET textural features using different reconstruction methods, iteration numbers, and voxel size | Threshold uptake 40% of the SUVmax *° | FOS/IVH = 6 GLCM = 21 GLRLM = 11 GLSZM = 13 NGLDM = 5 NGTDM = 5 | Image features had different sensitivities to reconstruction settings (entropyHist, difference entropy, inverse difference normalized, inverse difference moment normalized, low gray-level run emphasis, high gray-level run emphasis, and low gray-level zone emphasis were the most robust features; skewness, cluster shade, and zone percentage exhibited large variations) |
| Yip[ | R | 26 | Staging, n.r. | Sensitivity of texture features to tumor motion by comparing static (3D) and respiratory-gated (4D) PET imaging | Threshold uptake 40% of the SUVmax | GLCM = 1 GLRLM = 1 NGTDM = 4 | 4D-PET derived textures were less susceptible to tumor motion and may have greater prognostic value |
FF: fractal features; FLAB: fuzzy locally adaptive Bayesian; FOS/IVH: first-order statistics/intensity-volume histogram; GLCM: gray-level co-occurrence matrix; GLRLM: gray-level run-length matrix; GLSZM: gray-level size-zone matrix; Gr: absolute gradient; ICC: intra-class correlation coefficient; L: Laplacian; LF: Laws family; n.a.: not available; n.r.: not reported; NGLDM: neighboring gray-level dependence matrix; NGTDM: neighborhood gray-tone difference matrix; P: prospective; R: retrospective; SA: spatial autocorrelation; SS: shape and size; W: wavelet
*Segmentation of only primary lung lesion.
#Segmentation of lymph nodes.
§ Segmentation of primary lung lesion and other tissues (e.g. lymph nodes).
°Included in the analysis only lung lesion with a volume > of a minimum cut-off (e.g. 3 mL).
Publications reporting studies on the diagnostic, prognostic and predictive role of texture analysis in NSCLC patients.
| Reference | Type of study | Patients, n | Setting, stage | Aspect investigated | Lesion segmentation method | PET features and textural index matrix | Main results |
|---|---|---|---|---|---|---|---|
| Apostolova[ | R | 60 | Staging, I–III | Prognostic value of asphericity | Adaptive threshold method*° | FOS/IVH = 2SS = 4 | Asphericity was a predictor of progression-free survival and overall survival |
| Budiawan[ | R | 44 | Staging, I–IV | Ability of PET features to predict lymph node metastases | Manual#° | FOS/IVH = 4 + visual score | Metastatic lymph nodes had higher heterogeneity (coefficient of variation) than inflammatory ones |
| Carvalho[ | n.r. | 220 | Staging, I–IIIb | Prognostic value of heterogeneity based on PET textural features | Absolute SUV cut-off values of 2.5, 3, and 4, threshold at 40% and 50% of SUVmax | FOS/IVH = 8 SS = 1 | Best prognostic value for overall survival was found for relative portions of the tumor above higher uptakes (80% SUV) |
| Cook[ | R | 53 | Staging, I–III | Ability of PET features to predict prognosis and disease progression after concurrent chemoradiotherapy | Threshold at 45% of the SUVmax * | FOS/IVH = 3 SS = 2 NGTDM = 4 | Coarseness, contrast, and busyness were associated with response to chemoradiotherapy and prognosis |
| Cook[ | P | 47 | Staging, IIIb–IV | Ability of PET features to predict prognosis and disease progression after erlotinib | Threshold at 40% of the SUVmax * | FOS/IVH = 8 SS = 2 NGTDM = 4 | Heterogeneity predictedresponse to erlotinib. Changes in entropyHist (baseline and 6 weeks) were independently associated with overall survival and treatment response |
| Desseroit[ | R | 116 | Staging, I–III | Develop a nomogram by exploiting intratumor heterogeneity (PET and CT features) to identify patients with the poorest prognosis | Fully automatic method (FLAB) | FOS/IVH = 3 SS = 1 GLCM = 2 GLSZM = 2 ( + 35 on CT images) | Intratumor heterogeneity could be used to create a nomogram with a higher stratification power than staging alone (poorest prognosis: stage III, large tumor volume, high PET heterogeneity, and low CT heterogeneity) |
| Fried[ | R | 195 | Staging, III | Ability of PET features to enhance overall survival risk stratification | Manual§° | FOS/IVH = 8 SS = 3 GLCM = 4 | Imaging features (solidity and primary tumor energy) improved risk stratification |
| Fried[ | R | 225 | Staging, III | Ability of PET features to identify patients who might benefit from a higher radiation dose compared with that for the entire stage III | Semiautomatic gradient based§ | FOS/IVH = 1 SS = 3 GLCM = 1 | Imaging features were found to be capable of isolating subgroups of patients who received a benefit or detriment from dose escalation |
| Ha[ | R | 30 | Diagnostic, n.r. | Correlation between metabolic heterogeneity and histopathologic characteristics | Adaptive threshold* | FOS/IVH = 1 GLCM = 21 Gr = 2 | The majority of texture features analyzed (including SUVmax) differed between Adk and Sqc |
| Hatt[ | R | 101 | Staging, I–III | Relationship between tumor MTV and derived heterogeneity measurements | Fully automatic method (FLAB)*° | FOS/IVH = 3 SS = 1 GLCM = 2 GLSZM = 2 | Correlation between MTV and textural features varied greatly depending on the MTV (reduced correlation for increasing volumes) |
| Kang[ | R | 116 | Staging, III | Ability of PET features to predict disease progression after concurrent chemoradiotherapy | Absolute SUV cut-off value of 3.0* | FOS/IVH = 2 SS = 1 | Intratumoral heterogeneity predicted disease progression after chemoradiotherapy in inoperable stage III NSCLC |
| Kim[ | R | 119 | Staging, I | Ability of PET features to predict prognosis after curative surgical resection in pathologically N0 tumor | Absolute SUV cut-off value of 2.5* | FOS/IVH = 2 SS = 2 | Heterogeneity of primary tumor was predictive of recurrence in pN0 Adk but not in Sqc |
| Lovinfosse[ | R | 63 | Staging, I | Ability of PET features to predict prognosis after radiotherapy | Fully automatic method (FLAB)* | FOS/IVH = 7 SS = 2 GLCM = 6 GLSZM = 2 NGTDM = 3 | Intratumoral heterogeneity (dissimilarity) appeared to be a strong independent outcome predictor after radiotherapy |
| Miwa[ | R | 54 | Diagnostic, n.a. | Ability of PET and CT features to differentiate malignant from benign pulmonary nodules | Threshold at 40–100% (intervals of 2%) of SUVmax * | FOS/IVH = 1 FF = 1 (+1 on CT images) | Intratumoral heterogeneity could help to differentiate malignant and benign pulmonary nodules (better diagnostic ability of density fractal dimension on PET than morphological fractal dimension on CT) |
| Nair[ | R | 172 (study cohort = 25, external cohort = 63, validation cohort = 84) | Staging, I–IV (study cohort) and I–II (validation cohort) | Identify individual genes and gene expression signatures associated with prognostically relevant PET features | Adaptive threshold method* | FOS/IVH = 10 SS = 3 | Four genes (LY6E, RNF149, MCM6, FAP) associated with textural features were also associated with survival |
| Ohri[ | P | 250, only 201 analyzed | Staging, IIb–III | Prognostic value of heterogeneity based on PET textural features | Semiautomatic gradient-based | FOS/IVH SS GLCM GLRLM GLSZM NGTDM NGLDM (total 45) + visual score | SumAverg was an independent predictor of overall survival |
| Pyka[ | R | 45 | Staging, I | Ability of PET features to predict prognosis and disease progression after radiotherapy | Absolute SUV cut-off values of 2.0 and 2.5* | FOS/IVH = 3 SS = 1 GLCM = 2 NGTDM = 3 | Tumor heterogeneity was associated with response to radiation therapy |
| Tixier[ | R | 108, only 102 analyzed | Staging, I–III | Prognostic value of heterogeneity | Fully automatic method (FLAB)*^° | FOS/IVH = 3 SS = 2 GLCM = 3 GLSZM = 3 + visual score | High SUV, large metabolic volumes, and high heterogeneity were associated with poorer overall survival and recurrence-free survival |
| Vaidya[ | R | 27 | Staging, I–IV | Ability of PET and CT features to predict disease progression after radiotherapy | Manual | FOS/IVH = 12 SS = 2 GLCM = 4 ( + 32 on CT images) | IVH parameters (Ix metrics for PET and Vx metrics for CT) yielded the highest association with locoregional control |
| van Gómez López[ | R | 38 | Staging, I–IIIa | Correlation between metabolic heterogeneity and pathologic staging | Absolute SUV cut-off value of 2.5* | FOS/IVH = 2 SS = 2 GLCM = 5 | Tumor heterogeneity was correlated with global metabolic parameters, and both were associated with macroscopic tumor diameter and, under special conditions (exclusion of a small tumor with high AJCC stage), with the AJCC stage |
| Win[ | P | 122 (study cohort = 56, validation cohort = 66) | Staging, I–IV | Ability of PET and CT features to predict survival | Threshold at 42% of the SUVmax * | FOS/IVH = 2 ( + 1 on CT images) | PET-derived heterogeneity was predictive of survival at univariate analysis; at multivariate analysis only CT-derived heterogeneity, stage, and permeability were independent predictors of survival |
| Wu[ | R | 101 (study cohort = 70, validation cohort = 31) | Staging, I | Ability of PET features to predict distant metastases | Fully automatic method* | FOS/IVH = 11 SS = 2 GLCM = 3 W = 24 LF = 30 | The optimal prognostic model for identifying groups at risk of developing distant metastasis included SUVpeak2mL and Gauss cluster shadeLaws |
Adk: adenocarcinoma type; FF: fractal features; FLAB: fuzzy locally adaptive Bayesian; FOS/IVH: first-order statistics/intensity-volume histogram; GLCM: gray-level co-occurrence matrix; GLRLM: gray-level run-length matrix; GLSZM: gray-level size-zone matrix; Gr: absolute gradient; ICC: intra-class correlation coefficient; L: Laplacian; LF: Laws family; n.a.: not available; n.r.: not reported; NGLDM: neighboring gray-level dependence matrix; NGTDM: neighborhood gray-tone difference matrix; P: prospective; R: retrospective; SA: spatial autocorrelation; Sqc: squamocellular types; SS: shape and size; W: wavelet.
*Segmentation of only primary lung lesion.
#Segmentation of lymph nodes.
§Segmentation of primary lung lesion and other tissues (e.g. lymph nodes).
^Application of partial volume correction.
°Included in the analysis only lung lesion with a volume > of a minimum cut-off (e.g. 3 mL).
Summary of clinically relevant results in investigations assessing the diagnostic, prognostic and predictive role of FDG-PET/CT texture analysis.
|
|
| Compared with non-malignant lesions, malignant lung nodules are characterized by higher SUVmax and lower morphological and density fractal dimensions[ |
| Metastatic lymph nodes are characterized by higher heterogeneity (coefficient of variation) than inflammatory ones[ |
| Large lesions are characterized by high heterogeneity (i.e., visual score, entropyGLCM, coefficient of variation)[ |
| Each subtype of NSCLC tumor has different metabolic heterogeneity characteristics. Compared with Adk, Sqc is characterized by higher SUVmax, AUC-IVH, energyGLMC, entropyGLCM, sum entropy, difference entropy, and inverse different moment and by lower homogeneityGLCM, sum of squares, angular second moment, ratio of non-zeroGr, and difference variance[ |
|
|
| Heterogeneity (i.e., AUC-CSH) can predict recurrence in pN0 Adk patients who have undergone curative surgery but not in Sqk patients (high heterogeneity is associated with a shorter DFS)[ |
| Best prognostic value for overall survival is found for relative portions of the tumor above higher uptakes defined as SUVmax > 80% (i.e., V80) in patients who received radiation therapy (sequential chemoradiation, concurrent chemoradiation, or only radiation). The higher the portion above higher uptake (V80), the better the prognosis[ |
| Heterogeneity (i.e., low AUC-CSH) identifies patients with inoperable stage III NSCLC with poor PFS[ |
| High SUVmax, large MTV, and high heterogeneity (i.e., high entropyGLCM, high asphericity, homogeneityGLCM, and high dissimilarity, size-zone variability, and low zone percentage) are associated with poorer OS and RFS in stage I–III NSCLC[ |
| Tumor heterogeneity (i.e., entropyGLCM) is associated with response to radiation therapy in NSCLC (DSS is lower for patients with high entropyGLCM)[ |
| Lesions in responders (complete or partial response) to chemoradiotherapy are characterized by lower coarseness, contrastNGTDM, and busyness than non-responders (stable or progressive disease). High coarseness values are associated with an increased risk of progression (increased risk of death), whereas high contrastNGTDM and busyness values are associated with a lower risk of progression (PFS and LPFS)[ |
| Large primary tumors with low SumAverage (i.e., more heterogeneous) have a poor prognosis following chemoradiotherapy[ |
| Lesions in responders to erlotinib are characterized by lower heterogeneity than those in non-responders. Specifically, lower heterogeneity after 6 weeks of treatment, as measured by contrast NGTDM, is independently associated with longer survival, and a larger reduction in heterogeneity between baseline and 6 weeks of treatment, as measured by entropyHist, is independently associated with longer survival and with treatment response[ |
| Tumor heterogeneity (i.e., dissimilarity) appears to be a strong independent outcome predictor (DSS and DFS) after radiotherapy. Low dissimilarity is associated with a higher risk of recurrence[ |
| The optimal prognostic model for identification of groups of NSCLC patients at risk for developing distant metastasis includes SUVpeak2mL and Gauss cluster shadeLaws. High SUVpeak2mL and Gauss cluster shadeLaws are associated with an increased risk of distant metastases[ |
| Solidity (which quantifies the dispersion of primary and nodal disease in a local region, with high values corresponding to disease that is compact and in close proximity, and low values corresponding to disease that is dispersed) and primary tumor energyGLCM (higher level for tumors that are more heterogeneous) improve risk stratification compared with a model with conventional prognostic factors alone in stage III NSCLC. Solidity and primary tumor energyGLCM are capable of isolating subgroups of patients who will receive a benefit or detriment from dose escalation (i.e., as disease solidity and primary co-occurrence matrix energy increase, patients receiving higher dose radiation therapy have improved OS and PFS compared with those receiving lower doses)[ |
Adk: adenocarcinoma type; AUC-IVH: area under the curve within the intensity volume histogram; DFS: disease-free survival; DSS: disease-specific survival; GLCM: gray-level co-occurrence matrix; GLRLM: gray-level run-length matrix; GLSZM: gray-level size-zone matrix; Gr: absolute gradient; LPFS: local progression-free survival; MTV: metabolic tumor volume; NGTDM: neighborhood gray-tone difference matrix; NSCLC: non-small cell lung cancer; OS: overall survival; PFS: progression-free survival; Sqc: squamocellular types; SUV: standardized uptake value.
Summary of relevant methodological issues in calculating and reporting FDG-PET/CT texture analysis.
|
| |
| a) | scanner |
| b) | method of images acquisition (e.g. respiratory motion, dynamic) |
| c) | parameters used to acquire images |
| d) | parameters used to reconstruct images |
| e) | type of images used to extract features (i.e., PET or both PET and CT) |
| f) | “target” of texture analysis (e.g., primary tumor, lymph nodes, metastases) |
| g) | application of PVC and/or a minimum lesion size/volume |
| h) | method of segmentation (e.g., threshold uptake 40% of the SUVmax) |
| i) | discretization method (e.g., fixed number of bins) |
| j) | software |
| k) | features and matrix computation method* |
|
| |
| Datasets of 10–15 patients per feature have been recommended to test the prognostic power of texture features | |
|
| |
| The use of the radiomics features insensitive to acquisition modes and reconstruction parameters is recommended. A correlation of conventional metrics (SUV, MTV, etc.) and texture features should be assessed to evaluate the potential complementary value of the measures. Independent validation datasets are needed to confirm the results. | |
*A proposal for a consistent terminology is reported within the Supplementary material.