| Literature DB >> 28748524 |
Usman Bashir1, Gurdip Azad2, Muhammad Musib Siddique2, Saana Dhillon2, Nikheel Patel2, Paul Bassett3, David Landau2,4, Vicky Goh2,5, Gary Cook2,6.
Abstract
BACKGROUND: Measures of tumour heterogeneity derived from 18-fluoro-2-deoxyglucose positron emission tomography/computed tomography (18F-FDG PET/CT) scans are increasingly reported as potential biomarkers of non-small cell lung cancer (NSCLC) for classification and prognostication. Several segmentation algorithms have been used to delineate tumours, but their effects on the reproducibility and predictive and prognostic capability of derived parameters have not been evaluated. The purpose of our study was to retrospectively compare various segmentation algorithms in terms of inter-observer reproducibility and prognostic capability of texture parameters derived from non-small cell lung cancer (NSCLC) 18F-FDG PET/CT images. Fifty three NSCLC patients (mean age 65.8 years; 31 males) underwent pre-chemoradiotherapy 18F-FDG PET/CT scans. Three readers segmented tumours using freehand (FH), 40% of maximum intensity threshold (40P), and fuzzy locally adaptive Bayesian (FLAB) algorithms. Intraclass correlation coefficient (ICC) was used to measure the inter-observer variability of the texture features derived by the three segmentation algorithms. Univariate cox regression was used on 12 commonly reported texture features to predict overall survival (OS) for each segmentation algorithm. Model quality was compared across segmentation algorithms using Akaike information criterion (AIC).Entities:
Keywords: 18F-FDG PET; Inter-observer reproducibility; Non-small cell lung cancer; Prognosis; Segmentation
Year: 2017 PMID: 28748524 PMCID: PMC5529305 DOI: 10.1186/s13550-017-0310-3
Source DB: PubMed Journal: EJNMMI Res ISSN: 2191-219X Impact factor: 3.138
Patient demographics and clinical characteristics
| Patient characteristic | Value |
|---|---|
| Male:female | 31:22 |
| Tumour subtype | |
| Adenocarcinoma | 21 (40%) |
| Squamous cell carcinoma | 24 (45%) |
| Not specified | 8 (15%) |
| T status | |
| T1 | 6 (11%) |
| T2 | 14 (27%) |
| T3 | 15 (28%) |
| T4 | 17 (32%) |
| Tx | 1 (2%) |
| N status | |
| N0 | 11 (21%) |
| N1 | 4 (8%) |
| N2 | 33 (62%) |
| N3 | 5 (9%) |
| Tumour stage | |
| IB | 3 (6%) |
| IIB | 5 (9%) |
| IIIA | 24 (45%) |
| IIIB | 21 (40%) |
| Median interval between 18F-FDG PET and start of treatment (days) | 45 (range 0–174) |
| Median radiotherapy dose (Gy) | 64 (range 55–64) |
| Median chemotherapy cycles | 4 (range 1–6) |
Fig. 1Boxplots comparing the three segmentation algorithms in volume measurement. The boxes represent the interquartile range (IQR). Horizontal lines through the boxes show median values. The whiskers represent values within 1.5*IQR. 40P = 40% of maximum intensity threshold. FH freehand, FLAB fuzzy locally adaptive Bayesian
Fig. 2Lesion volumes computed with FLAB are plotted against JSI between FLAB and FH (a) and JSI between FLAB and 40P (b). Slope lines are shown along with 95% standard error of slope (dashed lines). r 2 values are displayed on the figures. The nearly straight slope lines and small r 2 imply that there is no particular trend to the degree of mismatch between FH and FLAB derived volumes over the range of tumour sizes. JSI Jaccard similarity index
Fig. 3Boxplots comparing ICC values for the three segmentation algorithms over the four groups of texture parameters, i.e. first-order, second-order, and higher-order statistics, and model-based parameters. 40P 40% of maximum intensity threshold. FH freehand, FLAB fuzzy locally adaptive Bayesian, ICC intraclass correlation coefficient
Comparison of ICC values of 11 commonly reported texture parameters derived with 3 contending segmentation algorithms
| Texture parameter | ICC FH (95% CI) | ICC FLAB (95% CI) | ICC 40P (95% CI) |
|---|---|---|---|
| TLG* | 0.948(0.919–0.967) | 0.939(0.906–0.962) | 0.968(0.95–0.98) |
| SUVmean | 0.9 (0.84–0.93) | 0.91 (0.86–0.94) | 0.94 (0.91–0.96) |
| SUVmax | 0.951 (0.925–0.97) | 0.927 (0.887–0.954) | 0.943 (0.911–0.964) |
| SUV Standard deviation | 0.911 (0.865–0.945) | 0.907 (0.859–0.942) | 0.937 (0.903–0.961) |
| First-order entropy | 0.745 (0.634–0.834) | 0.775 (0.673–0.854) | 0.87 (0.805–0.918) |
| GLCM entropy | 0.767 (0.663–0.849) | 0.779 (0.679–0.857) | 0.868 (0.801–0.916) |
| GLCM homogeneity | 0.782 (0.682–0.859) | 0.833 (0.752–0.893) | 0.912 (0.866–0.945) |
| GLCM dissimilarity | 0.753 (0.644–0.839) | 0.82 (0.734–0.885) | 0.898 (0.845–0.936) |
| GLSZM intensity variability* | 0.917 (0.874–0.949) | 0.908 (0.86–0.943) | 0.931 (0.894–0.957) |
| NGTDM coarseness | 0.613 (0.469–0.738) | 0.657 (0.522–0.77) | 0.876 (0.814–0.922) |
| NGTDM contrast* | 0.704 (0.581–0.805) | 0.72 (0.601–0.816) | 0.852 (0.779–0.906) |
Note that ICC of MATV was not calculated as it was substituted for by JSI
*Variable was log-transformed
Results of univariate cox proportional hazards done on 12 commonly reported texture parameters
| Texture parameter | FH | FLAB | 40P | |||
|---|---|---|---|---|---|---|
| HR (95% CI) | AIC (95% CI) | HR (95% CI) | AIC (95% CI) | HR 40P (95% CI) | AIC (95% CI) | |
| MATV† | 1.1 (0.55–2.02) | 220.81 (178.62–261.66) | 1.06 (0.51–2.16) | 220.8(176.18–259.29) | 0.8 (0.39–1.61) | 220.46 (178.15–259.25) |
| TLG† | 1.0 (0.6–1.7) | 220.84 (177.82–261.74) | 1.0(0.57–1.73) | 220.84 (173.1–257.46) | 0.88 (0.53–1.47) | 220.55 (176.2–264.39) |
| SUVmean | 1 (1–1) | 220.41(176.44–261.64) | 1 (1–1) | 220.24(178.26–261.34) | 1 (1–1) | 220.5(172.19–262.68) |
| SUVmax | 1 (1–1) | 220.77 (174.54–259.51) | 1 (1–1) | 220.77 (178.24–258.84) | 1 (1–1) | 220.69 (175.34–261.62) |
| SUV standard deviation | 1 (1–1) | 220.23 (176.06–260.46) | 1 (1–1) | 220.33 (177.47–262.87) | 1 (1–1) | 220.26 (177.06–264.65) |
| first-order entropy | 0.16 (0.04–0.74)* | 216.12 (174.54–255.93) | 0.2 (0.06–0.73)* | 215.91 (175.69–259.35) | 0.04 (0.005–0.328 | 212.47 (169.02–256.07) |
| GLCM entropy | 0.56 (0.22–1.42) | 219.53 (178.49–261.04) | 0.45 (0.2–1.01) | 217.8 (173.74–258.62) | 0.21 (0.04–0.89)* | 216.75 (173.21–256.01) |
| GLCM homogeneity | 14.89 (0.12–1799.35) | 219.68 (175.77–258.64) | 32.51 (0.26–3999.05) | 219.12 (173.93–256.75) | 193.94 (0.02–2,314,503.14) | 219.62 (176.21–259.9) |
| GLCM dissimilarity | 0.836 (0.54–1.28) | 220.18 (176.78–257.79) | 0.75 (0.49–1.13) | 219.09 (173.93–257.71) | 0.83 (0.5–1.37) | 220.32 (177.06–257.92) |
| GLSZM intensity variability† | 0.46 (0.18–1.18) | 218.24 (169.74–256.67) | 0.65 (0.32–1.34) | 219.55 (174.97–260.52) | 0.72 (0.35–1.47) | 220.07 (173.75–258.65) |
| NGTDM coarseness | 0.88 (0.74–1.05) | 218.64 (170.64–257.63) | 0.9 (0.74–1.08) | 219.6 (175.71–260.62) | 0.82 (0.67–1.02) | 217.62 (168.78–252.86) |
| NGTDM contrast† | 0.24 (0.04–1.35) | 218.45 (178.17–260.37) | 0.18 (0.03–0.85) | 216.85 (173.33–256) | 0.46 (0.04–4.6) | 220.42 (173.91–257.85) |
AIC Akaike information criterion, CI confidence interval, HR hazard ratio
*p value <0.05
†Variable log-transformed