| Literature DB >> 33368399 |
Elisabeth Pfaehler1, Liesbet Mesotten2,3, Ivan Zhovannik4,5, Simone Pieplenbosch1, Michiel Thomeer2,3, Karolien Vanhove2,6, Peter Adriaensens7, Ronald Boellaard1,8.
Abstract
BACKGROUND: Radiomics refers to the extraction of a large number of image biomarker describing the tumor phenotype displayed in a medical image. Extracted from positron emission tomography (PET) images, radiomics showed diagnostic and prognostic value for several cancer types. However, a large number of radiomic features are nonreproducible or highly correlated with conventional PET metrics. Moreover, radiomic features used in the clinic should yield relevant information about tumor texture. In this study, we propose a framework to identify technical and clinical meaningful features and exemplify our results using a PET non-small cell lung cancer (NSCLC) dataset.Entities:
Keywords: clinical value; feature selection; radiomics
Mesh:
Substances:
Year: 2021 PMID: 33368399 PMCID: PMC7985880 DOI: 10.1002/mp.14684
Source DB: PubMed Journal: Med Phys ISSN: 0094-2405 Impact factor: 4.071
Patient characteristics.
| Cancer stage | IA | 28 patients |
| IB | 13 patients | |
| IIA | 10 patients | |
| IIB | 8 patients | |
| IIIA | 29 patients | |
| IIIB | 15 patients | |
| IV | 47 patients | |
| Age | Mean | 67 yr |
| Std | 9.3 yr | |
| Sex | Men | 93 |
| Women | 57 |
Fig. 1Original tumor (left), two examples (middle and right) of the same tumor after randomly shuffling the intensity values in the VOI.
Fig. 2Illustration of the correlation between features and MATV or SUVMEAN: a): the feature Gray‐level nonuniformity GLSZM2D yielding a correlation coefficient with MATV of 0.97 (upper row) and a correlation with SUVMEAN of 0.34 (lower row). b): The feature joint average GLCM 3D avg yielding a correlation of 0.45 with MATV (upper row) and a correlation of 0.99 with SUVMEAN. [Color figure can be viewed at wileyonlinelibrary.com]
Fig. 3Number of features outside the random range (criteria 1) and not correlated with conventional metrics (criteria 1 + 2), and number of features after eliminating features highly correlated between each other (after mutual correlation) for the different volume ranges (left: all volumes are included, from left to right: more and more smaller lesions are excluded from the analysis until on the very right only tumors with a MATV > 103.94 mL are left). [Color figure can be viewed at wileyonlinelibrary.com]
Features selected for different volume ranges.
| Feature name | Selected by which volume range |
|---|---|
| Gray_level_non_uniformity (GLCM, 2Davg) | All volume ranges |
| Gray_level_non_uniformity (GLCM, 2Dmrg) | All volume ranges >= 3.2 mL |
| long_runs_emphasis (GLRLM, 2Davg) | All volume ranges >= 11.4 mL |
| Run_percentage (GLRLM 2DWmrg) | All volume ranges >= 33.4 mL |
| Run_length_variance (GLRLM 2Dvmrg) | All volume ranges >= 33.4 mL |
| Gray_level_non_uniformity (GLRLM, 3Davg) | All volume ranges >= 33.4 mL |
| Gray_level_non_uniformity (GLDZM 2Davg) | All volume ranges >=33.4 mL |
| Zone_percentage (GLDZM 2Davg) | All volume ranges >= 33.4 mL |
| Gray_level_non_uniformity (GLDZM 3D) | All volume ranges >= 33.4 mL |
| Zone_distance_non_uniformity (GLDZM 2Davg) | All volume ranges >= 45 mL |
| Zone_distance_non_uniformity (GLDZM 2Dmrg) | All volume ranges >= 45 mL |
| coarseness (NGLDM 2Dmrg) | All volume ranges >= 103.9 mL |
| small_distance_emphasis (GLDZM 2Davg) | All volume ranges >= 103.9 mL |
| Dependence_count_non_uniformity (NGTDM 2Dmrg) | ALL volume ranges >= 103.9 mL |
| Dependence_count_entropy (NGTDM 2Davg) | All volume ranges >= 103.9 mL |
Features selected by the RELIEF algorithm. Features that were also found by our procedure are displayed in bold, while features not reflecting actual texture are marked with (NoTex), features highly correlated with volume or SUVMEAN are marked with (CORR).
| Volume > 0 mL | Volume > 3.2 mL | Volume > 11.48 mL | Volume > 33.04 mL | Volume > 45 mL | Volume > 103.94 mL |
|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| difference variance (GLCM 2Dvmrg) |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| SUVPEAK |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| small distance emphasis (GLDZM 2Davg) | Zone size entropy (GLSZM 2Davg) (CORR) |
|
|
|
Features selected by the MRMR algorithm for the different volume ranges. Features that were also found by our procedure are displayed in bold, while features not reflecting actual texture are marked with (NoTex), features highly correlated with volume or SUVMEAN are marked with (CORR).
| Volume > 0 mL | Volume > 3.2 mL | Volume > 11.48 mL | Volume > 33.04 mL | Volume > 45 mL | Volume > 103.94 mL |
|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Fig. 4Model 1: Mean cross‐validation AUC of: (a) the feature run percentage which shows an increasing AUC with larger volumes included in the analysis; (b) the feature run length nonuniformity results in a reasonable accuracy when including the whole dataset, while the accuracy is decreasing with decreasing volume range and decreasing correlation with volume. [Color figure can be viewed at wileyonlinelibrary.com]
Fig. 5Model 2: Mean cross‐validation AUC of: (a) the feature run percentage, (b) the feature run length nonuniformity used together with MATV in the logistic regression model. [Color figure can be viewed at wileyonlinelibrary.com]
Fig. 6Model 3: Combined AUCs: for lesions above threshold, the feature run percentage is used for prediction, while for features below the threshold, volume is used for prediction. [Color figure can be viewed at wileyonlinelibrary.com]