| Literature DB >> 31063500 |
Qian Du1, Michael Baine2, Kyle Bavitz2, Josiah McAllister2, Xiaoying Liang3, Hongfeng Yu4, Jeffrey Ryckman2, Lina Yu4, Hengle Jiang4, Sumin Zhou2, Chi Zhang1, Dandan Zheng2.
Abstract
Radiomic analysis has recently demonstrated versatile uses in improving diagnostic and prognostic prediction accuracy for lung cancer. However, since lung tumors are subject to substantial motion due to respiration, the stability of radiomic features over the respiratory cycle of the patient needs to be investigated to better evaluate the robustness of the inter-patient feature variability for clinical applications, and its impact in such applications needs to be assessed. A full panel of 841 radiomic features, including tumor intensity, shape, texture, and wavelet features, were extracted from individual phases of a four-dimensional (4D) computed tomography on 20 early-stage non-small-cell lung cancer (NSCLC) patients. The stability of each radiomic feature was assessed across different phase images of the same patient using the coefficient of variation (COV). The relationship between individual COVs and tumor motion magnitude was inspected. Population COVs, the mean COVs of all 20 patients, were used to evaluate feature motion stability and categorize the radiomic features into 4 different groups. The two extremes, the Very Small group (COV≤5%) and the Large group (COV>20%), each accounted for about a quarter of the features. Shape features were the most stable, with COV≤10% for all features. A clinical study was subsequently conducted using 140 early-stage NSCLC patients. Radiomic features were employed to predict the overall survival with a 500-round bootstrapping. Identical multiple regression model development process was applied, and the model performance was compared between models with and without a feature pre-selection step based on 4D COV to pre-exclude unstable features. Among the systematically tested cutoff values, feature pre-selection with 4D COV≤5% achieved the optimal model performance. The resulting 3-feature radiomic model significantly outperformed its counterpart with no 4D COV pre-selection, with P = 2.16x10-27 in the one-tailed t-test comparing the prediction performances of the two models.Entities:
Mesh:
Year: 2019 PMID: 31063500 PMCID: PMC6504105 DOI: 10.1371/journal.pone.0216480
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Patient, tumor, and treatment characteristics and clinical outcomes.
| Total (n = 140) | Median (range) or Number of patients (percentage) | |
|---|---|---|
| Age | 73 (40–94) | |
| Gender | Female | 72 (51%) |
| Male | 68 (49%) | |
| Ethnicity | African-American | 15 (11%) |
| Caucasian | 124 (89%) | |
| Others | 1 (1%) | |
| Pack-years | 40 (0–180) | |
| ECOG performance status | 0 | 40 (29%) |
| 1 | 67 (48%) | |
| 2 | 28 (20%) | |
| 3 | 5 (4%) | |
| Histology | Adenocarcinoma | 61 (44%) |
| Squamous cell carcinoma | 46 (33%) | |
| NSCLC not otherwise specified | 33 (24%) | |
| Overall stage | Stage 1 | 140 (100%) |
| Tumor stage | T1 | 122 (87%) |
| T2 | 18 (13%) | |
| Tumor location | Right upper lobe | 45 (32%) |
| Right middle lobe | 5 (4%) | |
| Right lower lobe | 31 (22%) | |
| Left upper lobe | 37 (26%) | |
| Left lower lobe | 21 (15%) | |
| Chestwall | 1 (1%) | |
| SBRT prescription dose and fractionation | 60 Gy in 5 fractions | 2 (1%) |
| 50 Gy in 5 fractions | 120 (86%) | |
| 48 Gy in 4 fractions | 18 (13%) | |
| Follow-up time (months) | 22.1 (0.5–121.3) | |
| Overall survival at last follow-up | No | 57 (41%) |
| Yes | 83 (59%) | |
| Time to death (months) | 15.6 (0.6–67.1) | |
Fig 1Radiomic feaure 4D stability COV grouping.
(a) The COV categories comparing the original features vs. derived features applying different wavelet filters. (b) The COV categories comparing different feature classes.
Radiomic feature 4D variability grouping based on population mean COV calculated over individual 4D breathing phases.
| Feature Class | Very Small (COV≤5%) | Small (5%<COV≤10%) | Intermediate (10%<COV≤20%) | Large (COV>20%) |
|---|---|---|---|---|
| Shape | ||||
| First Order | ||||
| GLCM | ||||
| GLRLM | LRE, SRE, RP, RunEntropy, RLNUN | GLV, GLNUN, GLNU, RLNU | RV, SRHGLE, LRHGLE, HGLRE | SRLGLE, LGLRE, LRLGLE |
| GLSZM | GLNUN, SZNUN, ZP, SAE, ZE | GLV, SZNU, GLNU | SAHGLE, HGLZE | ZV, LAE, LALGLE, LAHGLE, LGLZE, SALGLE |
| GLDM | SDE, DE | GLV, DNU, DNUN | HGLE, GLNU, SDHGLE, LDE | LDLGLE, Emphasis, DV, LDHGLE, SDLGLE, LGLE |
| NGTDM | Coarseness | Complexity, Strength, Contrast, Busyness |
Fig 2The distributions of COV for the population (total count = 20) among different stability bins (COV≤5%, 5%
Fig 3Heat map of the individual COV4D plotted against the patients’ tumor motion magnitude. Rows of the heatmap represent different patients (ordered by the patient’s tumor motion magnitude) and columns describe features. The cell color, changing gradually from green to red, indicates increasing 4D COV from 0% to≥20%. Features were grouped with the complete-linkage hierarchical clustering method, based on the Euclidean distance.
Fig 4Radiomic heat map.
Unsupervised clustering of patients based on the Stable features (COV≤5%) on the y axis (n = 140) and the radiomic feature expression patterns separately within the Stable (n = 200) and Unstable groups (n = 641) on the x axis. Clinical parameters for the patients are also illustrated at the bottom. For gender, 0 = male and 1 = female; for race (ethnicity), 0 = Caucasian, 1 = African-American, and 2 = others; for pack-year, the patients were divided into 4 quartiles and 0–3 represent the quartiles from low to high; for histology, 0 = adenocarcinoma, 1 = squamous cell carcinoma, and 2 = non-small cell not otherwise classified pathologically; for tumor location (site), 0 = right upper lobe, 1 = right middle lobe, 2 = right lower lobe, 3 = left upper lobe, 4 = left lower lobe, and 5 = chest wall.
Fig 5A box plot of the χ2 values of data in the test datasets during the 500 bootstrapping round of multiple regression for overall survival prediction with different COV cutoffs and all features.
For each model, a higher χ2 indicates a superior prediction performance, i.e. a large difference between high risk cases and low risk cases. Except COV cutoffs, all models had the same setting, i.e. a univariate regression P-value cutoff of 0.01 and Q-value cutoff of 0.05 were used by all models. The box plot shows χ2 statistics among the 500 bootstrapping rounds, including the median (middle horizontal line), the first and third quartile (the lower and upper bound of the box), and the minimum and maximum (the dash line limits). A comparison of prediction performance between using a given COV cutoff and not using any COV cutoff (i.e. modeling with all features) was conducted. The P-value of the one-tailed t-test comparing the logarithm of χ2 values between the model applying a given 4D COV cutoff and its corresponding model without any 4D COV pre-selection (i.e. all) is shown for each 4D pre-selection model with a given COV cutoff. *** indicates P-value < 10−5, ** indicates <10−3, * indicates P-value < 0.05, and “ns” indicates not significant (p-value > 0.05).
Fig 6Kaplan-Meier survival curves of an example bootstrapping round to compare the 4D pre-selection model with the optimal cutoff (4D COV≤5%) and the corresponding model without 4D stability pre-selection.
For each case, the Kaplan-Meier survival curves are shown to compare the overall survival of the model predicted low- and high-risk groups based on the median risk score. The χ2 value from the logrank test comparing the two risk groups is also shown.
The P-values of the one-tailed t-test comparing the logarithm of χ2 values between the model meeting a given pre-selection criteria (applying both the 4D COV and the combined univariate regression P-value and FDR adjusted Q-value cutoffs) and its corresponding model without 4D COV pre-selection (applying the combined univariate regression P-value and FDR adjusted Q-value cutoffs alone).
A 3-feature radiomic model developed from the stable features with 4D COV≤5% and univariate regression P-value<0.01 and Q-value<0.05 achieved the most significantly improved performance over its counterpart without 4D COV pre-selection (i.e. developed from all features with univariate regression P-value<0.01 and Q-value<0.05 regardless of their 4D COV values), with a P-value of 2.16x10-27.
| Pre-selection criteria | Univariate P<0.005, Q<0.04 | Univariate P<0.007, Q<0.045 | Univariate P<0.01, Q<0.05 | Univariate P<0.03, Q<0.07 | Univariate P<0.05, |
|---|---|---|---|---|---|
| 4D COV≤5% | 6.07 x10-8 | 1.28 x10-16 | 2.16 x10-27 | 1.66 x10-7 | 0.037 |
| 4D COV≤10% | 7.93 x10-3 | 9.84 x10-4 | 2.23 x10-4 | 1.94 x10-2 | 0.463 |
| 4D COV≤15% | 7.93 x10-3 | 9.84 x10-4 | 2.23 x10-4 | 1.974 x10-2 | 0.302 |
| 4D COV≤20% | 0.258 | 0.137 | 3.34 x10-2 | 0.154 | 0.396 |
| 4D COV≤25% | 0.500 | 0.500 | 0.500 | 0.087 | 0.016 |