| Literature DB >> 35651590 |
Ting Yan1, Lili Liu1, Zhenpeng Yan1, Meilan Peng1, Qingyu Wang2, Shan Zhang2, Lu Wang1, Xiaofei Zhuang3, Huijuan Liu1, Yanchun Ma1, Bin Wang2, Yongping Cui1.
Abstract
To construct a prognostic model for preoperative prediction on computed tomography (CT) images of esophageal squamous cell carcinoma (ESCC), we created radiomics signature with high throughput radiomics features extracted from CT images of 272 patients (204 in training and 68 in validation cohort). Multivariable logistic regression was applied to build the radiomics signature and the predictive nomogram model, which was composed of radiomics signature, traditional TNM stage, and clinical features. A total of 21 radiomics features were selected from 954 to build a radiomics signature which was significantly associated with progression-free survival (p < 0.001). The area under the curve of performance was 0.878 (95% CI: 0.831-0.924) for the training cohort and 0.857 (95% CI: 0.767-0.947) for the validation cohort. The radscore of signatures' combination showed significant discrimination for survival status. Radiomics nomogram combined radscore with TNM staging and showed considerable improvement over TNM staging alone in the training cohort (C-index, 0.770 vs. 0.603; p < 0.05), and it is the same with clinical data (C-index, 0.792 vs. 0.680; p < 0.05), which were confirmed in the validation cohort. Decision curve analysis showed that the model would receive a benefit when the threshold probability was between 0 and 0.9. Collectively, multiparametric CT-based radiomics nomograms provided improved prognostic ability in ESCC.Entities:
Keywords: computed tomography; esophageal squamous cell carcinoma; nomogram; progression-free survival; radiomics
Year: 2022 PMID: 35651590 PMCID: PMC9149002 DOI: 10.3389/fncom.2022.885091
Source DB: PubMed Journal: Front Comput Neurosci ISSN: 1662-5188 Impact factor: 3.387
Patient and tumor characteristics in the training and validation cohorts.
|
| |||
|---|---|---|---|
| Gender | 0.539 | ||
| Male | 146 (71.6%) | 46 (67.6%) | |
| Female | 58 (28.4%) | 22 (32.4%) | |
| Age | 0.398 | ||
| Median (interquartile range) | 60.22 | 60.44 | |
| ≤ 56 | 63 (30.9%) | 19 (27.9%) | |
| 56–66 | 92 (45.1%) | 27 (39.7%) | |
| ≥66 | 49 (24.0%) | 22 (32.4%) | |
| Location | 0.452 | ||
| Up | 10 (4.9%) | 5 (7.4%) | |
| Mid | 135 (66.2%) | 48 (70.6%) | |
| Down | 59 (28.9%) | 15 (22.1%) | |
| Drinking | 0.662 | ||
| Yes | 75 (36.8%) | 23 (33.8%) | |
| No | 129 (63.2%) | 45 (66.2%) | |
| Smoking | 0.569 | ||
| Yes | 118 (57.8%) | 42 (61.8%) | |
| No | 86 (42.2%) | 26 (38.2%) | |
| Genetic history | 0.880 | ||
| Yes | 64 (31.4%) | 22 (32.4%) | |
| No | 140 (68.6%) | 46 (67.6%) | |
| Invasion degree | 0.887 | ||
| Full layer | 121 (59.3%) | 41 (60.3%) | |
| Non-full layer | 83 (40.7%) | 27 (39.7%) | |
| TNM | 0.556 | ||
| I | 19 (9.3%) | 9 (13.2%) | |
| II | 105 (51.5%) | 36 (52.9%) | |
| III | 80 (39.2%) | 23 (33.8%) | |
| Lymph node metastasis | 0.255 | ||
| Yes | 88 (43.1%) | 24 (35.3%) | |
| No | 116 (56.9%) | 44 (64.7%) |
Radiomics features selection results based on the ANOVA.
|
|
|
|---|---|
| Number of selected features | 221 |
| The best-performance feature | HLL-original_glcm_InverseVariance |
| ( |
Radiomics signature selection results with descriptions.
|
|
|
|---|---|
| HHL_firstorder_Skewness | 0.066 |
| HLH_firstorder_Median | −1.812 |
| HLH_glszm_SmallAreaEmphasis | −13.697 |
| HLH_glszm_ZoneEntropy | 0.092 |
| HLL_glcm_ClusterShade | −0.004 |
| HLL_glcm_InverseVariance | −6.470 |
| HLL_glszm_GrayLevelNonUniformityNormalized | −0.612 |
| HLL_glszm_SizeZoneNonUniformityNormalized | 15.084 |
| HLL_gldm_SmallDependenceHighGrayLevelEmphasis | −0.0008 |
| HLL_ngtdm_Complexity | −0.0007 |
| LHH_glszm_LargeAreaLowGrayLevelEmphasis | 1.02e-06 |
| LHH_gldm_DependenceNonUniformityNormalized | 31.635 |
| LHH_ngtdm_Busyness | 0.001 |
| LHL_glcm_Idn | 12.445 |
| LHL_glszm_LargeAreaHighGrayLevelEmphasis | −1.42e-10 |
| LHL_gldm_SmallDependenceLowGrayLevelEmphasis | −82.462 |
| LLH_firstorder_Energy | 9.77e-10 |
| LLH_glcm_Contrast | 0.029 |
| LLH_glszm_SizeZoneNonUniformity | 1.37e-05 |
| LLH_ngtdm_Complexity | 5.66e-05 |
| LLL_gldm_LargeDependenceHighGrayLevelEmphasis | 2.44e-06 |
Skewness:The asymmetric distribution of the Mean value. Depending on where the tail is elongated and the mass of the distribution is concentrated, it can be positive or negative.
Median:The median gray level intensity within ROI.
Small Area Emphasis (SAE): A measure of the distribution of small size zones, with a greater value indicative of more smaller size zones and more fine textures.
Zone Entropy (ZE): The degree of instability and variation in spatial and regional differences of the image distribution range.
Cluster Shade: A measure of skewness and uniformity of the GLCM. A higher cluster shade implies greater asymmetry about the mean.
Size Zone Non-Uniformity Normalized: The variability of size zone volumes throughout images, with a lower value indicating more homogeneity among zone size volumes in images. it's the normalized version of the SZN formula.
Small Dependence High Gray Level Emphasis (SDHGLE): Measures the joint distribution of small dependence with higher gray-level values.
Large Area Low Gray Level Emphasis (LALGLE): The proportion in images of the joint distribution of larger size zones with lower gray-level values.
Dependence Non-Uniformity Normalized (DNN): Measures the similarity of dependence throughout images, with a lower value indicating more homogeneity among dependencies in images. This is the normalized version of the DLN formula.
Busyness: A measure of the change from a pixel to its neighbor. A high value for busyness indicates a ‘busy’ image, with rapid changes of intensity between pixels and their neighborhood.
IDN (inverse difference normalized): Another measure of a local homogeneity of images. Unlike Homogeneity1, IDN normalizes the difference between neighboring intensity values by dividing over the total number of discrete intensity values.
Large Area High Gray Level Emphasis (LAHGLE): The proportion in images of the joint distribution of larger size zones with higher gray-level values.
Small Dependence Low Gray Level Emphasis (SDLGLE): Measures the joint distribution of small dependence with lower gray-level values.
Energy: The sum of squares of gray level intensity within ROI.
Contrast: A measure of local intensity variation, favoring values away from the diagonal (i=j). A larger value correlates with a greater disparity in intensity values among neighboring voxels.
Large Dependence High Gray Level Emphasis (LDHGLE): Measures the joint distribution of large dependence with higher gray-level values.
Figure 1Radiomics feature selection using LASSO logistic regression model. (A) Identification of the optimal penalization coefficient lambda (λ) in the LASSO model used 10-fold cross-validation and the minimum criterion. As a result, a λ value of 0.022 was selected. (B) LASSO coefficient profiles of the 221 radiomics features.
Figure 2Rad-score for each patient in the training cohort and validation cohort. (A) ROCs were employed to assess the radiomics signature discriminative performance of the survival status. ROC in the training cohort with 0.878 (95% CI: 0.831–0.924, sensitivity = 71.3%, specificity = 90.0%); ROC in the validation cohort with 0.857 (95% CI: 0.767–0.947, sensitivity = 62.9%, specificity = 97.0%). Rad-score for each patient in the training cohort (B) and validation cohort (C). Blue bars show scores for patients who survived without disease progression or were censored, while red bars show scores for those who experienced progression or died.
Figure 3Stratified analyses were performed to estimate PFS in various subgroups, comparing high-risk patients and low-risk patients.
Figure 4(A) A radiomics nomogram integrated the radiomics signature from CT images with the TNM staging system in the training cohort. (B) Calibration curve of the radiomics nomogram. The diagonal dotted line represents an ideal evaluation, while the yellow and red solid lines represent the performance of the nomogram. Closer fit to the diagonal dotted line indicates a better evaluation. (C) Adding Age, gender, invasion degree, location, genetic history, and metastasis to the radiomics nomogram. (D) Calibration curve of the radiomics nomogram with the addition of Age, gender, invasion degree, location, genetic history, metastasis.
Figure 5(A) The DCA of the radiomics-comparison-based nomogram. The black dotted line describes the scheme of no treatment. The green dotted line describes the scheme of treatment. The red line represents our predictive model with only traditional TNM staging combined with clinical features. And the blue line represents our personalized prediction model that added Rad-score. The x-axis is the threshold probability and the y-axis is the net benefit. It can be seen the personalized prediction model with Rad-score added had a better net benefit than the traditional predictive model when the threshold is in the range of 0–0.9. Hence, the patient with ESCC would receive benefit from taking our CT-based radiomics nomogram guidance. (B) Heatmap of associations between selected radiomics features and clinical data. p < 0.05 indicates statistical associations, as determined using t-tests.