| Literature DB >> 35186707 |
Degan Hao1, Qiong Li2, Qiu-Xia Feng2, Liang Qi2, Xi-Sheng Liu2, Dooman Arefan3, Yu-Dong Zhang2, Shandong Wu1,3,4,5.
Abstract
BACKGROUND: Gastric cancer is one of the leading causes of cancer death in the world. Improving gastric cancer survival prediction can enhance patient prognostication and treatment planning.Entities:
Keywords: deep learning - CNN; gastric cancer; multi-modal data analysis; radiomics; survival analysis (source: MeSH NLM)
Year: 2022 PMID: 35186707 PMCID: PMC8847133 DOI: 10.3389/fonc.2021.725889
Source DB: PubMed Journal: Front Oncol ISSN: 2234-943X Impact factor: 6.244
Figure 1Machine learning of multi-modal features for gastric cancer survival prediction and interpretation. The significant clinical variables, radiomics signature, and deep learning signature were integrated in the Cox model for survival prediction, and the effects of these features were measured and analyzed by hazard ratios at pre- and post-operative settings.
Figure 2Deep learning feature extraction from CT images through an attention-guided Variational AutoEncoder (attention-guided VAE) model. (A) model structure. (B) Gastric tumor region (yellow annotations) and the attention regions (highlighted by heatmaps) identified by the attention-guided VAE model.
Patient characteristics (i.e., 16 clinical variables) included for survival modeling.
| Characteristic | Group-A for training (n = 743) | Group-B for independent test (n = 318) | p-value |
|---|---|---|---|
|
| |||
| Age, mean ± Std | 61.8 ± 9.7 | 62.0 ± 9.6 | 0.43 |
| Sex, No. (%) | 0.20 | ||
| Male | 541 (72.8) | 221 (69.5) | |
| Female | 202 (27.2) | 97 (30.5) | |
| CA19-9 < 39 units/milliliter, No. (%) | 0.27 | ||
| Yes | 94 (12.7) | 35 (11.0) | |
| No | 649 (87.3) | 283 (89.0) | |
| CEA < 4.7 nanograms/milliliter, No. (%) | 0.24 | ||
| Yes | 160 (21.5) | 63 (19.8) | |
| No | 583 (78.5) | 255 (80.2) | |
| Biopsy histologic grade, No. (%) | 0.63 | ||
| Well/moderate | 440 (59.2) | 194 (61.0) | |
| Poor/undifferentiated | 303 (40.8) | 124 (39.0) | |
| Location, No. (%) | 0.99 | ||
| Upper | 165 (22.2) | 87 (27.4) | |
| Middle | 232 (31.2) | 96 (30.2) | |
| Lower | 332 (44.7) | 128 (40.3) | |
| Entire | 14 (1.9) | 7 (2.2) | |
| Radiologic T stage, No. (%) | 0.05 | ||
| rT1 stage | 152 (20.5) | 46 (14.5) | |
| rT2 stage | 123 (16.6) | 70 (22.0) | |
| rT3 stage | 286 (38.5) | 122 (38.4) | |
| rT4 stage | 182 (24.5) | 80 (25.2) | |
| Radiologic N stage, No. (%) | 0.83 | ||
| rN0 stage | 281 (37.8) | 113 (35.5) | |
| rN1 stage | 196 (26.4) | 81 (25.5) | |
| rN2 stage | 134 (18.0) | 65 (20.4) | |
| rN3 stage | 70 (9.4) | 34 (10.7) | |
| rN4 stage | 62 (8.3) | 25 (7.9) | |
|
| |||
| Pathological T stage†, No. (%) | 0.99 | ||
| pT1 stage | 202 (27.2) | 86 (27.0) | |
| pT2 stage | 95 (12.8) | 39 (12.3) | |
| pT3 stage | 202 (27.2) | 85 (26.7) | |
| pT4 stage | 244 (32.8) | 108 (34.0) | |
| Pathological N stage†, No. (%) | 0.69 | ||
| pN0 stage | 285 (38.4) | 115 (36.2) | |
| pN1 stage | 102 (13.7) | 51 (16.0) | |
| pN2 stage | 120 (16.2) | 47 (14.8) | |
| pN3a stage | 131 (17.6) | 53 (16.7) | |
| pN3b stage | 105 (14.1) | 52 (16.4) | |
| Surgical histologic grade, No. (%) | 0.44 | ||
| Well/moderate | 418 (56.3) | 170 (53.5) | |
| Poor/undifferentiated | 325 (43.7) | 148 (46.5) | |
| Lauren classification, No. (%) | 0.63 | ||
| Intestinal type | 407 (54.8) | 180 (56.6) | |
| Diffuse/mixed type | 336 (45.2) | 138 (43.4) | |
| Gross appearance, No. (%) | 0.75 | ||
| Borrmann type I-III | 715 (96.2) | 304 (95.6) | |
| Borrmann type IV | 28 (3.8) | 14 (4.4) | |
| Lymphovascular invasion, No. (%) | 0.51 | ||
| Negative | 457 (61.5) | 188 (59.1) | |
| Positive | 286 (38.5) | 130 (40.9) | |
| Perineural invasion, No. (%) | 0.86 | ||
| Negative | 443 (59.6) | 187 (58.8) | |
| Positive | 300 (40.4) | 131 (41.2) | |
| Chemotherapy therapy, No. (%) | 0.55 | ||
| Yes | 334 (45.0) | 150 (47.2) | |
| No | 409 (55.0) | 168 (52.8) |
†According to the eighth edition AJCC Cancer Staging Manual.
Radiomic features selected by random survival forest to generate the radiomics signatures for overall survival and progression-free survival.
| Prediction | Radiomic feature name | Permutation importance | Intra-observer ICC | Inter-observer ICC |
|---|---|---|---|---|
|
| wavelet-HLL_firstorder_MeanAbsoluteDeviation | 0.0068 | 0.999 | 0.999 |
| wavelet-HHH_glszm_SmallAreaLowGrayLevelEmphasis | 0.0048 | 0.823 | 0.818 | |
| log-sigma-5-0-mm-3D_glszm_SmallAreaHighGrayLevelEmphasis | 0.0034 | 0.947 | 0.912 | |
| original_shape_Maximum2DDiameterRow | 0.0033 | 0.999 | 0.999 | |
| original_glszm_ZoneVariance | 0.0029 | 0.999 | 0.999 | |
| wavelet-LLL_firstorder_Energy | 0.0029 | 0.999 | 0.999 | |
| original_shape_MajorAxis | 0.0029 | 0.999 | 0.999 | |
| wavelet-HLH_glszm_LargeAreaEmphasis | 0.0028 | 0.987 | 0.977 | |
| wavelet-HLH_glrlm_LongRunEmphasis | 0.0027 | 0.996 | 0.994 | |
| log-sigma-5-0-mm-3D_glszm_GrayLevelNonUniformity | 0.0027 | 0.999 | 0.999 | |
| wavelet-LHL_firstorder_MeanAbsoluteDeviation | 0.0023 | 0.999 | 0.998 | |
| original_shape_SurfaceVolumeRatio | 0.0022 | 0.994 | 0.991 | |
| original_glszm_SmallAreaEmphasis | 0.0022 | 0.972 | 0.942 | |
| original_firstorder_10Percentile | 0.0021 | 0.996 | 0.992 | |
| log-sigma-2-0-mm-3D_glszm_GrayLevelNonUniformity | 0.0021 | 0.999 | 0.999 | |
| wavelet-LLH_gldm_DependenceNonUniformity | 0.0021 | 0.998 | 0.999 | |
| wavelet-HLH_firstorder_Mean | 0.002 | 0.984 | 0.962 | |
| original_firstorder_Energy | 0.002 | 0.999 | 0.999 | |
| wavelet-HHL_glcm_ClusterTendency | 0.0017 | 0.996 | 0.995 | |
| wavelet-LLH_glrlm_GrayLevelNonUniformityNormalized | 0.0017 | 0.998 | 0.996 | |
|
| original_glszm_SizeZoneNonUniformity | 0.0037 | 0.999 | 0.999 |
| log-sigma-4-0-mm-3D_gldm_LargeDependenceHighGrayLevelEmphasis | 0.0033 | 0.981 | 0.980 | |
| wavelet-LLH_firstorder_RootMeanSquared | 0.0032 | 0.999 | 0.999 | |
| wavelet-LLH_gldm_LargeDependenceEmphasis | 0.003 | 0.995 | 0.992 | |
| original_shape_SurfaceArea | 0.0029 | 0.999 | 0.999 | |
| original_glszm_GrayLevelNonUniformity | 0.0028 | 0.999 | 0.997 | |
| log-sigma-5-0-mm-3D_glszm_LargeAreaHighGrayLevelEmphasis | 0.0026 | 0.999 | 0.991 | |
| wavelet-HHL_glcm_SumSquares | 0.0025 | 0.999 | 0.999 | |
| wavelet-HLL_glszm_ZoneVariance | 0.0025 | 0.992 | 0.983 | |
| wavelet-HHH_glszm_SmallAreaLowGrayLevelEmphasis | 0.0024 | 0.823 | 0.818 | |
| wavelet-HHH_glszm_SizeZoneNonUniformity | 0.0023 | 0.929 | 0.929 | |
| wavelet-HHL_glcm_JointEntropy | 0.0022 | 0.999 | 0.999 | |
| wavelet-LHH_firstorder_RobustMeanAbsoluteDeviation | 0.002 | 0.999 | 0.999 | |
| wavelet-LHL_glszm_LargeAreaLowGrayLevelEmphasis | 0.002 | 0.996 | 0.944 | |
| wavelet-HHL_glrlm_RunLengthNonUniformity | 0.002 | 0.999 | 0.999 | |
| wavelet-LLH_glszm_ZoneVariance | 0.0018 | 0.998 | 0.997 | |
| wavelet-HHH_glrlm_RunPercentage | 0.0018 | 0.995 | 0.988 | |
| original_firstorder_Variance | 0.0018 | 0.995 | 0.991 | |
| wavelet-HHH_glszm_SmallAreaEmphasis | 0.0017 | 0.856 | 0.822 | |
| wavelet-HHL_glszm_GrayLevelNonUniformityNormalized | 0.0017 | 0.973 | 0.948 |
Larger permutation importance values indicate more important features for the prediction task. The intra-class correlation coefficient (ICC) values indicate reliability of the features.
Prediction performance of overall survival and progression-free survival and their comparisons at different settings.
| Tasks | Variable groups | Variable names | Setting 1: Pre-operative / post-operative separate modeling | Setting 2: Combined modeling (full set of variabels in Setting 1) | Setting 3: Combined modeling (only significant variabels in Setting 1) | Setting 4: Combined modeling (applied feature selection to full set in Setting 1) | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Hazard Ratio | p-value | C-index [95% CI] | Hazard Ratio | p-value | C-index [95% CI] | Hazard Ratio | p-value | C-index [95% CI] | Hazard Ratio | p-value | C-index [95% CI] | |||
|
| Pre-operative variables | CEA |
| 0.03 | 0.651 (0.649, 0.653) |
| 0.06 | 0.703 (0.702, 0.706) |
| 0.02 | 0.708 (0.706, 0.709) | 0.721 (0.720, 0.722) | ||
| CA199 |
| 0.51 |
| 0.82 | ||||||||||
| Biopsy histologic grade |
| 0.98 |
| 0.86 | ||||||||||
| rT |
| 0.65 |
| 0.90 |
| 0.65 | ||||||||
| rN |
| 0.10 |
| 0.20 |
| 0.12 | ||||||||
| Deep learning signature |
| <0.005 |
| <0.005 |
| <0.005 |
| <0.005 | ||||||
| Radiomics signature |
| <0.005 |
| <0.005 |
| <0.005 |
| <0.005 | ||||||
| Post-operative variables | pT |
| <0.005 | 0.783 (0.782, 0.783) |
| 0.88 |
| 0.88 |
| 0.43 | ||||
| pN |
| <0.005 |
| 0.05 |
| 0.17 |
| 0.05 | ||||||
| LVI |
| 0.34 |
| 0.60 | ||||||||||
| PNI |
| 0.10 |
| 0.05 | ||||||||||
| Gross appearance |
| 0.04 |
| 0.64 |
| 0.05 | ||||||||
| Surgical histologic grade |
| 0.76 |
| 0.90 |
| 0.61 | ||||||||
| Chemotherapy |
| <0.005 |
| <0.005 |
| <0.005 |
| <0.005 | ||||||
|
| Pre-operative variables | CEA |
| 0.01 | 0.686 (0.685,0.687) |
| 0.01 | 0.743 (0741,0.744) |
| 0.02 | 0.761 (0.759, 0.762) |
| 0.01 | 0.758 (0.757, 0.759) |
| CA199 |
| 0.78 |
| 0.57 | ||||||||||
| Biopsy histologic grade |
| 0.03 |
| 0.25 |
| 0.19 | ||||||||
| rT |
| 0.07 |
| 0.61 |
| 0.61 | ||||||||
| rN |
| 0.06 |
| 0.22 |
| 0.26 | ||||||||
| Deep learning signature |
| <0.005 |
| <0.005 |
| <0.005 |
| <0.005 | ||||||
| Radiomics signature |
| <0.005 |
| 0.01 |
| 0.01 |
| <0.005 | ||||||
| Post-operative variables | pT |
| <0.005 | 0.770 (0.769, 0771) |
| <0.005 |
| <0.005 |
| <0.005 | ||||
| pN |
| <0.005 |
| 0.14 |
| <0.005 |
| 0.03 | ||||||
| LVI |
| 0.82 |
| 0.42 | ||||||||||
| PNI |
| 0.36 |
| 0.29 | ||||||||||
| Gross appearance |
| 0.07 |
| 0.76 |
| 0.48 | ||||||||
| Surgical histologic grade |
| 0.09 |
| 0.14 | ||||||||||
| Chemotherapy |
| <0.005 |
| <0.005 |
| <0.005 |
| <0.005 | ||||||
The length of the color bars in each cell represents the absolute value of the hazard ratios.