| Literature DB >> 30421504 |
Lin Wang1,2,3, Gaochao Dong1, Wenjie Xia1,2, Qixing Mao1,2, Anpeng Wang1,2, Bing Chen1,2, Weidong Ma1,2, Yaqin Wu1,2, Lin Xu1, Feng Jiang1.
Abstract
BACKGROUND: Developments in high-throughput genomic technologies have led to improved understanding of the molecular underpinnings of esophageal squamous cell carcinoma (ESCC). However, there is currently no model that combines the clinical features and gene expression signatures to predict outcomes.Entities:
Keywords: ESCC; nomogram; prediction model; prognosis
Mesh:
Substances:
Year: 2018 PMID: 30421504 PMCID: PMC6312844 DOI: 10.1111/1759-7714.12902
Source DB: PubMed Journal: Thorac Cancer ISSN: 1759-7706 Impact factor: 3.500
Clinicopathologic characteristics of ESCC patients
| Characteristics | No. of patients | % |
|---|---|---|
| Age, years | ||
| ≥ 60 | 88 | 49.2 |
| < 60 | 91 | 50.8 |
| Gender | ||
| Male | 146 | 81.6 |
| Female | 33 | 18.4 |
| Tobacco use | ||
| Yes | 114 | 63.7 |
| No | 65 | 36.3 |
| Alcohol use | ||
| Yes | 106 | 59.2 |
| No | 73 | 40.8 |
| Tumor location | ||
| Upper | 20 | 11.2 |
| Middle | 97 | 54.2 |
| Lower | 62 | 34.6 |
| Tumor grade | ||
| Well | 32 | 17.9 |
| Moderately | 98 | 54.7 |
| Poorly | 49 | 27.4 |
| Invasion of adjacent structure | ||
| Yes | 31 | 17.3 |
| No | 148 | 82.7 |
| Lymphatic metastasis | ||
| Yes | 96 | 53.6 |
| No | 83 | 46.4 |
| TNM stage | ||
| I | 10 | 5.59 |
| II | 77 | 43.0 |
| III | 92 | 51.4 |
| Arrhythmia | ||
| Yes | 43 | 24.0 |
| No | 136 | 76.0 |
| Pneumonia | ||
| Yes | 35 | 19.6 |
| No | 164 | 80.4 |
| Anastomotic leak | ||
| Yes | 12 | 6.70 |
| No | 167 | 93.3 |
| Adjuvant therapy | ||
| Yes | 108 | 60.3 |
| No | 45 | 25.1 |
| Unknown | 26 | 14.6 |
ESCC, esophageal squamous cell carcinoma; TNM, tumor node metastasis.
Figure 1Systematic analysis of differential transcribed genes and bioinformatics analysis of the differentially expressed coding genes. (a) Use of the Limma package (R software) to screen and analyze the differentially expressed genes of paired samples, including coding and non‐coding. (b) The heatmap reveals the significantly differentially expressed coding genes between tumor and normal specimens. (c,d) Bioinformatic analysis of differentially expressed coding genes according to Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway.
Figure 2Systematic analysis of differentially expressed non‐coding genes and the prediction of function. (a) The heatmap comprises significantly differentially expressed non‐coding genes. (b) The classification of differentially expressed long non‐coding RNAs (lncRNAs). (c) The network between the different protein‐coding genes and non‐coding genes based on the WGCNA package. (d) The predictive function of lncRNA according to correlation analysis. () long intergenic non‐coding RNA (lincRNA), () antisense, (), pseudogene, () Processed transcript, () misc RNA, () sense_intronic, () to be experimentally confirmed (TEC), and () unknown.
Univariate and multivariable analyses based on the clinical Cox model
| Parameters | Univariate analysis | Multivariable analysis | ||||
|---|---|---|---|---|---|---|
| HR | 95% CI |
| HR | 95% CI |
| |
| Age (≥ 60) | 1.680 | 1.146–2.461 | 0.008 | 1.575 | 1.006–2.467 | 0.047 |
| Gender (female) | 1.277 | 0.789–2.044 | 0.307 | — | — | — |
| Tobacco use | 1.334 | 0.905–1.967 | 0.145 | — | — | — |
| Alcohol use | 1.158 | 0.788–1.700 | 0.456 | — | — | — |
| Tumor location | 0.257 | — | — | — | ||
| Tumor location (middle vs. upper) | 1.669 | 0.905–3.078 | 0.101 | — | — | — |
| Tumor location (lower vs. upper) | 1.135 | 0.740–1.741 | 0.561 | — | — | — |
| Tumor location (middle vs. lower) | 0.680 | 0.385–1.202 | 0.184 | — | — | — |
| Tumor grade | 0.048 | 0.829 | 0.516–1.330 | 0.436 | ||
| Tumor grade (well vs. poorly) | 0.605 | 0.338–1.082 | 0.090 | |||
| Tumor grade (moderately vs. poorly) | 0.613 | 0.401–0.939 | 0.024 | |||
| Tumor grade (moderately vs. well) | 1.014 | 0.587–1.750 | 0.961 | |||
| Invasion of adjacent structure | 1.628 | 1.017–2.605 | 0.042 | 0.852 | 0.610–1.189 | 0.346 |
| Lymphatic metastasis | 2.129 | 1.420–3.192 | 0.000 | 1.528 | 0.931–2.508 | 0.094 |
| TNM stage | 0.001 | — | — | — | ||
| TNM stage (I vs. III) | 0.276 | 0.087–0.879 | 0.029 | — | — | — |
| TNM stage (II vs. III) | 0.492 | 0.327–0.739 | 0.001 | — | — | — |
| TNM stage (II vs. I) | 1.782 | 0.549–5.788 | 0.336 | — | — | — |
| Arrhythmia | 0.893 | 0.580–1.375 | 0.607 | — | — | — |
| Pneumonia | 0.702 | 0.354–1.390 | 0.310 | — | — | — |
| Anastomotic leak | 0.770 | 0.357–1.658 | 0.504 | — | — | — |
| Adjuvant therapy | 0.442 | 0.256–0.762 | 0.003 | 0.520 | 0.289–0.934 | 0.029 |
Indicated P < 0.05. CI, confidence interval; HR, hazard ratio; TNM, tumor node metasta.
Figure 3Two logistic regression modeling approaches to predict esophageal squamous cell carcinoma (ESCC) survival after surgery. (a) Multivariate analysis of the overall survival of ESCC patients based on the different Cox models, one based exclusively on the clinical variables, the other based on the integration of clinical variables and a 2‐gene score. (b) Kalpan–Meier survival curves of the two logistic regression models. (c) Receiver operating characteristic (ROC) curves of the two models are presented, and reflect the specificity and sensitivity of the two different comprehensive variables. () Low‐risk and () High‐risk; () Low‐risk and () High‐risk. AUC, area under the ROC; CI, confidence interval; HR, hazard ratio.
Univariate analysis of gene expression profiles correlated with overall survival of ESCC patients
| Number | Ensemble name | logFC | adj.P.Val | ENSG | Type | Univariate analysis | ||
|---|---|---|---|---|---|---|---|---|
| HR | 95% CI |
| ||||||
| 1 | CASC2 | −1.05 | 1.23E‐25 | ENSG00000177640 | Antisense | 1.506 | 1.026–2.209 | 0.036 |
| 2 | FLJ40288 | −2.98 | 6.24E‐82 | ENSG00000183470 | lincRNA | 0.649 | 0.441–0.954 | 0.028 |
| 3 | KB‐1183D5.11 | 1.09 | 3.19E‐11 | ENSG00000215498 | Processed_transcript | 0.676 | 0.461–0.992 | 0.046 |
| 4 | RP11‐357H14.2 | 1.85 | 1.88E‐34 | ENSG00000233283 | Processed_transcript | 1.703 | 1.159–2.503 | 0.007 |
| 5 | RP11‐438N16.1 | 2.19 | 6.6E‐27 | ENSG00000249550 | lincRNA | 0.657 | 0.447–0.965 | 0.032 |
| 6 | RP11‐129M6.1 | 1.38 | 2.22E‐14 | ENSG00000251363 | lincRNA | 0.677 | 0.461–0.996 | 0.047 |
| 7 | AC006296.1 | −1.73 | 9.65E‐21 | ENSG00000251412 | lincRNA | 0.644 | 0.438–0.946 | 0.025 |
| 8 | AC007880.1 | −2.22 | 4.47E‐26 | ENSG00000234572 | lincRNA | 0.652 | 0.444–0.957 | 0.029 |
| 9 | AC092168.4 | −1.21 | 6.69E‐32 | ENSG00000228488 | lincRNA | 0.586 | 0.398–0.865 | 0.007 |
| 10 | AC093850.2 | 4.96 | 6.63E‐96 | ENSG00000230838 | lincRNA | 1.471 | 1.002–2.159 | 0.049 |
| 11 | AF003626.1 | −1.18 | 2.87E‐36 | ENSG00000230153 | lincRNA | 0.629 | 0.428–0.925 | 0.018 |
| 12 | AP000344.3 | −2.11 | 1.18E‐32 | ENSG00000234928 | lincRNA | 0.654 | 0.445–0.961 | 0.031 |
| 13 | AP000473.6 | 1.23 | 3.22E‐16 | ENSG00000237735 | lincRNA | 0.605 | 0.410–0.892 | 0.011 |
| 14 | CTD‐2382E5.1 | 1.13 | 8.53E‐12 | ENSG00000246740 | Antisense | 0.644 | 0.439–0.946 | 0.025 |
| 15 | FRMPD2P1 | −1.93 | 3.84E‐33 | ENSG00000150175 | Pseudogene | 0.614 | 0.418–0.904 | 0.013 |
| 16 | LINC00028 | −1.41 | 6.61E‐33 | ENSG00000233354 | lincRNA | 0.582 | 0.395–0.858 | 0.006 |
| 17 | MAMDC2‐AS1 | −1.71 | 6.45E‐54 | ENSG00000204706 | Antisense | 1.685 | 1.144–2.483 | 0.008 |
| 18 | RP11‐120J1.1 | −1.62 | 8.68E‐33 | ENSG00000225472 | Antisense | 0.635 | 0.431–0.936 | 0.022 |
| 19 | RP11‐225N10.1 | −1.58 | 3.08E‐47 | ENSG00000240063 | Antisense | 0.680 | 0.463–0.999 | 0.049 |
| 20 | RP11‐226F19.5 | 1.10 | 3.34E‐21 | ENSG00000259062 | Antisense | 1.486 | 1.013–2.181 | 0.043 |
| 21 | RP11‐242F24.1 | 1.03 | 1.75E‐46 | ENSG00000228750 | lincRNA | 1.513 | 1.028–2.226 | 0.036 |
| 22 | RP1‐12803.4 | −3.71 | 2.95E‐82 | ENSG00000230248 | lincRNA | 0.638 | 0.432–0.936 | 0.022 |
| 23 | RP11‐411K7.1 | −1.30 | 4.81E‐13 | ENSG00000236740 | Processed_transcript | 0.642 | 0.437–0.943 | 0.024 |
| 24 | RP11‐51M18.1 | 1.59 | 2.44E‐17 | ENSG00000253898 | lincRNA | 0.594 | 0.403–0.876 | 0.009 |
| 25 | RP11‐521B24.3 | 1.09 | 2.49E‐21 | ENSG00000251602 | Antisense | 1.740 | 1.181–2.564 | 0.005 |
| 26 | RP11‐526P5.2 | −1.16 | 3.68E‐12 | ENSG00000235281 | lincRNA | 0.655 | 0.446–0.963 | 0.031 |
| 27 | RP11‐71G12.1 | 1.23 | 1.1E‐10 | ENSG00000229961 | lincRNA | 0.653 | 0.445–0.960 | 0.030 |
| 28 | RP11‐768G7.2 | 1.26 | 3.51E‐31 | ENSG00000241213 | lincRNA | 1.694 | 1.150–2.495 | 0.008 |
| 29 | RP11‐89N17.4 | −1.55 | 4.54E‐41 | ENSG00000236494 | lincRNA | 0.573 | 0.389–0.844 | 0.006 |
| 30 | RP11‐726G1.1 | 1.04 | 2.23E‐19 | ENSG00000214776 | Processed_transcript | 1.497 | 1.019–2.200 | 0.040 |
| 31 | RP11‐69C17.1 | −1.97 | 1.75E‐37 | ENSG00000234962 | lincRNA | 0.674 | 0.459–0.989 | 0.044 |
CI, confidence interval; HR, hazard ratio; lincRNA, long intergenic non‐coding RNA.
Multivariate analysis based on the integration of clinical variables and gene expression signatures in a Cox model
| Parameters | Multivariable analysis | ||
|---|---|---|---|
| HR | 95% CI |
| |
| Age (> 60) | 2.029 | 1.173–3.508 | 0.011 |
| Tumor grade (well vs. poorly) | 1.126 | 0.642–1.976 | 0.679 |
| Invasion of adjacent structure | 0.804 | 0.531–1.217 | 0.302 |
| Lymphatic metastasis | 1.589 | 0.836–3.023 | 0.158 |
| Adjuvant therapy | 0.408 | 0.192–0.868 | 0.020 |
| CASC2 | 0.841 | 0.468‐1.510 | 0.561 |
| FLJ40288 | 0.667 | 0.365–1.219 | 0.188 |
| KB‐1183D5.11 | 0.707 | 0.386–1.292 | 0.259 |
| RP11‐357H14.20 | 2.235 | 1.237–4.038 | 0.008 |
| RP11‐438N16.1 | 0.804 | 0.468–1.384 | 0.432 |
| RP11‐129M6.1 | 0.971 | 0.545–1.730 | 0.920 |
| AC006296.1 | 0.389 | 0.117–1.293 | 0.123 |
| AC007880.1 | 2.347 | 0.749–7.358 | 0.143 |
| AC092168.4 | 1.313 | 0.654–2.636 | 0.444 |
| AC093850.2 | 1.011 | 0.545–1.877 | 0.972 |
| AF003626.1 | 0.742 | 0.386–1.425 | 0.370 |
| AP000344.3 | 1.573 | 0.769–3.218 | 0.215 |
| AP000473.6 | 0.792 | 0.457–1.374 | 0.407 |
| CTD‐2382E5.1 | 1.199 | 0.611–2.353 | 0.597 |
| FRMPD2P1 | 0.614 | 0.158–2.383 | 0.481 |
| LINC00028 | 0.784 | 0.429–1.436 | 0.431 |
| MAMDC2‐AS1 | 1.313 | 0.723–2.385 | 0.372 |
| RP11‐120J1.1 | 0.715 | 0.371–1.377 | 0.316 |
| RP11‐225N10.1 | 0.903 | 0.524–1.557 | 0.714 |
| RP11‐226F19.5 | 1.605 | 0.845–3.051 | 0.149 |
| RP11‐242F24.1 | 0.840 | 0.387–1.823 | 0.659 |
| RP1‐12803.4 | 1.055 | 0.540–2.061 | 0.875 |
| RP11‐411K7.1 | 1.021 | 0.550–1.895 | 0.947 |
| RP11‐51M18.1 | 0.827 | 0.479–1.428 | 0.496 |
| RP11‐521B24.3 | 1.167 | 0.595–2.289 | 0.653 |
| RP11‐526P5.2 | 0.733 | 0.390–1.375 | 0.333 |
| RP11‐71G12.1 | 0.819 | 0.467–1.436 | 0.485 |
| RP11‐768G7.2 | 2.215 | 1.258–3.903 | 0.006 |
| RP11‐89N17.4 | 0.598 | 0.300–1.193 | 0.144 |
| RP11‐726G1.1 | 1.698 | 0.990–2.914 | 0.055 |
| RP11‐69C17.1 | 1.646 | 0.766–3.535 | 0.201 |
Indicated P < 0.05. CI, confidence interval; HR, hazard ratio.
Figure 4Nomograms of the two Cox models. (a,c) Two models are shown. An individual patient's value is located on each variable axis, and a line is drawn upward to determine the number of points received for each variable value. The sum of these numbers is located on the Total Points axis, and a line is drawn downward to the survival axes to determine the likelihood of three or five‐year survival. (b,d) The calibration curve for predicting patient survival at three or five years in the former and combined nomograms, respectively. Nomogram‐predicted probability of overall survival is plotted on the x‐axis; actual overall survival is plotted on the y‐axis. OS, overall survival; TNM, tumor node metastasis.