| Literature DB >> 32480397 |
Xi Luo1, Kuan-Jui Su2, Chuan Qiu2, Xing Liu1, Fang Yang1.
Abstract
BACKGROUND The established clinical criteria for gastric cancer prognosis are insufficient due to molecular heterogeneity. Therefore, constructing a robust prognostic model is essential to predict gastric cancer patient survival. MATERIAL AND METHODS A comprehensive method, which combined weighted gene co-expression network analysis (WGCNA) with elastic-net Cox regression, was utilized to identify prognostic long non-coding RNAs (lncRNAs) from Gene Expression Omnibus database for overall survival (OS) prediction. Methods using WGCNA or elastic-net Cox regression alone were treated as "contrast" methods. The univariate and multivariate Cox regression was used to identify independent prognostic clinical factors. We performed 3-year and 5-year area under the curve (AUC) of the time-dependent receiver operating characteristic comparison of 3 different methods in gene and clinical-gene models to explore the prediction ability of the comprehensive method. The optimal model identified in the training set were validated in the validation set. Biological information analysis for the optimal model was also explored. RESULTS The clinical-gene model containing 13 co-expression lncRNAs identified by the comprehensive method and 3 clinical factors including molecular subtype, recurrence status and operation type, was the found to be the optimal model in the study, with 0.832 and 0.830 for the 3-year and 5-year AUC in the training set, and 0.764 and 0.778 in the validation set, respectively. Biological information analysis suggested that lipid metabolism played an important role in the occurrence and development of gastric cancer. CONCLUSIONS We constructed a novel prognostic model containing 13 co-expression lncRNAs and 3 clinical factors for gastric cancer patients.Entities:
Mesh:
Substances:
Year: 2020 PMID: 32480397 PMCID: PMC7288832 DOI: 10.12659/MSM.923295
Source DB: PubMed Journal: Med Sci Monit ISSN: 1234-1010
Basic characteristics of the clinical variables in GC patients.
| Clinical variables | Total | Training dataset | Validate dataset |
|---|---|---|---|
| Sample size | 300 | 210 | 90 |
| Survival status | |||
| Dead | 152 | 102 | 50 |
| Survived | 148 | 108 | 40 |
| Median age (year) | 61.94 | 47.59 | 62.76 |
| Sex | |||
| Female | 101 | 74 | 27 |
| Male | 199 | 136 | 63 |
| T stage | |||
| 2 | 188 | 130 | 58 |
| 3 | 91 | 67 | 24 |
| 4 | 21 | 13 | 8 |
| N stage | |||
| 0 | 38 | 28 | 10 |
| 1 | 131 | 94 | 37 |
| 2 | 80 | 49 | 31 |
| 3 | 51 | 39 | 12 |
| M stage | |||
| 0 | 273 | 191 | 82 |
| 1 | 27 | 19 | 8 |
| AJCC stage | |||
| I | 30 | 21 | 9 |
| II | 97 | 70 | 27 |
| III | 96 | 65 | 31 |
| IV | 77 | 54 | 23 |
| Recurrent status | |||
| Yes | 125 | 110 | 53 |
| No | 157 | 77 | 29 |
| Unknown | 18 | 19 | 8 |
| Tumor cite | 4 | ||
| Antrum | 155 | 110 | 45 |
| Body | 107 | 77 | 30 |
| Cardia | 32 | 19 | 13 |
| Whole | 6 | 4 | 2 |
| WHO classification | |||
| W/D and M/D tubular | 114 | 89 | 25 |
| P/D tubular | 116 | 73 | 43 |
| Signet ring cell | 37 | 25 | 12 |
| Mucinous | 8 | 5 | 3 |
| Papillary | 9 | 6 | 3 |
| Other | 16 | 12 | 4 |
| Lauren type | |||
| Intestinal | 150 | 110 | 40 |
| Diffuse | 142 | 95 | 47 |
| Mixed | 8 | 5 | 3 |
| Molecular subtype | |||
| MSS/TP53− | 107 | 79 | 28 |
| MSS/TP53+ | 79 | 53 | 26 |
| MSI | 68 | 44 | 24 |
| MSS/EMT | 46 | 34 | 12 |
| EBV status | |||
| Positive | 18 | 181 | 76 |
| Negative | 257 | 13 | 5 |
| Missing | 12 | 16 | 9 |
| Adjuvant CCRT | |||
| Completed | 73 | 46 | 27 |
| Not completed | 7 | 5 | 2 |
| Not done | 220 | 129 | 61 |
| OP type | |||
| TG | 135 | 94 | 41 |
| STG | 165 | 116 | 49 |
GC – gastric cancer; AJCC – American Joint Committee on Cancer; WHO – World Health Organization; W/D – well-differentiated; M/D – moderately differentiated; P/D – poorly differentiated; MSS – microsatellite stable; TP53− – tumor protein 53 inactive; TP53+ – tumor protein 53 active; MSI – microsatellite instability; EMT – epithelial-mesenchymal transition; TG – total gastrectomy; STG – subtotal gastrectomy.
Prognostic clinical factors of OS for GC patients estimated by univariate and multivariate Cox regression.
| Clinical variables | Univariate analysis | Multivariate analysis | ||||
|---|---|---|---|---|---|---|
| HR | P value | 95% CI | HR | P value | 95% CI | |
| Sex | 0.80 | 0.311 | 0.53–1.23 | 0.66 | 0.074 | 0.42–1.04 |
| Age | 1.01 | 0.416 | 0.99–1.03 | 1.03 | 0.014 | 1.01–1.05 |
| Molecular subtype | 0.79 | 0.028 | 0.64–0.98 | 0.70 | 0.003 | 0.55–0.88 |
| Lauren subtype | 1.25 | 0.220 | 0.87–1.79 | 1.15 | 0.506 | 0.76–1.75 |
| WHO classification | 1.11 | 0.148 | 0.96–1.28 | 1.03 | 0.752 | 0.87–1.22 |
| T stage | 1.68 | 0.001 | 1.24–2.28 | 1.51 | 0.059 | 0.98–2.32 |
| N stage | 1.99 | 0.000 | 1.55–2.56 | 1.33 | 0.266 | 0.81–2.19 |
| M stage | 3.81 | 0.002 | 1.61–9.04 | 1.83 | 0.257 | 0.64–5.19 |
| AJCC stage | 2.20 | 0.000 | 1.7–2.86 | 1.37 | 0.307 | 0.75–2.49 |
| Tumor site | 1.48 | 0.003 | 1.14–1.91 | 1.18 | 0.352 | 0.83–1.66 |
| Recurrence status | 1.39 | 0.013 | 1.07–1.81 | 1.57 | 0.007 | 1.13–2.17 |
| OP type | 0.53 | 0.001 | 0.36–0.78 | 0.56 | 0.049 | 0.32–0.99 |
| EBV status | 0.58 | 0.441 | 0.14–2.35 | 0.59 | 0.471 | 0.14–2.49 |
| Adjuvant. CCRT | 1.07 | 0.576 | 0.85–1.33 | 1.00 | 1.000 | 0.78–1.28 |
Independent prognostic clinical variables with statistical significance at P<0.05 level both in univariate and multivariate Cox analyses.
OS – overall survival; GC – gastric cancer; HR – hazard ratio; CI – confidence interval; WHO – World Health Organization; AJCC – American Joint Committee on Cancer; OP – operation; EBV – Epstein-Barr virus; CCRT – concurrent chemoradiation therapy.
Figure 1Gene clustering dendrogram and merged module colors based on a dissimilarity measure (1-TOM).
Prognostic modules of OS for GC patients estimated by univariate Cox regression analysis.
| Modules | HR | P-Value | 95%CI | Gene numbers | Hub genes |
|---|---|---|---|---|---|
| Blue | 0.18 | 0.234 | 0.01–3.090 | 826 | – |
| Brown | 1.03 | 0.981 | 0.06–17.34 | 110 | – |
| Turquoise | 1.87 | 0.671 | 0.10–33.44 | 2064 | – |
| Yellow | 1.72 | 0.712 | 0.10–30.39 | 86 | – |
| Green | 3.29 | 0.392 | 0.22–50.32 | 65 | – |
| Red | 42.25 | 0.013 | 2.20–811.4 | 55 | 11 |
| Gray | 0.05 | 0.053 | 0.00–1.040 | 2032 | – |
Modules identified by WGCNA;
prognostic module with statistical significance at P<0.05 level in univariate Cox analysis.
OS – overall survival; GC – gastric cancer; HR – hazard ratio; CI – confidence interval. WGCNA – weighted gene co-expression network analysis.
The candidate LncRNAs identified by 3 different methods.
| RSW | RSe | RSc | |||
|---|---|---|---|---|---|
| lncRNA | Coef | lncRNA | Coef | lncRNA | Coef |
| LOC644656 | 0.0375 | LINC00930 | −1.4153 | LOC644656 | 0.1710 |
| VWA8.AS1 | 0.0435 | AP000550.1 | −0.6437 | VWA8.AS1 | 0.2159 |
| LINC01085 | 0.0216 | AC009052.1 | −1.9817 | LOC101928069 | −0.4790 |
| LINC00606 | 0.0302 | LINC01206 | −0.2111 | ||
| KMT2E.AS1 | 0.0679 | LINC01085 | 0.1170 | ||
| DLG1.AS1 | 0.0186 | KMT2E.AS1 | 0.3330 | ||
| BVES.AS1 | 0.0666 | DAPK1.IT1 | 0.1238 | ||
| ADAMTSL4.AS1 | 0.0275 | AC139713.2 | 0.3901 | ||
| AC139713.2 | 0.0773 | AC023509.1 | −0.7042 | ||
| AC017091.1 | 0.1007 | AC017091.1 | 0.2683 | ||
| PXN.AS1 | 0.0679 | PXN.AS1 | 0.1654 | ||
| PTPRD.AS1 | −0.5058 | ||||
| PRKAG2.AS1 | −0.3523 | ||||
Coef was the corresponding ridge regression coefficient of the lncRNA (long noncoding RNA).
Figure 2The 3-year and 5-year area under the curve (AUC) comparison. (A) The 3-year and 5-year AUC comparison between 3 methods in gene and clinical-gene models in the training set. (B) The 3-year and 5-year AUC comparison between clinical, gene, and clinical-gene models of the comprehensive method in the training set. (C) The 3-year and 5-year AUC comparison between 3 methods in gene and clinical-gene models in the validation set. (D) The 3-year and 5-year AUC comparison between clinical, gene, and clinical-gene models of the comprehensive method in the validation set.
Figure 3Kaplan-Meier curve. (A) Kaplan-Meier curve of high-risk and low-risk groups in the training set (the median of RSc-clinical as a cutoff value for patients grouping); (B) Kaplan-Meier curve of high-risk and low-risk groups in the validation set (the median of RSc-clinical as a cutoff value for patients grouping).
Stratification analysis of the optimal model based on 3 independent clinical factors in the training set.
| Independent clinical factors | 3-year AUC | 5-year AUC |
|---|---|---|
| Molecular type | ||
| MSS/TP53+ | 0.860 | 0.816 |
| MSS/TP53− | 0.843 | 0.800 |
| MSI | 0.677 | 0.717 |
| MSS/EMT | 0.782 | 0.881 |
| Recurrent status | ||
| Yes | 0.664 | 0.748 |
| No | 0.812 | 0.730 |
| Operation type | ||
| TG | 0.765 | 0.804 |
| STG | 0.770 | 0.684 |
AUC – area under the curve; MSS – microsatellite stable; TP53+ – tumor protein 53 active; TP53− – tumor protein 53 inactive; MSI – microsatellite instability; EMT – epithelial-mesenchymal transition; TG – total gastrectomy; STG – subtotal gastrectomy.
Figure 4The most significantly enriched Gene Ontology (GO) annotations and pathways of 13 lncRNAs which we identified in the study. The length of bars and the size of dots represent the numbers of genes; the color of bars and dots corresponds to P value according to legend. (A) Top 10 significantly enriched biological process GO annotations. (B) Top 10 significantly enriched cellular component GO annotations. (C) Top 10 significantly enriched molecular function GO annotations. (D) Top 5 significantly enriched Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway.