| Literature DB >> 30133128 |
Jun Wang1, Peng Gao1, Yongxi Song1, Jingxu Sun1, Xiaowan Chen1, Hong Yu1, Yu Wang1, Zhenning Wang1.
Abstract
Selecting differentially expressed genes (DEGs) based on integrated bioinformatics analyses has been used in previous studies to explore potential biomarkers in gastric cancer (GC) with microarray and RNA sequencing data. However, the genes obtained may be inaccurate because of noisy data and errors, as well as insufficient clinical sample sizes. Thus, we aimed to find robust and strong DEGs with prognostic value for GC, where the robust rank aggregation method was employed to select significant DEGs from eight Gene Expression Omnibus data sets with a total of 140 up-regulated and 206 down-regulated genes. Network data mining was then used to screen hub genes, and 11 genes were filtered using Fisher's exact test. Based on these results, we built a prognostic signature with seven genes (FBN1, MMP1, PLAU, SPARC, COL1A2, COL2A1 and ATP4A) using stepwise multivariate Cox proportional hazard regression. According to the risk score for each patient, we found that high-risk group patients had significantly worse survival results compared with those in the low-risk group (log-rank test P-value < 0.001). This seven-gene signature was then validated with an external data set. Thus, we established a signature based on seven DEGs with prognostic value for GC patients using multi-steps bioinformatics methods, which may provide novel insights and potential biomarkers for prognosis, as well as possibly serving as new therapeutic targets in clinical applications.Entities:
Keywords: gastric cancer; meta-analysis; network mining; prognosis; robust rank aggregation
Mesh:
Substances:
Year: 2018 PMID: 30133128 PMCID: PMC6201382 DOI: 10.1111/jcmm.13823
Source DB: PubMed Journal: J Cell Mol Med ISSN: 1582-1838 Impact factor: 5.310
Figure 1Workflow of our multi‐step strategy for identifying a gene signature with prognostic value in gastric cancer (GC)
Figure 2Establishment of a seven‐gene signature prognostic risk scoring system based on above DEGs. A, Time‐dependent ROC curve for predicting the 5‐y survival. B, Kaplan‐Meier curve for the seven‐gene signature (log‐rank test P‐value < 0.001). The two dotted lines in each group are the level for a two‐sided confidence interval on the survival curve. C, The seven‐gene signature‐based risk score distributions, patient survival results and expression heatmap. D‐F, Expression profiles of , and in different TNM stages of GC