| Literature DB >> 35371973 |
Longxiang Xie1, Qiang Wang1, Zhongyi Yan1, Yali Han1, Xiaoyu Ma1, Huimin Li1, Lu Zhang1, Xianzhe Li2, Xiangqian Guo1.
Abstract
Evaluating the prognostic value of genes of interest in different populations of gastric cancer (GC) is difficult and time-consuming for basic and translational researchers even though many datasets are available in public dataset depositories. In the current study, we developed a robust web-based portal called OSgc (Online consensus Survival analysis of gastric cancer) that enables easy and swift verification of known and novel biomarker candidates in GC. OSgc is composed of gene expression profiling data and clinical follow-up information of 1,824 clinical GC cases, which are collected from 7 public independent datasets derived from Gene Expression Omnibus (GEO) and The Cancer Genome Atlas (TCGA). By OSgc, users input the official gene symbol and will promptly retrieve the Kaplan-Meier survival plot with hazard ratio (HR) and log rank p value on the output webpage, by which users could assess the prognostic value of interesting genes for GC patients. Five survival end points containing overall survival, progression-free survival, progression-free interval, relapse-free survival, and disease-free survival could be measured in OSgc. OSgc can greatly help cancer biologists and clinicians to explore the effect of gene expression on patient survival. OSgc is freely available without restrictions at http://bioinfo.henu.edu.cn/GC/GCList.jsp.Entities:
Keywords: GEO; OSgc; TCGA; gastric cancer; gene expression; prognostic biomarker; survival analysis
Year: 2022 PMID: 35371973 PMCID: PMC8965707 DOI: 10.3389/fonc.2022.856988
Source DB: PubMed Journal: Front Oncol ISSN: 2234-943X Impact factor: 6.244
Figure 1The flowchart of OSgc establishment. GEO, Gene Expression Omnibus; TCGA, The Cancer Genome Atlas.
Clinical characteristics of individual dataset in OSgc.
| ID | GPL | Sample size | Gender Female/male | Number of deaths | Grade | Stage (I/II/III/IV) | Median age (years) | Survival terms | Reference |
|---|---|---|---|---|---|---|---|---|---|
| GSE22377 | GPL570 | 43 | 28/15 | 30 | – | 2/12/19/2 | 64 | OS, DFS | ( |
| GSE26253 | GPL8432 | 432 | – | – | – | 68/167/130/67 | RFS | ( | |
| GSE29272 | GPL96 | 126 | 27/99 | 31 | 1/44/81 | 5/4/108/8 | 59 | OS | ( |
| GSE57303 | GPL570 | 70 | 14/56 | 7 | – | 0/4/34/32 | 68 | OS | ( |
| GSE62254 | GPL570 | 300 | 101/199 | 152 | – | 30/96/95/77 | 64 | OS, DFS | ( |
| GSE84437 | GPL6947 | 433 | 137/296 | 209 | – | 11/38/92/292 | 62 | OS | ( |
| TCGA-GC | RNAseq | 420 | 134/286 | 169 | 10/150/251 | 55/128/202/40 | 67 | OS, RFS, PFI, PFS | ( |
| Total | 1824 | 441/951 | 598 | 11/194/332 | 171/593/677/519 |
9 samples were GX that grade cannot be defined.
Figure 2The association of GC patient survival with gender (A, B) and stage (C, D) in OSgc.
Figure 3An overview of OSgc application. OSgc application mainly contains three sections including input, clinical features, and results.
Test the performance of previously reported prognostic biomarker candidates using OSgc.
| Gene symbol | Literature data | OSgc data | |||||||
|---|---|---|---|---|---|---|---|---|---|
| N | Survival | Prognostic value | Method | Reference | Dataset | HR (95% CI) |
| Validation results | |
| HOXA5 | 30 | OS | Unfavorable | qRT-PCR and Western blot | ( | Combined | 1.3724 (1.114–1.5818) | 0.0015 | √ |
| CAP2 | 436 | OS | Unfavorable | RT-PCR | ( | Combined | 1.5207 (1.2645–1.7858) | <0.0001 | √ |
| LAMA4 | 388 | OS | Unfavorable | qRT-PCR | ( | Combined | 1.4904 (1.2305–1.8053) | <0.0001 | √ |
| MTBP | 352 | OS | Unfavorable | qRT-PCR and Western blot | ( | GSE22377 | 2.7219 (1.2506–5.9242) | 0.0116 | √ |
| RAI14 | 68 | OS | Unfavorable | IHC | ( | Combined | 1.6117 (1.3304–1.9524) | <0.0001 | √ |
| SETD2 | 153 | OS | Favorable | qRT-PCR and IHC | ( | Combined | 0.6946 (1.3304–1.0078) | 0.0603 | × |
| NDRG4 | 286 | OS | Unfavorable | IHC | ( | Combined | 1.4149 (1.1904–1.6818) | 1e-04 | √ |
| SPARC | 227 | OS | Unfavorable | qRT-PCR | ( | Combined | 1.4217 (1.1612–1.7407) | 7e-04 | √ |
| HOXB9 | 190 | OS | Favorable | IHC | ( | Combined | 0.8461 (0.7008–1.0215) | 0.0821 | × |
| DDC | 39 | OS | Favorable | qRT-PCR and IHC | ( | GSE22377 | 0.2805 (0.097–0.8108) | 0.0189 | √ |
| ERCC1 | 106 | OS | Unfavorable | qRT-PCR | ( | GSE62254 | 1.6054 (1.1377–2.2662) | 0.007 | √ |
| STAT3 | 50 | OS | Unfavorable | RT-PCR, Western blot and IHC | ( | TCGA | 1.4306 (1.0138–2.0188) | 0.0416 | √ |
| IGFBP7 | 247 | OS | Unfavorable | qRT-PCR and IHC | ( | Combined | 1.4577 (1.2253–1.7342) | <0.0001 | √ |
| TIMP3 | 17 | OS | Unfavorable | qRT-PCR and IHC | ( | Combined | 1.4779 (1.2283–1.7783) | <0.0001 | √ |
| KLK6 | 66 | OS | Unfavorable | qRT-PCR and IHC | ( | GSE62254 | 1.4878 (1.0523–2.1037) | 0.0246 | √ |
| NNMT | 641 | OS | Unfavorable | qRT-PCR | ( | Combined | 1.3734 (1.1529–1.6362) | 4e-04 | √ |
| ATAD2 | 166 | OS | Unfavorable | qRT-PCR | ( | GSE22377 | 3.4084 (1.5908–7.3027) | 0.0016 | √ |
| CXCR3 | 96 | OS | Favorable | RT-PCR and qRT-PCR | ( | Combined | 0.7203 (0.5923–0.8759) | 0.0010 | √ |
| SMYD3 | 166 | OS | Unfavorable | IHC | ( | Combined | 1.1857 (0.9798–1.435) | 0.0801 | × |
| S100A4 | 434 | OS | Unfavorable | qRT-PCR | ( | Combined | 1.2081 (1.0103–1.4447) | 0.0382 | √ |
HOXA5, homeobox A5; CAP2, cyclase-associated protein 2; LAMA4, laminin a4; MTBP, MDM2 binding protein; RAI14, retinoic acid induced 14; SETD2, SET domain containing; NDRG4, N-Myc downstream-regulated gene 4; SPARC: secreted protein acidic and rich in cysteine; HOXB9: homeodomain-containing transcription factor; DDC: Dopa decarboxylase; ERCC1, excision repair cross complementation group 1; STAT3, signal transducers and activators of transcription; IGFBP7, insulin-like growth factor binding protein 7; KLK6, kallikrein 6; NNMT, nicotinamide N-methyltransferase; ATAD2, ATPase family, AAA domain containing 2; CXCR3, chemokine receptor; SMYD3, MYND domain-containing protein 3; S100A4: S100 calcium binding protein A4.
Figure 4The prognostic value of E2F2 and E2F8 in GC patients (OS in OSgc). (A) KM survival plot for E2F2 suggests that its high expression (red) indicates favorable prognosis (GC patients were separated by the quarter of gene expression level); (B) KM survival plot for E2F8 suggests that its high expression (red) indicates favorable prognosis (GC patients were separated by the quarter of gene expression level); E2F2, E2F transcription factor 2; E2F8, E2F transcription factor 8; GC, gastric cancer; OS, overall survival.