| Literature DB >> 32587519 |
Yang An1, Qiang Wang1, Guosen Zhang1, Fengjie Sun1, Lu Zhang1, Haojie Li1, Yingkun Li1, Yanyu Peng1, Wan Zhu2, Shaoping Ji1, Xiangqian Guo1.
Abstract
Liver hepatocellular carcinoma (LIHC) is one of the most common malignant tumors in the world with an increasing number of fatalities. Identification of novel prognosis biomarker for LIHC may improve treatment and therefore patient outcomes. The availability of public gene expression profiling data offers the opportunity to discover prognosis biomarkers for LIHC. We developed an online consensus survival analysis tool named OSlihc using gene expression profiling and long-term follow-up data to identify new prognosis biomarkers. OSlihc consists of 637 cases from four independent cohorts. As a risk assessment tool, OSlihc generates the Kaplan-Meier survival plot with hazard ratio (HR) and p value to evaluate the prognostic value of a gene of interest. To test the reliability of OSlihc, we analyzed 65 previous reported prognostic biomarkers in OSlihc and showed that all of which have significant prognostic values. Furthermore, we identified four novel potential prognostic biomarkers (ATG9A, WIPI1, CXCL1, and CSNK2A2) for LIHC, the elevated expression of which predict the unfavorable survival outcomes. These genes (ATG9A, WIPI1, CXCL1, and CSNK2A2) may be potentially new biomarkers to identify at-risk LIHC patients when further validated. By OSlihc, users can evaluate the prognostic abilities of genes of their interest, which provides a platform for researchers to identify prognostic biomarkers to further develop targeted therapy strategies for LIHC patients. OSlihc is public and free to the users at http://bioinfo.henu.edu.cn/LIHC/LIHCList.jsp.Entities:
Keywords: gene expression profiling; liver hepatocellular carcinoma; prognosis biomarker; survival analysis; survival outcome
Year: 2020 PMID: 32587519 PMCID: PMC7298068 DOI: 10.3389/fphar.2020.00875
Source DB: PubMed Journal: Front Pharmacol ISSN: 1663-9812 Impact factor: 5.810
Summary of datasets in OSlihc.
| Dataset | Sample size | Data type | Platform | No. of death | Median of OS (months) | Survival terms |
|---|---|---|---|---|---|---|
| TCGA | 361 | RNA-seq | Illumina HiSeqV2 | 129 | 20.03 | OS, DSS, DFI, PFI |
| GSE76427 | 115 | cDNA array | GPL10558 | 23 | 13.92 | OS, RFS |
| GSE20140 | 80 | cDNA array | GPL5474 | 32 | 96.15 | OS |
| GSE27150 | 81 | cDNA array | GPL13128 | 28 | 39 | OS |
| Total | 637 | 212 | 22.43 |
Figure 1Flowchart of web server architecture.
Figure 2Survival analysis of clinical characteristics of the patients included in OSlihc. (A) Pathologic stage, (B) BCLC stage, (C) Serum AFP level.
Figure 3Overview of OSlihc. (A) Screenshot of OSlihc main interface. (B–F) Input and output interface of OSlihc.
Validation of previously published prognostic biomarkers for LIHC in OSlihc.
| Gene symbol | Biomarker name | Clinical survival terms | In OSlihc | In reference | Prognostic outcome (higher expression) | Ref. | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Cut-off | p value | HR | 95%CI | Case | Cut-off | p value | case | Detection level | Validation | |||||
| SPP1 | Osteopontin (OPN) | OS | Upper 25% | 0 | 2.3521 | 1.6394-3.3747 | 361 | / | <0.05 | 110 | serum | Yes, serum biochemical assay | worse | ( |
| DSS | 0.0126 | 1.8422 | 1.1403-2.9763 | / | ||||||||||
| DFI | 0.0183 | 1.5 | 1.071-2.1008 | DFS: p < 0.05 | ||||||||||
| PFI | 0.0078 | 1.5611 | 1.1242-2.1679 | / | ||||||||||
| BIRC5 | Survivin | OS | Upper 25% | 4e-04 | 1.9443 | 1.3457-2.8093 | 361 | upper n = 31/lower n = 41 | / | 72 | protein | Yes, IHC assay | worse | ( |
| DSS | 2e-04 | 2.4062 | 1.5199-3.8095 | / | ||||||||||
| DFI | 9e-04 | 1.753 | 1.2585-2.4419 | DFS: 0.0098 | ||||||||||
| PFI | 0.0012 | 1.7165 | 1.2375-2.3809 | / | ||||||||||
| MKI67 | Ki-67 | OS | Upper 25% | 6e-04 | 1.9063 | 1.3174-2.7586 | 361 | upper n = 47/lower n = 0 | 0.0009 | 67 | protein (Ki-67 labeling index >10% indicated poor prognosis) | Yes, IHC assay | worse | ( |
| DSS | 0.0011 | 2.1783 | 1.3664-3.4729 | / | ||||||||||
| DFI | 0.002 | 1.6821 | 1.2089-2.3404 | DFS: 0.02 | ||||||||||
| PFI | 0.0027 | 1.6475 | 1.1891-2.2827 | / | ||||||||||
Figure 4Validation of the top three high-frequency reported biomarkers in OSlihc. Kaplan–Meier plots for (A) SPP1, (B) MKI67, and (C) BIRC5 in terms of OS. p-value, confidence interval (95%CI) and number at risk are as shown. The y-axis represents survival rate and the x-axis represents survival time (months). p = 0 denotes p < 0.0001.
Evaluation of novel potential prognostic biomarkers for LIHC in OSlihc.
| Gene symbol | In OSlihc | case | ||||
|---|---|---|---|---|---|---|
| cut-off | survival terms | p value | HR | 95%CI | ||
| ATG9A | upper 30% | OS | 0.0157 | 1.4513 | 1.0727–1.9633 | 556 |
| OS | 0.0012 | 1.7919 | 1.2594–2.5496 | 361 | ||
| DSS | 0.0246 | 1.6859 | 1.0693–2.6581 | |||
| DFI | no significance | |||||
| PFI | no significance | |||||
| WIPI1 | upper 25% | OS | <0.0001 | 1.9141 | 1.4027–2.6119 | 556 |
| OS | 0.0176 | 1.5866 | 1.0837–2.323 | 361 | ||
| DSS | no significance | |||||
| DFI | no significance | |||||
| PFI | no significance | |||||
| CXCL1 | upper 30% | OS | no significance | 636 | ||
| OS | 0.0081 | 1.6312 | 1.1356–2.3432 | 361 | ||
| DSS | 0.0454 | 1.6077 | 1.0098–2.5595 | |||
| DFI | no significance | |||||
| PFI | no significance | |||||
| CSNK2A2 | upper 25% | OS | 5e-04 | 1.7411 | 1.2721–2.383 | 556 |
| OS | 0.0348 | 1.5271 | 1.0308–2.2623 | 361 | ||
| DSS | 0.0197 | 1.7965 | 1.0979–2.9396 | |||
| DFI | 0.0208 | 1.4965 | 1.0631–2.1065 | |||
| PFI | 0.0254 | 1.4692 | 1.0485–2.0586 | |||
Figure 5Evaluation of the prognostic values of potential biomarkers in OSlihc. Kaplan–Meier plots for high (red) and low (green) ATG9A, WIPI1, CXCL1, or CSNK2A2-expression cohort in combined dataset (A, C, F) and TCGA dataset (B, D, E, G). p-value, confidence interval (95%CI) and number at risk are as shown. The y-axis represents survival rate and the x-axis represents survival time (months).