| Literature DB >> 27006471 |
Lu Ao1,2, Xuekun Song1, Xiangyu Li2, Mengsha Tong2, You Guo2, Jing Li2, Hongdong Li2, Hao Cai2, Mengyao Li2, Qingzhou Guan2, Haidan Yan2, Zheng Guo1,2.
Abstract
Previously reported prognostic signatures for predicting the prognoses of postsurgical hepatocellular carcinoma (HCC) patients are commonly based on predefined risk scores, which are hardly applicable to samples measured by different laboratories. To solve this problem, using gene expression profiles of 170 stage I/II HCC samples, we identified a prognostic signature consisting of 20 gene pairs whose within-sample relative expression orderings (REOs) could robustly predict the disease-free survival and overall survival of HCC patients. This REOs-based prognostic signature was validated in two independent datasets. Functional enrichment analysis showed that the patients with high-risk of recurrence were characterized by the activations of pathways related to cell proliferation and tumor microenvironment, whereas the low-risk patients were characterized by the activations of various metabolism pathways. We further investigated the distinct epigenomic and genomic characteristics of the two prognostic groups using The Cancer Genome Atlas samples with multi-omics data. Epigenetic analysis showed that the transcriptional differences between the two prognostic groups were significantly concordant with DNA methylation alternations. The signaling network analysis identified several key genes (e.g. TP53, MYC) with epigenomic or genomic alternations driving poor prognoses of HCC patients. These results help us understand the multi-omics mechanisms determining the outcomes of HCC patients.Entities:
Keywords: gene expression; hepatocellular carcinoma; multi-omics; prognostic signature; relative expression orderings
Mesh:
Year: 2016 PMID: 27006471 PMCID: PMC5029687 DOI: 10.18632/oncotarget.8212
Source DB: PubMed Journal: Oncotarget ISSN: 1949-2553
Figure 1The workflow for construction and validation of the prognostic signature
The workflow showed four major analysis steps: the development (step 1) and validation (step 2) of the gene pairs signature; multi-omics characteristics analyses of distinct prognostic groups (step 3) and the SIGNOR network analysis for HCC prognostic genes (step 4).
Description of the datasets used in this study
| HCC170 | HCC60 | HCC314 | ||
|---|---|---|---|---|
| GSE14520 | GSE14520 | E-TABM-36 | TCGA | |
| GPL3921 | GPL571 | IlluminaHiseq-RNAseqV2 | ||
| 170 | 21 | 39 | 314 | |
| 51.5 (21–77) | 47 (33–76) | 68 (18–79) | 61 (18–90) | |
| 53.4 (3.3–67.4) | 34.4 (1.8–63.8) | 35 (3–60) | 10.93 (0–122.5) | |
| Male | 143 (84%) | 20 (95%) | 31 (79%) | 219 (70%) |
| Female | 27 (16%) | 1 (5%) | 8 (21%) | 95 (30%) |
| I | 93 | 3 | 0 | 154 |
| II | 77 | 1 | 0 | 71 |
| III | 0 | 2 | 0 | 65 |
| IV | 0 | 0 | 0 | 4 |
| NA | 0 | 15 | 39 | 20 |
| AVR-CC | 46 | 2 | 0 | 0 |
| CC | 119 | 4 | 0 | 0 |
| NA | 5 | 15 | 39 | 314 |
| yes | 153 | 20 | 0 | 6 |
| no | 17 | 1 | 0 | 3 |
| NA | 0 | 0 | 39 | 305 |
| > 300 ng/mL | 66 | 10 | 0 | 56 |
| < = 300 ng/mL | 101 | 10 | 0 | 178 |
| NA | 3 | 1 | 39 | 80 |
Abbreviations: NA, not available; AVR-CC, active viral replication chronic carrier; CC, chronic carrier.
The 20-gene-pair prognostic signature
| Signature | Gene A | Gene B | C-index | Beta | pFDR |
|---|---|---|---|---|---|
| Pair1 | CAT | EBAG9 | 0.6581 | 1.19 | 4.31 × 10−06 |
| Pair2 | RNF208 | MTMR10 | 0.6121 | 0.96 | 9.20 × 10−05 |
| Pair3 | TUBA8 | MTMR10 | 0.6069 | 0.92 | 1.01 × 10−03 |
| Pair4 | RNF208 | GULP1 | 0.5886 | 0.79 | 5.25 × 10−04 |
| Pair5 | ST18 | ADAMTS3 | 0.5714 | 0.61 | 6.10 × 10−03 |
| Pair6 | OSBPL10 | NPM3 | 0.5679 | 0.74 | 2.81 × 10−03 |
| Pair7 | TUBA8 | NPM3 | 0.5583 | 0.60 | 2.06 × 10−02 |
| Pair8 | TRIM26 | TRMT12 | 0.5565 | 1.20 | 2.17 × 10−04 |
| Pair9 | RNF208 | OSBPL10 | 0.5562 | 1.33 | 8.85 × 10−03 |
| Pair10 | TUBA8 | PF4V1 | 0.5502 | 1.29 | 1.68 × 10−04 |
| Pair11 | ST18 | NTS | 0.5469 | 0.49 | 4.97 × 10−02 |
| Pair12 | CAT | TRMT12 | 0.5431 | 1.74 | 8.87 × 10−05 |
| Pair13 | RNF208 | SLC52A2 | 0.5362 | 1.22 | 2.82 × 10−02 |
| Pair14 | SORBS2 | HSF1 | 0.5359 | 1.34 | 1.26 × 10−03 |
| Pair15 | RNF208 | HSF1 | 0.5342 | 1.97 | 3.55 × 10−02 |
| Pair16 | SORBS2 | SLC52A2 | 0.5337 | 1.82 | 1.31 × 10−04 |
| Pair17 | PYGL | EIF3H | 0.5333 | 1.97 | 3.56 × 10−02 |
| Pair18 | CAT | NOLC1 | 0.5326 | 2.35 | 3.84 × 10−05 |
| Pair19 | RNF208 | SLC2A1 | 0.5269 | 0.70 | 7.22 × 10−02 |
| Pair20 | EYA1 | PF4V1 | 0.5264 | 1.13 | 7.67 × 10−03 |
Note: Beta the parameters calculated by the univariate Cox regression model. Beta represents the risk coefficient of within-sample REO of gene pair (A, B), where Beta > 0 indicates that E < E is a risk factor, and vice versa. pFDR, the adjusted P-value by storey.
Figure 2The Kaplan-Meier curves of disease-free survival and overall survival for prognostic groups predicted by the 20-gene-pair in the training and validation datasets
Kaplan-Meier curves of disease-free survival (A) and overall survival (B) for the training dataset HCC170; Kaplan-Meier curves of disease-free survival (C) and overall survival (D) for the validation dataset HCC60; Kaplan-Meier curves of overall survival (E) for the validation dataset HCC314. A sample was classified into the high-risk group (red line) if and only if at least 10 of the 20 prognostic gene pairs voted for high-risk; otherwise, the low-risk group (blue line).
Univariate and multivariate Cox regression analyses for the 20-gene-pair signature
| Variables | Univariate model | Multivariate model | ||
|---|---|---|---|---|
| HR (95% CI) | HR (95% CI) | |||
| Predictive signature (high vs. low) | 5.97 (3.78–9.44) | 5.92 (3.68–9.53) | ||
| Stage (II vs. I) | 2.03 (1.32–3.11) | 1.62 (1.03–2.54) | ||
| HBV (AVR-CC vs. CC) | 1.41 (0.89–2.22) | 0.14 | 1.14 (0.88–1.81) | 0.58 |
| Cirrhosis (yes vs. no) | 2.14 (0.87–5.28) | 0.09 | 1.60 (0.63–4.05) | 0.32 |
| AFP (> 300 ng/mL vs. < = 300 ng/mL) | 0.92 (0.59–1.44) | 0.73 | 0.73 (0.46–1.14) | 0.17 |
| HCC314 | ||||
| Predictive signature (high vs. low) | 1.94 (1.21–3.14) | 1.87 (1.12–3.11) | ||
| Stage (IV/III vs. II/I) | 1.62 (0.91–2.89) | 0.10 | 1.53 (0.85–2.75) | 0.16 |
NOTE: Bold indicates significant P values.
Abbreviations: HBV, hepatitis B virus; AVR-CC, active viral replication chronic carrier; CC, chronic carrier.
Figure 3The SIGNOR sub-network for HCC prognosis “drivers”
This sub-network included prognosis “drivers”, which have distinct epigenomic and/or genomic alternations with concordant expression changes in the high-risk patients, and other “non-drivers” prognosis-associated DEGs directly linked to the drivers. Circle nodes represent “non-drivers” and the other nodes represent various types of drivers. Rectangle, amplified genes with concordant overexpressions in the high-risk group; Diamond, hypermethylated (or hypomethylated) genes with underexpression (or overexpression) in the high-risk group; Triangle, genes with frequently mutated in the high-risk group; Hexagon, overexpressed genes with hypomethylation and mutation in the high-risk group. The node colors indicate genes’ overexpression (Red) or underexpression (Blue) states in the high-risk group.