| Literature DB >> 29117471 |
Oleg V Grinchuk1, Surya P Yenamandra1, Ramakrishnan Iyer1, Malay Singh1,2, Hwee Kuan Lee1,2, Kiat Hon Lim3, Pierce Kah-Hoe Chow3,4,5, Vladamir A Kuznetsov1.
Abstract
Currently, molecular markers are not used when determining the prognosis and treatment strategy for patients with hepatocellular carcinoma (HCC). In the present study, we proposed that the identification of common pro-oncogenic pathways in primary tumors (PT) and adjacent non-malignant tissues (AT) typically used to predict HCC patient risks may result in HCC biomarker discovery. We examined the genome-wide mRNA expression profiles of paired PT and AT samples from 321 HCC patients. The workflow integrated differentially expressed gene selection, gene ontology enrichment, computational classification, survival predictions, image analysis and experimental validation methods. We developed a 24-ribosomal gene-based HCC classifier (RGC), which is prognostically significant in both PT and AT. The RGC gene overexpression in PT was associated with a poor prognosis in the training (hazard ratio = 8.2, P = 9.4 × 10-6 ) and cross-cohort validation (hazard ratio = 2.63, P = 0.004) datasets. The multivariate survival analysis demonstrated the significant and independent prognostic value of the RGC. The RGC displayed a significant prognostic value in AT of the training (hazard ratio = 5.0, P = 0.03) and cross-validation (hazard ratio = 1.9, P = 0.03) HCC groups, confirming the accuracy and robustness of the RGC. Our experimental and bioinformatics analyses suggested a key role for c-MYC in the pro-oncogenic pattern of ribosomal biogenesis co-regulation in PT and AT. Microarray, quantitative RT-PCR and quantitative immunohistochemical studies of the PT showed that DKK1 in PT is the perspective biomarker for poor HCC outcomes. The common co-transcriptional pattern of ribosome biogenesis genes in PT and AT from HCC patients suggests a new scalable prognostic system, as supported by the model of tumor-like metabolic redirection/assimilation in non-malignant AT. The RGC, comprising 24 ribosomal genes, is introduced as a robust and reproducible prognostic model for stratifying HCC patient risks. The adjacent non-malignant liver tissue alone, or in combination with HCC tissue biopsy, could be an important target for developing predictive and monitoring strategies, as well as evidence-based therapeutic interventions, that aim to reduce the risk of post-surgery relapse in HCC patients.Entities:
Keywords: adjacent non-malignant tissue; co-transcription; hepatocellular carcinoma; personalized prognostic biomarkers; primary tumor; ribosome gene
Mesh:
Substances:
Year: 2017 PMID: 29117471 PMCID: PMC5748488 DOI: 10.1002/1878-0261.12153
Source DB: PubMed Journal: Mol Oncol ISSN: 1574-7891 Impact factor: 6.603
Clinicopathological characteristics in HCC patients’ cohorts used in the present study
| Clinical characteristic | Singapore (training) ( | LCI (validation) ( |
| |
|---|---|---|---|---|
| 1 | HBV infection (yes/no/–/%) | 53/43/19/46 | 187/4/15/91 | < 0.001 |
| 2 | HCV infection (yes/no/–/%) | 20/36/59/18 | – | – |
| 3 | Sex, male (yes/no/%) | 93/22/81 | 183/23/89 | 0.06 |
| 4 | Age (years) (≥50/<50/median) | 104/11/64 | 120/86/51 | < 0.001 |
| 5 | AFP (> 300 ng·mL−1/≤ 300 ng·mL−1/–/%) | 42/68/5/37 | 94/109/3/46 | 0.12 |
| 6 | Cirrhosis (yes/no/%) | 62/53/54 | 189/17/92 | < 0.001 |
| 7 | Multinodular/solitary tumors (yes/no/%) | 26/89/23 | 40/166/19 | 0.5 |
| 8 | Tumor size (> 5 cm/≤ 5 cm/%) | 65/50/57 | 71/134/34 | < 0.001 |
| 9 | Total nodules (yes/%) | |||
| 1 | 89/77 | – | – | |
| 2–7 | 16/14 | – | – | |
| > 7 | 10/9 | – | – | |
| 10 | Microscopic vascular invasion (yes/no/%) | 45/70/39 | – | – |
| 11 | Albumin Child points (yes/%) | |||
| 1 point | 74/64 | – | – | |
| 2 points | 34/30 | – | – | |
| 3 points | 7/6 | – | – | |
| 12 | Bilirubin Child points (yes/%) | |||
| 1 point | 107/93 | – | – | |
| 2 points | 7/6 | – | – | |
| 3 points | 0/0 | – | – | |
| 13 | Child‐Pugh status (yes/%) | |||
| Child‐Pugh A | 101/88 | – | – | |
| Child‐Pugh B | 13/11 | – | – | |
| Child‐Pugh C | 1/1 | – | – | |
| 14 | Milan criteria (beyond/within/%) | 75/40/65 | ||
| 15 | Edmondson tumor grade (3, 4/1, 2/–/%) | 62/50/3/53 | – | – |
| 16 | Platelets score (> 100/≤ 100/%) | 108/7/94 | – | – |
| 17 | Metastasis(imaging) (yes/no/–/%) | 2/110/3/2 | – | – |
| 18 | Presence of tumor capsule (yes/no/–/%) | 40/66/9/34 | – | – |
| 19 | Extra‐hepatic invasion(histology) (yes/no/%) | 1/114/1 | – | – |
| 20 | Portal vein invasion (yes/no/–/%) | 8/102/5/7 | – | – |
| 21 | Positive tumor margins (yes/no/%) | 12/103/10 | – | – |
| 22 | BCLC staging (yes/%) | |||
| 0 | 4/3 | 17/8 | 0.1 | |
| A | 75/65 | 133/65 | 0.8 | |
| B | 27/23 | 19/9 | < 0.001 | |
| C | 9/8 | 22/11 | 0.3 | |
| – | 0/0 | 15/7 | – | |
| 23 | TNM staging (II–IV/I/–/%) | 58/56/1/50 | 105/86/15/51 | 0.9 |
| 24 | Median follow‐up (OS), years (25–75th percentile) | 1.17 (0.45–3.12) | 4.36 (1.36–4.80) | < 0.001 |
| 25 | Overall death, | 25 (22) | 80 (39) | 0.002 |
Number of patients/percentage.
Total nodules, based on histology report, including satellite nodules.
Albumin Child points, Child‐Pugh Category score: 1 point = > 35 g·L−1; 2 points = 28–35 g·L−1; 3 points = < 28 g·L−1; Bilirubin Child points, Child‐Pugh Category score: 1 point = < 34.2 μmol·L−1; 2 points = 34.2–51.3 μmol·L−1; 3 points = > 51.3 μmol·L−1; Edmondson tumor grade: 1 = Grade 1; 2 = Grade 2; 3 = Grade 3; 4 = Grade 4. –, missing data.
Fisher's exact test (two‐sided).
Mann–Whitney test.
Figure 1Identification of CPG candidates for PT and AT as HCC prognostic biomarkers. (A, B) Traditional approaches previously used for identifying prognostic biomarkers in HCC. (B) Biomarker candidates may be pre‐selected using DEG analysis between PT and AT, followed by survival prognostic analysis. (C, D) Scheme for identification of tumor suppressor‐like and pro‐oncogenic CPGs in the Singapore HCC cohort using the 1‐D DDg method for survival prognostic analysis. Only the CPG subsets with identical 1‐D DDg design 1 (tumor suppressor‐like CPGs) or only with 1‐D DDg design 2 (pro‐oncogenic CPGs; see 2) in both PT and AT were selected. (E) An example of pro‐oncogenic CPG RPS3A identified in the Singapore HCC cohort. Kaplan–Meier survival curves were obtained using 1‐D DDg by fitting the expression values to survival data. Analyses were performed independently in PT and AT for each gene in the Singapore (n = 52) HCC cohort. Vertical bars and P‐values show the significant difference in the level of gene expression between the LR and HR patient subgroups (Mann–Whitney test). (F) FA/GO analysis of TER genes in two distinct biological contexts (david bioinformatics software). The results of the FA/GO enrichment analysis are presented for the pro‐oncogenic CPGs subset (1) 1‐D DDg design 2 [the top 1000 survival significant genes obtained according to the scheme shown in (D)] and for the subset of DEGs (2) significantly up‐regulated in PT compared to AT [the top 1000 significantly up‐regulated genes, obtained according the scheme shown in (B)]. Only significant representative FA/GO terms are shown; Fisher test P‐values (P < 0.05) are Benjamini corrected and –log10 transformed.
Pro‐oncogenic ribosomal genes of the 24‐gene HCC prognostic classifier
| Number | Host gene symbol | Illumina probe ID | RNA ID | Host gene description (UCSC genome browser) | Chromosome band |
|---|---|---|---|---|---|
| 1 | RPL9 | ILMN_1750507 |
| Ribosomal protein L9 | 4p13 |
| 2 | RPL12 | ILMN_2116366 |
| Ribosomal protein L12 | 9q34 |
| 3 | RPL26 | ILMN_1731546 |
| Ribosomal protein L26 | 17p13 |
| 4 | RPL37 | ILMN_2191634 |
| Ribosomal protein L37 | 5p13.1 |
| 5 | RPL31 | ILMN_1754195 |
| Ribosomal protein L31 | 2q11.2 |
| 6 | RPL41 | ILMN_2331890 |
| Ribosomal protein L41 | 12q13 |
| 7 | RPL30 | ILMN_1754303 |
| Ribosomal protein L30 | 8q22 |
| 8 | RPS9 | ILMN_1749447 |
| Ribosomal protein S9 | 19q13.4 |
| 9 | RPS15A | ILMN_1787949 |
| Ribosomal protein S15a | 16p12.3 |
| 10 | RPS25 | ILMN_1746516 |
| Ribosomal protein S25 | 11q23.3 |
| 11 | RPS11 | ILMN_1740587 |
| Ribosomal protein S11 | 19q13.3 |
| 12 | RPS4X | ILMN_2166831 |
| Ribosomal protein S4, X‐linked | Xq13.1 |
| 13 | RPL19 | ILMN_1701832 |
| Ribosomal protein L19 | 17q12 |
| 14 | RPL32 | ILMN_2400143 |
| Ribosomal protein L32 | 3q13.3‐q21 |
| 15 | RPS5 | ILMN_1707810 |
| Ribosomal protein S5 | 19q13.4 |
| 16 | RPL34 | ILMN_1706873 |
| Ribosomal protein L34 | 4q25 |
| 17 | RPL3 | ILMN_2319994 |
| Ribosomal protein L3 | 22q13 |
| 18 | RPL36 | ILMN_1685088 |
| Ribosomal protein L36 | 19p13.2 |
| 19 | RPS2 | ILMN_2218277 |
| Ribosomal protein S2 | 16p13.3 |
| 20 | RPL15 | ILMN_1762747 |
| Ribosomal protein L15 | 3p24.1 |
| 21 | RPS13 | ILMN_1777344 |
| Ribosomal protein S13 | 11p |
| 22 | RPL18A | ILMN_2141452 |
| Ribosomal protein l18a | 19p13.11 |
| 23 | RPS12 | ILMN_1782621 |
| Ribosomal protein S12 | 6q23 |
| 24 | RPL17 | ILMN_1655422 |
| Ribosomal protein L17 | 18q21 |
Figure 2Cross‐cohort validation of the RGC in PT. (A, B) The results of the SWV procedure for the selected 24 ribosomal genes in the Singapore (A) and LCI (B) HCC patient cohorts (see Results). Green: LR HCC patients (LR subgroup); blue: HR HCC patients (HR subgroup). (C, D) Kaplan–Meier survival curves for integrated patient partitions in the Singapore and LCI HCC cohorts, respectively. x‐axis: OS, years; y‐axis: patient survival probability. (E) Subclass association matrix obtained as a result summary of the SubMap analysis (see 3). The bottom left red quadrant indicates the significant similarity between the two LR subgroups, the top right quadrant indicates the significant similarity between the two HR subgroups (in the Singapore and LCI cohorts, respectively).
Figure 3RGC‐based stratification in AT and PT. (A, B) Training and cross‐cohort validation of the RGC in AT. Kaplan–Meier survival curves for integrated patient partitions in the Singapore and LCI HCC cohorts, correspondingly. (C) Contingency table for HCC patient stratification in PT and AT in the LCI cohort. The combined LR subgroup LR &A included the patients stratified only as LR using gene expression information from PT and AT (blue). The HR subgroup HR &A included the remaining patients (pink). (D) Combined stratification (cohort validation model) using information from PT and AT in the LCI cohort.
Univariate and multivariate analyses of clinical, pathological and molecular variables for overall survival in the training, validation and overall cohorts
| Univariate analysis | Multivariate analysis | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| Variables | Patients ( | Hazard ratio | 95% CI | Wald | Variables | Patients ( | Hazard ratio | 95% CI | Wald |
| Training cohort (Singapore) ( | |||||||||
| RGC (R1) | 115 | 8.2 | 3.23–20.8 | 9.4 × 10−6 | RGC (R1) | 115 | 6.4 | 2.38–17.19 | 0.0002 |
| Age (> 50 years) | 115 | 1.0 | 0.96–1.04 | 0.8 | – | – | – | – | – |
| Sex (female) | 115 | 0.81 | 0.18–3.54 | 0.8 | – | – | – | – | – |
| AFP (> 300 ng·mL−1) | 115 | 1.49 | 0.65–3.45 | 0.3 | – | – | – | – | – |
| HBV status | 96 | 1.74 | 0.71–4.27 | 0.23 | – | – | – | – | – |
| Tumor size (> 5 cm) | 115 | 1.33 | 0.75–3.31 | 0.5 | – | – | – | – | – |
| Multiple tumors | 115 | 1.71 | 0.69–4.20 | 0.24 | – | – | – | – | – |
| Total nodules | 115 | 1.36 | 0.80–2.30 | 0.25 | – | – | – | – | – |
| Microvascular invasion | 115 | 1.69 | 0.74–3.84 | 0.21 | – | – | – | – | – |
| Albumin child points | 115 | 1.95 | 1.07–3.57 | 0.03 | Albumin child points | 115 | 2.24 | 1.07–4.68 | 0.03 |
| Bilirubin Child points | 115 | 0.68 | 0.095.05 | 0.70 | – | – | – | – | – |
| Cirrhosis | 115 | 1.55 | 0.66–3.66 | 0.32 | – | – | – | – | – |
| Child status | 115 | 2.02 | 0.58–6.96 | 0.26 | – | – | – | – | – |
| Milan criteria | 115 | 0.81 | 0.29–2.24 | 0.68 | – | – | – | – | – |
| Edmondson tumor grade | 112 | 0.93 | 0.52–1.65 | 0.79 | – | – | – | – | – |
| Presence of tumor capsule | 106 | 1.03 | 0.41–2.57 | 0.9 | – | – | – | – | – |
| Portal vein invasion | 110 | 0.90 | 0.12–6.86 | 0.9 | – | – | – | – | – |
| Platelet score | 115 | 0.67 | 0.08–5.08 | 0.7 | – | – | – | – | – |
| Positive tumor margins | 115 | 3.07 | 0.98–9.51 | 0.05 | – | – | – | – | – |
| TNM staging | 115 | 1.72 | 1.16–2.56 | 0.007 | TNM staging | 115 | 1.21 | 0.62–2.40 | 0.66 |
| BCLC staging | 115 | 2.50 | 1.39–4.49 | 0.002 | BCLC staging | 115 | 1.73 | 0.66–4.58 | 0.26 |
| Validation cohort (LCI) ( | |||||||||
| RGC (R1) | 206 | 1.94 | 1.23–3.08 | 0.004 | RGC | 187 | 1.77 | 1.02–3.06 | 0.04 |
| Age (> 50 years) | 206 | 0.99 | 0.97–1.01 | 0.32 | – | – | – | – | – |
| Sex (female) | 206 | 0.95 | 0.55–1.68 | 0.8 | – | – | – | – | – |
| AFP (> 300 ng·mL−1) | 205 | 1.66 | 1.07–2.60 | 0.03 | AFP (> 300 ng·mL−1) | 187 | 1.01 | 0.58–1–68 | 0.97 |
| HBV status | 191 | 1.62 | 0.23–11.68 | 0.6 | – | – | – | – | – |
| Tumor size (> 5 cm) | 205 | 1.75 | 1.12–2.72 | 0.01 | Tumor size (> 5 cm) | 187 | 0.81 | 0.43–1.50 | 0.5 |
| Multiple tumors | 206 | 1.76 | 1.08–2.88 | 0.02 | Multiple tumors | 187 | 0.69 | 0.38–1.26 | 0.22 |
| Cirrhosis | 206 | 4.35 | 1.07–17.72 | 0.04 | Cirrhosis | 187 | 3.43 | 0.82–14.25 | 0.09 |
| TNM staging | 191 | 1.81 | 1.44–2.27 | 4.01 × 10−7 | TNM staging | 187 | 1.37 | 0.98–1.92 | 0.06 |
| BCLC staging | 191 | 2.14 | 1.65–2.79 | 1.4 × 10−8 | BCLC staging | 187 | 1.84 | 1.23–2.75 | 0.003 |
| Overall (Training + Validation cohorts) ( | |||||||||
| RGC | 321 | 2.63 | 1.77–3.92 | 1.6 × 10−6 | RGC | 302 | 2.48 | 1.59–3.85 | 5.5E–05 |
| Age (> 50 years) | 321 | 1.02 | 0.64–1.50 | 0.9 | Age (> 60 years) | – | – | – | – |
| Sex | 321 | 0.66 | 0.33–1.31 | 0.24 | Sex | – | – | – | – |
| AFP (> 300 ng·mL−1) | 318 | 1.44 | 1.06–1.96 | 0.02 | AFP (> 300 ng·mL−1) | 302 | 0.98 | 0.71–1.36 | 0.9 |
| HBV status | 287 | 1.48 | 0.71–3.08 | 0.3 | HBV status | – | – | – | – |
| HCV status | 56 | 0.61 | 0.13–2.90 | 0.5 | HCV status | – | – | – | – |
| Tumor size (> 5 cm) | 320 | 1.54 | 1.045–2.28 | 0.03 | Tumor size (> 5 cm) | 302 | 0.93 | 0.57–1.53 | 0.8 |
| Multiple tumors | 321 | 1.77 | 1.15–2.73 | 0.009 | Multiple tumors | 302 | 0.7 | 0.42–1.17 | 0.17 |
| Cirrhosis | 321 | 2.08 | 1.11–3.89 | 0.02 | Cirrhosis | 302 | 1.69 | 0.85–3.35 | 0.13 |
| TNM staging | 306 | 1.80 | 1.47–2.19 | 6.6 × 10−9 | TNM staging | 302 | 1.37 | 1.03–1.82 | 0.03 |
| BCLC staging | 306 | 2.19 | 1.72–2.78 | 2.1 × 10−10 | BCLC staging | 302 | 1.70 | 1.20–2.42 | 0.003 |
HBV and HCV status was based on serology and/or documented history.
Total nodules, based on histology report, including satellite nodules; total nodules are binned as: 1 = 1 nodule; 2 = 2–7 nodules; 3 = 7 or more nodules. Albumin Child points were binned according to Child‐Pugh Category score: 1 point = > 35 g·L−1; 2 points = 28–35 g·L−1; 3 points = < 28 g·L−1; Bilirubin Child points, Child‐Pugh Category score: 1 point = < 34.2 μmol·L−1; 2 points = 34.2–51.3 μmol·L−1; 3 points = > 51.3 μmol·L−1; Edmondson tumor grade binning: 1 = Grade 1; 2 = Grade 2; 3 = Grade 3; 4 = Grade 4; Child status binning: 1 = Child A; 2 = Child B; 3 = Child C; Platelets score binning: 1 = ≤ 100; 2 = > 100; Milan criteria binning: 1 = beyond; 2 = within.
TNM staging binning: 1 = Stage I (T1, N0, M0); 2 = Stage II (T2, N0, M0); 3 = Stage IIIA (T3a, N0, M0); 4 = Stage IIIb (T3b, N0, M0), Stage IIIc (T4, N0, M0), Stage IVa (Any T, N1, M0), Stage IVb (Any T, Any N, M1).
BCLC staging binning: 1 = Stage 0; 2 = Stage A; 3 = Stage B; 4 = Stage C. NA, data not available. –, excluded from the analysis. OS was used as the endpoint in all analyses.
Figure 4Comparison of biological pathways between the RGC‐derived HCC patient subgroups in PT and AT (david and MetaCore software). (A) david FA/GO heat map analysis (excluding 24 genes comprising the RGC) for the DEG subsets. Up‐regulated and down‐regulated DEGs in HR subgroups (HR and HR) after RGC‐stratification in PT and AT and common for the Singapore and LCI cohorts. (B) Venn diagram and MetaCore cellular compartments heatmap analysis of tissue‐specific and common prognostic DEGs for PT and AT up‐regulated in HR and HR subgroups in the two studied cohorts. Gray cells: nonsignificant FA/GO terms. Color gradient: significant FA/GO terms [Benjamini corrected Fisher test –log10 P‐values (P < 0.05)].
Figure 5The computational analyses and hypothetical data‐driven model suggest the role of the TER pathway in PT and AT of HCC patients. (A, B) Correlation analyses of in the Singapore and LCI cohorts in PT, AT. x‐axis: Kendall's Tau correlation coefficient; y‐axis: cumulative relative frequency. Black circles: correlation coefficients for TER gene sets (‘TER gene set (Singapore)’ and ‘TER gene set (LCI)’; white circles: correlation coefficients for random control gene set (see also the Supporting information: MYC as a key regulator of ribosomal pathway in HCC PT and AT). Dashed lines indicate medians for correlation coefficients distributions. (C) Frequencies of ChIP‐seq binding regions in HepG2 cells in the vicinity of proximal promoters (+200/−500 bp) in TER gene sets and DEGs up‐regulated in HR subgroups. x‐axis: various gene sets; y‐axis: frequency of ChIP‐seq binding regions (%). Differences in the frequencies assessed using Fisher's exact test (see the Supporting information: MYC as a key regulator of ribosomal pathway in HCC PT and AT). (D) MetaCore transcription factors interactors significantly enriched for the DEGs after RGC stratification in PT and AT. x‐axis: −log10 transformation of FDR‐corrected P‐values (P < 0.05); y‐axis: MetaCore terms for transcription factor (TF) interactors. (E) Correlation analysis of MYC with eight representative TER genes and five genes involved in liver metabolism based on quantitative RT‐PCR data from 92 PT samples from the Singapore cohort. PCC, Pearson's correlation coefficient. (F) Hypothetical data‐driven model of the TER pathway in PT and AT of HCC patients. Red and blue arrows: genes up‐regulated and genes down‐regulated in HR HCC patient subgroups, respectively. TER, translation elongation/ribosomal genes; CC/CD, cell cycle/cell division genes; EE and FA, extracellular exosome and focal adhesion genes, respectively. LM, genes involved in liver metabolism.
Figure 6Gene expression heatmap for representative genes of common prognostic pathways in the nonstratified and RGC‐stratified PT and AT. The heatmap before (A, B) and after (C, D) stratification of HCC patients using the RGC. (A, C) Singapore cohort. (B, D) LCI HCC cohort. The pathways for the up‐regulated or down‐regulated DEGs in HR subgroups were selected as gene sets enriched under the specific FA/GO terms (Table S12). TER, genes enriched under the FA/GO term ‘GO:0006414~translational elongation’ in PT; CC/CD: cell cycle/cell division genes enriched under the FA/GO terms ‘GO:0007049~cell cycle’ and ‘cell division’ (SP_PIR_KEYWORDS); EE and FA, extracellular exosome and focal adhesion genes enriched under the ‘GO localization’ terms (MetaCore) ‘extracellular exosome’ and ‘focal adhesion’, respectively. LM, representative mitochondrial and oxidoreductase DEGs down‐regulated in HR subgroups in both PT and AT and enriched under the FA/GO terms ‘GO:0005739~mitochondrion’ and ‘oxidoreductase’. Mitochondria is known as an integrative energy hub of diverse liver metabolism pathways (Degli Esposti et al., 2012). Heat map spectrum displays log2 transformed gene expression values.