Siyu Xiao1, Shanzhou Huang2, Jie Yang3. 1. Department of Clinical Laboratory, The Sixth Affiliated Hospital, Sun Yat-sen University, Guangzhou, People's Republic of China. 2. Department of General Surgery, Guangdong Provincial People's Hospital, Guangdong Academy of Medical Sciences, School of Medicine, South China University of Technology, Guangzhou, People's Republic of China. 3. Guangdong Lung Cancer Institute, Guangdong Provincial People's Hospital, Guangdong Academy of Medical Sciences, School of Medicine, South China University of Technology, Guangzhou, People's Republic of China.
Abstract
PURPOSE: Hepatocellular carcinoma (HCC) is one of the most commonly diagnosed digestive cancers and the fourth leading cause of death worldwide. Long noncoding RNAs (lncRNAs) with key roles in HCC development and progression have emerged and are used in the diagnosis and prognostic prediction of HCC. The lncRNA gradually increased during hepatocarcinogenesis (GIHCG) is a novel lncRNA with aberrant expression in many tumors, but its prognostic value and biological role in HCC have not been fully studied. Thus, the aim of this study was to explore the expression pattern and potential biological role of GIHCG in HCC. PATIENTS AND METHODS: The expression pattern of GIHCG in HCC was analyzed in our HCC cohort and validated in The Cancer Genome Atlas (TCGA) database. To assess the prognostic value of GIHCG, survival analyses and Cox regression analyses were carried out in two HCC cohorts. Functional enrichment analysis was used to predict the gene sets and pathways related to aberrant GIHCG expression. Furthermore, the relationship between GIHCG expression and immune infiltration in HCC was analyzed. RESULTS: GIHCG was highly expressed in HCC tissues compared with normal liver tissues. In addition, high GIHCG expression was significantly correlated with inferior clinicopathological characteristics and shorter survival times. High GIHCG expression was an independent prognostic factor for overall survival and disease-free survival in the HCC cohort in our center and in the TCGA-LIHC cohort. Hallmark terms such as "G2M checkpoint", "MYC targets" and "DNA repair" were enriched in the GIHCG high-expression groups. High GIHCG expression negatively correlated with the infiltration of memory CD4+ and CD8+ T cells, natural killer cells, macrophages, dendritic cells, neutrophils and monocytes. CONCLUSION: The findings of our study indicate that GIHCG is a biomarker that can be used to predict the prognosis of HCC patients and is correlated with immune cell infiltration in HCC.
PURPOSE: Hepatocellular carcinoma (HCC) is one of the most commonly diagnosed digestive cancers and the fourth leading cause of death worldwide. Long noncoding RNAs (lncRNAs) with key roles in HCC development and progression have emerged and are used in the diagnosis and prognostic prediction of HCC. The lncRNA gradually increased during hepatocarcinogenesis (GIHCG) is a novel lncRNA with aberrant expression in many tumors, but its prognostic value and biological role in HCC have not been fully studied. Thus, the aim of this study was to explore the expression pattern and potential biological role of GIHCG in HCC. PATIENTS AND METHODS: The expression pattern of GIHCG in HCC was analyzed in our HCC cohort and validated in The Cancer Genome Atlas (TCGA) database. To assess the prognostic value of GIHCG, survival analyses and Cox regression analyses were carried out in two HCC cohorts. Functional enrichment analysis was used to predict the gene sets and pathways related to aberrant GIHCG expression. Furthermore, the relationship between GIHCG expression and immune infiltration in HCC was analyzed. RESULTS: GIHCG was highly expressed in HCC tissues compared with normal liver tissues. In addition, high GIHCG expression was significantly correlated with inferior clinicopathological characteristics and shorter survival times. High GIHCG expression was an independent prognostic factor for overall survival and disease-free survival in the HCC cohort in our center and in the TCGA-LIHC cohort. Hallmark terms such as "G2M checkpoint", "MYC targets" and "DNA repair" were enriched in the GIHCG high-expression groups. High GIHCG expression negatively correlated with the infiltration of memory CD4+ and CD8+ T cells, natural killer cells, macrophages, dendritic cells, neutrophils and monocytes. CONCLUSION: The findings of our study indicate that GIHCG is a biomarker that can be used to predict the prognosis of HCC patients and is correlated with immune cell infiltration in HCC.
Hepatocellular carcinoma (HCC) is the third most common digestive cancer and the fourth leading cause of cancer death worldwide.1 The 5-year survival rate of HCC is lower than 20%,2 which is primarily attributed to the difficulty of early diagnosis via current biomarkers.3 As a result, most HCC patients are diagnosed at an advanced stage, at which point the opportunity to receive curative treatments is lost.4,5 Moreover, even for those undergoing surgical resection, recurrence remains a major problem, and half of the patients with recurrence die within 1 year. Thus, an urgent need to develop new approaches for early diagnosis, real-time monitoring, and effective treatment of HCC remains.Long noncoding RNAs (lncRNAs) make up a large class of noncoding transcripts that are usually greater than 200 nucleotides in length. Multiple studies have reported that aberrant lncRNA expression affects tumor cell growth, apoptosis, invasion, and metastasis. LncRNAs may be useful in cancer diagnosis and prognostic evaluations and some functional lncRNAs may serve as potential therapeutic targets based on their key roles in vital biological processes of tumor development and progression.6–12One lncRNA known as gradually increased during hepatocarcinogenesis (GIHCG) is a functional lncRNA that was recently identified by Sui et al in HCC.13 The GIHCG gene alias is GIHCG inhibitor of miR-200b/200a/429 expression. The GIHCG gene has 4727 bases and is located on 12q14.1, and the Ensembl ID of GIHCG is ENSG00000257698 (Ensembl version: ENSG00000257698.2). The former transcript name was RP11-620J15.3 (Ensembl version: ENSG00000257698.1). By epigenetically inhibiting miR-200b/a/429 expression, Sui et al showed that GIHCG affects the proliferation, migration, and invasion of HCC cell lines.13 In addition, GIHCG has been reported to be overexpressed in and promote tumor progression of renal cell carcinoma, tongue squamous cell carcinoma, ovarian cancer, cervical cancer, breast cancer, colorectal cancer and gastric cancer.14–21We hypothesized that GIHCG may act as an oncogene during tumorigenesis, however the prognostic value of GIHCG and its possible biological functions in HCC have yet to be fully assessed. Our study aims to assess the prognostic value of GIHCG in both the public HCC database and our clinical HCC cohort and to explore potential pathway correlations with GIHCG expression.
Patients and Methods
SYSU HCC Sample Collection and Follow-Up
The present study included 120 patients with HCC who underwent R0 surgical resection between July 2013 and December 2014 from the First Affiliated Hospital, Sun Yat-sen University (SYSU). One hundred HCC samples were formalin-fixed and paraffin-embedded as the main evaluation cohort. The other 20 fresh HCC tissues with paired adjacent normal tissue (ANT) samples were obtained and stored in RNAlater solution (Invitrogen, USA) immediately after resection and then frozen in liquid nitrogen until further use, as previously described.22 None of the enrolled patients received any radiotherapy or chemotherapy before surgery, and all specimens were confirmed by pathology. Data on the clinical and pathological characteristics, such as tumor stage, differentiation grade, presentation of tumor thrombi, HBsAg status and AFP level, were retrieved from the inpatient database. Overall survival (OS) was defined as the period from resection to death or the last follow-up. Disease-free survival (DFS) was defined as the period between surgery and death or first confirmed relapse until the last follow-up date in December 2018.This study conformed to the 1964 Declaration of Helsinki and was approved by the Ethics Committee of the First Affiliated Hospital, Sun Yat-sen University. Written informed consent was obtained from all patients for the use of their tissue samples.
RNA Extraction and GIHCG Expression Measurement
Total RNA was extracted from approximately 100 mg of tissue using the Qiagen RNeasy Mini Kit according to the manufacturer’s instructions as previously described.23 To detect GIHCG expression in HCC and ANTs, Quantitative real-time PCR (qRT-PCR) was performed using the Power SYBR Green PCR Master Mix (Applied Biosystems, USA) according to the manufacturer’s instructions with gene-specific primers. Primers were designed as follows:GIHCG forward primer, 5′-CTTTCAAGAAGTTTGGCTGTC-3′; GIHCG reverse primer: 5′-GCTCATTCAACGGATAAGTC-3′; GAPDH forward primer: 5′-TGTGGGCATCAATGGATTTGG-3′; GAPDH reverse primer: 5′-ACACCATGTATTCCGGGTCAAT-3′.
Chromogenic in situ Hybridization (CISH)
To evaluate the expression level of GIHCG, CISH was applied to SYSU HCC samples. All slices were fixed for 2–12 hrs with 4% paraformaldehyde (DEPC, Servicebio, Wuhan, China), incubated in proteinase K (Servicebio) for 30 mins and rinsed three times with PBS (Servicebio). Next, the samples were prehybridized in hybridization buffer (Servicebio) at 37°C for 1 hr. Then, fresh hybridization buffer containing 8 ng/mL of the corresponding probe was used to replace the prehybridization buffer, and the samples were incubated at 37°C overnight. Finally, nuclear fast red (Servicebio) staining was performed to visualize the cell nuclei of the samples, and the samples were mounted in neutral balsam (Sinopharm Chemical Reagent Co., Ltd) and examined by bright-field microscopy. Based on the staining intensity, the patients were divided into high- or low-expression groups as previously described.22
Public Datasets Download and Validation
To determine the expression profile of GIHCG in HCC, we downloaded the expression data, clinical information and mutation data from The Cancer Genome Atlas (TCGA) Liver Hepatocellular Carcinoma (TCGA-LIHC) dataset (April 21, 2020) using the R package TCGAbiolinks24 (version 2.14.0). GIHCG expression levels were compared using transcripts per kilobase million (TPM) normalization. After exclusion of repeat samples from the same patient, 371 HCC and 50 paired normal liver tissues were included for expression analysis. In addition, patients with any missing or insufficient data on TNM stage, OS or DFS were excluded from the survival analysis and correlation analysis of GIHCG with clinicopathologic characteristics. Finally, a total of 347 HCC patients were included for the correlation and survival analysis. High- and low-expression patients were divided according to the median expression level of GIHCG. Because a limited number of normal liver samples were available in the TCGA-LIHC dataset, the expression of GIHCG in HCC and normal liver tissue was further validated in GEPIA, a web server for analyzing the RNA-sequencing data from TCGA and thousands of normal samples from GTEx projects ().
Functional Enrichment Analysis for GIHCG
Differentially expressed genes between the high- and low-expression groups were identified using the R package Limma25 (version 3.42.2). Gene Set Enrichment Analysis (GSEA) and Gene Ontology (GO) analyses were conducted with the R package (clusterProfiler,26 version 3.14.3) to explore different molecular mechanisms and involved pathways between high- and low-expression groups in the TCGA-LIHC dataset. Hallmark gene sets and molecular signatures were obtained from the Molecular Signatures Database. Normalized enrichment scores were acquired using gene set permutations 1000 times, and a cutoff P-value of 0.05 was used to filter the significant enrichment results.
Immune Cell Infiltrates Analysis
To evaluate the correlation between GIHCG expression and tumor-infiltrating immune cells (TIICs), we performed single-sample gene set enrichment analysis (ssGSEA),27 a method used to transform the bulk transcriptomic expression data into relative abundance of specific immune cell types, to quantitatively identify the proportion of TIICs in the tumor microenvironment. Feature gene panels for each immune cell type were obtained and referenced from recent publications.28,29 The ssGSEA enrichment scores of each immune cell type were calculated by using TCGA-LIHC expression data and R package GSVA30 (version 1.34.0). The immune cell infiltration profiles and marker genes of different immune cell subsets was compared between the GIHCG high- and low-expression groups. For heatmap visualization, the ssGSEA scores were normalized to show the relative abundance of infiltrating immune cell populations.
Statistical Analysis
All statistical analyses were performed using R software (version 3.6.2) with the corresponding packages. The Shapiro–Wilk method was used for normality tests. The mean ± standard deviation (mean ± SD) is presented for normally distributed variables, and the median is presented for nonnormally distributed variables. The relationships between categorical variables were compared using the χ2 test, and continuous variables were analyzed with Student’s t-test. Kaplan-Meier survival analysis and the Log-rank test were employed to analyze OS and DFS (R package survival, version 3.1–8). Univariate and multivariate Cox regression analyses were performed to evaluate the prognostic value of GIHCG.
Results
GIHCG Expression in HCC and Normal Tissues
GIHCG expression was first investigated in the SYSU HCC cohort. CISH staining revealed that HCC tissues exhibited significantly higher GIHCG expression than the adjacent normal liver tissue. Representative photos showed GIHCG low- and high-expression in normal liver tissue (Figure 1A) and HCC (Figure 1B). The relative GIHCG mRNA expression was significantly higher in HCC tissues compared to normal tissues (Figure 1C).
Figure 1
GIHCG expression in HCC and normal liver samples. (A and B) Representative photos (magnification: 20× and 40×) of GIHCG expression in adjacent normal liver tissues (A) and HCC (B) tissues. (C) GIHCG expression was examined by qRT-PCR in 20 paired HCC tissues and adjacent normal liver tissues. (D and E) Comparison of GIHCG expression in HCC and normal liver using the TCGA-LIHC cohort containing 371 HCC samples and 50 paired adjacent normal liver tissue samples as shown in Panel B; (F) Comparison expression of GIHCG in HCC (TCGA-LIHC: 369 HCC samples, excluding 2 repeated patients) and normal liver (GETx: 160 donation liver samples).
GIHCG expression in HCC and normal liver samples. (A and B) Representative photos (magnification: 20× and 40×) of GIHCG expression in adjacent normal liver tissues (A) and HCC (B) tissues. (C) GIHCG expression was examined by qRT-PCR in 20 paired HCC tissues and adjacent normal liver tissues. (D and E) Comparison of GIHCG expression in HCC and normal liver using the TCGA-LIHC cohort containing 371 HCC samples and 50 paired adjacent normal liver tissue samples as shown in Panel B; (F) Comparison expression of GIHCG in HCC (TCGA-LIHC: 369 HCC samples, excluding 2 repeated patients) and normal liver (GETx: 160 donation liver samples).This expression pattern was then validated in 371 HCC and 50 adjacent normal liver tissues using the TCGA-LIHC dataset. In the TCGA-LIHC cohort, GIHCG expression was significantly higher in tumors than in normal tissues (P-value < 0.001). GIHCG expression in HCC and normal liver samples is plotted in Figure 1D. Furthermore, in the 50 paired adjacent normal tissues from the same HCC patients, GIHCG expression was significantly upregulated in the HCC tissues compared with the adjacent normal liver tissues (Figure 1E). Using the online GEPIA databases, which included many more normal liver tissues from the GTEx projects, GIHCG expression was also significantly upregulated in HCC tissues (n = 369) compared with normal liver tissues (n = 160, Figure 1F). These results demonstrate that GIHCG is highly expressed in HCC tissues compared to normal tissues.
Correlation of GIHCG Expression with Clinicopathological Characteristics
To investigate the association between GIHCG expression and clinical factors in HCC patients, we analyzed the GIHCG expression in different clinicopathological subgroups in both the SYSU HCC and TGCA-LIHC cohorts. We performed CISH staining via semiquantitative measurement of GIHCG in the SYSU HCC cohort. Within all the SYSU HCC samples, 58 (58.0%) displayed high GIHCG expression. The relationship between the GIHCG expression level and HCC clinicopathological features is shown in Table 1. The results showed that GIHCG expression was significantly associated with tumor numbers (P < 0.05), tumor size (P < 0.05), AFP level and tumor thrombus (P < 0.05).
Table 1
Association Between GIHCG Expression and the Clinicopathological Characteristics of Patients in SYSU HCC Cohort
Variables
Total (n)
Low Expression (n=42)
High Expression (n=58)
P value
Gender
0.126
Male
86
33 (78.6)
53 (91.4)
Female
14
9 (21.4)
5 (8.6)
Age, years
0.126
<50
49
26 (61.9)
23 (39.7)
≥50
51
16 (38.1)
35 (60.3)
Tumor nodule number
0.006
Solitary
59
32 (76.2)
27 (46.6)
Multiple (≥2)
41
10 (23.8)
31 (53.4)
Tumor size (cm)
<0.001
≤5
50
31 (73.8)
19 (32.8)
>5
50
11 (26.2)
39 (67.2)
Cancer embolus
<0.001
Absence
66
35 (83.3)
31 (53.4)
Presence
34
7 (16.7)
27 (46.6)
AFP, ng/mL
<0.001
<200
52
31 (73.8)
21 (36.2)
≥200
48
11 (26.2)
37 (63.8)
HBsAg
0.724
Negative
42
19 (45.2)
23 (39.7)
Positive
58
23 (54.8)
35 (60.3)
TNM stage
0.098
Stage I & II
51
26 (61.9)
25 (43.1)
Stage III & IV
49
16 (38.1)
33 (56.9)
Differentiation grade
0.234
Well
66
31 (73.8)
35 (60.3)
Poor
34
11 (26.2)
23 (39.7)
Association Between GIHCG Expression and the Clinicopathological Characteristics of Patients in SYSU HCC CohortIn the TCGA-LIHC cohort, as shown in Figure 2, GIHCG overexpression was positively associated with the TNM stage, histological grade and T stage (Figure 2A–C); however, other clinicopathologic factors, such as gender, AFP levels and status of hepatitis virus infection, had no significant association with GIHCG expression (data not shown). In addition, GIHCG high expression was significantly correlated with TP53 mutation (Figure 2D).
Figure 2
Correlation of GIHCG expression with clinical and pathological characteristics. GIHCG expression profiles in different TNM stages (A), histological grades (B), T stages (C) and TP53 mutation status (D). GIHCG expression is significantly correlated to tumor stage, tumor grade, tumor size and TP53 mutation in HCC.
Correlation of GIHCG expression with clinical and pathological characteristics. GIHCG expression profiles in different TNM stages (A), histological grades (B), T stages (C) and TP53 mutation status (D). GIHCG expression is significantly correlated to tumor stage, tumor grade, tumor size and TP53 mutation in HCC.
High GIHCG Expression Predicts Poor Survival Among Patients with HCC
To verify the prognostic value of GIHCG expression in HCC, we performed a Kaplan-Meier analysis in the SYSU HCC cohort and TCGA-LIHC cohort. In the SYSU HCC cohort, the median OS time of the low-GIHCG-expression group was significantly longer than that of the high-GIHCG-expression group (HR=0.269, 95% CI: 0.133–0.544; P<0.001; Figure 3A), and the median DFS of the low-GIHCG-expression group was significantly longer than that of the high-GIHCG-expression group (HR=0.262, 95% CI: 0.129–0.530; P<0.001; Figure 3B). Consistent with the SYSU HCC cohort, in the TCGA-LIHC cohort, the median OS time of the low-GIHCG-expression group was significantly longer than that of the high-GIHCG-expression group (HR=0.537, 95% CI: 0.378–0.763; P<0.001; Figure 3C), and the median DFS time of the low-GIHCG-expression group was significantly longer than that of high-GIHCG-expression group (HR=0.614, 95% CI: 0.457–0.825; P<0.001; Figure 3D).
Figure 3
High GIHCG expression is associated with poor prognosis. Kaplan-Meier curves show overall survival (A) and disease-free survival (B) curves for 58 GIHCG high-expression patients and 42 GIHCG low-expression patients in the SYSU HCC cohort. Kaplan-Meier plots show overall survival (C) and disease-free survival (D) curves for 185 GIHCG high-expression patients and 186 GIHCG high-expression patients from the TCGA-LIHC cohort.
High GIHCG expression is associated with poor prognosis. Kaplan-Meier curves show overall survival (A) and disease-free survival (B) curves for 58 GIHCG high-expression patients and 42 GIHCG low-expression patients in the SYSU HCC cohort. Kaplan-Meier plots show overall survival (C) and disease-free survival (D) curves for 185 GIHCG high-expression patients and 186 GIHCG high-expression patients from the TCGA-LIHC cohort.Moreover, univariate and multivariate Cox regression analysis was performed to determine the relationships between risk factors and prognosis in the two HCC cohorts. The results revealed that tumor nodule number, tumor size, TNM stage, histological grade, tumor thrombi, AFP level and GIHCG expression level were significantly associated with OS and DFS in the SYSU cohort. In the TCGA-LIHC cohort, TNM stage, T stage, hepatitis virus infection and GIHCG expression were significantly associated with OS and DFS (Tables 2 and 3).
Table 2
Univariate and Multivariate Cox Regression Analyses of Risk Factors Associated with Overall Survival in SYSU HCC and TCGA-LIHC Cohort
Variables
Univariate Analysis
Multivariate Analysis
SYSU HCC cohort (n=100)
HR (95% CI for HR)
P value
HR (95% CI for HR)
P value
Gender
3.19 (0.985–10.3)
0.053
Age
1.21 (0.679–2.15)
0.520
TNM stage
2.95 (1.61–5.41)
0.000
2.245 (1.14–4.421)
0.019
HBV infection
1.55 (0.846–2.83)
0.157
Tumor number
0.498 (0.28–0.884)
0.017
1.182 (0.591–2.364)
0.636
Tumor size
2.65 (1.42–4.95)
0.002
1.381 (0.708–2.694)
0.344
Tumor thrombus
3.27 (1.84–5.82)
< 0.001
2.044 (1.023–4.085)
0.043
AFP level
2.11 (1.17–3.79)
0.013
0.931 (0.462–1.874)
0.841
Differentiation grade
0.506 (0.285–0.898)
0.020
0.784 (0.426–1.443)
0.435
GIHCG expression level
0.269 (0.133–0.544)
< 0.001
0.333 (0.146–0.756)
0.009
TCGA LIHC cohort (n=345)
HR (95% CI for HR)
P value
HR (95% CI for HR)
P value
Gender
0.776 (0.531–1.13)
0.188
Age
1.19 (0.748–1.9)
0.459
TNM stage
2.5 (1.72–3.63)
< 0.001
0.769 (0.105–5.656)
0.797
T stage
2.52 (1.73–3.67)
< 0.001
2.893 (0.393–21.299)
0.297
HBV/HCV infection
2 (1.38–2.91)
< 0.001
1.979 (1.356–2.887)
< 0.001
AFP level
0.984 (0.597–1.62)
0.949
Differentiation grade
1.14 (0.784–1.66)
0.490
GIHCG expression
0.518 (0.356–0.754)
< 0.001
0.551 (0.376–0.807)
0.002
Table 3
Univariate and Multivariate Cox Regression Analyses of Risk Factors Associated with Disease-Free Survival in SYSU HCC and TCGA-LIHC Cohort
Variables
Univariate Analysis
Multivariate Analysis
SYSU HCC cohort (n=100)
HR (95% CI for HR)
P value
HR (95% CI for HR)
P value
Gender
2.28 (0.816–6.39)
0.116
Age
1.11 (0.621–1.98)
0.728
TNM stage
3.26 (1.76–6.06)
< 0.001
2.465 (1.233–4.926)
0.011
HBV infection
1.54 (0.838–2.82)
0.164
Tumor number
0.396 (0.221–0.711)
0.002
0.759 (0.385–1.498)
0.427
Tumor size
3.41 (1.77–6.59)
< 0.001
1.924 (0.966–3.835)
0.063
Tumor thrombus
2.96 (1.65–5.3)
< 0.001
1.379 (0.704–2.701)
0.349
AFP level
2.36 (1.3–4.3)
0.005
0.915 (0.44–1.904)
0.813
Differentiation grade
0.488 (0.273–0.87)
0.015
0.86 (0.447–1.653)
0.65
GIHCG expression level
0.262 (0.129–0.53)
< 0.001
0.365 (0.161–0.828)
0.016
TCGA LIHC cohort (n=345)
HR (95% CI for HR)
P value
HR (95% CI for HR)
P value
Gender
0.933 (0.673–1.29)
0.679
Age
0.991 (0.682–1.44)
0.963
TNM stage
2.21 (1.6–3.07)
< 0.001
1.312 (0.318–5.415)
0.708
T stage
2.21 (1.59–3.07)
< 0.001
1.573 (0.378–6.542)
0.534
HBV/HCV infection
1.45 (1.07–1.98)
0.018
1.379 (1.009–1.884)
0.044
AFP level
0.971 (0.651–1.45)
0.886
Differentiation grade
1.15 (0.841–1.57)
0.380
GIHCG expression
0.619 (0.454–0.844)
0.002
0.666 (0.486–0.911)
0.011
Univariate and Multivariate Cox Regression Analyses of Risk Factors Associated with Overall Survival in SYSU HCC and TCGA-LIHC CohortUnivariate and Multivariate Cox Regression Analyses of Risk Factors Associated with Disease-Free Survival in SYSU HCC and TCGA-LIHC CohortThese significant prognostic risk factors were included in the multivariate Cox regression analysis. The results showed that in the SYSU cohort, tumor thrombi (HR=2.044; 95% CI: 1.023–4.085; P=0.043), TNM stage (HR=2.245; 95% CI: 1.14–4.421; P=0.019) and GIHCG expression level (HR=0.333; 95% CI: 0.146–0.756; P= 0.009) were independent prognostic factors for OS (Table 2, ). Further, TNM stage (HR=2.465; 95% CI: 1.233–4.926; P=0.011) and GIHCG expression level (HR=0.365; 95% CI: 0.161–0.828; P= 0.016) were independent prognostic factors for DFS (Table 3, ). In the TCGA-LIHC cohort, hepatitis virus infection (HR=1.979; 95% CI: 1.356–2.887; P< 0.001) and GIHCG expression (HR=0.551; 95% CI: 0.376–0.807; P=0.002) were independent prognostic factors for OS (Table 2, ) and hepatitis virus infection (HR=1.379; 95% CI: 1.009–1.884; P=0.044) and GIHCG expression (HR=0.666; 95% CI: 0.486–0.911; P=0.011) were independent prognostic factors for DFS (Table 3, ).To test the robustness of GIHCG expression for the prediction of HCC survival, we validated the prognostic significance in different clinicopathological subgroups. The result of the subgroups analysis showed that GIHCG expression can also serve as a prognostic factor for OS and DFS in most subgroups (Figure 4, ). In summary, the above results show that high GIHCG expression is positively correlated with poor survival among HCC patients.
Figure 4
Hazard ratio values for subgroup analyses of GIHCG expression in HCC patients. Forest plots summarizing the hazard ratio (HR) between GIHCG expression and survival in HCC patients as analyzed by the different clinicopathological subgroups. (A) HR values for OS in the SYSU HCC cohort. (B) HR values for DFS in the SYSU HCC cohort. (C) HR values for OS in the TCGA-LIHC cohort. (D) HR values for DFS in the TCGA-LIHC cohort.
Hazard ratio values for subgroup analyses of GIHCG expression in HCC patients. Forest plots summarizing the hazard ratio (HR) between GIHCG expression and survival in HCC patients as analyzed by the different clinicopathological subgroups. (A) HR values for OS in the SYSU HCC cohort. (B) HR values for DFS in the SYSU HCC cohort. (C) HR values for OS in the TCGA-LIHC cohort. (D) HR values for DFS in the TCGA-LIHC cohort.
Functional Enrichment Analysis of High GIHCG Expression
To evaluate potential key molecules and pathways associated with GIHCG expression, we performed GO analysis and GSEA between high- and low-expression groups. The top enriched GO terms in biological processes (BP), cellular components (CC) and molecular function (MF) are shown in Figure 5A and . GO analysis revealed that “RNA catabolic process”, “organic acid catabolic process” and “spindle organization” were the main terms involved in BP; “spindle”, “chromosomal region” and “cytosolic ribosome” were significantly enriched in CC; and “coenzyme binding”, “structural constituent of ribosome” and “catalytic activity, acting on DNA” were the top enriched terms in MF. GSEA enrichment analysis showed that the significantly enriched hallmark terms included “G2M checkpoint”, “E2F targets”, “spermatogenesis”, “MYC targets” and “DNA repair” (Figure 5B and C, ), which were associated with the proliferation of cancer cells. The results of the KEGG pathway analysis showed that 77 pathways were enriched (p<0.05), among which “Cell cycle”, “Ribosome”, “Fanconi anemia pathway”, “Spliceosome” and “DNA replication” were highly associated with high GIHCG expression (Figure 5D, ).
Figure 5
Gene set enrichment analysis between GIHCG high- and low-expression groups. (A) Gene Ontology enrichment analysis includes biological processes, cellular components and molecular function. (B) Enriched hallmark terms correlated to GIHCG expression. (C) Representative hallmark terms enrichment plot. (D) GSEA-based Kyoto Encyclopedia of Genes and Genomes (KEGG) analysis revealed the enriched activated and suppressed pathways between the GIHCG high- and low- expression groups. The circle size indicates the counts of enriched terms and the color represents the P value.
Gene set enrichment analysis between GIHCG high- and low-expression groups. (A) Gene Ontology enrichment analysis includes biological processes, cellular components and molecular function. (B) Enriched hallmark terms correlated to GIHCG expression. (C) Representative hallmark terms enrichment plot. (D) GSEA-based Kyoto Encyclopedia of Genes and Genomes (KEGG) analysis revealed the enriched activated and suppressed pathways between the GIHCG high- and low- expression groups. The circle size indicates the counts of enriched terms and the color represents the P value.
Correlations Between GIHCG Expression and Immune Cell Infiltration and Markers
Because tumor microenvironmental alterations correlate with a poor prognosis among HCC patients,31–34 we investigated the correlation between the GIHCG expression level and the proportions of TIICs in the TCGA-LIHC cohort. Figure 6A shows the relative abundance of 28 TICs populations for 345 HCC tumor samples in the TCGA-LIHC cohort. The results of the ssGSEA showed that antitumor immune cells, including central memory T cells, effector memory T cells, type 1 T helper cells, type 17 T helper cells, CD56bright natural killer cells and natural killer cells, were significantly lower in the high-GIHCG-expression group than the low-GIHCG-expression group (Figure 6B and ). Other TIICs, such as activated CD4 regulatory T cells, CD56dim natural killer cells, neutrophils, plasmacytoid dendritic cells, eosinophils and monocytes, also significantly differed between the high-GIHCG- and low-GIHCG-expression groups. Specifically, GIHCG expression had a significant negative correlation with the proportion of TIICs, including central memory CD4/8 T cells, effector memory CD4/8 T cells, type 1/17 T helper cells, immature dendritic cells, plasmacytoid dendritic cells, neutrophils, eosinophils and monocytes (Figure 7A and ). We next explored the correlation between GIHCG expression and the marker of genes of different immune cell subsets (Figure 7B and ). The GIHCG expression level was positively correlated with the gene markers of T cells (CD8A, CD8B, CD2, CD3D, CD3E), T cell exhaustion (LAG3, CTLA4, GZMB), B cells (CD19), macrophages (CD68, IRF5, VSIG4, MS4A4A), dendritic cells (HLA-DPB1, HLA-DQB1, HLA-DRA, ITGAX), natural killer cells (KIR2DL4, NCAM1) and neutrophils (ITGAM). These results suggest that GIHCG expression correlates with the tumor microenvironment in HCC.
Figure 6
Visualization of the relationships between infiltrating immune cells and GIHCG expression. (A) A heatmap shows the relative infiltration of immune cell proportion for all TCGA-LIHC tumor patients (normalized ssGESA). GIHCG value, age, gender, tumor stage, and AFP are shown as patient annotations. (B) Comparison of infiltration immune cells differences in the GIHCG high- and low-expression groups. Symbol “****”, “***”, “**”, “*”, “ns” indicate a P value <0.0001, <0.001, <0.01, <0.05 and >0.05, respectively.
Figure 7
Correlation of GIHCG expression and immune cell infiltration in HCC. (A) Spearman correlation analysis between GIHCG expression and the proportion of immune cell infiltration. (B) Correlation analysis between GIHCG expression and immune cell marker genes. Only significant results were plotted. The circle size indicates the absolute value of R and the color represents the P value.
Visualization of the relationships between infiltrating immune cells and GIHCG expression. (A) A heatmap shows the relative infiltration of immune cell proportion for all TCGA-LIHC tumor patients (normalized ssGESA). GIHCG value, age, gender, tumor stage, and AFP are shown as patient annotations. (B) Comparison of infiltration immune cells differences in the GIHCG high- and low-expression groups. Symbol “****”, “***”, “**”, “*”, “ns” indicate a P value <0.0001, <0.001, <0.01, <0.05 and >0.05, respectively.Correlation of GIHCG expression and immune cell infiltration in HCC. (A) Spearman correlation analysis between GIHCG expression and the proportion of immune cell infiltration. (B) Correlation analysis between GIHCG expression and immune cell marker genes. Only significant results were plotted. The circle size indicates the absolute value of R and the color represents the P value.
Discussion
Although many advances have been achieved in the management and treatment of HCC, HCC ranks as the third leading cause of tumor-related death worldwide. Identification of meaningful biomarkers associated with HCC progression are still needed. GIHCG acts as an onco-long noncoding RNA in many types of cancer, but its prognostic value, biological functions, and relationship with the tumor microenvironment have not been fully assessed. In this study, we analyzed the prognostic value and potential biological functions of GIHCG in HCC.In the current study, we systematically analyzed the GIHCG expression pattern in HCC, and our results demonstrated that GIHCG expression is significantly increased in HCC tissues compared with normal liver tissues. High GIHCG expression was positively correlated with poor clinicopathologic factors and poor survival among HCC patients. GIHCG expression was an independent prognostic factor of OS and DFS for HCC patients. These results were validated in both public databases and local HCC cohorts, and the results were consistent with those of a previous study.13Additionally, bioinformatics results revealed that high GIHCG levels activated proliferation pathways in HCC, which is consistent with the findings of Sui et al showing that GIHCG promotes HCC cell proliferation and metastasis in vivo by silencing miR-200b/a/429.13 The GO terms suggested that GIHCG overexpression could alter the RNA catabolic process in HCC. KEGG pathway analysis revealed that “Cell cycle” and “DNA replication” were linked to GIHCG overexpression. Previous studies have supported that lncRNAs are involved in vital biological processes, such as the cell cycle and DNA repair.35,36 GSEA showed that GIHCG overexpression is involved in pivotal hallmarks of proliferation, including “G2M checkpoint”, “E2F targets”, “MYC targets” and “DNA repair”. LncRNAs are reported to be regulated by MYC and affect MYC expression in different cancer types, which could influence cancer cell viability and proliferation.37,38
GIHCG may also be involved in these key pathways, but further studies are needed to confirm this hypothesis.LncRNAs are directly and indirectly involved in the crosstalk between immune cells and tumor cells, and dysregulated lncRNA expression in these cells could drive tumorigenesis in HCC.39–42 Our results demonstrate that GIHCG expression is significantly correlated with the tumor microenvironment in HCC. High GIHCG expression negatively correlated with the infiltration of memory CD4+ and CD8+ T cells, natural killer cells, macrophages, dendritic cells, neutrophils and monocytes. By analyzing the correlation between GIHCG and immune cell markers, we found that increased GIHCG expression was significantly associated with the abovementioned immune cells. This finding suggests that GIHCG may play a role in regulating the microenvironment in HCC and influences HCC patient prognosis. This hypothesis and the detailed mechanism warrant further investigation.
Conclusion
In conclusion, our results provide evidence that GIHCG expression is increased in HCC and leads to a poor prognosis, making it possible for GIHCG to be considered a novel prognostic biomarker for HCC. In addition, GIHCG may affect the infiltration and function of immune cells in HCC.