Literature DB >> 29760565

Genome-wide association pathway analysis to identify candidate single nucleotide polymorphisms and molecular pathways associated with TP53 expression status in HBV-related hepatocellular carcinoma.

Xiwen Liao1, Long Yu1, Xiaoguang Liu1,2, Chuangye Han1, Tingdong Yu1, Wei Qin1, Chengkun Yang1, Guangzhi Zhu1, Hao Su1, Tao Peng1.   

Abstract

BACKGROUND: The aim of this investigation was to identify candidate single nucleotide polymorphisms (SNPs) and molecular pathways associated with tumor protein p53 (TP53) expression status in hepatitis B virus (HBV)-related hepatocellular carcinoma (HCC), clarify their potential mechanisms, and generate SNP-to-gene to pathway hypothesis.
MATERIALS AND METHODS: Identify candidate Causal SNPs and Pathways (ICSNPathway) was used to perform pathway analysis based on the results of our previous genome-wide association study of TP53 expression status in 387 HBV-related HCC patients.
RESULTS: Through the ICSNPathway analysis, we identified 18 candidate SNPs and 10 candidate pathways that are associated with TP53 expression status in HBV-related HCC. The strongest mechanism involved the modulation of major histocompatibility complex, class II, DP beta 1 (human leukocyte antigen [HLA]-DPB1-rs1042153), major histocompatibility complex, class II, DQ beta 1 (HLA-DQB1-rs1130399, HLA-DQB1-rs1049056, HLA-DQB1-rs1049059, and HLA-DQB1-rs1049060), and major histocompatibility complex, class II, DR beta 1 (HLA-DRB1-rs35445101). SNPs consequently affected regulatory roles in all the candidate pathways except hematopoietic cell lineage pathways. Association analysis using the GSE14520 data set, Gene Multiple Association Network Integration Algorithm, and Search Tool for the Retrieval of Interacting Genes/Proteins suggests that all genes of the candidate SNPs were associated with TP53. Survival analysis showed that the collagen type VI alpha 3 chain (COL6A3) rs111231885 and COL6A3-rs113155945 and COL6A3 block 4 CC haplotypes with TP53 negative status may have protective effects in HBV-related HCC patients after hepatectomy.
CONCLUSION: Our pathway analysis identified 18 candidate SNPs and 10 candidate pathways that were associated with TP53 expression status in HBV-related HCC. Among these candidate SNPs, the genetic variation of COL6A3 may be a potential prognostic biomarker of HBV-related HCC.

Entities:  

Keywords:  TP53; genome-wide association study; hepatitis B virus; hepatocellular carcinoma; pathway analysis

Year:  2018        PMID: 29760565      PMCID: PMC5937480          DOI: 10.2147/CMAR.S163209

Source DB:  PubMed          Journal:  Cancer Manag Res        ISSN: 1179-1322            Impact factor:   3.989


Introduction

Liver cancer is the third leading cause of cancer death in China, with an age-standardized 5-year relative survival rate of 10.1%.1,2 The majority of liver cancer cases are hepatocellular carcinoma (HCC).3 A high prevalence of hepatitis B virus (HBV) infection and aflatoxin B1 exposure are the main factors of HCC in the Guangxi province of China.4–6 Previous studies have demonstrated that the tumor protein p53 (TP53) mutation is frequently found in HBV-related HCC in patients of Guangxi province.7–10 Therefore, in the Guangxi region, there is a representative population in which the associations between HBV infection and the TP53 gene in HCC can be investigated. Hepatocarcinogenesis is driven by the interaction of genetic and environmental factors.11,12 Genome-wide association studies (GWAS) can be used to identify associations between specific single nucleotide polymorphisms (SNPs) and complex diseases or other traits.13 The roles of the corresponding genes or proteins in the context of the pathway might be altered by trait-related SNPs. Identify candidate Causal SNPs and Pathways (ICSNPathway) is an analytical framework for the comprehensive interpretation of GWAS data by integrating linkage disequilibrium (LD) analysis, functional SNP annotation, and pathway-based analysis (PBA) and can be used to derive the mechanism hypothesis of SNP→gene→pathway(s) for complex disease studies, including cancer.14 In addition, ICSNPathway also is a tool based on PBA algorithm, which is a method for secondary excavation of GWAS results based on prior biological knowledge on gene function and biological metabolic pathways.14 By using the PBA algorithm, more information about the pathway and gene sets with same functions which are associated with the diseases or traits from GWAS results could be obtained. Our previous study has identified several SNPs associated with TP53 expression status in HBV-related HCC in patients of Guangxi by using the GWAS approach.15 In the present study, we further investigated candidate SNPs and molecular pathways associated with TP53 expression status in HBV-related HCC by using the ICSNPathway web server based on the result of our previous GWAS.

Materials and methods

Study population and GWAS data

Our study was approved by the Ethics Committee of the First Affiliated Hospital of Guangxi Medical University with an ethics approval number of 2015 (KY-E-032).15 Written informed consent was obtained from all the participants enrolled in the study. The primary GWAS data set was extracted from our previous study.15 Clinicopathological characteristics and prognosis of the HBV-related HCC patients, genotyping, quality control, and GWAS analysis methods have been described and published in our earlier article.15 A total of 403 patients with serum tests that were HBV surface antigen positive and newly diagnosed with HCC by pathological examination in the First Affiliated Hospital of Guangxi Medical University between 2001 and 2013 were included.15 TP53 staining in HCC tumor tissues was detected by immunohistochemistry.15 The SNPs were genotyped by an Illumina Human Exome BeadChip 12 v1-1 system (Illumina Inc, San Diego, CA, USA). Quality control standards were set as follows: samples were excluded if they had 1) an overall genotyping rate of <95%; 2) ambiguous gender; 3) genome-wide identity by-descent >0.1875; 4) outliers in principal component analysis (PCA) for ancestry and population stratification. SNPs had to meet the following criteria: 1) a call rate of >95%; 2) a Hardy–Weinberg equilibrium P>1×10−6; 3) a minor allele frequency >0.01.15 PCA for ancestry and population stratification suggest that no or mild population stratification was found in the current study population, and similar results were observed in our previous study.15 A total of 387 patients with 28,952 SNPs passed the quality control filters and were included in further investigations.

Identification of candidate SNPs and pathways

The ICSNPathway (http://icsnpathway.psych.ac.cn, accessed February 20, 2017) web server contains a two-stage analysis: 1) preselect candidate SNPs by LD analysis and functional SNP annotation based on the most significant SNPs and 2) annotate the biological mechanisms for the preselected candidate SNPs by using PBA.14 A complete list of GWAS SNP P-values was input for ICSNPathway analysis. The parameters used in the ICSNPathway were 1) threshold to specify the most significant SNPs: P-value <1×10−2; 2) Hap-Map population: Han Chinese in Beijing, China; 3) LD cut-off: r2>0.8; 4) distance for searching LD neighborhoods: 200 kb; 5) rule of mapping SNPs to genes: 500 kb upstream and downstream of gene; 6) pathway/gene set database: Kyoto Encyclopedia of Genes and Genomes; 7) number of genes in each pathway/gene set: minimum 5 and maximum 100; and 8) false discovery rate cutoff for PBA: 0.1.

Association analysis

Based on the results of the ICSNPathway analysis, haplotype analysis among the candidate SNPs was calculated using Haploview version 4.2 (Broad Institute of MIT and Harvard, Cambridge, MA, USA).16 Regional LD plots of the candidate SNPs were generated by SNP Annotation and Proxy Search (SNAP) (http://archive.broadinstitute.org/mpg/snap, accessed February 20, 2017), a tool used for the identification and annotation of proxy SNPs using HapMap.17 Genotype and haplotype distribution of the candidate SNPs in different TP53 expression status groups were determined using a binary logistic regression model. Co-expression analysis of the TP53 gene and the genes of candidate SNPs was performed using GSE14520, a Chinese HBV-related HCC mRNA expression chip data set obtained from Gene Expression Omnibus (https://www.ncbi.nlm.nih.gov/geo, accessed February 20, 2017).18,19 In order to eliminate the batch effect of the expression chip, we only included the Affymetrix HT Human Genome U133A Array data set (Thermo Fisher Scientific, Waltham, MA, USA) of GSE14520 in the co-expression analysis. Gene Multiple Association Network Integration Algorithm20,21 (GeneMANIA; http://genemania.org, accessed February 20, 2017) and Search Tool for the Retrieval of Interacting Genes/Proteins22,23 (STRING; http://string.embl.de/, accessed February 20, 2017) web servers were used for investigating the gene–gene and protein–protein interactions (PPIs) among genes of the candidate SNPs, respectively.

Survival analysis

We further analyzed the association of the candidate SNPs and clinical outcomes. The TP53 expression status and the candidate SNP interactions were analyzed using a joint effects survival analysis. In addition, we also analyzed haplotypes of the candidate SNPs.

Statistical analysis

Pearson correlation coefficient was used to assess co-expression correlation. The odds ratio (OR) and the corresponding 95% confidence interval (CI) of the binary logistic regression model were used to estimate the relative risk of TP53 expression status in HBV-related HCC. Univariate analysis between clinical features and overall survival (OS) were studied using the Kaplan–Meier method with the log-rank test. Cox proportional hazards regression analysis was used to calculate the crude and adjusted hazard ratio (HR) and 95% CI in univariate and multivariate analyses, with adjustment for age, gender, race, body mass index (BMI), smoking status, drinking status, Barcelona Clinic Liver Cancer (BCLC) stage, cirrhosis, radical resection, antiviral therapy, status of tumor capsule, regional invasion, and portal vein tumor thrombus (PVTT). A value of P<0.05 was considered statistically significant. All the statistical analyses were conducted with SPSS version 20.0 software (IBM Corporation, Armonk, NY, USA).

Results

Candidate SNPs and pathways

Using the P-values of the 28,952 GWAS SNPs as input data, the ICSNPathway identified 18 SNPs as candidate SNPs (Table 1) and 10 pathways as candidate pathways (Table 2) that were associated with TP53 expression status in HBV-related HCC. All of the candidate SNPs were non-synonymous coding SNPs, and the alteration of four SNPs (desmoglein 3 [DSG3]-rs16961975, keratin 35 (KRT35)-rs2071601, KRT35-rs743686, and keratin 36 (KRT36)-rs2301354) was deleterious. The strongest mechanism involved the modulation of major histocompatibility complex, class II, DP beta 1 (human leukocyte antigen [HLA]-DPB1-rs1042153), major histocompatibility complex, class II, DQ beta 1 (HLA-DQB1-rs1130399, HLA-DQB1-rs1049056, HLA-DQB1-rs1049059, and HLA-DQB1-rs1049060) and major histocompatibility complex, class II, DR beta 1 (HLA-DRB1-rs35445101), consequently affecting their regulatory roles in all the candidate pathways except the hematopoietic cell lineage pathway. The second strongest hypothetical biological mechanism was that SNPs of major histocompatibility complex, class I, C (HLA-C-rs1131096 and HLA-C-rs1130838) influence the regulatory role of cell adhesion molecules (CAMs), autoimmune thyroid disease, allograft rejection, antigen processing and presentation, type I diabetes mellitus, and graft-versus-host disease pathways. Other multiple mechanisms presented in Tables 1 and 2 indicated that alterations in candidate SNPs of collagen type VI alpha 3 chain (COL6A3)-rs111231885 and COL6A3-rs113155945, desmoglein 3 (DSG3)-rs16961975, keratin 32 (KRT32)-rs3744786), keratin 35 (KRT35)-rs2071601 and KRT35-rs743686, and keratin 36 (KRT36)-rs2301354 affect their regulatory roles in cell communication, whereas legumain (LGMN)-rs118128989 alterations affected the antigen processing and presentation pathway and interleukin 6 receptor (IL6R)-rs2228145, interleukin 7 receptor (IL7R)-rs6897932 alterations affected the hematopoietic cell lineage pathway.
Table 1

Candidate SNPs identified from ICSNPathway analysis

Candidate SNPFunctional classGeneCandidate pathwaya−log10(P)bIn LD withr2D’−log10(P)c in original GWAS
rs1042153non_synonymous_codingHLA-DPB11,2,3,4,5,7,9,102.45rs10421532.45
rs1130399non_synonymous_codingHLA-DQB11,2,3,4,5,7,9,103.153rs11303993.153
rs1049056non_synonymous_codingHLA-DQB11,2,3,4,5,7,9,102.433rs10490562.433
rs1049059non_synonymous_codingHLA-DQB11,2,3,4,5,7,9,102.357rs10490592.357
rs1049060non_synonymous_codingHLA-DQB11,2,3,4,5,7,9,102.467rs10490602.467
rs35445101non_synonymous_codingHLA-DRB11,2,3,4,5,7,8,9,103.591rs354451013.591
rs1131096non_synonymous_codingHLA-C3,4,5,7,9,102.097rs11310962.097
rs1130838non_synonymous_codingHLA-C3,4,5,7,9,102.267rs11308382.267
rs111231885non_synonymous_codingCOL6A362.541rs1112318852.541
rs113155945non_synonymous_codingCOL6A362.044rs1131559452.044
rs16961975non_synonymous_coding (deleterious)DSG362.12rs169619752.12
rs3744786non_synonymous_codingKRT326rs23013540.8470.9453.583
rs2071601non_synonymous_coding (deleterious)KRT356rs23013540.90113.583
rs743686non_synonymous_coding (deleterious)KRT3563.209rs7436863.209
rs2301354non_synonymous_coding (deleterious)KRT3663.583rs23013543.583
rs118128989non_synonymous_codingLGMN72.084rs1181289892.084
rs2228145non_synonymous_codingIL6R82.081rs22281452.081
rs6897932non_synonymous_codingIL7R82.004rs68979322.004

Notes:

The number indicates the index of pathways (listed in Table 2), which are ranked by their statistical significance (FDR).

−log10(P) for candidate SNP in original GWAS. “–” denotes that this SNP is not represented in the original GWAS.

−log10(P) for the SNP (which candidate SNP is in LD with) in original GWAS.

Abbreviations: SNP, single nucleotide polymorphism; ICSNPathway, Identify candidate Causal SNPs and Pathways; LD, linkage disequilibrium; GWAS, genome-wide association studies; HLA-DPB1, major histocompatibility complex, class II, DP beta 1; HLA-DQB1, major histocompatibility complex, class II, DQ beta 1; HLA-DRB1, major histocompatibility complex, class II, DR beta 1; HLA-C, major histocompatibility complex, class I, C; COL6A3, collagen type VI alpha 3 chain; DSG3, desmoglein 3; KRT32, keratin 32; KRT35, keratin 35; KRT36, keratin 36; LGMN, legumain; IL6R, interleukin 6 receptor; IL7R, interleukin 7 receptor; FDR, false discovery rate.

Table 2

Candidate pathways identified from ICSNPathway analysis

IndexCandidate pathwayDescriptionNominal P-valueFDR
1hsa05310Asthma0.0050.026
2hsa05322Systemic lupus erythematosus0.0050.026
3hsa04514CAMs0.0340.056
4hsa05320Autoimmune thyroid disease0.0280.058
5hsa05330Allograft rejection0.0280.058
6hsa01430Cell communication0.0240.058
7hsa04612Antigen processing and presentation0.0410.06
8hsa04640Hematopoietic cell lineage0.0090.069
9hsa04940Type I diabetes mellitus0.0320.077
10hsa05332Graft-versus-host disease0.0320.077

Abbreviations: FDR, false discovery rate; CAMs, cell adhesion molecules; ICSNPathway, Identify candidate Causal SNPs and Pathways.

In the output of candidate SNPs, KRT32-rs3744786 and KRT35-rs2071601 were not present in the original GWAS result. So, only 16 candidate SNPs and their corresponding genes were included in further association and survival analysis. Four haplotype blocks were detected in the haplotypes analysis (block 1 pairwise r2=0.992, constituted by HLA-C-rs1130838 and HLA-C-rs1131096; block 2 pairwise r2=0.121–1.0, constituted by HLA-DRB1-rs35445101, HLA-DQB1-rs1130399, HLA-DQB1-rs1049060, HLA-DQB1-rs1049059, and HLA-DQB1-rs1049056; block 3 pairwise r2=0.973, constituted by KRT35-rs743686 and KRT36-rs2301354; block 4 pairwise r2=0.755, constituted by COL6A3-rs111231885 and COL6A3-rs113155945; Figure 1A). Distribution of candidate SNPs in different TP53 expression status patients is shown in Table S1. After adjusting for age, gender, race, BMI, smoking status, drinking status, BCLC stage, cirrhosis, radical resection, antiviral therapy, status of tumor capsule, regional invasion, and PVTT, all the SNPs were significantly associated with TP53 expression status in HBV-related HCC. Association analysis in four haplotypes demonstrated that GT in block 1, AGTCC and GATCC/other haplotypes in block 2, GA/other haplotypes in block 3, and TT/other haplotypes in block 4 had significantly decreased risk of TP53 expression status in HBV-related HCC, compared to patients with TC, AGAGA, AG, and CC (Table 3), respectively.
Figure 1

Haplotype association of the candidate SNPs and co-expression heat map for the TP53 gene and corresponding genes of the candidate SNPs.

Notes: (A) Patterns of LD plots for 16 candidate SNPs. (B) Co-expression heat map between TP53 and corresponding genes of the candidate SNPs.

Abbreviations: SNP, single nucleotide polymorphism; TP53, tumor protein p53; LD, linkage disequilibrium.

Table 3

Haplotype distribution of the candidate SNPs in patients with different TP53 expression status

HaplotypesTP53 negative (2n=308)TP53 positive (2n=466)Crude OR (95% CI)Crude P-valueAdjusted OR (95% CI)Adjusted P-valuea
Block 1
TC22938711
GT79790.592 (0.416–0.841)0.0030.622 (0.432–0.895)0.011
Block 2
AGAGA11823811
AGTCC1161670.714 (0.516–0.987)0.0410.696 (0.494–0.980)0.038
GATCC+other haplotypes74610.409 (0.273–0.612)<0.0010.395 (0.259–0.603)<0.001
Block 3
AG15629111
GA+other haplotypes1521750.617 (0.461–0.826)0.0010.582 (0.427–0.792)0.001
Block 4
CC27944611
TT+other29200.431 (0.239–0.777)0.0050.376 (0.201–0.703)0.002
haplotypes

Notes:

Adjusted for age, gender, race, body mass index, smoking status, drinking status, BCLC stage, cirrhosis, radical resection, antiviral therapy, status of tumor capsule, regional invasion, and PVTT.

Abbreviations: SNP, single nucleotide polymorphism; TP53, tumor protein p53; BCLC, Barcelona Clinic Liver Cancer; PVTT, portal vein tumor thrombus.

Co-expression analysis in HBV-related HCC expression profile data set from the GSE14520 cohort revealed that the TP53 gene has a significantly weak negative correlation with HLA-C (r=−0.322, P<0.001), IL6R (r=−0.132, P=0.007), and KRT36 (r=−0.186, P<0.001), whereas it had a positive correlation with COL6A3 (r=0.238, P<0.001), HLA-DPB1 (r=0.115, P=0.018), HLA-DQB1 (r=0.176, P=0.0003), and LGMN (r=0.157, P=0.001) at the mRNA level in HBV-related HCC of GSE14520; the co-expression heat map is shown in Figure 1B. The remaining genes were not significantly correlated with TP53 at the mRNA level. Gene and gene co-expression interaction networks constructed by GeneMANIA demonstrated that all the genes of the candidate SNPs exist in a complex gene–gene co-expression interaction network and are directly or indirectly associated with TP53 (Figure 2A). In addition, PPIs determined experimentally and constructed by STRING showed that HLA-DQB1, HLA-C, KRT35, IL7R, and LGMN were associated with TP53 through ubiquitin C (UBC) (Figure 2B).
Figure 2

Gene–gene and protein–protein interaction networks.

Notes: (A) Gene–gene interaction networks constructed by GeneMANIA. (B) Protein–protein interaction networks constructed by STRING.

Abbreviations: GeneMANIA, Gene Multiple Association Network Integration Algorithm; STRING, Search Tool for the Retrieval of Interacting Genes/Proteins; TP53, tumor protein p53; UBC, ubiquitin C; HLA-DPB1, major histocompatibility complex, class II, DP beta 1; HLA-DQB1, major histocompatibility complex, class II, DQ beta 1; HLA-DRB1, major histocompatibility complex, class II, DR beta 1; HLA-C, major histocompatibility complex, class I, C; COL6A3, collagen type VI alpha 3 chain; DSG3, desmoglein 3; KRT32, keratin 32; KRT35, keratin 35; KRT36, keratin 36; LGMN, legumain; IL6R, interleukin 6 receptor; IL7R, interleukin 7 receptor.

Detailed regional LD plots of the four haplotype blocks of the candidate SNPs were generated by SNAP. Regional LD plots for four SNPs of HLA-DQB1 in block 2 are shown in Figure 3A–D, whereas the regional LD plot of HLA-DRB1-rs35445101 in block 2 was not available on the SNAP website. Regional LD plots of block 3 (Figure 4A, B) and block 4 (Figure 4C, D) are shown in Figure 4, whereas the regional LD plot for block 1 of HLA-C-rs1131096 and HLA-C-rs1130838 was not available on the SNAP website. Regional LD plot for these SNPs indicated that there were strong LD loci of these blocks detectable in the region nearby them.
Figure 3

Regional LD plots of block 2 (HLA-DQB1).

Notes: Regional LD plots of (A) HLA-DQB1-rs1049056, (B) HLA-DQB1-rs1049059, (C) HLA-DQB1-rs1049060, and (D) HLA-DQB1-rs1130399.

Abbreviations: LD, linkage disequilibrium; HLA-DQB1, major histocompatibility complex, class II, DQ beta 1; CHB, chronic hepatitis B.

Figure 4

Regional LD plots of block 3 (KRT35 and KRT36) and block 4 (COL6A3).

Notes: Regional LD plots of (A) KRT35-rs743686, (B) KRT36-rs2301354, (C) COL6A3-rs111231885, and (D) COL6A3-rs113155945.

Abbreviations: LD, linkage disequilibrium; KRT35, keratin 35; KRT36, keratin 36; COL6A3, collagen type VI alpha 3 chain.

Survival analysis was used to further investigate the associations between the candidate SNPs and haplotypes with HBV-related HCC prognosis. Clinicopathological characteristics and prognosis information of patients with HBV-related HCC have been described and published in an earlier article15 and shown in Table 4. Survival analysis of COL6A3-rs111231885 showed that patients with the T allele had a shorter median survival time (MST) than those with the C allele (51 vs 33 months for CC vs TT/TC, log-rank P=0.012; Table S2). After adjusting for age, gender, race, BMI, smoking status, drinking status, BCLC stage, cirrhosis, radical resection, antiviral therapy, status of tumor capsule, regional invasion, and PVTT in the Cox proportional hazards regression model, patients with the T allele had a significantly increased risk of death compared to those with the C allele (adjusted P=0.043, HR=1.64, 95% CI=1.015–2.647; Table S2). Similar results were also observed with COL6A3-rs113155945; patients with the TT genotype had a significantly increased risk of death compared to the CC genotype (adjusted P=0.047, HR=4.281, 95% CI=1.017–18.022; Table S2). In addition, joint effects analysis was also used to explore the SNPs and TP53 interaction in HBV-related HCC prognosis. TP53-negative patients with the COL6A3-rs111231885 T allele carriers had a significantly increased risk of death (adjusted P=0.034, HR=1.994, 95% CI=1.052–3.778; Table S3) and a poor clinical outcome (MST: 33 vs 68 months for TT/TC vs CC, log-rank P=0.031; Table S3) in HBV-related HCC, compared to TP53-negative patients with C allele carriers. No other genotypes were significantly associated with OS in single and joint effects analysis.
Table 4

Clinicopathological characteristics of patients with HBV-related HCC after data quality control

VariableGWAS
Survival analysis
TP53 negative (n=154)TP53 positive (n=233)OR (95% CI)P-valuePatients (n=387)MST (months)HR (95% CI)Log-rank P
Age (years)0.313
 ≤601332111344511
 >6021220.66 (0.35–1.247)0.20143411.268 (0.795–2.023)
Sex0.563
 Male1412071348481
 Female13260.734 (0.365–1.477)0.38639420.856 (0.504–1.455)
Race0.837
 Han1031421245471
 Minority51911.294 (0.845–1.983)0.236142500.968 (0.711–1.318)
BMI0.933
 ≤251211811302481
 >2533521.053 (0.643–1.725)0.83685470.985 (0.694–1.399)
Smoking status0.054
 None971471244611
 Ever57860.996 (0.653–1.518)0.984143391.342 (0.992–1.816)
Drinking status0.153
 None911341225511
 Ever63991.067 (0.706–1.613)0.758162411.239 (0.921–1.666)
Child–Pugha0.009
 A1161891305511
 B28300.658 (0.374–1.156)0.14658311.665 (1.131–2.452)
Cirrhosis0.117
 No1227139881
 Yes1422060.645 (0.316–1.315)0.228348481.503 (0.897–2.518)
Radical resectionb0.095
 Yes821331215711
 None67940.865 (0.57–1.313)0.496161411.286 (0.955–1.733)
Portal hypertensionc0.347
 No691241193521
 Yes75870.645 (0.421–0.989)0.044162421.163 (0.847–1.598)
Pathological graded0.665
 Well differentiated1410124791
 Moderately differentiated1261902.111 (0.909–4.901)0.082316481.350 (0.688–2.648)
 Poorly differentiated11014 (1.536–127.621)0.01911NA1.203 (0.370–3.908)
Serum AFPe0.499
 ≤400 (ng/mL)871101197511
 >400 (ng/mL)581051.432 (0.935–2.193)0.099163431.111 (0.817–1.512)
Antiviral therapy0.002
 No951571252401
 Yes59760.779 (0.51–1.192)0.25135NA0.574 (0.398–0.828)
Tumor behavior
Tumor size<0.001
 ≤5 cm72851157751
 >5 cm821481.529 (1.011–2.313)0.044230391.757 (1.278–2.415)
Tumor number<0.001
 Single1111691280581
 Multiple43640.978 (0.62–1.54)0.922107271.788 (1.310–2.439)
Status of tumor capsule0.131
 Complete1262011327501
 Incomplete28320.716 (0.412–1.247)0.23860351.335 (0.914–1.949)
Regional invasion0.421
 Absence1311991330511
 Presence23340.973 (0.549–1.726)0.92657401.185 (0.781–1.800)
BCLC stage<0.001
 A871291216951
 B28380.915 (0.523–1.6)0.75666392.080 (1.395–3.102)
 C39661.141 (0.706–1.845)0.59105272.672 (1.919–3.720)
PVTT<0.001
 No1301891319731
 Yes24441.261 (0.731–2.175)0.40468272.233 (1.615–3.088)

Notes:

Child–Pugh class was unavailable in 24 patients. Information of

radical resection was unavailable in 11 patients,

portal hypertension was unavailable in 32 patients,

pathological diagnosis was unavailable in 36 patients,

serum AFP level was unavailable in 27 patients.

Abbreviations: GWAS, genome-wide association study; BMI, body mass index; AFP, α-fetoprotein; BCLC, Barcelona Clinic Liver Cancer; PVTT, portal vein tumor thrombus; HBV, hepatitis B virus; HCC, hepatocellular carcinoma; NA, not applicable; TP53, tumor protein p53; MST, median survival time.

Survival analysis for haplotypes of candidate SNPs is shown in Figure 5A–D, and indicated that patients with TT/other haplotypes in block 4 had a significantly shorter OS (MST: 33 vs 51 months for TT/other haplotypes vs CC, log-rank P=0.019; Table 5, Figure 5D). Multivariate analyses of the Cox proportional hazards regression model suggest that patients with TT/other haplotypes in block 4 had increased risk of death in HBV-related HCC (adjusted P=0.07, HR=1.450, 95% CI=0.970–2.167; Table 5), with a critically significant P-value, compared to patients with CC haplotypes. Joint effects analysis (Figure 6A–D) indicated that TP53-negative patients with TT/other haplotypes in block 4 had a significantly poor prognosis (MST: 33 vs 68 months for TT/other haplotypes vs CC haplotypes, log-rank P=0.047; Table 6, Figure 6D) and increased risk of death in HBV-related HCC (adjusted P=0.047, HR=1.713, 95% CI =1.006–2.918; Table 6), compared to TP53-negative patients with CC haplotypes. No other haplotypes were significantly associated with OS in single or joint effects analysis.
Figure 5

Survival curves of different haplotypes.

Notes: OS stratified by (A) block 1 haplotypes, (B) block 2 haplotypes, (C) block 3 haplotypes, (D) block 4 haplotypes.

Abbreviations: OS, overall survival.

Table 5

Survival analysis of different haplotypes

HaplotypesPatients (2n=774)MST (months)Crude HR (95% CI)Crude P-valueAdjusted HR (95% CI)Adjusted P-valuea
Block 1
TC6164711
GT158680.913 (0.702–1.187)0.4970.934 (0.714–1.221)0.618
Block 2
AGAGA3565111
AGTCC283480.930 (0.735–1.176)0.5440.915 (0.717–1.170)0.48
GATCC+other haplotypes135411.198 (0.909–1.580)0.1991.153 (0.867–1.533)0.327
Block 3
AG4475811
GA+other haplotypes327411.132 (0.918–1.396)0.2471.121 (0.903–1.392)0.299
Block 4
CC7255111
TT+other haplotypes49331.566 (1.070–2.292)0.0211.450 (0.970–2.167)0.07

Notes:

Adjusted for age, gender, race, body mass index, smoking status, drinking status, BCLC stage, cirrhosis, radical resection, antiviral therapy, status of tumor capsule, regional invasion, and PVTT.

Abbreviations: BCLC, Barcelona Clinic Liver Cancer; PVTT, portal vein tumor thrombus; MST, median survival time.

Figure 6

Joint effects survival analysis of different haplotypes and TP53 expression status.

Notes: OS stratified by (A) block 1 haplotypes and TP53 expression status, (B) block 2 haplotypes and TP53 expression status, (C) block 3 haplotypes and TP53 expression status, (D) block 4 haplotypes and TP53 expression status.

Abbreviations: OS, overall survival; TP53, tumor protein p53.

Table 6

Joint effects survival analysis of different haplotypes and TP53 expression status

GroupHaplotypesTP53 statusPatients (2n=774)MST (months)Crude HR (95% CI)Crude P-valueAdjusted HR (95% CI)Adjusted P-valuea
Block 1
Group 1TCNegative2295111
Group 2TCPositive387431.107 (0.868–1.413)0.4131.104 (0.855–1.425)0.449
Group 3GTNegative79680.845 (0.567–1.260)0.4090.857 (0.570–1.289)0.459
Group 4GTPositive79411.126 (0.771–1.644)0.5391.158 (0.787–1.706)0.456
Block 2
Group AAGAGANegative1186811
Group BAGAGAPositive238411.302 (0.928–1.827)0.1261.206 (0.854–1.704)0.288
Group CAGTCC+other haplotypesNegative190481.171 (0.818–1.675)0.3891.057 (0.733–1.524)0.765
Group DAGTCC+other haplotypesPositive228431.256 (0.892–1.769)0.1911.199 (0.841–1.709)0.316
Block 3
Group iAGNegative1567111
Group iiAGPositive291481.243 (0.920–1.678)0.1561.329 (0.976–1.811)0.071
Group iiiGA+other haplotypesNegative152451.235 (0.875–1.743)0.231.341 (0.943–1.905)0.102
Group ivGA+other haplotypesPositive175401.380 (0.993–1.918)0.0551.361 (0.971–1.908)0.074
Block 4
Group ICCNegative2796811
Group IICCPositive446441.196 (0.953–1.501)0.1231.261 (0.995–1.599)0.055
Group IIITT+other haplotypesNegative29331.677 (0.992–2.834)0.0541.713 (1.006–2.918)0.047
Group IVTT+other haplotypesPositive20321.855 (1.045–3.294)0.0351.651 (0.872–3.126)0.123

Notes:

Adjusted for age, gender, race, body mass index, smoking status, drinking status, BCLC stage, cirrhosis, radical resection, antiviral therapy, status of tumor capsule, regional invasion, and PVTT.

Abbreviations: TP53, tumor protein p53; BCLC, Barcelona Clinic Liver Cancer; PVTT, portal vein tumor thrombus; MST, median survival time.

Discussion

The GWAS approach is increasingly being used to discover the association between genes and disease. However, most GWAS have focused on SNPs with high statistical significance, whereas many other SNPs have received little attention and the full potential of these data have not been fully exploited.24,25 Therefore, it is necessary to perform in-depth data mining with GWAS results. Genome-wide pathway analysis can investigate the GWAS SNPs through a SNP→gene→pathway approach to discover the overrepresented pathways in the GWAS data, which would consider rare variants, multi-omics, and interactions. In the current study, ICSNPathway analysis identified 18 candidate SNPs and 10 candidate pathways that are associated with TP53 expression status in HBV-related HCC. Five hypothetical biological mechanisms can be obtained from ICSNPathway analysis. The strongest hypothetical biological mechanism found that the candidate SNPs of HLA-DPB1, HLA-DQB1, and HLA-DRB1 affected their regulatory roles in all the candidate pathways except hematopoietic cell lineage pathway. The major histocompatibility complex class II molecule is a heterodimer consisting of an alpha (DQA) and a beta chain (DQB), both anchored in the membrane. Previous studies have demonstrated that HLA-DPB1 polymorphisms were significantly associated with the risk of HBV infection susceptibility,26,27 whereas the distribution of the SNPs genotype frequencies was similar in HCC and chronic hepatitis B patients.26,28 A case–control study that compared persistence and natural clearance of HBV infection in a population indicated that the HLA-DPB1-rs9277535 A allele has a major effect on the risk of persistent HBV infection.29 Subsequently, another study also reported that HLA-DPB1-rs9277535 was significantly related to HBV infection risks and increased HBV clearance possibility in a dose-dependent manner.30 Furthermore, the polymorphisms of another HLA class II molecule, HLA-DQB1, were also associated with the development of chronic HBV infection and liver cirrhosis,31 as well as the risk factor of HCC.32,33 In addition, our previous study also showed that HLA-DQB1 polymorphisms have a prognosis predictive value in HBV-related HCC patients undergoing hepatic resection.34 Similar genetic susceptibility research on HLA-DRB1 also demonstrated that polymorphisms in the HLA-DRB1 gene were significantly associated with HCC risk, HBV infection, and progression from CHB to HCC.35–38 The second strongest mechanism was that a candidate SNP of HLA-C-rs1131096 and HLA-C-rs1130838 influenced the regulatory role of CAMs, autoimmune thyroid disease, allograft rejection, antigen processing and presentation, type I diabetes mellitus, and graft-versus-host disease pathways. HLA-C belongs to the HLA class I heavy chain paralogues and its genetic variation can influence the risk of HBV-related HCC development;39 furthermore, HLA-C*15 is also an important host immunogenetic factor that negatively associates with hepatitis C virus viral load in chronic hepatitis C patients.40 Studies of major histocompatibility complex class I and II gene polymorphisms demonstrate a risk factor for hepatitis virus and HCC. In the current study, our findings suggest that genetic variation in HLA-DPB1, HLA-DQB1, HLA-DRB1, and HLA-C were associated with TP53 expression status in HBV-related HCC. These results contribute to a better understanding of the heritability of HLA-DPB1, HLA-DQB1, HLA-DRB1, and HLA-C in HBV-related HCC and subsequently provide hypotheses to clarify their potential mechanisms in HBV and HCC genetic susceptibility. In the remaining genes with candidate SNPs, we only found that IL6R and IL7R were associated with HCC among the previous studies, whereas associations between the other genes and human HCC have not been reported. IL6R encodes a subunit of the IL6R complex, and its dysregulation is related to the pathogenesis of many diseases, including cancer. A study by Deng et al revealed that the IL6R-rs6684439 T allele is associated with a lower susceptibility of HBV-related HCC in the Guangxi population,41 whereas miR-451 plays a suppressive role in tumor angiogenesis via the regulation of the IL6R-signal transducer and activator of transcription 3-vascular endothelial growth factor signaling pathway.42 Research by Midorikawa et al confirms that IL7R is downregulated in well-differentiated tumor tissue in HCC and can serve as a predictor gene of HCC dedifferentiation.43 Our results contribute to a better understanding of genes associated with different HBV-related HCC subgroups. Our association analysis demonstrated that seven genes with candidate SNPs were correlated to TP53 at the HBV-related HCC mRNA level, whereas PPI networks showed that five genes with candidate SNPs were associated with TP53 via UBC through experiments. However, the GeneMANIA gene–gene interaction networks showed complex co-expression networks among those genes, and all genes were directly or indirectly related to the TP53 gene. In the present study, we confirmed that LGMN and TP53 are positively correlated at the mRNA level in HBV-related HCC based on the GSE14520 data set, and our bioinformatics analysis by GeneMANIA also suggests that LGMN and TP53 were co-expressed via the TAP1 gene, whereas LGMN was also related to TP53 via the UBC gene in the PPI networks that were constructed by STRING. A study by Murthy et al has reported that LGMN was significantly upregulated in tumor tissue and its low expression showed a better prognosis in colorectal cancer (CRC); meanwhile, it has a positive correlation with TP53.44 LGMN expression and its enzyme activity can also be regulated by TP53, and knockdown experiments suggest that LGMN and TP53 have a positive correlation in HCT116 cells.45 Our bioinformatics analysis also suggests that IL7R is associated with TP53 in GeneMANIA and PPI networks, and IL7/IL7R prevents apoptosis by regulating bcl-2 expression and the TP53 pathway in A549 and human bronchial epithelial cells.46 Among the 10 candidate pathways, CAMs and cell communication pathways were the most common hypothetical biological mechanisms that involved the majority of candidate SNPs. CAMs play an important role in cell communication47 and are associated with HCC diagnosis and survival prediction.48,49 Our findings suggest a novel hypothetical biological mechanism between CAMs and TP53 expression status in HBV-related HCC. Survival analysis in the current study indicates that the C allele of COL6A3-rs111231885 and COL6A3-rs113155945, and COL6A3 block 4 CC haplotypes with TP53 negative status significantly decrease the risk of death in HBV-related HCC patients after hepatectomy. Previous studies have confirmed that high COL6A3 expression was significantly associated with poor prognosis and its mutation can be used for survival prediction in CRC.50,51 Furthermore, COL6A3 was markedly upregulated in the tumor tissue of gastric cancer,52,53 pancreatic cancer,54,55 and CRC50,56 and can serve as a potential diagnostic biomarker in these cancers. This evidence suggests that COL6A3 may be a potential diagnosis and prognosis marker in CRC and may serve as an oncogene of CRC. Our findings demonstrate that several SNPs of COL6A3 have a prediction value for HBV-related HCC prognosis and provide insight into the clinical utility of HBV-related HCC prognosis. Once validated, COL6A3 may be used for prognosis prediction and decision-making in HCC management. There were limitations in our study that need to be recognized. First, our study evaluates the association between TP53 expression status and candidate SNPs using the GWAS approach, and validates the association between the genes of the candidate SNPs and TP53 using the Gene Expression Omnibus data set, GeneMANIA, and STRING bioinformatics tools that lack confirmation by in vivo and in vitro experiments. Second, all patients in the present study were exclusively from a Guangxi population of HBV-related HCC; therefore, in order to generalize our findings, additional external validation in cohorts from other ethnic populations is necessary to confirm our results. Despite these limitations, our study is the first to explore the association between the SNPs and molecular pathways associated with TP53 expression status in HBV-related HCC by using the genome-wide association pathway analysis approach, and that might have etiology or clinical implications.

Conclusion

Genome-wide association pathway analysis in the current study identified 18 candidate SNPs and 10 candidate pathways that are associated with TP53 expression status in HBV-related HCC and generated five novel SNP-to-gene to pathway hypotheses. These results contribute to a better understanding of the heritability of HBV-related HCC in different TP53 expression subgroups and provide evidence for personalized treatment strategies of different TP53 expression subgroups in HBV-related HCC patients. Additional in vivo and in vitro experimental studies will be necessary to elucidate the role of these pathways in different TP53 expression subgroups of HBV-related HCC. Among these candidate SNPs, the C allele of COL6A3-rs111231885 and COL6A3-rs113155945, and COL6A3 block 4 CC haplotypes with TP53 negative status may have protective effects in HBV-related HCC patients after hepatectomy and can serve as a potential prognostic biomarker. Further well-designed and larger sample size studies are needed to validate the associations between COL6A3 genetic variation and HBV-related HCC prognosis.
  56 in total

1.  SNAP: a web-based tool for identification and annotation of proxy SNPs using HapMap.

Authors:  Andrew D Johnson; Robert E Handsaker; Sara L Pulit; Marcia M Nizzari; Christopher J O'Donnell; Paul I W de Bakker
Journal:  Bioinformatics       Date:  2008-10-30       Impact factor: 6.937

2.  Association between chronic hepatitis B virus infection and HLA-DP gene polymorphisms in the Turkish population.

Authors:  Ersin Akgöllü; Ramazan Bilgin; Hikmet Akkız; Yakup Ülger; Berrin Yalınbaş Kaya; Ümit Karaoğullarından; Yusuf Kemal Arslan
Journal:  Virus Res       Date:  2017-01-22       Impact factor: 3.303

3.  Hepatitis C virus-related hepatocellular carcinoma and B-cell lymphoma patients show a different profile of major histocompatibility complex class II alleles.

Authors:  V De Re; L Caggiari; R Talamini; M Crovatto; S De Vita; C Mazzaro; R Cannizzaro; R Dolcetti; M Boiocchi
Journal:  Hum Immunol       Date:  2004-11       Impact factor: 2.850

4.  Role of several cytokines and adhesion molecules in the diagnosis and prediction of survival of hepatocellular carcinoma.

Authors:  Raim Iliaz; Umit Akyuz; Didem Tekin; Murat Serilmez; Sami Evirgen; Bilger Cavus; Hilal Soydinc; Derya Duranyildiz; Cetin Karaca; Kadir Demir; Fatih Besisik; Sabahattin Kaymakoglu; Filiz Akyuz
Journal:  Arab J Gastroenterol       Date:  2016-12       Impact factor: 2.076

5.  The p53 mutation spectrum in hepatocellular carcinoma from Guangxi, China : role of chronic hepatitis B virus infection and aflatoxin B1 exposure.

Authors:  Lu-Nan Qi; Tao Bai; Zu-Shun Chen; Fei-Xiang Wu; Yuan-Yuan Chen; Bang- De Xiang; Tao Peng; Ze-Guang Han; Le-Qun Li
Journal:  Liver Int       Date:  2014-01-27       Impact factor: 5.828

6.  Role of glycosaminoglycans in cellular communication.

Authors:  Robert J Linhardt; Toshihiko Toida
Journal:  Acc Chem Res       Date:  2004-07       Impact factor: 22.384

7.  Polymorphism of XRCC1 and the frequency of mutation in codon 249 of the p53 gene in hepatocellular carcinoma among Guangxi population, China.

Authors:  Xi Dai Long; Yun Ma; Hong Dong Huang; Jin Guang Yao; De Ying Qu; Yun Long Lu
Journal:  Mol Carcinog       Date:  2008-04       Impact factor: 4.784

8.  Novel recurrently mutated genes and a prognostic mutation signature in colorectal cancer.

Authors:  Jun Yu; William K K Wu; Xiangchun Li; Jun He; Xiao-Xing Li; Simon S M Ng; Chang Yu; Zhibo Gao; Jie Yang; Miao Li; Qiaoxiu Wang; Qiaoyi Liang; Yi Pan; Joanna H Tong; Ka F To; Nathalie Wong; Ning Zhang; Jie Chen; Youyong Lu; Paul B S Lai; Francis K L Chan; Yingrui Li; Hsiang-Fu Kung; Huanming Yang; Jun Wang; Joseph J Y Sung
Journal:  Gut       Date:  2014-06-20       Impact factor: 23.059

9.  Identification of key genes associated with gastric cancer based on DNA microarray data.

Authors:  Hui Sun
Journal:  Oncol Lett       Date:  2015-11-17       Impact factor: 2.967

10.  Genetic variations in STAT4,C2,HLA-DRB1 and HLA-DQ associated with risk of hepatitis B virus-related liver cirrhosis.

Authors:  De-Ke Jiang; Xiao-Pin Ma; Xiaopan Wu; Lijun Peng; Jianhua Yin; Yunjie Dan; Hui-Xing Huang; Dong-Lin Ding; Lu-Yao Zhang; Zhuqing Shi; Pengyin Zhang; Hongjie Yu; Jielin Sun; S Lilly Zheng; Guohong Deng; Jianfeng Xu; Ying Liu; Jinsheng Guo; Guangwen Cao; Long Yu
Journal:  Sci Rep       Date:  2015-11-05       Impact factor: 4.379

View more
  1 in total

1.  Endotrophin, a pro-peptide of Type VI collagen, is a biomarker of survival in cirrhotic patients with hepatocellular carcinoma.

Authors:  Diana Julie Leeming; Signe Holm Nielsen; Roslyn Vongsuvanh; Pruthviraj Uchila; Mette Juul Nielsen; Alexander L Reese-Petersen; David van der Poorten; Mohammed Eslam; Detlef Schuppan; Morten Asser Karsdal; Jacob George
Journal:  Hepat Oncol       Date:  2020-12-18
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.