Literature DB >> 33584989

Integrative analysis of layers of data in hepatocellular carcinoma reveals pathway dependencies.

Mamatha Bhat1, Elisa Pasini2, Chiara Pastrello3, Sara Rahmati3, Marc Angeli2, Max Kotlyar3, Anand Ghanekar4, Igor Jurisica3.   

Abstract

BACKGROUND: The broader use of high-throughput technologies has led to improved molecular characterization of hepatocellular carcinoma (HCC). AIM: To comprehensively analyze and characterize all publicly available genomic, gene expression, methylation, miRNA and proteomic data in HCC, covering 85 studies and 3355 patient sample profiles, to identify the key dysregulated genes and pathways they affect.
METHODS: We collected and curated all well-annotated and publicly available high-throughput datasets from PubMed and Gene Expression Omnibus derived from human HCC tissue. Comprehensive pathway enrichment analysis was performed using pathDIP for each data type (genomic, gene expression, methylation, miRNA and proteomic), and the overlap of pathways was assessed to elucidate pathway dependencies in HCC.
RESULTS: We identified a total of 8733 abstracts retrieved by the search on PubMed on HCC for the different layers of data on human HCC samples, published until December 2016. The common key dysregulated pathways in HCC tissue across different layers of data included epidermal growth factor (EGFR) and β1-integrin pathways. Genes along these pathways were significantly and consistently dysregulated across the different types of high-throughput data and had prognostic value with respect to overall survival. Using CTD database, estradiol would best modulate and revert these genes appropriately.
CONCLUSION: By analyzing and integrating all available high-throughput genomic, transcriptomic, miRNA, methylation and proteomic data from human HCC tissue, we identified EGFR, β1-integrin and axon guidance as pathway dependencies in HCC. These are master regulators of key pathways in HCC, such as the mTOR, Ras/Raf/MAPK and p53 pathways. The genes implicated in these pathways had prognostic value in HCC, with Netrin and Slit3 being novel proteins of prognostic importance to HCC. Based on this integrative analysis, EGFR, and β1-integrin are master regulators that could serve as potential therapeutic targets in HCC. ©The Author(s) 2020. Published by Baishideng Publishing Group Inc. All rights reserved.

Entities:  

Keywords:  Gene expression; Hepatocellular carcinoma; High throughput data; Methylation; Proteomics; miRNA

Year:  2021        PMID: 33584989      PMCID: PMC7856865          DOI: 10.4254/wjh.v13.i1.94

Source DB:  PubMed          Journal:  World J Hepatol


Core Tip: Analyzing all available high-throughput genomic, transcriptomic, miRNA, methylation and proteomic data from human hepatocellular carcinoma tissue, we identified master regulators of key pathways in hepatocellular carcinoma, such as the mTOR, Ras/Raf/MAPK and p53 pathways.

INTRODUCTION

The molecular basis of hepatocellular carcinoma (HCC) has been elusive, given the significant heterogeneity of this tumor that arises in the context of various chronic liver diseases[1]. HCC remains a high-fatality cancer, despite large-scale efforts to better characterize and therapeutically target this malignancy. Since prevalence of cirrhosis due to hepatitis C and fatty liver disease is increasing in North America, HCC continues to rise[2]. Five-year survival remains poor at 18% due to late diagnosis and inability to tolerate chemotherapy in patients with cirrhosis[2]. Consequently, there is an urgent need to better understand the molecular basis of this highly fatal cancer. Clinical management of HCC is optimized based on disease stage[3]. Curative treatment with resection, radiofrequency ablation or transplantation is possible in early stage disease[4]. When HCC is diagnosed at a later stage, sorafenib is the first-line chemotherapy, which is directed against the Ras/Raf/MAPK pathway[4]. This is associated with a very modest improvement in overall survival of 3 additional months as compared to placebo (10.7 mo vs 7.9 mo)[5]. The cancer genome atlas (TCGA) is a large-scale project that has enabled improved characterization of cancers with several layers of data. The TCGA multi-platform analysis of 196 HCC tumors described this cancer as highly heterogeneous and difficult to characterize, although certain key pathways did emerge including the Ras/Raf/MAPK, mTOR, Wnt/B-catenin, and Sonic Hedgehog pathways[1,6]. Integration of various types of data has previously been performed to map interaction networks. By integrating genomic, transcriptomic and proteomic data, one can understand potential interactions that contribute to a disease condition or process[7,8]. These interactions may otherwise not be uncovered, on the basis of a single type of data. This systems biology approach has been especially important in cancer, given that alterations in one gene can have a ripple effect on proteins in the rest of a protein-protein interaction network. Therefore, elucidating the layers of data in a disease can provide additional insights into the pathways that drive cancer[9]. In the current study, we aim to characterize the landscape of high-throughput data profiling in HCC and determine the patterns in key dysregulated genes and pathways across these different layers of data. The patterns that emerge could help in better understanding the pathways that drive HCC and could be considered as therapeutic targets.

MATERIALS AND METHODS

Data collection, analysis and database compiling

We downloaded all available high-throughput genomic, transcriptomic, microRNA, methylation, and proteomic datasets related to human HCC samples from published datasets (PubMed, http://www.ncbi.nlm.nih.gov/PubMed and Gene Expression Omnibus (GEO), https://www.ncbi.nlm.nih.gov/geo). Using PubMed, the following search was performed for whole exome sequencing data on HCC: ("carcinoma, hepatocellular" [MeSH Terms] OR ("carcinoma" [All Fields] AND "hepatocellular" [All Fields]) OR "hepatocellular carcinoma" [All Fields] OR ("hepatocellular" [All Fields] AND "carcinoma" [All Fields])) AND (whole [All Fields] AND ("exome" [MeSH Terms] OR "exome" [All Fields]) AND sequencing [All Fields]). The following MeSH terms were used to identify gene expression papers: ("carcinoma, hepatocellular" [MeSH Terms] OR ("carcinoma" [All Fields] AND "hepatocellular" [All Fields]) OR "hepatocellular carcinoma" [All Fields] OR ("hepatocellular" [All Fields] AND "carcinoma" [All Fields])) AND ("gene expression" [MeSH Terms] OR ("gene" [All Fields] AND "expression" [All Fields]) OR "gene expression" [All Fields]) AND ("humans" [MeSH Terms] OR "humans" [All Fields]) AND English [All Fields] NOT ("review" [Publication Type] OR "review literature as topic" [MeSH Terms] OR "reviews" [All Fields]). To identify suitable papers regarding methylation in HCC, we used the following terms: ("methylation" [MeSH Terms] OR "methylation"[All Fields]) AND ("carcinoma, hepatocellular" [MeSH Terms] OR ("carcinoma" [All Fields] AND "hepatocellular" [All Fields]) OR "hepatocellular carcinoma" [All Fields] OR ("hepatocellular" [All Fields] AND "carcinoma" [All Fields]) AND ("humans" [MeSH Terms] AND English [lang]). Proteomics papers were retrieved using the following search: [("proteomics" [MeSH Terms] OR "proteomics" [All Fields]) AND high [All Fields] AND throughput [All Fields]] AND ("carcinoma, hepatocellular" [MeSH Terms]) OR ("carcinoma" [All Fields] AND "hepatocellular" [All Fields]) OR "hepatocellular carcinoma" [All Fields] OR ("hepatocellular"[All Fields] AND "carcinoma"[All Fields]). MicroRNAs reported in HCC were identified using these MeSH terms: ("micrornas" [MeSH Terms] OR "micrornas"[All Fields] OR "mirna" [All Fields]) AND profile [All Fields] AND ("carcinoma, hepatocellular" [MeSH Terms] OR ("carcinoma" [All Fields] AND "hepatocellular" [All Fields]) OR "hepatocellular carcinoma" [All Fields] OR ("hepatocellular" [All Fields] AND "carcinoma" [All Fields]). We considered for inclusion all datasets available in PubMed. The datasets publicly available on the GEO, a public functional genomics data repository of high-throughput array data (https://www.ncbi.nlm.nih.gov/geo) were retrieved and analyzed using GEO2R (https://www.ncbi.nlm.nih.gov/geo/info/geo2r.html), a web tool available on the portal, identifying genes differentially expressed between samples of HCC and the non-tumoral liver portion. GEO2R compares original submitter-supplied processed data tables using the GEOquery and limma R packages from the Bioconductor project. Following instructions available online at (https://www.ncbi.nlm.nih.gov/geo/info/geo2r.html), we retrieved all dysregulated genes. Only those with an adjusted P value < 0.05, and expression fold change value below ≤ 0.5 or above ≥ 1.5 were considered for further analysis (Table 1, Supplementary Table 1). The genes included in our list from WES papers were reported as affected by nonsynonymous mutations, and synonymous mutations were not considered. Putative microRNA gene targets were identified using an online database, mirDIP 4.1[10], (http://ophid.utoronto.ca/mirDIP). The most stringent predictive search option (top 1%) was used to obtain the list of putative targets of all differentially expressed miRNAs.
Table 1

List of the final 85 selected publications for each layer of data. For each publication the number of hepatocellular carcinoma samples and controls and the platform used for the analysis are reported

Gene expression
No.yearPMIDHCC (n)Controls (n)GEO dataset
12004173935203513GSE6764
2200818504433112GSE6222
32008189231658082GSE10143
42009190989974758GSE14323
52009198615151647GSE17967
62011213204993434GSE20140 (GSE10141, GSE10140)
72011217124454040GSE28248
82013236911391515GSE17548
9201323800896GSE36376_276; GSE25097_211GSE36376_247; GSE25097_283GSE36376, GSE25097
102014244980024646GSE47595
112014245644074545GSE45114
122014250935043940GSE57958
132014251418671111GSE55092
142014253763021818GSE60502
152014255360567272GSE39791
16201525666192132132GSE54236
17201525645722228168GSE63898
182016274999186060GSE64041
192016259640792620GSE54238
Proteomics
No.yearPMIDHCC (n)Controls (n)
120041472649288
22008190038641212
32005157593161010
42005160970301414
52007176279331212
620142362163433
720091956280533
82016267097252412
92013235893622020
102012228138771010
112012220822271111
1220112163110969123
1320102023004655
142010199568372020
152009197156081818
1620091953509533
172009191613268080
182004152217722020
192003146737982121
202003146545282121
212002124812711111
2220132346220777
2320051633595188
242006163422421010
2520112203487233
2620051585230077
2720112191371733
282007172039742528
292007175862771010
Whole exome sequencing
No.yearPMIDHCC (n)Controls (n)GEO dataset
120132391267733N/A
220142405550847N/A
320172832312355N/A
4201424798001231231GSE54504
52012225615172424N/A
Epigenetic_miRNAs
No.yearPMIDHCC (n)Controls (n)GEO dataset
120152619016097N/A
2201424789420109GSE31383
32014245644074545GSE10694
42011212980087373GSE21362
52008186493637810N/A
62012221351592020N/A
72011213199969494N/A
82009194734412020N/A
920091917327735N/A
102007181713461010N/A
112006163312542525N/A
122015260628883030N/A
1320152604678032743N/A
142015258612556666GSE54751
1520152550007566GSE54537
162014248756492424
17201323812667166166GSE31384
18201323390000917GSE40744
192012230820621818N/A
202014245867852929N/A
212013244179707878N/A
Epigenetic methylation
No.yearPMIDHCC (n)Controls (n)GEO dataset
12011215001881312N/A
22014243066624545N/A
32014253762922222N/A
420152594512988GSE59260
52011217471161212GSE29720
62010201658822020GSE18081
72012222349436262GSE37988
8201324012984208GSE44970
92013232080766666GSE54503
102014250935045959GSE57956
112014252948082727GSE60753

HCC: Hepatocellular carcinoma; GEO: Gene Expression Omnibus; N/A: Not applicable.

From the selected 11 methylation datasets, raw data from eight studies were available on the GEO website (https://www.ncbi.nlm.nih.gov/geo/). We selected the CpG sites or genes reported to be hyper-or hypo- methylated in these publications. The genomic region was considered differentially methylated between HCC tissue and the adjacent non-tumoral sample, if the FDR corrected P value < 0.01. Furthermore, we filtered out everything that did not satisfy the criteria: ∆β ≥ 0.20 or ∆β ≤ -0.20, where ∆β = βHCC - βadjacent was the difference in methylation between above specified groups. When the CpG sites were considered, the Illumina HumanMethylation450K and 27K platforms were used for mapping to the genes. When multiple sites or genes were found to have the same sense of differential methylation, the mean value of ∆β was calculated. Only the CpGs in the 5’UTR, 1st Exon, TSS200, TSS1500 or in CpG islands were considered in our analysis. Proteomic results were retrieved and included only if protein abundance was reported as different in HCC liver samples compared to control samples. Figure 1 outlines our study workflow. Papers were excluded from each specific search for the following reasons: Data from cell lines, or animal models, studying efficacy or drugs, or the presence of long non-coding RNA, mechanistic studies not performing high-throughput or evaluating the role of one molecule, papers focused on liver diseases but not HCC or liver tissue, not original data such as review articles, or those studies using already selected datasets, not reporting the modulation of the molecules, and papers without data available.
Figure 1

Flow chart showing the paper selection process and exclusion criteria for each data type: Gene expression, proteomics, whole exome sequencing, microRNAs and methylation.

Flow chart showing the paper selection process and exclusion criteria for each data type: Gene expression, proteomics, whole exome sequencing, microRNAs and methylation. List of the final 85 selected publications for each layer of data. For each publication the number of hepatocellular carcinoma samples and controls and the platform used for the analysis are reported HCC: Hepatocellular carcinoma; GEO: Gene Expression Omnibus; N/A: Not applicable. Available patient data, including etiology of liver disease (hepatitis C, hepatitis B, alcohol, fatty liver disease) on the basis of which the HCC tumors developed, presence of cirrhosis, the Model for End-stage Liver Disease score (MELD score, an assessment of the severity of liver dysfunction), tumor histology, stage of cancer, alpha-fetoprotein level, overall and recurrence-free survival following treatment were also documented (Supplementary Table 2).

Pathway enrichment analysis

The key dysregulated genes from each type of data (genomic, miRNA, methylation, transcriptomic, and proteomic) were fed into the Integrated Interactions Database[11] (IID, http://ophid.utoronto.ca/iid), to obtain a list of the protein-protein interactions. For the miRNA dataset, we determined the target genes of the differentially expressed miRNAs in tumors using the miRNA Data Integration Portal mirDIP v4.1[10]. The individual lists derived from each type of data were then fed into the pathway Data Integration Portal, pathDIP v3.0 (http://ophid.utoronto.ca/pathDIP)[12], in order to determine the significantly dysregulated pathways in HCC. pathDIP integrates data from 20 major pathway databases, and computationally predicts gene association to curated pathways using protein-protein interactions from IID significance of their connectivity[12]. We used this comprehensive pathway enrichment analysis portal to obtain a list of significantly enriched pathways using literature curated (core) pathway memberships P value (FDR: BH-method) less than 0.05. The lists of pathways from each type of data were then assessed for overlap using Venny 2.1, an online tool for Venn diagram design (http://bioinfogp.cnb. csic.es/tools/venny/index.html).

Retrospective validation on independent dataset

In order to determine whether key differentially expressed genes along the overlapping pathways had prognostic value, we used KMplotter, a web-based tool that enables survival analysis across multiple cancers and datasets[13]. Patient samples were split into two groups per autoselection of the best cutoff for each gene, in order to assess its prognostic value. We ran multivariate overall survival analysis based on the high vs low expression of each gene in HCC tumors. The two groups were compared by a Kaplan-Meier survival plot, and the hazard ratio with 95% confidence intervals and log-rank P value were calculated.

Drug identification by CTD

The identification of putative therapeutic agents able to revert the modulation of genes of interest based on their modulation associated with a worse prognosis was obtained using the online Comparative Toxicogenomics Database http://ctdbase.org[14]. This database provides manually curated information about chemical–gene/protein interactions, chemical–disease and gene–disease relationships.

RESULTS

We identified a total of 8733 abstracts retrieved by the search on PubMed on HCC for the different layers of data on human HCC samples, published until December 2016. The flow chart outlining the selection process is detailed in Figure 1. The number of samples included in our analysis are as follows: (1) Whole exome sequencing: 267 HCC and 270 control samples; (2) Gene expression: 870 HCC and 814 control samples; (3) miRNA: 1172 HCC and 771 control samples; (4) Methylation: 354 HCC and 341 control samples; and (5) Proteomics: 421 HCC and 473 control samples. The methodologies and platforms used to obtain these high-throughput data are reported by type of data (genomic, transcriptomic, miRNA, methylation and proteomic) in Table 1. Clinical data, regarding etiology of liver disease (hepatitis C, hepatitis B, alcohol, fatty liver disease) were frequently reported, on the other side serum levels of liver enzymes, AST and ALT, frequently used to assess liver functions were not available. Pathological details relative to differentiation or stage were frequently absent as well as other crucial variables in the clinic setting, such as Child Pugh/MELD score (Supplementary Table 2).

Integrative analysis reveals most important pathways in HCC

There were 188 overlapping dysregulated genes/proteins across the different types of data. Independently for each type of data, we obtained a list of pathways using pathDIP. We merged the list of dysregulated pathways in miRNA and methylation, given that these epigenetically regulate gene expression, in order to assess for overlapping pathways across the datasets. This resulted in a list of 3 common, overlapping pathways among the different types of data: EGFR, β1-integrin, and axon guidance pathways, as depicted in Figure 2. From the previous list of 188 common dysregulated elements in all different layers of data (Figure 3), we were able to identify 35/188 genes that were involved in these 3 shared pathways across the layers of data (Supplementary Table 1).
Figure 2

Venn diagram shows the three common pathways (EGFR, epidermal growth factor, β1-integrin, and axon guidance pathways) across the four different types of data.

Figure 3

From the previous list of 188 common dysregulated elements in all different layers of data. A: Number of genes/proteins identified in each data type; B: Venn diagram showing the 188 genes identified as commonly deregulated across the 4 different type of data.

Venn diagram shows the three common pathways (EGFR, epidermal growth factor, β1-integrin, and axon guidance pathways) across the four different types of data. From the previous list of 188 common dysregulated elements in all different layers of data. A: Number of genes/proteins identified in each data type; B: Venn diagram showing the 188 genes identified as commonly deregulated across the 4 different type of data.

Prognostic value of pathways in HCC

We then examined the prognostic value of the deregulated genes associated to pathways of interest in HCC using TCGA RNA seq dataset, as listed in Table 2. Median survival of 364 patients in the TCGA, which was used for validation purposes regarding the prognostic value is reported. KMplotter HR results from TCGA RNA seq data reflected the altered modulation identified for these 9 genes in the 19 HCC papers relative to the gene expression data (Table 2). Among the five upregulated genes associated with positive HR values, CDK5, was reported with the highest HR value (1.85, P = 0.0035) and involved in cell cycle (Table 3). The other 4/9 genes reported as upregulated, COL2A1, LAMC1, RPS6KA3 and ITGB1 were identified with positive HR value by KM plotter analysis and involved in cellular migration (Table 2 and Table 3).
Table 2

Prognostic value of the 9 dysregulated genes associated with the 3 common dysregulated pathways (EGFR, epidermal growth factor, β1-integrin and axon guidance) among the 4 types of data in obtained with KMplotter

Gene
Modulation in the 19 HCC papers
Probe-ID
HR
CI
Log-Rank P value
Median survival low (mo)
Median survival high (mo)
Estradiol gene modulation predicted by CTD
COL2A1 Up12801.491.05-2.110.022961.754.1N/A
FGA Down22430.520.35-0.770.000949.770.5+
FGG Down22660.560.39-0.790.000938.370.5+
LAMC1 Up39151.430.98-2.090.0656.538.3N/A
CDK5 Up10201.851.22-2.810.003581.96.2N/A
EPHB1 Down20470.720.048-1.080.113554.170.5N/A
RPS6KA3 Up61971.20.8-1.780.374354.156.5-
EGFR Down19560.610.43-0.890.00853170.5+
ITGB1 Up36881.370.95-1.970.092482.949.7N/A

CTD based prediction identified Estradiol to efficiently affect the expression of the 4/9 genes based on their hazard ratios values. HR: Hazard ratios; HCC: Hepatocellular carcinoma; CI: Confidence interval; N/A: Not applicable.

Table 3

Modulation of the 9 dysregulated genes associated with the 3 common dysregulated pathways (EGFR, epidermal growth factor, β1-integrin and axon guidance) identified in the 19 hepatocellular carcinoma gene expression papers. Their genetic alteration in hepatocellular carcinoma and their mechanism in cancer are reported

Gene
Modulation in the 19 HCC papers
PMID
Mutation in HCC (PMID)
Role in cancer (PMID)
COL2A1 Up (2/19)23800896/25666192(rs3917) polymorphism is associated with higher risk of HCC (21665180)COL2A1 promotes migration in HCC (29858962)
FGA Down (9/19)21320499/23800896/25093504/25536056/25141867/25376302/25666192/25645722/25666192Deleted in HCC patients (27511114)FGA is a positive predictor of survival in gastric cancer patients (15756001)
FGG Down 8/1921320499/23800896/25093504/25536056/25141867/25376302/25645722/24498002Allelic loss (16980951)FGG is involved in amino acid and redox metabolism pathway in HCC (28089356)
LAMC1 Up (4/19)23800896/25536056/25141867/25645722Not identifiedLAMC1 promotes tumor cell invasion and migration in HCC (28928891)
CDK5 Up (2/19)25141867/25376302Not identifiedCDK5 promotes proliferation in HCC (29312535)
EPHB1 Down (2/19)23800896/25141867Missense mutation (19469653)EPHB1 inhibits cell migration(22242939)
RPS6KA3 Up 1/1925141867Somatic mutation and copy number variations (22561517)RPS6KA3 increases cell proliferation (15833840)
EGFR Down (2/19)19098997/25141867Missense mutation (26436086)EGFR promotes cell adhesion (31465839)
ITGB1 Up (1/19)25141867Somatic number variations (24512821)ITGB1 promotes migration (30664185)

HCC: Hepatocellular carcinoma.

Prognostic value of the 9 dysregulated genes associated with the 3 common dysregulated pathways (EGFR, epidermal growth factor, β1-integrin and axon guidance) among the 4 types of data in obtained with KMplotter CTD based prediction identified Estradiol to efficiently affect the expression of the 4/9 genes based on their hazard ratios values. HR: Hazard ratios; HCC: Hepatocellular carcinoma; CI: Confidence interval; N/A: Not applicable. Modulation of the 9 dysregulated genes associated with the 3 common dysregulated pathways (EGFR, epidermal growth factor, β1-integrin and axon guidance) identified in the 19 hepatocellular carcinoma gene expression papers. Their genetic alteration in hepatocellular carcinoma and their mechanism in cancer are reported HCC: Hepatocellular carcinoma. Four out of 9 genes were reported as downmodulated in the 19 HCC gene expression papers. Among these four, two genes, FGA and FGG, were identified as the top statistically significantly (P = 0.0009) associated with a protective role in HCC (HR values 0.52 and 0.59, respectively). FGA and FGG were consistently reported as downmodulated in about 45% of our 19 selected gene expression papers (Table 3). The other two downmodulated genes, EPHB1 and EFGR with negative HR values (Table 2) are reported to be affected by missense mutation leading to a loss of their protective role against cell migration.

Estradiol is a therapeutic agent that appropriately targets HCC genes

Using CTD, we found that estradiol was able to appropriately down- or upmodulate 4 out of 9 cancer-related genes (Table 2). Particularly, CTD reported estradiol capabilities to upregulated FGA, FGG and EGFR reported downmodulated in HCC (Table 2) and counteracting the upregulation of RPS6KA3 in HCC, suggesting a possible role for this hormone in HCC treatment.

DISCUSSION

In this study, we evaluate the molecular pathogenesis of HCC using a unique approach, that of combining all publicly available high-throughput data from patient HCC tumors. This encompasses all miRNA, methylation, genomic, transcriptomic and proteomic profiling data present in the literature, and represents the first effort to derive a consensus molecular model of HCC through analysis of these different types of data. Although these datasets originated from different patient cohorts, presented integrative analysis offers the opportunity to explore common key pathway dependencies of HCC. Starting with the initial generation of genomics and whole exome sequencing data, previous high-throughput studies have brought forth different lists of dysregulated genes, depending on the type of data evaluated. Dysregulated genes may affect different parts of a pathway. Therefore, a pathway-based approach when evaluating different types of high-throughput data offers the ability to assess the pathways most commonly affected in a given cancer. Additionally, the integrative analysis in our study encompasses a large number of patient samples. Using this integrative approach, we confirm the importance of EGFR, β1-integrin and axon guidance as pathways critical in hepatocarcinogenesis. EGFR activates the signaling cascades of the Ras/Raf/MAPK and mTOR pathways, two pathways that were identified as key to HCC pathogenesis in the TCGA study[6]. The identification of β1-integrin as being commonly dysregulated in HCC is novel, and its significance is confirmed through its consistent dysregulation across types of data. β1-integrin is a cell surface receptor that senses the extracellular matrix, thereby modulating the hallmarks of cancer such as proliferative signaling with continuous activated cell replication, evasion of growth suppressors, resistance to angiogenesis as well as cancer cell invasion and metastasis[14]. Ras/Raf/MAPK and mTOR are established pathways in hepatocarcinogenesis, and are integrin-dependent signaling pathways[15]. Additionally, β1-integrin is known to crosstalk with EGFR. In fact, the downregulation of β1-integrin was found to decrease phosphorylation of EGFR and c-Met in hepatocytes during liver regeneration[16]. A synergistic relationship between integrins and EGFR has also been demonstrated in tumor progression[17]. The finding of axon guidance pathway-related proteins as being dysregulated across types of data, thereby establishing consistent dysregulation of this pathway in HCC, is also novel. Netrin-1 is the best studied protein in the axon guidance pathway, and is known to be overexpressed in various cancers[13]. It is responsible for regulation of apoptosis, with increased presence of netrin-1 leading to inhibition of apoptosis. The tumor suppressor p53, frequently mutated in the TCGA HCC study, regulates the cell cycle through netrin-1. The axon guidance pathway has previously been identified as a pathway that is significantly mutated in HCC based on integration of all genomic data in HCC[18]. This analysis revealed mutations along the axon guidance pathway as being prognostic of a higher rate of HCC metastasis. We were able to additionally validate the prognostic importance of dysregulated proteins in these pathways proteins using TCGA data. HCC is a cancer that develops in the context of various chronic liver diseases, which may influence the molecular characteristics of HCC. Additionally, the underlying cirrhosis and liver dysfunction that are often concurrent may influence HCC development and behavior[2]. Patients are often diagnosed at an advanced stage of disease, when it is too late for curative treatment. A unique consideration in HCC is the inability to tolerate hepatotoxic chemotherapy in patients with liver dysfunction, as it is often patients with cirrhosis who develop HCC[19,20]. Therefore, liver function must be considered prior to, during, and after any form of treatment for HCC. Thus, especially for HCC, it has been suggested that a multi-pronged approach to HCC therapy jointly targeting different pathways be adopted. Omics technologies are essential in the progress towards elucidating the molecular basis of HCC. The current study represents the largest integration of all publicly available genomic, gene expression, methylation, miRNA and proteomic data in HCC, covering 85 studies and 3355 patient sample profiles. We identified consistently deregulated pathways associated with hepatocarcinogenesis across different types of data using integrative analysis tools, thereby confirming the importance of these genes in HCC pathogenesis. EGFR (activator of Ras/Raf/MAPK and mTOR) and β1-integrin (also modulator of the aforementioned pathways) were clearly identified as pivotal to HCC[5,21-23]. This is in keeping with the efficacy of the Ras/Raf/MAPK inhibitors sorafenib and regorafenib in HCC[24]. Even beyond this, we found these consistently deregulated genes across pathways to be appropriately modulated by estradiol. HCC is less common in women, and there have been clinical studies demonstrating that hormone therapy and female sex are protective against HCC as described earlier in this thesis. Other integrative multi-omics studies have been recently performed for other tumors with high mortality such as breast and ovarian cancer[6,25]. Several breast cancer studies emphasizing how data integration of genomic/transcriptomic and proteomic has improved the molecular characterization of subtypes of breast cancer and elucidate its heterogeneity and its interaction with the microenvironment and aggressiveness[26,27]. A single source of data was used in the ovarian cancer multi-omics mathematical integration performed by Bhardwaj et al[25]. Copy number variation gene expression and methylation data from TCGA data portal were integrated using mathematical algorithm and identified 32 co-expressed genes and 6 pathways associated with survival. The main limitation of our study is the different patient samples represented by the various types of data. Nonetheless, there is a large amount of high-throughput data, which allowed us to detect pathway dependency patterns that are compatible with the current HCC literature. Additionally, HCC tumors arise in the setting of various chronic liver diseases. We could not assess for etiology-specific genes and pathways in this study, given that the clinical and genetic data to evaluate these differences were not fully available for all the studies. Therefore, we could only evaluate gene differences over whole datasets, rather than individual patients, due not complete individual annotation of the samples available on GEO for each specific dataset. The HCC samples in this integrative analysis all came from patients who had undergone hepatectomy. There were no specimens from patients who were candidates for ablation therapy (early stage), those who were undergoing liver transplantation, or those with advanced HCC. One might anticipate that the molecular features of such tumors differ, given the different stages of HCC captured, but there is unfortunately scarcity of data in this regard.

CONCLUSION

In conclusion, our study represents the largest integrative analysis of all publicly available data in HCC, spanning different types of high-throughput data. Pathway enrichment analysis elucidated EGFR, β1-integrin and axon guidance as pathway dependencies in HCC. These are proteins known to serve as master regulators of key pathways in HCC such as Ras/Raf/MAPK, Wnt/β-catenin and mTOR[28], and may serve as potential overarching therapeutic targets in HCC. The axon guidance pathway was identified as being of potential importance to HCC for the first time, with prognostic value suggested in patient sample validation with TCGA. Estradiol affects a large number of deregulated genes across data with appropriate modulation and may be a therapeutic agent that helps in HCC. A combined therapeutic approach conjointly targeting different pathways may be more optimal in the treatment of HCC, especially when underlying hepatic dysfunction compromises the ability to tolerate optimal chemotherapeutic doses.

ARTICLE HIGHLIGHTS

Research background

Hepatocellular carcinoma (HCC) is highly heterogeneous, difficult to characterize and the molecular basis of HCC has been elusive.

Research motivation

The Cancer Genome Atlas is a large-scale project that has enabled improved characterization of cancers with several layers of data. Elucidating the layers of data in a disease can provide additional insights into the pathways that drive cancer.

Research objectives

A novel integrative approach of all publicly available high-throughput data from patient HCC tumors was used to delineate critical pathway dependencies in HCC.

Research methods

A comprehensive analysis and characterization of all publicly available genomic, gene expression, methylation, miRNA and proteomic data in HCC covered 85 studies and 3355 patient sample profiles and identified the key overlapping dysregulated genes and pathways affected.

Research results

We identified the prognostic value of these genes in HCC genes, specifically with Netrin and Slit3 being novel proteins of prognostic importance to HCC.

Research conclusions

Our large integrative analysis of all publicly available data in HCC and our pathway enrichment analysis has elucidated epidermal growth factor, β1-integrin, and axon guidance as pathway dependencies in HCC.

Research perspectives

Based on our integrative analysis, epidermal growth factor, and β1-integrin are master regulators that could be considered as potential therapeutic targets in HCC.

ACKNOWLEDGEMENTS

The authors thank undergraduate students Sujitha Srinathan, Emily Chen, Bishoy Lawendy, Nangi Suo and Amira Abdallah for their help in data curation.
  27 in total

1.  Integrative analysis reveals disrupted pathways regulated by microRNAs in cancer.

Authors:  Gary Wilk; Rosemary Braun
Journal:  Nucleic Acids Res       Date:  2018-02-16       Impact factor: 16.971

Review 2.  The mTOR pathway in hepatic malignancies.

Authors:  Mamatha Bhat; Nahum Sonenberg; Gregory J Gores
Journal:  Hepatology       Date:  2013-04-17       Impact factor: 17.425

3.  Regorafenib for patients with hepatocellular carcinoma who progressed on sorafenib treatment (RESORCE): a randomised, double-blind, placebo-controlled, phase 3 trial.

Authors:  Jordi Bruix; Shukui Qin; Philippe Merle; Alessandro Granito; Yi-Hsiang Huang; György Bodoky; Marc Pracht; Osamu Yokosuka; Olivier Rosmorduc; Valeriy Breder; René Gerolami; Gianluca Masi; Paul J Ross; Tianqiang Song; Jean-Pierre Bronowicki; Isabelle Ollivier-Hourmand; Masatoshi Kudo; Ann-Lii Cheng; Josep M Llovet; Richard S Finn; Marie-Aude LeBerre; Annette Baumhauer; Gerold Meinhardt; Guohong Han
Journal:  Lancet       Date:  2016-12-06       Impact factor: 79.321

4.  Sorafenib in advanced hepatocellular carcinoma.

Authors:  Josep M Llovet; Sergio Ricci; Vincenzo Mazzaferro; Philip Hilgard; Edward Gane; Jean-Frédéric Blanc; Andre Cosme de Oliveira; Armando Santoro; Jean-Luc Raoul; Alejandro Forner; Myron Schwartz; Camillo Porta; Stefan Zeuzem; Luigi Bolondi; Tim F Greten; Peter R Galle; Jean-François Seitz; Ivan Borbath; Dieter Häussinger; Tom Giannaris; Minghua Shan; Marius Moscovici; Dimitris Voliotis; Jordi Bruix
Journal:  N Engl J Med       Date:  2008-07-24       Impact factor: 91.245

5.  Integrated Proteogenomic Characterization of Human High-Grade Serous Ovarian Cancer.

Authors:  Hui Zhang; Tao Liu; Zhen Zhang; Samuel H Payne; Bai Zhang; Jason E McDermott; Jian-Ying Zhou; Vladislav A Petyuk; Li Chen; Debjit Ray; Shisheng Sun; Feng Yang; Lijun Chen; Jing Wang; Punit Shah; Seong Won Cha; Paul Aiyetan; Sunghee Woo; Yuan Tian; Marina A Gritsenko; Therese R Clauss; Caitlin Choi; Matthew E Monroe; Stefani Thomas; Song Nie; Chaochao Wu; Ronald J Moore; Kun-Hsing Yu; David L Tabb; David Fenyö; Vineet Bafna; Yue Wang; Henry Rodriguez; Emily S Boja; Tara Hiltke; Robert C Rivers; Lori Sokoll; Heng Zhu; Ie-Ming Shih; Leslie Cope; Akhilesh Pandey; Bing Zhang; Michael P Snyder; Douglas A Levine; Richard D Smith; Daniel W Chan; Karin D Rodland
Journal:  Cell       Date:  2016-06-29       Impact factor: 41.582

6.  Management of hepatocellular carcinoma: an update.

Authors:  Jordi Bruix; Morris Sherman
Journal:  Hepatology       Date:  2011-03       Impact factor: 17.425

7.  Two-layer modular analysis of gene and protein networks in breast cancer.

Authors:  Alok Srivastava; Suraj Kumar; Ramakrishna Ramaswamy
Journal:  BMC Syst Biol       Date:  2014-07-05

8.  Integrated analysis of mutation data from various sources identifies key genes and signaling pathways in hepatocellular carcinoma.

Authors:  Yuannv Zhang; Zhaoping Qiu; Lin Wei; Ruqi Tang; Baofeng Lian; Yingjun Zhao; Xianghuo He; Lu Xie
Journal:  PLoS One       Date:  2014-07-02       Impact factor: 3.240

9.  A Single-Cell Atlas of the Tumor and Immune Ecosystem of Human Breast Cancer.

Authors:  Johanna Wagner; Maria Anna Rapsomaniki; Stéphane Chevrier; Tobias Anzeneder; Claus Langwieder; August Dykgers; Martin Rees; Annette Ramaswamy; Simone Muenst; Savas Deniz Soysal; Andrea Jacobs; Jonas Windhager; Karina Silina; Maries van den Broek; Konstantin Johannes Dedes; Maria Rodríguez Martínez; Walter Paul Weber; Bernd Bodenmiller
Journal:  Cell       Date:  2019-04-11       Impact factor: 41.582

10.  Comprehensive and Integrative Genomic Characterization of Hepatocellular Carcinoma.

Authors: 
Journal:  Cell       Date:  2017-06-15       Impact factor: 66.850

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.