| Literature DB >> 36147508 |
Jiaying Wang1, Yuting Miao2, Lingmei Li2, Yongqing Wu2, Yan Ren3,4, Yuehua Cui5, Hongyan Cao2,6.
Abstract
Hepatocellular carcinoma (HCC) is a leading malignant liver tumor with high mortality and morbidity. Patients at the same stage can be defined as different molecular subtypes associated with specific genomic disorders and clinical features. Thus, identifying subtypes is essential to realize efficient treatment and improve survival outcomes of HCC patients. Here, we applied a regularized multiple kernel learning with locality preserving projections method to integrate mRNA, miRNA and DNA methylation data of HCC patients to identify subtypes. We identified two HCC subtypes significantly correlated with the overall survival. The patient 3-years mortality rates in the high-risk and low-risk group was 51.0% and 23.5%, respectively. The high-risk group HCC patients were 3.37 times higher in death risk compared to the low-risk group after adjusting for clinically relevant covariates. A total of 196 differentially expressed mRNAs, 2,151 differentially methylated genes and 58 differentially expressed miRNAs were identified between the two subtypes. Additionally, pathway activity analysis showed that the activities of six pathways between the two subtypes were significantly different. Immune cell infiltration analysis revealed that the abundance of nine immune cells differed significantly between the two subtypes. We further applied the weighted gene co-expression network analysis to identify gene modules that may affect patients prognosis. Among the identified modules, the key module genes significantly associated with prognosis were found to be involved in multiple biological processes and pathways, revealing the mechanism underlying the progression of HCC. Hub gene analysis showed that the expression levels of CDK1, CDCA8, TACC3, and NCAPG were significantly associated with HCC prognosis. Our findings may bring novel insights into the subtypes of HCC and promote the realization of precision medicine.Entities:
Keywords: biomarkers; multiple kernel learning; omics data integration; rMKL-LPP; subtype identification
Year: 2022 PMID: 36147508 PMCID: PMC9485934 DOI: 10.3389/fgene.2022.962870
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.772
FIGURE 1Kaplan-Meier survival curves of HCC subtypes identified by rMKL-LPP method. (A) The survival curves drawn based on the initial subtypes in HCC and (B) the survival curves of the regrouped Subtype 1 and Subtype 2.
Clinical characteristics of HCC subtypes.
| Item | Subtype 1 | Subtype 2 |
|---|---|---|
| Cases, | 248 (86.4) | 39 (13.6) |
| Age, years | 58.57 ± 13.00 | 59.15 ± 11.78 |
| Female, | 80 (32.3) | 12(30.8) |
| Pathologic stage, | ||
| Stage I | 133 (53.6) | 16 (41.0) |
| Stage II | 59 (23.8) | 13 (33.3) |
| Stage III | 52 (21.0) | 10 (25.7) |
| Stage IV | 4 (1.6) | 0 |
| Death event, | 48 (19.4) | 14 (35.9) |
Results of Cox regression analysis in 287 patients with HCC.
| Item | Coefficient (SE) | Wald Z | P | HR (95% CI) |
|---|---|---|---|---|
| Subtypes | 1.214(0.323) | 3.756 | <0.001 | 3.369 (1.787,6.349) |
| Age | 0.005(0.011) | 0.413 | 0.680 | 1.005 (0.983,1.026) |
| Gender | -0.191(0.282) | −0.678 | 0.498 | 0.826 (0.475,1.436) |
| Pathologic stage | ||||
| Stage II | 0.010(0.348) | 0.027 | 0.978 | 1.010 (0.511,1.996) |
| Stage III | 0.285(0.312) | 0.912 | 0.361 | 1.330(0.721,2.451) |
| Stage IV | 1.907(0.638) | 2.990 | 0.003 | 6.730 (1.929,23.483) |
Shows the statistical significance at the α = 0.05 level.
FIGURE 2(A) Heatmap of DEmRNAs, DMGs and DEmiRNAs between the two subtypes. Each column corresponds to a patient and each row indicates an individual feature. The relatively high and low expression of genes are shown in red and green color respectively. (B) Venn diagram of differentially expressed gene analysis results in different omics data.
FIGURE 3KEGG enrichment analysis for 56 genes selected.
FIGURE 4GO enrichment analysis for 56 genes selected.
FIGURE 5Boxplots showing the pathway activity for six pathways.
FIGURE 6Boxplots showing the abundance of nine immune cells between the two subtypes.
FIGURE 7(A) Hierarchical clustering dendrogram of identified co-expression modules. (B) Heatmaps of the correlation between modules and clinical traits. Each row represents a module and each column represents a clinical feature. Each cell consists of the correlation and p-value (in parenthesis).
FIGURE 8Network diagram of interaction between hub genes and candidate genes.
FIGURE 9Survival curves (A–D) of the four hub genes sorted in ascending order of p-value.