| Literature DB >> 33784984 |
Qinghong Dai1,2,3,4,5, Tao Liu1, Yongchao Gao2,3,4,5, Honghao Zhou2,3,4,5, Xiong Li6, Wei Zhang7,8,9,10,11.
Abstract
BACKGROUND: Hepatocellular carcinoma (HCC), derived from hepatocytes, is the main histological subtype of primary liver cancer and poses a serious threat to human health due to the high incidence and poor prognosis. This study aimed to establish a multigene prognostic model to predict the prognosis of patients with HCC.Entities:
Keywords: Cox hazard regression; DEGs; HCC; Hub gene; Prognostic model; Risk score
Mesh:
Substances:
Year: 2021 PMID: 33784984 PMCID: PMC8011138 DOI: 10.1186/s12859-021-04095-7
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Sample size of each dataset
| Data set | Non-tumor samples | Tumor samples |
|---|---|---|
| GSE121248 | 37 | 70 |
| GSE40873 | 49 | 0 |
| GSE62232 | 10 | 81 |
| GSE14520 | 241 | 247 |
| TCGA | 50 | 374 |
Fig. 1Overall process of this study. MCODE, a plugin for Cytoscape
Fig. 2Differantially expressed genes. a Merged dataset generated from GSE62232 and GSE40873. b GSE121248. c,d Venn plot of shared gene between merged dataset and GSE121248
Fig. 3Hub genes. a PPI network was consisted of 35 genes, interaction score > = 0.7 was the cutoff value. b,c Subnetwork 1 and subnetwork 2 identified by MCODE. Red represented upregulated gene and green represent downregulated gene. d,e Twenty hub genes were differencially expressed between tumor and nontumor samples in TCGA cohort (p < 0.001). “N” meant nontumor group and “T” meant tumor group
Fig. 4Functional enrichment and univariate regression analysis of hub genes in TCGA cohort. a,b GO enrichment analysis. CC, cellular component. BP, biological process. MF, molecular function. c Circle plot of KEGG pathway. d Univariate hazard regression analysis of hub genes
Information of prognosis model
| Gene_symble | Coef# | HR# | HR.95L | HR.95H | P value |
|---|---|---|---|---|---|
| CDKN3 | − 0.40871 | 0.66451 | 0.47098 | 0.93754 | 0.01996 |
| ZWINT | 0.39124 | 1.47881 | 0.98122 | 2.22874 | 0.06157 |
| NUSAP1 | − 1.07861 | 0.34007 | 0.22714 | 0.50913 | 1.62e−07 |
| DLGAP5 | 0.75777 | 2.13351 | 1.21285 | 3.75304 | 0.00855 |
| HMMR | 0.47968 | 1.61556 | 1.09716 | 2.37891 | 0.01512 |
| KIF20A | 0.65619 | 1.92743 | 1.23469 | 3.00886 | 0.00388 |
# coef, coefficient; HR, hazard ratio
Fig. 5Predictive performance of prognostic model. a KM survival curve. b ROC curve of multiple indicators. c Univariate hazard regression analysis. d Multivariate hazard regression analysis
Fig. 6Risk score analysis. a Samples were sorted according to risk score from low to high. b Correlation between survical time and risk score. c Heatmap of six genes expression involved in prognosis model. d Correlation between risk score and fustate. e Box plot of risk score relative to tumor size
Fig. 7Results of validation cohort. a Comparison of risk score between tumor and normal samples. b KM survival curve, the cutoff divided tumor samples into two groups was the median of risk scores. c Distribution of risk score relative to tumor stage in HCC. d Comparison of risk score between small and large tumors. The diameter of 5 cm was the cutoff value