| Literature DB >> 31897208 |
Weiguo Hong1, Yan Hu1, Zhenping Fan2, Rong Gao1, Ruichuang Yang1, Jingfeng Bi1, Jun Hou1.
Abstract
Hepatocellular carcinoma (HCC) is the second leading cause of cancer-associated mortality worldwide. Transcription factors (TFs) are crucial proteins that regulate gene expression during cancer progression; however, the roles of TFs in HCC relapse remain unclear. To identify the TFs that drive HCC relapse, the present study constructed co-expression network and identified the Tan module the most relevant to HCC relapse. Numerous hub TFs (highly connected) were subsequently obtained from the Tan module according to the intra-module connectivity and the protein-protein interaction network connectivity. Next, E1A-binding protein p400 (EP400) and TIA1 cytotoxic granule associated RNA binding protein (TIA1) were identified as hub TFs differentially connected between the relapsed and non-relapsed subnetworks. In addition, zinc finger protein 143 (ZNF143) and Yin Yang 1 (YY1) were also identified by using the plugin iRegulon in Cytoscape as master upstream regulatory elements, which could potentially regulate expression of the genes and TFs of the Tan module, respectively. The Kaplan-Meier (KM) curves obtained from KMplot and Gene Expression Profiling Interactive Analysis tools confirmed that the high expression of EP400 and TIA1 were significantly associated with shorter relapse-free survival and disease-free survival of patients with HCC. Furthermore, the KM curves from the UALCAN database demonstrated that high EP400 expression significantly reduced the overall survival of patients with HCC. EP400 and TIA1 may therefore serve as potential prognostic and therapeutic biomarkers. Copyright: © Hong et al.Entities:
Keywords: E1A-binding protein p400; Kaplan-Meier survival analysis; TIA1 cytotoxic granule associated RNA binding protein; gene co-expression network; hepatocellular carcinoma; relapse free survival; transcription factor
Year: 2019 PMID: 31897208 PMCID: PMC6924164 DOI: 10.3892/ol.2019.11171
Source DB: PubMed Journal: Oncol Lett ISSN: 1792-1074 Impact factor: 2.967
Figure 1.Gene co-expression network identified by weighted gene co-expression network analysis. (A) Network topology for different soft-thresholding powers. Numbers in the plots indicate the corresponding soft thresholding powers. The approximate scale-free topology can be attained at the soft-thresholding power of 12. (B) Gene dendrogram obtained by clustering the dissimilarity based on topological overlap with the corresponding module colors indicated by the color row. Each colored row represents a color-coded module which contains a group of co-expressed genes. A total of 32 modules were identified.
Figure 2.Relationships between module eigengenes and variables of hepatocellular carcinoma. Each row in the table corresponds to a module and each column corresponds to a clinical trait. The module name is shown on the left side of each cell. Numbers in the table corresponds to the correlations of the corresponding module eigengenes and clinical traits, with the P-values printed below the correlations. Intensity and direction of correlations are indicated on the right side of the heatmap (red, positively correlated; green, negatively correlated).
Expression differences of the hub transcription factors between the tumor and adjacent tissues.
| Tumor vs. adjacent | |||||
|---|---|---|---|---|---|
| Ensembl ID | Log2 fold change | P-value | Adjusted P-value | kME | Gene symbol |
| ENSG00000116001 | 0.28 | 0.04 | 0.08 | 0.314 | TIA1 |
| ENSG00000179912 | 0.33 | 0.02 | 0.04 | −0.281 | R3HDM2 |
| ENSG00000163666 | 0.29 | 0.15 | 0.24 | 0.264 | HESX1 |
| ENSG00000124459 | 0.46 | 0.01 | 0.02 | −0.260 | ZNF45 |
| ENSG00000185670 | 0.28 | 0.04 | 0.07 | −0.256 | ZBTB3 |
| ENSG00000148200 | 0.14 | 0.40 | 0.51 | 0.253 | NR6A1 |
| ENSG00000167081 | 0.39 | 0.04 | 0.09 | 0.239 | PBX3 |
| ENSG00000162086 | 0.12 | 0.43 | 0.54 | −0.200 | ZNF75A |
| ENSG00000089335 | 0.44 | 0.01 | 0.02 | 0.197 | ZNF302 |
| ENSG00000177463 | 0.33 | 0.04 | 0.08 | 0.192 | NR2C2 |
| ENSG00000133111 | 0.55 | <0.01 | <0.01 | 0.189 | RFXAP |
| ENSG00000100307 | 0.34 | 0.07 | 0.13 | 0.186 | CBX7 |
| ENSG00000254004 | 0.62 | <0.01 | <0.01 | −0.172 | ZNF260 |
| ENSG00000173153 | 0.16 | 0.21 | 0.32 | 0.169 | ESRRA |
| ENSG00000137337 | 0.65 | <0.01 | <0.01 | −0.166 | MDC1 |
| ENSG00000068885 | 0.75 | <0.01 | <0.01 | 0.155 | IFT80 |
| ENSG00000166432 | 0.08 | 0.76 | 0.83 | 0.151 | ZMAT1 |
Differential expressions were analyzed using R package DESeq2 (version 1.20.0). kME indicates the eigengene-based connectivity of gene co-expression network.
Differentially connected transcription factors between the relapsed and non-relapsed subnetworks and their expression differences between the tumor and adjacent normal tissues.
| Ensembl ID | DiffK | Log2 fold change | P-value | Adjusted P-value | Gene symbol |
|---|---|---|---|---|---|
| ENSG00000117569 | −0.58 | 0.60 | <0.01 | <0.01 | PTBP2 |
| ENSG00000267680 | −0.55 | 0.47 | 0.01 | 0.02 | ZNF224 |
| ENSG00000076108 | −0.54 | 0.39 | <0.01 | 0.01 | BAZ2A |
| ENSG00000183495 | −0.47 | 0.41 | <0.01 | 0.01 | EP400 |
| ENSG00000185946 | −0.45 | 0.49 | 0.01 | 0.02 | RNPC3 |
| ENSG00000186908 | −0.40 | 0.13 | 0.24 | 0.35 | ZDHHC17 |
| ENSG00000116001 | 0.40 | 0.28 | 0.04 | 0.08 | TIA1 |
| ENSG00000177463 | 0.45 | 0.33 | 0.04 | 0.08 | NR2C2 |
Differential expressions were analyzed using R package DESeq2 (version 1.20.0). DiffK indicates the differential connectivity between the relapsed and non-relapsed subnetwork.
Correlation of zinc finger protein 143 with its target genes in the Tan module.
| Ensembl ID | Correlation coefficient | P-value | q-value | TFs | Gene symbol |
|---|---|---|---|---|---|
| ENSG00000089335 | 0.43 | 0.005 | 0.014 | Yes | ZNF302 |
| ENSG00000109118 | 0.37 | 0.019 | 0.033 | Yes | PHF12 |
| ENSG00000124459 | 0.40 | 0.010 | 0.022 | Yes | ZNF45 |
| ENSG00000125945 | 0.39 | 0.012 | 0.024 | No | ZNF436 |
| ENSG00000127483 | 0.51 | 0.001 | 0.004 | No | HP1BP3 |
| ENSG00000127957 | 0.38 | 0.015 | 0.029 | No | PMS2P3 |
| ENSG00000131115 | 0.50 | 0.001 | 0.006 | Yes | ZNF227 |
| ENSG00000137337 | 0.41 | 0.009 | 0.021 | Yes | MDC1 |
| ENSG00000155428 | −0.39 | 0.014 | 0.027 | No | TRIM74 |
| ENSG00000159905 | 0.38 | 0.015 | 0.028 | No | ZNF221 |
| ENSG00000175787 | 0.42 | 0.007 | 0.017 | Yes | ZNF169 |
| ENSG00000196597 | 0.33 | 0.035 | 0.050 | No | ZNF782 |
| ENSG00000198707 | 0.47 | 0.002 | 0.009 | No | CEP290 |
| ENSG00000254004 | 0.62 | <0.001 | 0.001 | Yes | ZNF260 |
| ENSG00000256294 | 0.40 | 0.010 | 0.022 | No | ZNF225 |
| ENSG00000263002 | 0.50 | 0.001 | 0.006 | No | ZNF234 |
Pearson correction coefficient was calculated using R package (version 3.5.1). q-values are adjusted P-values, which were calculated from the P-values using an optimized false discovery rate approach using R function in the weighted correlation network analysis package. TFs, transcription factors.
Correlation of zinc finger protein 143 with its upstream transcription factors.
| Ensembl ID | Correlation coefficient | P-value | q-value | Gene symbol |
|---|---|---|---|---|
| ENSG00000078399 | 0.45 | 0.003 | 0.011 | HOXA9 |
| ENSG00000100811 | 0.44 | 0.004 | 0.013 | YY1 |
| ENSG00000105866 | 0.49 | 0.001 | 0.007 | SP4 |
| ENSG00000106689 | −0.54 | <0.001 | 0.003 | LHX2 |
| ENSG00000147133 | 0.52 | 0.001 | 0.004 | TAF1 |
| ENSG00000164002 | 0.37 | 0.019 | 0.033 | EXO5 |
| ENSG00000167182 | 0.52 | 0.001 | 0.004 | SP2 |
| ENSG00000168066 | −0.40 | 0.010 | 0.022 | SF1 |
| ENSG00000172845 | 0.40 | 0.011 | 0.024 | SP3 |
| ENSG00000177374 | −0.47 | 0.002 | 0.009 | HIC1 |
| ENSG00000185630 | −0.45 | 0.004 | 0.012 | PBX1 |
Pearson correction coefficient was calculated using R package (version 3.5.1). q-values are adjusted P-values, which were calculated from the P-values using an optimised false discovery rate approach using R function in the weighted correlation network analysis package.
Correlation of Yin Yang 1 with its target transcription factors.
| Ensembl ID | Correlation coefficient | P-value | q-value | Gene symbol |
|---|---|---|---|---|
| ENSG00000005339 | 0.46 | 0.003 | 0.012 | CREBBP |
| ENSG00000076108 | 0.36 | 0.023 | 0.048 | BAZ2A |
| ENSG00000089335 | 0.39 | 0.013 | 0.034 | ZNF302 |
| ENSG00000106261 | 0.37 | 0.018 | 0.042 | ZKSCAN1 |
| ENSG00000117569 | 0.37 | 0.019 | 0.043 | PTBP2 |
| ENSG00000124177 | 0.53 | <0.001 | 0.004 | CHD6 |
| ENSG00000131051 | 0.46 | 0.003 | 0.012 | RBM39 |
| ENSG00000131115 | 0.38 | 0.016 | 0.038 | ZNF227 |
| ENSG00000137337 | 0.52 | 0.001 | 0.004 | MDC1 |
| ENSG00000173575 | 0.37 | 0.019 | 0.043 | CHD2 |
| ENSG00000254004 | 0.46 | 0.003 | 0.013 | ZNF260 |
| ENSG00000166478 | 0.44 | 0.004 | 0.016 | ZNF143 |
Pearson correction coefficient was calculated using R package (version 3.5.1). q-values are adjusted P-values, which were calculated from the P-values using an optimized false discovery rate approach using R function in the weighted correlation network analysis package.
Correlation of Yin Yang 1 with its upstream transcription factors.
| Ensembl ID | Correlation coefficient | P-value | q-value | Gene symbol |
|---|---|---|---|---|
| ENSG00000100811 | 1.00 | <0.001 | <0.001 | YY1 |
| ENSG00000105866 | 0.38 | 0.016 | 0.039 | SP4 |
| ENSG00000112592 | 0.39 | 0.014 | 0.035 | TBP |
| ENSG00000147133 | 0.42 | 0.008 | 0.024 | TAF1 |
| ENSG00000167182 | 0.48 | 0.002 | 0.009 | SP2 |
| ENSG00000172845 | 0.65 | <0.001 | <0.001 | SP3 |
| ENSG00000177374 | −0.58 | <0.001 | 0.001 | HIC1 |
Pearson correction coefficient was calculated using R package (version 3.5.1). q-values are adjusted P-values, which were calculated from the P-values using an optimized false discovery rate approach using R function in the weighted correlation network analysis package.
Figure 3.KM survival curves for TIA1, NR6A1, RFXAP, ZNF260, EP400 and PTBP2. Red color indicates high expression and grey/blue colors indicate low expression. (A) Effect of TIA1, NR6A1, RFXAP, ZNF260, EP400 and PTBP2 expression levels on RFS of patients with HCC (data from KMplot). (B) Effect of TIA1, NR6A1, RFXAP, ZNF260, EP400 and PTBP2 expression levels on DFS of patients with HCC (data from Gene Expression Profiling Interactive Analysis). DFS, disease-free survival; EP400, E1A binding protein p400; NR6A1, nuclear receptor subfamily 6 group A member 1; PTBP2, polypyrimidine tract binding protein 2; RFS, relapse-free survival; RFXAP, regulatory factor X associated protein; TIA1, TIA1 cytotoxic granule associated RNA binding protein; ZNF260, zinc finger protein 260; KM, Kaplan-Meier.