| Literature DB >> 27818995 |
Abstract
Liver hepatocellular carcinoma (HCC) remains a leading cause of cancer-related death. Poor understanding of the mechanisms underlying HCC prevents early detection and leads to high mortality. We developed a random forest model that incorporates copy-number variation, DNA methylation, transcription factor, and microRNA binding information as features to predict gene expression in HCC. Our model achieved a highly significant correlation between predicted and measured expression of held-out genes. Furthermore, we identified potential regulators of gene expression in HCC. Many of these regulators have been previously found to be associated with cancer and are differentially expressed in HCC. We also evaluated our predicted target sets for these regulators by making comparison with experimental results. Lastly, we found that the transcription factor E2F6, one of the candidate regulators inferred by our model, is predictive of survival rate in HCC. Results of this study will provide directions for future prospective studies in HCC.Entities:
Mesh:
Year: 2016 PMID: 27818995 PMCID: PMC5080476 DOI: 10.1155/2016/1035945
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.411
Figure 1Comparison of the random forest models that exclude one type of feature at a time.
Figure 2A toy example illustrating the calculation of ranks from importance values.
List of candidate regulators.
| Regulator | Avg. rank | logFC |
|
|---|---|---|---|
| Methylation | 86 | — | — |
| CNV | 171 | — | — |
| GLIS3 | 195 | −0.61 | 0.008 |
| TCF3 | 195 | 0.81 | 1.95 |
| HIC2 | 199 | 0.48 | 3 |
| ZEB1 | 205 | −0.06 | 0.629 |
| SPIB | 206 | −1.59 | 0.0019 |
| GLIS2 | 208 | 0.52 | 0.05 |
| TCF4 | 209 | 0.03 | 0.881 |
| REST | 209 | −0.57 | 0.029 |
| PKNOX2 | 212 | −1.12 | 0.00017 |
| GLI2 | 213 | −1.33 | 0.0001 |
| NFIA | 214 | −0.69 | 2.63 |
| ID4 | 214 | −1.24 | 5.69 |
| MEF2A | 215 | −0.36 | 0.002 |
| E2F6 | 216 | 0.15 | 0.187 |
| ELF4 | 218 | −0.25 | 0.28 |
| ZBTB7C | 223 | −0.92 | 0.0002 |
| TEAD4 | 223 | 0.62 | 0.02 |
| PRDM1 | 224 | −0.29 | 0.183 |
| MEIS3 | 225 | 0.05 | 0.89 |
| NR2F1 | 226 | −0.82 | 0.0004 |
| ZNF143 | 228 | 0.47 | 4.58 |
| miR.766.3p | 283 | 0.60 | 0.002 |
| miR.335.3p | 283 | −0.17 | 0.39 |
| miR.122.5p | 284 | −0.50 | 0.083 |
Analysis of predicted target sets for candidate regulators.
| Regulator | Target overlap | Enriched GO terms | Enrich. |
|---|---|---|---|
| TCF3 | 140 | Cell cycle process | 3.7 |
| ZEB1 | 126 | — | — |
| SPIB | 146 | Immune system process | 7 |
| TCF4 | 138 | Extracellular matrix organization | 2.1 |
| REST | 143 | — | — |
| MEF2A | 155 | — | — |
| E2F6 | 165 | Cell cycle process | 3.5 |
| PRDM1 | 142 | Immune system process | 4.5 |
| NR2F1 | 135 | — | — |
| ZNF143 | 158 | Chromosome segregation | 4 |
| miR.122.5p | 169 | — | — |
Figure 3E2F6 expression is predictive of survival rate in HCC patients.