| Literature DB >> 27688708 |
Sunny Tian1, Karina Bertelsmann2, Linda Yu3, Shuying Sun4.
Abstract
Heterogeneous DNA methylation patterns are linked to tumor growth. In order to study DNA methylation heterogeneity patterns for breast cancer cell lines, we comparatively study four metrics: variance, I (2) statistic, entropy, and methylation state. Using the categorical metric methylation state, we select the two most heterogeneous states to identify genes that directly affect tumor suppressor genes and high- or moderate-risk breast cancer genes. Utilizing the Gene Set Enrichment Analysis software and the ConsensusPath Database visualization tool, we generate integrated gene networks to study biological relations of heterogeneous genes. This analysis has allowed us to contribute 19 potential breast cancer biomarker genes to cancer databases by locating "hub genes" - heterogeneous genes of significant biological interactions, selected from numerous cancer modules. We have discovered a considerable relationship between these hub genes and heterogeneously methylated oncogenes. Our results have many implications for further heterogeneity analyses of methylation patterns and early detection of breast cancer susceptibility.Entities:
Keywords: DNA methylation; heterogeneity; hub genes
Year: 2016 PMID: 27688708 PMCID: PMC5032785 DOI: 10.4137/CIN.S40300
Source DB: PubMed Journal: Cancer Inform ISSN: 1176-9351
Figure 1Dot plot of methylation states.
Number and percentage of CpG sites belonging to each methylation state.
| METHYLATION STATE | NUMBER OF CpG SITES | PERCENTAGE OF TOTAL CpG SITES |
|---|---|---|
|
| 2106 | 1.98% |
|
| 17362 | 16.33% |
|
| 12575 | 11.83% |
|
| 40541 | 38.13% |
|
| 5548 | 5.22% |
|
| 205 | 0.19% |
|
| 27980 | 26.32% |
| Total | 106317 | 100.00% |
Notes: The summary is done for chromosome 1, but the percentages are representative of the entire genome. Methylation states are shown in alphabetical order.
Figure 2Boxplots of quantitative metrics vs. methylation states.
Figure 3A scatterplot of entropy against standard deviation.
Figure 4A scatterplot of I2 statistic vs. standard deviation.
Fifteen heterogeneous sample genes represented in top 10 significant cancer modules.
| GENE SYMBOL | CANCER MODULE ID | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| 100 | 66 | 137 | 11 | 220 | 47 | 12 | 41 | 88 | 55 | |
| PDGFRA | X | X | X | – | X | – | X | X | X | X |
| DPYSL3 | X | X | X | X | X | X | X | – | – | – |
| SPOCK1 | X | X | X | X | X | X | – | – | – | – |
| TFAP2B | X | X | X | X | X | – | X | X | X | X |
| PAX6 | X | X | X | X | X | – | X | X | X | X |
| SOX9 | X | X | X | X | X | – | X | – | X | X |
| FEZ1 | X | X | X | X | X | – | X | – | – | – |
| NTRK2 | X | X | X | X | X | – | X | – | – | – |
| CRMP1 | X | X | X | X | X | – | X | – | – | – |
| KAL1 | X | X | X | X | X | – | X | – | – | – |
| NEURL | X | X | X | X | X | – | – | X | X | X |
| NRG2 | X | X | X | X | X | – | – | X | X | X |
| DPYSL4 | X | X | X | X | X | – | – | X | X | X |
| TBR1 | X | X | X | X | X | – | – | X | X | X |
| CBLN1 | X | X | X | X | X | – | – | X | – | – |
Note: “X” indicates that the gene exists in the cancer module.
Figure 5Tumor suppressor gene network including TP53, TP63, and TNF-alpha.
Notes: The three genes (TP53, TP63, and TNF-alpha) are outlined in red. Black node labels represent heterogeneous genes. Blue node labels represent intermediate genes or proteins.
Figure 6Breast cancer susceptibility of high- and intermediate-risk genetic mapping. Note: These breast cancer genes are outlined in red.
Figure 7Networks for hub genes and potential biomarkers for breast cancer.
Notes: All known oncogenes that are part of our heterogeneous list are outlined in red. Highly interactive hub genes are outlined in black.
Carefully selected heterogeneous hub genes from cancer modules.
| GENE HUB | DESCRIPTION OF GENE |
|---|---|
| NCAM1 | Encodes a protein that is involved in cell-to-cell interactions and development and differentiation |
|
| Oncogene; Encodes a member of the NTRK family; mutations in this gene have been linked to cancer |
|
| Oncogene; Encodes a member of the NTRK family that leads to cell differentiation; mutations have not been linked to cancer |
|
| Oncogene; Protein coding gene; Mutations linked to various syndromes |
|
| Oncogene; Protein coding gene; Mutations linked to dwarfism |
| DCLK1 | Encodes a protein that is linked to neurogenesis and neuronal apoptosis |
| PRKCB | Protein coding gene; Serves as a receptor for a class of tumor promoters |
|
| Oncogene; Protein coding gene; Plays a role in tumor progression; Mutations have been linked with a variety of cancers |
|
| Oncogene; Encodes a nuclear transcription factor; Linked to leukemia |
| SH3GL2 | Protein coding gene; Related to identical protein binding and lipid binding |
| SH3GL3 | Protein coding gene; Related to identical protein binding and lipid binding |
| SPEG | Protein coding gene; Lack of this protein affects myocardial development |
| PRKCZ | Protein coding gene; Not a receptor for phorbol ester, tumor promoters |
| ROR2 | Protein coding gene required for cartilage and growth plate development |
| FHL1 | Protein coding gene; Mutations linked with muscular dystrophy |
| CDH2 | Encodes a protein required for establishment of left-right asymmetry |
| FLT1 | Protein coding gene; Related diseases include microcystic meningioma |
| EPHB1 | Encodes a protein that mediates developmental processes |
| EPHB2 | Encodes a protein that mediates developmental processes; Related diseases include various cancers |
Notes: The genetic functions of these genes are provided in the second column. Six known heterogeneous oncogenes are in italic. These hub genes have significant potential as indications of breast cancer susceptibility.