| Literature DB >> 34946378 |
Hanan Aljuaid1, Hanan A Hosni Mahmoud1,2.
Abstract
Epigenetic changes are a necessary characteristic of all cancer types. Tumor cells usually target genetic changes and epigenetic alterations as well. It is most beneficial to identify epigenetic similar features among cancer various types to be able to discover the appropriate treatments. The existence of epigenetic alteration profiles can aid in targeting this goal. In this paper, we propose a new technique applying data mining and clustering methodologies for cancer epigenetic changes analysis. The proposed technique aims to detect common patterns of epigenetic changes in various cancer types. We demonstrated the validation of the new technique by detecting epigenetic patterns across seven cancer types and by determining epigenetic similarities among various cancer types. The experimental results demonstrate that common epigenetic patterns do exist across these cancer types. Additionally, epigenetic gene analysis performed on the associated genes found a strong relationship with the development of various types of cancer and proved high risk across the studied cancer types. We utilized the frequent pattern data mining approach to represent cancer types compactly in the promoters for some epigenetic marks. Utilizing the built frequent pattern item set, the most frequent items are identified and yield the group of the bi-clusters of these patterns. Experimental results of the proposed method are shown to have a success rate of 88% in detecting cancer types according to specific epigenetic pattern.Entities:
Keywords: cancer groups; clustering; data mining; epigenetic pattern; genes
Year: 2021 PMID: 34946378 PMCID: PMC8700852 DOI: 10.3390/healthcare9121652
Source DB: PubMed Journal: Healthcare (Basel) ISSN: 2227-9032
Marks that characterize the investigated cancer types.
| H3K4m1 | Is an epigenetic DNA alteration to the Histone protein H1 |
| H3K4m3 | Is an epigenetic DNA alteration to the Histone protein H3 |
| H3K9m3 | Is a tri-methylation at the 9th residue of the protein H3 |
| H3K27c | Is the acetylation of the residue at terminal distance 27 |
| H3K27m3 | Is the tri-methylation of H3 protein |
| H3K36m3 | Is the tri-methylation at the 36th position of the H3 protein |
Figure 1The flowchart of the proposed three-way clustering technique (3WC) includes preprocessing of the epigenetic marks of various cancer types and detecting the three clusters with coherent epigenetic marks among various cancer types.
Figure 2Epigenetic list of the patterns H3K9m3 in a usual tri-cluster with the same pattern in human erythroleukemic cell line (HEC) cancer.
Figure 3The epigenetic patterns H3K4me3 in a cancer groups CDT, LHC, HEC, and AHC.
Figure 4Experimental results of the bi-clusters thresholds versus various gene patterns. The comparison specifies that the correspondences of the epigenetic patterns are quite different. H3K4me3 has a small number of bi-clusters, while H3K4me1 and H3K27me3 have similar patterns across various cancer types.
Experimental results as related to ground truth.
| Case | Age | Cancer Type | Diagnosis by | Diagnosis by | Genome Detected by the Proposed 3WC | Genome Detected by Physician (Ground Truth) | Status |
|---|---|---|---|---|---|---|---|
| 1 | 35 | CDT | Epigenetic DNA alteration to the Histone protein H1 | Epigenetic DNA alteration to the Histone protein H1 | H3K9m1 | H3K9m1 | Match |
| 2 | 61 | LHC | Epigenetic DNA alteration to the Histone protein H3 | Epigenetic DNA alteration to the Histone protein H3 | H3K4m3 | H3K4m3 | Match |
| 3 | 45 | BLC | Epigenetic DNA alteration to the Histone protein H27 | Epigenetic DNA alteration to the Histone protein H27 | H3K27c | H3K27c | Match |
| 4 | 32 | BLC | Epigenetic DNA alteration to the Histone protein H3 | Epigenetic DNA alteration to the Histone protein H1 | H3K9m1 | H3K9m3 | Miss Match |
| 5 | 25 | CDT | Epigenetic DNA alteration to the Histone protein H1 | Epigenetic DNA alteration to the Histone protein H1 | H3K9m1 | H3K9m1 | Match |
| 6 | 15 | MMC | Epigenetic DNA alteration to the Histone protein H3 | Epigenetic DNA alteration to the Histone protein H3 | H3K27m3 | H3K27m3 | Match |
| 7 | 67 | CDT | Epigenetic DNA alteration to the Histone protein H3 | Epigenetic DNA alteration to the Histone protein H1 | H3K9m1 | H3K9m1 | Match in epigenetic, mismatch in diagnosis |
| 8 | 75 | LHC | Epigenetic DNA alteration to the Histone protein H27 | Epigenetic DNA alteration to the Histone protein H3 | H3K4m3 | H3K4m3 | Match in epigenetic, mismatch in diagnosis |
| 9 | 32 | MMC | Epigenetic DNA alteration to the Histone protein H27 | Epigenetic DNA alteration to the Histone protein H27 | H3K27c | H3K27c | Match |
| 10 | 25 | CDT | Epigenetic DNA alteration to the Histone protein H27 | Epigenetic DNA alteration to the Histone protein H27 | H3K27c | H3K27c | Match |
| 11 | 52 | LHC | Epigenetic DNA alteration to the Histone protein H1 | Epigenetic DNA alteration to the Histone protein H1 | H3K9m1 | H3K9m1 | Match |
| 12 | 25 | LHC | Epigenetic DNA alteration to the Histone protein H3 | Epigenetic DNA alteration to the Histone protein H3 | H3K4m3 | H3K4m3 | Match |
| 13 | 5 | MMC | Epigenetic DNA alteration to the Histone protein H3 | Epigenetic DNA alteration to the Histone protein H3 | H3K4m3 | H3K4m3 | Match |
| 14 | 4 | LHC | Epigenetic DNA alteration to the Histone protein m3 | Epigenetic DNA alteration to the Histone protein m3 | H3K27m3 | H3K27m3 | Match |
| 15 | 6 | MMC | Epigenetic DNA alteration to the Histone protein m3 | Epigenetic DNA alteration to the Histone protein m3 | H3K27m3 | H3K27m3 | Match |
| 16 | 4.5 | CDT | Epigenetic DNA alteration to the Histone protein m3 | Epigenetic DNA alteration to the Histone protein m3 | H3K27m3 | H3K27m3 | Match |
| 17 | 3 | LHC | Epigenetic DNA alteration to the Histone protein H3 | Epigenetic DNA alteration to the Histone protein H3 | H3K9m1 | H3K9m3 | Match |
| 18 | 2.5 | MMC | Epigenetic DNA alteration to the Histone protein H1 | Epigenetic DNA alteration to the Histone protein H1 | H3K9m1 | H3K9m1 | Match |
| 19 | 5 | CDT | Epigenetic DNA alteration to the Histone protein H3 | Epigenetic DNA alteration to the Histone protein H1 | H3K9m1 | H3K9m3 | Miss Match |
| 20 | 25 | LHC | Epigenetic DNA alteration to the Histone protein H1 | Epigenetic DNA alteration to the Histone protein H1 | H3K9m1 | H3K9m1 | Match |
| 21 | 50 | MMC | Epigenetic DNA alteration to the Histone protein H27 | Epigenetic DNA alteration to the Histone protein H3 | H3K4m3 | H3Kc27 | Miss Match |
| 22 | 61 | CDT | Epigenetic DNA alteration to the Histone protein H1 | Epigenetic DNA alteration to the Histone protein H1 | H3K9m1 | H3K9m1 | Match |
Figure 5Patterns of the three clusters (epigenetic tri-clusters). (A) The marks are represented by columns, where rows represent clusters. (B) Columns represent cancer types, while rows represent clusters. Fold ratio is computed as the ratio of the genes count to the clusters in the genes.