| Literature DB >> 30255812 |
So Yeon Kim1, Tae Rim Kim1, Hyun-Hwan Jeong2,3, Kyung-Ah Sohn4.
Abstract
BACKGROUND: Integrative analysis on multi-omics data has gained much attention recently. To investigate the interactive effect of gene expression and DNA methylation on cancer, we propose a directed random walk-based approach on an integrated gene-gene graph that is guided by pathway information.Entities:
Keywords: Breast cancer; DNA methylation; Denoising autoencoder; Gene expression; Integrative analysis; Multi-omics; Pathway; Random walk
Mesh:
Substances:
Year: 2018 PMID: 30255812 PMCID: PMC6157196 DOI: 10.1186/s12920-018-0389-z
Source DB: PubMed Journal: BMC Med Genomics ISSN: 1755-8794 Impact factor: 3.063
Fig. 1Overview of the proposed integrative pathway-based survival prediction method
Fig. 2Classification performance comparison between a single type of feature data and converted pathway profiles by DRW in terms of mean AUC (left) and mean accuracy (right) after 10 repeats of 5-fold cross validation process. Gene and pathway profiles with gene expression and methylation profiles were evaluated. The pathway profiles were obtained by the original DRW method. Error bars represent the standard error of the mean values
Fig. 3Classification performance comparison of the pathway-based prediction methods on the combined feature data. Mean AUC (left) and mean accuracy (right) after 10 repeats of 5-fold cross validation process are shown. Error bars represent the standard error of the mean values
Fig. 4Heat-map for comparing selected pathways by the iDRW and iDRW + DA methods. Each cell represents similarity using Simpson coefficient between two lists of differentially expressed genes and methylated genes from a pair of pathways selected by each method. Note that the rows and columns represent selected pathways by iDRW+DA and the iDRW method, respectively, and are clustered via hierarchical clustering with complete-linkage method
Fig. 5Genetic alterations for the four genes in the dorso-ventral axis formation pathway from the Breast Invasive Carcinoma dataset in cBioPortal (http://www.cbioportal.org/)
Risk-active pathways identified by the proposed method (iDRW+DA)
| Pathway ID | Pathway name | Frequencya | Total genesb | DE genes | DM genes |
|---|---|---|---|---|---|
| map 04320 | Dorso-ventral axis formation | 10/50 | 27 | 4 | 0 |
| map 04972 | Pancreatic secretion | 8/50 | 65 | 26 | 3 |
| map 04722 | Neurotrophin signaling pathway | 7/50 | 90 | 47 | 3 |
| map 05020 | Prion diseases | 7/50 | 30 | 12 | 0 |
| map 00670 | One carbon pool by folate | 5/50 | 33 | 6 | 1 |
| map 00592 | alpha-Linolenic acid metabolism | 5/50 | 23 | 8 | 1 |
| map 00620 | Pyruvate metabolism | 5/50 | 96 | 7 | 1 |
| map 03320 | PPAR signaling pathway | 5/50 | 61 | 13 | 1 |
| map 04660 | T cell receptor signaling pathway | 5/50 | 85 | 52 | 8 |
| map 04510 | Focal adhesion | 5/50 | 148 | 83 | 11 |
| map 03010 | Ribosome | 5/50 | 143 | 1 | 0 |
| map 05214 | Glioma | 5/50 | 52 | 27 | 0 |
| map 04711 | Circadian rhythm - fly | 5/50 | 8 | 4 | 1 |
| map 00960 | Tropane, piperidine, and pyridine alkaloid biosynthesis | 5/50 | 26 | 1 | 0 |
aFrequency: the number of times the pathway has been selected over 10 times of 5-fold cross validation process (50 iterations)
bTotal genes: the number of genes mapped to the pathway in the KEGG database
Note that the number of differentially expressed genes (DE genes) and differentially methylated genes (DM genes) are also shown (p-value of DESeq2 or t-test < 0.05)
Fig. 6Pathway-based gene-gene interaction network between gene expression profiles and DNA methylation features extracted by iDRW + DA. The genes in the top-10 pathways are shown; the hub genes whose degree is greater than 4 in the gene expression data (green) and genes that are detected in differential methylation regions (orange) are emphasized in different colors
Top-5 genes ranked by GDA scores from the DisGeNET database (http://www.disgenet.org/) that are associated with breast-cancer-related diseases
| Disease ID | Disease | Gene | GDA score |
|---|---|---|---|
| C0678222 | Breast Carcinoma | AKT1 | 0.2418 |
| PIK3CD | 0.0448 | ||
| MAPK3 | 0.0118 | ||
| HRAS | 0.0077 | ||
| BCAR1 | 0.0074 | ||
| C0006142 | Malignant neoplasm of breast | AKT1 | 0.2420 |
| PIK3CD | 0.0475 | ||
| KDR | 0.0119 | ||
| MAPK3 | 0.0110 | ||
| PAK1 | 0.0095 | ||
| C3539878 | Triple Negative Breast Neoplasms | PIK3CD | 0.0047 |
| AKT1 | 0.0022 | ||
| AKT3 | 0.0011 | ||
| MAPK3 | 0.0011 | ||
| KDR | 0.0008 |