| Literature DB >> 34335705 |
Linzhuo Fan1, Jinhong Hou1, Guimin Qin1.
Abstract
Breast cancer is one of the most common malignant tumors in women, which seriously endangers women's health. Great advances have been made over the last decades, however, most studies predict driver genes of breast cancer using biological experiments and/or computational methods, regardless of stage information. In this study, we propose a computational framework to predict the disease genes of breast cancer based on stage-specific gene regulatory networks. Firstly, we screen out differentially expressed genes and hypomethylated/hypermethylated genes by comparing tumor samples with corresponding normal samples. Secondly, we construct three stage-specific gene regulatory networks by integrating RNA-seq profiles and TF-target pairs, and apply WGCNA to detect modules from these networks. Subsequently, we perform network topological analysis and gene set enrichment analysis. Finally, the key genes of specific modules for each stage are screened as candidate disease genes. We obtain seven stage-specific modules, and identify 20, 12, and 22 key genes for three stages, respectively. Furthermore, 55%, 83%, and 64% of the genes are associated with breast cancer, for example E2F2, E2F8, TPX2, BUB1, and CKAP2L. So it may be of great importance for further verification by cancer experts.Entities:
Keywords: DNA methylation; WGCNA; breast cancer; differentially expressed genes; stage-specific gene regulatory networks
Year: 2021 PMID: 34335705 PMCID: PMC8321251 DOI: 10.3389/fgene.2021.717557
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
FIGURE 1Workflow of the computational framework for predicting disease genes based on stage-specific gene regulatory network.
FIGURE 2The relationship between DNA methylation and gene expression of each stage.
FIGURE 3Module division results of each stage. (A) Stage I, (B) Stage II, (C) Stage III.
Gene distribution of each module.
| S1_black | 71 | 5 | S2_black | 66 | 4 | S3_black | 99 | 11 |
| S1_blue | 149 | 11 | S2_blue | 125 | 5 | S3_blue | 142 | 8 |
| S1_brown | 120 | 28 | S2_brown | 116 | 3 | S3_brown | 120 | 26 |
| S1_green | 106 | 5 | S2_green | 90 | 9 | S3_green | 110 | 25 |
| S1_grey | 42 | 4 | S2_grey | 40 | 3 | S3_greenyellow | 51 | 14 |
| S1_magenta | 63 | 4 | S2_magenta | 58 | 3 | S3_grey | 30 | 2 |
| S1_pink | 66 | 10 | S2_pink | 58 | 6 | S3_magenta | 61 | 4 |
| S1_purple | 40 | 2 | S2_red | 68 | 3 | S3_pink | 70 | 14 |
| S1_red | 95 | 8 | S2_turquoise | 337 | 21 | S3_purple | 54 | 10 |
| S1_turquoise | 270 | 13 | S2_yellow | 108 | 2 | S3_red | 104 | 15 |
| S1_yellow | 107 | 1 | S3_tan | 45 | 2 | |||
| S3_turquoise | 337 | 47 | ||||||
| S3_yellow | 116 | 8 |
FIGURE 4Joint enrichment analysis of seven specific modules.
Functional enrichment analysis.
| S1_turquoise | GO:0044770 | Cell cycle phase transition | −32.060 | 52 |
| GO:0051301 | Cell division | −31.343 | 50 | |
| GO:0006260 | DNA replication | −21.835 | 30 | |
| S1_blue | GO:0022411 | Cell component disassembly | −8.88 | 18 |
| GO:0001046 | The core promoter sequence specifically binds to DNA | −8.71 | 7 | |
| GO:0070897 | Transcription pre-priming complex assembly | −3.78 | 6 | |
| S1_brown | GO:0001228 | DNA binding transcription activator activity | −9.13 | 15 |
| GO:0001227 | DNA binding transcription repressor activity | −6.08 | 10 | |
| GO:0004879 | Nuclear receptor activity | −5.54 | 5 | |
| S2_turquoise | GO:0044770 | Cell cycle transition | −41.180 | 66 |
| GO:0007059 | Chromosome segregation | −38.573 | 50 | |
| GO:0005819 | Spindle | −27.162 | 42 | |
| S3_turquoise | GO:0044770 | Cell cycle transition | −44.41 | 69 |
| GO:0098687 | Chromosome region | −38.25 | 50 | |
| hsa04110 | Cell cycle | −28.49 | 29 | |
| S3_brown | GO:0006352 | DNA template transcription | −9.100 | 12 |
| GO:0001046 | The core promoter sequence specifically binds to DNA | −7.678 | 6 | |
| GO:0034655 | Catabolism of nucleobase-containing compounds | −6.932 | 14 | |
| S3_green | GO:0016570 | Histone modification | −6.678 | 12 |
| GO:0005697 | Telomerase holoenzyme complex | −5.816 | 4 | |
| GO:0034243 | Macromolecule methylation | −5.194 | 5 |
Candidate disease genes at each stage.
| Stage I | E2F2*, E2F8#, TPX2*, BUB1*, CKAP2L#, CBX3#, CASC5#, KPNA2#, LMNB1, NEK2*, TTK*, SLC25A36, CREBRF, ZC3H6, PAN2, BTAF1, SLC25A39, DDX49, SLC39A1#, MRPS12 |
| Stage II | E2F2*, E2F8#, TPX2*, KPNA2#, CKAP2L#, CBX3#, DDIAS, BUB1*, CCNE2#, CASC5#, SPDL1, TOP2A* |
| Stage III | E2F2*, RAD21#, FBXO5#, CCNE2#, CBX3#, STIL#, CKAP2L#, PCNA*, NEK2*, TTK*, CSE1L#, H2AFZ#, NR2F6, TRAPPC6A, IGSF8, FDXR, SLC39A1#, EXOSC5, RBBP5, KDM5B#, H3F3A, CDC42SE1 |
| Common genes | E2F2, CKAP2L, CBX3 |