| Literature DB >> 31428132 |
Jun-Li Deng1,2,3,4, Yun-Hua Xu1,2,3,4, Guo Wang1,2,3,4.
Abstract
Background: The molecular mechanism of tumorigenesis remains to be fully understood in breast cancer. It is urgently required to identify genes that are associated with breast cancer development and prognosis and to elucidate the underlying molecular mechanisms. In the present study, we aimed to identify potential pathogenic and prognostic differentially expressed genes (DEGs) in breast adenocarcinoma through bioinformatic analysis of public datasets.Entities:
Keywords: GEO; TCGA; bioinformatics; biomarker; breast cancer; differentially expressed genes; survival
Year: 2019 PMID: 31428132 PMCID: PMC6688090 DOI: 10.3389/fgene.2019.00695
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Characteristics of datasets in this study.
| Dataset | Platform | Sample | Tumor type | Country | |
|---|---|---|---|---|---|
| Normal | Tumor | ||||
| GSE21422 | Affymetrix | 5 | 14 | Breast cancer | Germany |
| GSE29431 | Affymetrix | 12 | 54 | Breast cancer | Spain |
| GSE42568 | Affymetrix | 17 | 109 | Breast cancer | Ireland |
| GSE61304 | Affymetrix | 4 | 58 | Breast cancer | Singapore |
| TCGA | IlluminaHiSeq | 113 | 1,105 | Breast cancer | USA |
Primer sequences used for quantitative Real-time PCR (qRT-PCR).
| Gene symbol | Primer sequence |
|---|---|
| CDK1 | F: 5′-AAACTACAGGTCAAGTGGTAGCC-3′ |
| CCNA2 | F: 5′-GGATGGTAGTTTTGAGTCACCAC-3′ |
| TOP2A | F: 5′-TTAATGCTGCGGACAACAAACA-3′ |
| CCNB1 | F: 5′-AATAAGGCGAAGATCAACATGGC-3′ |
| KIF11 | F: 5′-TGTTTGATGATCCCCGTAACAAG-3′ |
| MELK | F: 5′-AACTCCAGCCTTATGCAGAAC-3′ |
| β-Actin | F: 5′-TTGATTTTGGAGGGATCTCGCTC-3′ |
Figure 1Identification of differentially expressed genes (DEGs) between breast malignant and non-malignant tissues. Panels A–D show the volcano plots of differentially expressed genes for dataset GSE21422 (A), GSE29431 (B), GSE42568 (C), and GSE61304 (D), respectively. Panels E–F show the Venn diagrams of the overlapping DEGs, including 230 up-regulated (E) and 130 down-regulated (F), among the four datasets. Panels G–H show the Venn diagrams of a total of 321 DEGs, including 203 up-regulated (E) and 118 down-regulated (F), among the four datasets of Gene Expression Omnibus (GEO) and the Cancer Genome Atlas (TCGA) datasets.
Three hundred twenty-one differentially expressed genes (DEGs) were identified and confirmed from four profile datasets and the Cancer Genome Atlas (TCGA), including 203 up-regulated genes and 118 down-regulated genes in the breast cancer tissues, compared to normal breast tissues.
| Regulation | DEGs (gene symbol) |
|---|---|
| Up-regulated | |
| Down-regulated |
Figure 2GO (Gene Ontology) enrichment analysis for upregulated DEGs. Panels A–B illustrate the top 10 elements significantly enriched in the GO categories: molecular function (A) and biological process (B).
Figure 3GO enrichment analysis for downregulated DEGs. Panels A–B illustrate the top 10 elements significantly enriched in the GO categories: molecular function (A) and biological process (B).
Figure 4KEGG (Kyoto Encyclopedia of Genes and Genomes) pathway enrichment analysis of the DEGs. (A) Top 10 functional network/pathways associated with these upregulated DEGs through KEGG analysis with a p value less than 0.05. (B) Top 10 functional network/pathways associated with these downregulated DEGs through KEGG analysis with a p value less than 0.05.
Figure 5Protein-protein interaction (PPI) network construction. (A) PPI network constructed with the DEGs from the four datasets of GEO and TCGA datasets. (B–C) The significant module identified from the PPI network using the molecular complex detection (MCODE) method with a score of ≥ 5.0. Panel B shows the module 1 with an MCODE score of 46.63. Panel C shows the module 2 with an MCODE score of 5. The red nodes stand for upregulated genes, while the yellow nodes stand for downregulated genes.
Hub genes with high degree of connectivity.
| Gene | Degree | Type | MCODE cluster |
|---|---|---|---|
| CDK1 | 78 | UP | Cluster 1 |
| CCNA2 | 72 | UP | Cluster 1 |
| TOP2A | 68 | UP | Cluster 1 |
| CCNB1 | 68 | UP | Cluster 1 |
| KIF11 | 64 | UP | Cluster 1 |
| MELK | 64 | UP | Cluster 1 |
Figure 6KEGG pathway enrichment analysis of module 1 and module 2. Panels A and B show the top 10 functional network/pathways associated with these genes in module 1 and module 2 through KEGG analysis, respectively, P < 0.05. Significantly enriched pathways of module 1 and module 2 are indicated in Y-axis. Rich factor in the X-axis represents the enrichment levels. The larger value of Rich factor represents the higher level of enrichment. The color of the dot stands for the different P-value and the size of the dot reflects the number of target genes enriched in the corresponding pathway.
Figure 7Kaplan–Meier survival curves of six hub genes in breast cancer patients. Overall survival (OS) by low and high (A) CDK1, (B) CCNA2, (C) TOP2A, (D) CCNB1, (E) KIF11, and (F) MELK expression.
Figure 8Relative expression of six hub genes in normal tissues and breast cancer tissues. (A) CDK1, (B) CCNA2, (C) TOP2A, (D) CCNB1, (E) KIF11, and (F) MELK. Data are mean ± SE. ***P < 0.001.
Figure 10Relative expression of six hub genes in normal tissues and breast cancer tissues with different tumors subclasses. (A) CDK1, (B) CCNA2, (C) TOP2A, (D) CCNB1, (E) KIF11, and (F) MELK. Data are mean ± SE. ***P < 0.001.
Figure 9Relative expression of six hub genes in normal tissues and breast cancer tissues with different tumors stages. (A) CDK1, (B) CCNA2, (C) TOP2A, (D) CCNB1, (E) KIF11, and (F) MELK. Data are mean ± SE. **P < 0.01; ***P < 0.001.
Figure 11The relative expression of six hub genes in the breast cancer samples (n = 22) collected in our clinic were detected using quantitative Real-Time PCR (qRT-PCR). β-Actin was used as an internal reference gene for normalization. (A) CDK1, (B) CCNA2, (C) TOP2A, (D) CCNB1, (E) KIF11, and (F) MELK. Data were analyzed using paired Student’s t-test.
Figure 12Gene-drug interaction network constructed with six hub genes and chemotherapeutic drugs. Panels A–F show available chemotherapeutic drugs decrease or increase the expression levels of hub genes in mRNA or protein. (A) CDK1, (B) CCNA2, (C) TOP2A, (D) CCNB1, (E) KIF11, and (F) MELK. Red arrows: chemotherapeutic drugs increase the expression of hub genes; green arrows: chemotherapeutic drugs decrease the expression of hub genes. The numbers of arrows between chemotherapeutic drugs and hub genes in this network represent the supported numbers of literatures by previous reports.