| Literature DB >> 33499770 |
Yujie Weng1, Wei Liang1, Yucheng Ji1, Zhongxian Li1, Rong Jia1, Ying Liang1, Pengfei Ning1, Yingqi Xu2.
Abstract
Human epidermal growth factor 2 (HER2)+ breast cancer is considered the most dangerous type of breast cancers. Herein, we used bioinformatics methods to identify potential key genes in HER2+ breast cancer to enable its diagnosis, treatment, and prognosis prediction. Datasets of HER2+ breast cancer and normal tissue samples retrieved from Gene Expression Omnibus and The Cancer Genome Atlas databases were subjected to analysis for differentially expressed genes using R software. The identified differentially expressed genes were subjected to gene ontology and Kyoto Encyclopedia of Genes and Genomes pathway enrichment analyses followed by construction of protein-protein interaction networks using the STRING database to identify key genes. The genes were further validated via survival and differential gene expression analyses. We identified 97 upregulated and 106 downregulated genes that were primarily associated with processes such as mitosis, protein kinase activity, cell cycle, and the p53 signaling pathway. Visualization of the protein-protein interaction network identified 10 key genes (CCNA2, CDK1, CDC20, CCNB1, DLGAP5, AURKA, BUB1B, RRM2, TPX2, and MAD2L1), all of which were upregulated. Survival analysis using PROGgeneV2 showed that CDC20, CCNA2, DLGAP5, RRM2, and TPX2 are prognosis-related key genes in HER2+ breast cancer. A nomogram showed that high expression of RRM2, DLGAP5, and TPX2 was positively associated with the risk of death. TPX2, which has not previously been reported in HER2+ breast cancer, was associated with breast cancer development, progression, and prognosis and is therefore a potential key gene. It is hoped that this study can provide a new method for the diagnosis and treatment of HER2 + breast cancer.Entities:
Keywords: HER2+ breast cancer; TPX2; bioinformatics; key gene; prognosis-related gene; therapeutic target
Year: 2021 PMID: 33499770 PMCID: PMC7844453 DOI: 10.1177/1533033820983298
Source DB: PubMed Journal: Technol Cancer Res Treat ISSN: 1533-0338
Figure 1.Volcano plots of DEGs (A. Dataset GSE45827; B. TCGA datasets). DEG: differentially expressed gene; FC, fold-change.
Number of DEGs in Datasets Retrieved From the 2 Databases.
| Database | Upregulated DEGs | Downregulated DEGs | Total |
|---|---|---|---|
| GSE45827 | 542 | 200 | 742 |
| TCGA datasets | 897 | 1166 | 2063 |
Figure 2.Venn diagrams of DEGs (A. Upregulated genes; B. Downregulated genes). DEG, differentially expressed gene.
Figure 3.Functional enrichment analysis of upregulated genes (A. Biological processes; B. Cellular components; C. Molecular functions; D. KEGG pathway enrichment). DEG, differentially expressed gene; KEGG, Kyoto Encyclopedia of Genes and Genomes.
Figure 4.Functional enrichment analysis of downregulated genes (A. Biological processes; B. Cellular components; C. Molecular functions; D. KEGG pathway enrichment). DEG, differentially expressed gene; KEGG, Kyoto Encyclopedia of Genes and Genomes.
Figure 5.Protein-protein interaction (PPI) network for DEGs in HER2+ breast cancer (red nodes represent upregulated genes, while blue nodes represent downregulated genes).
Figure 6.Module 1, comprising 61 nodes (all of which are upregulated genes) and 1,715 edges.
GO and KEGG Analyses for Module 1.
| Category | GOID | GO term | Count | Term P value | Associated genes found |
|---|---|---|---|---|---|
| GO_BP | GO:1903047 | mitotic cell cycle process | 44 | 5.4E-47 | [ANLN, AURKA, AURKB, BIRC5, BUB1B, CCNA2, CCNB1, CCNB2, CCNE2, CDC20, CDCA5, CDK1, CDKN3, CENPF, CENPK, CEP55, CKS2, DLGAP5, ECT2, GTSE1, KIF11, KIF14, KIF20A, KIF23, KIF2C, KIF4A, MAD2L1, MELK, NCAPG, NDC80, NUF2, NUSAP1, PKMYT1, PRC1, RACGAP1, RRM2, TACC3, TOP2A, TPX2, TRIP13, TTK, TYMS, UBE2C, ZWINT] |
| GO_BP | GO:0000280 | nuclear division | 31 | 1.7E-37 | [ASPM, AURKA, AURKB, BIRC5, BUB1B, CCNB1, CCNE2, CDC20, CDCA5, CENPK, CKS2, DLGAP5, KIF11, KIF14, KIF23, KIF2C, KIF4A, MAD2L1, NCAPG, NDC80, NUF2, NUSAP1, PRC1, PTTG1, RACGAP1, TOP2A, TPX2, TRIP13, TTK, UBE2C, ZWINT] |
| GO_BP | GO:0051983 | regulation of chromosome segregation | 18 | 2.5E-24 | [AURKB, BUB1B, CCNB1, CDC20, CDCA5, CENPF, DLGAP5, ECT2, KIF2C, MAD2L1, MKI67, NDC80, PTTG1, RACGAP1, TACC3, TRIP13, TTK, UBE2C] |
| GO_BP | GO:0007088 | regulation of mitotic nuclear division | 20 | 1.8E-23 | [ANLN, AURKA, AURKB, BUB1B, CCNB1, CDC20, CDCA5, CENPF, DLGAP5, KIF11, MAD2L1, MKI67, NDC80, NUSAP1, PKMYT1, PTTG1, TACC3, TRIP13, TTK, UBE2C] |
| GO_CC | GO:0005819 | spindle | 27 | 6.49E-28 | [ASPM, AURKA, AURKB, BIRC5, BUB1B, CCNB1, CDC20, CDK1, CENPF, DLGAP5, ECT2, FAM83D, KIF11, KIF14, KIF20A, KIF23, KIF2C, KIF4A, MAD2L1, NCAPG, NUSAP1, PRC1, RACGAP1, SHCBP1, TACC3, TPX2, TTK] |
| GO_CC | GO:0000779 | condensed chromosome, centromeric region | 15 | 5.24E-19 | [AURKA, AURKB, BIRC5, BUB1B, CCNB1, CENPF, CENPK, CENPU, HJURP, KIF2C, MAD2L1, NCAPG, NDC80, NUF2, ZWINT] |
| GO_CC | GO:0005876 | spindle microtubule | 8 | 1.59E-10 | [AURKA, AURKB, BIRC5, CDK1, KIF11, KIF4A, NUSAP1, PRC1] |
| GO_CC | GO:0000307 | cyclin-dependent protein kinase holoenzyme complex | 6 | 2.88E-08 | [CCNA2, CCNB1, CCNB2, CCNE2, CDK1, CKS2] |
| GO_CC | GO:0097149 | centralspindlin complex | 3 | 3.07E-08 | [ECT2, KIF23, RACGAP1] |
| GO_MF | GO:0003777 | microtubule motor activity | 6 | 9.01E-07 | [KIF11, KIF14, KIF20A, KIF23, KIF2C, KIF4A] |
| GO_MF | GO:0035173 | histone kinase activity | 4 | 3.47E-06 | [AURKA, AURKB, CCNB1, CDK1] |
| GO_MF | GO:0016538 | cyclin-dependent protein serine/threonine kinase regulator activity | 5 | 3.81E-06 | [CCNA2, CCNB1, CCNB2, CCNE2, CKS2] |
| GO_MF | GO:0097472 | cyclin-dependent protein kinase activity | 3 | 3.84E-04 | [CCNA2, CDK1, CDKN3] |
| KEGG | KEGG:04110 | Cell cycle | 11 | 8.36E-14 | [BUB1B, CCNA2, CCNB1, CCNB2, CCNE2, CDC20, CDK1, MAD2L1, PKMYT1, PTTG1, TTK] |
| KEGG | KEGG:04115 | p53 signaling pathway | 6 | 1.10E-07 | [CCNB1, CCNB2, CCNE2, CDK1, GTSE1, RRM2] |
| KEGG | KEGG:00240 | Pyrimidine metabolism | 3 | 8.88E-04 | [RRM2, TK1, TYMS] |
Figure 7.Venn diagram of key genes.
Ten Key Genes Identified in HER2+ Breast Cancer.
| Rank | Gene symbol | Degree |
|---|---|---|
| 1 | CCNA2 | 69 |
| 1 | CDK1 | 69 |
| 3 | CDC20 | 68 |
| 3 | CCNB1 | 68 |
| 5 | DLGAP5 | 66 |
| 5 | AURKA | 66 |
| 5 | BUB1B | 66 |
| 8 | RRM2 | 65 |
| 8 | TPX2 | 65 |
| 8 | MAD2L1 | 65 |
Figure 8.Survival analysis of 5 key genes associated with the prognosis of HER2+ breast cancer with GSE19783 dataset.
Figure 9.Survival analysis of 5 key genes associated with the prognosis of HER2+ breast cancer with GSE21653 dataset.
Figure 10.ROC curve of 5 key genes in predicting 5-year survival rate.
Figure 11.Expression of 5 key genes associated with the prognosis of HER2+ breast cancer (gene expression in breast cancer tissues and normal tissues is marked in red and black, respectively).
Figure 12.Immunohistochemistry of genes in breast cancer and normal tissues.
Figure 13.Logistic regression analysis (A. Nomogram;B.Calibration graph).
Figure 14.Oncomine analysis of TPX2 and DLGAP5 mRNA expression levels (A.TPX2 expression values in different cancer types; B. Expression of TPX2 in different HER2 immunohistochemical levels; C. Expression of TPX2 in patients with breast cancer at different status levels. D. DLGAP5 expression values in different cancer types; E. Expression of DLGAP5atdifferent HER2 immunohistochemical levels; F. Expression of DLGAP5 in patients with breast cancer with different status levels).
Figure 15.TPX2 variation in breast cancer (A.TPX2 mutation types in breast cancer; B. actions of molecules downstream of TPX2; C. scatter plots of TPX2 gene and copy number changes. D. DLGAP5 mutation types in breast cancer; E. actions of molecules downstream to DLGAP5; F. scatter plots of DLGAP5 gene and copy number changes).