| Literature DB >> 34541602 |
Lijian Zhang1,2,3, Luxuan Wang4, Hua Yang5, Chunhui Li2, Chuan Fang1,2,3.
Abstract
Brain metastases (BMs) usually develop in breast cancer (BC) patients. Thus, the molecular mechanisms of breast cancer brain metastasis (BCBM) are of great importance in designing therapeutic strategies to treat or prevent BCBM. The present study attempted to identify novel diagnostic and prognostic biomarkers of BCBM. Two datasets (GSE125989 and GSE100534) were obtained from the Gene Expression Omnibus (GEO) database to find differentially expressed genes (DEGs) in cases of BC with and without brain metastasis (BM). A total of 146 overlapping DEGs, including 103 up-regulated and 43 down-regulated genes, were identified. Functional enrichment analysis showed that these DEGs were mainly enriched for functions including extracellular matrix (ECM) organization and collagen catabolic fibril organization. Using protein-protein interaction (PPI) and principal component analysis (PCA) analysis, we identified ten key genes, including LAMA4, COL1A1, COL5A2, COL3A1, COL4A1, COL5A1, COL5A3, COL6A3, COL6A2, and COL6A1. Additionally, COL5A1, COL4A1, COL1A1, COL6A1, COL6A2, and COL6A3 were significantly associated with the overall survival of BC patients. Furthermore, COL6A3, COL5A1, and COL4A1 were potentially correlated with BCBM in human epidermal growth factor 2 (HER2) expression. Additionally, the miR-29 family might participate in the process of metastasis by modulating the cancer microenvironment. Based on datasets in the GEO database, several DEGs have been identified as playing potentially important roles in BCBM in BC patients.Entities:
Keywords: Brain metastasis; bioinformatics; tumor microenvironments
Mesh:
Substances:
Year: 2021 PMID: 34541602 PMCID: PMC8521534 DOI: 10.1042/BSR20211615
Source DB: PubMed Journal: Biosci Rep ISSN: 0144-8463 Impact factor: 3.840
Figure 1Identification of DEGs related to BCBM in BC patients
(A) Volcano map of DEGs in GSE100534. X-axis: log2FC; Y-axis: −log10 (FDR). Blue: down-regulated genes; red: up-regulated genes; green: non-significant. (B) Heat map of DEGs in GSE100534. X-axis: sample; Y-axis: different genes. Red: high expression; blue: low expression. BCBM and BC samples could be obviously clustered into two clusters. (C) Volcano map of DEGs in GSE125989. X-axis: Log2FC; Y-axis: −log10 (FDR). Blue: down-regulated genes; red: up-regulated genes; green: non-significant. (D) Heat map of DEGs in GSE125989. X-axis: sample; Y-axis: different genes. Red: high expression; blue: low expression. BCBM and BC samples could be obviously clustered into two clusters. (E) Venn diagram of overlapped DEGs between GSE100534 and GSE125989.
Figure 2PPI network and top module of 146 DEGs
(A) PPI network of DEGs in light blue and top one module in orange. (B) Top ten hub genes.
Figure 3PCA of DEGs
The variance in gene expression between BCBM group and BC group.
GO functional enrichment analysis for the DEGs
| Term | Category | UniProt ID | |
|---|---|---|---|
|
| |||
| Extracellular matrix organization | GO:0030198 | 2.66E-15 | COL1A1, COL3A1, COL14A1, LAMA4, COL6A2, COL5A3, COL5A2, COL6A1, COL6A3 |
| Collagen fibril organization | GO:0030199 | 3.10E-09 | COL1A1, COL3A1, COL14A1, COL5A3, COL5A2 |
| Cellular response to amino acid stimulus | GO:0071230 | 1.71E-06 | COL1A1, COL3A1, COL5A2, COL6A1 |
| Cell adhesion | GO:0007155 | 1.72E-06 | COL1A1, COL15A1, LAMA4, COL6A2, COL6A1, COL6A3 |
| Protein heterotrimerization | GO:0070208 | 2.32E-05 | COL1A1, COL6A2, COL6A1 |
| Skin development | GO:0043588 | 1.42E-04 | COL3A1, COL5A3, COL5A2 |
| Skeletal system development | GO:0001501 | 0.00229126 | COL1A1, COL3A1, COL5A2 |
| Blood vessel development | GO:0001568 | 0.020188218 | COL1A1, LAMA4 |
| Cell–matrix adhesion | GO:0007160 | 0.047226948 | COL3A1, COL5A3 |
| Osteoblast differentiation | GO:0001649 | 0.054392332 | COL1A1, COL6A1 |
| Platelet activation | GO:0030168 | 0.059988631 | COL1A1, COL3A1 |
| Regulation of immune response | GO:0050776 | 0.091476749 | COL1A1, COL3A1 |
|
| |||
| Extracellular matrix structural constituent | GO:0005201 | 2.59E-13 | COL1A1, COL3A1, COL15A1, COL14A1, LAMA4, COL5A3, COL5A2 |
| Platelet-derived growth factor binding | GO:0048407 | 1.39E-05 | COL1A1, COL3A1, COL6A1 |
| SMAD binding | GO:0046332 | 0.022698305 | COL3A1, COL5A2 |
| Collagen binding | GO:0005518 | 0.03154496 | COL14A1, COL5A3 |
|
| |||
| Endoplasmic reticulum lumen | GO:0005788 | 2.66E-15 | COL1A1, COL3A1, COL15A1, COL6A2, COL5A3, COL5A2, COL6A1, COL6A3 |
| Collagen trimer | GO:0005581 | 3.10E-09 | COL1A1, COL3A1, COL14A1, LAMA4, COL6A2, COL5A3, COL5A2, COL6A1, COL6A3 |
| Extracellular matrix | GO:0031012 | 1.71E-06 | COL1A1, COL3A1, COL14A1, COL5A3, COL5A2 |
| Extracellular region | GO:0005576 | 1.72E-06 | COL1A1, COL3A1, COL5A2, COL6A1 |
| Proteinaceous extracellular matrix | GO:0005578 | 2.32E-05 | COL1A1, COL15A1, LAMA4, COL6A2, COL6A1, COL6A3 |
| Extracellular space | GO:0005615 | 1.42E-04 | COL1A1, COL6A2, COL6A1 |
| Extracellular exosome | GO:0070062 | 0.00229126 | COL3A1, COL5A3, COL5A2 |
| Sarcolemma | GO:0042383 | 0.020188218 | COL1A1, COL3A1, COL5A2 |
| Collagen type V trimer | GO:0005588 | 0.047226948 | COL1A1, LAMA4 |
| Collagen type VI trimer | GO:0005589 | 2.98E-16 | COL3A1, COL5A3 |
Category refers to the GO functional categories.
Figure 4GO enrichment and KEGG pathway analysis of DEGs in BC and BCBM group
(A) GO categories of MF. (B) GO categories of CC. (C) GO categories of BP. (D) KEGG pathway analysis of the DEGs.
Pathway enrichment analysis for the DEGs
| Term | Category | UniProt ID | |
|---|---|---|---|
| Protein digestion and absorption | hsa04974 | 4.62E-15 | COL1A1, COL3A1, COL15A1, COL14A1, COL6A2, COL5A3, COL5A2, COL6A1, COL6A3 |
| ECM–receptor interaction | hsa04512 | 1.43E-12 | COL1A1, COL3A1, LAMA4, COL6A2, COL5A3, COL5A2, COL6A1, COL6A3 |
| Focal adhesion | hsa04510 | 6.68E-10 | COL1A1, COL3A1, LAMA4, COL6A2, COL5A3, COL5A2, COL6A1, COL6A3 |
| PI3K-Akt signaling pathway | hsa04151 | 2.48E-08 | COL1A1, COL3A1, LAMA4, COL6A2, COL5A3, COL5A2, COL6A1, COL6A3 |
| Amoebiasis | hsa05146 | 6.33E-06 | COL1A1, COL3A1, LAMA4, COL5A3, COL5A2 |
| Platelet activation | hsa04611 | 5.10E-04 | COL1A1, COL3A1, COL5A3, COL5A2 |
Category refers to the pathway functional categories.
Figure 5Prognostic value of ten genes
Figure 6Correlation between HER2 and genes
Figure 7The TF regulatory network
(A) The TF regulatory network of the top ten hub genes. Pink circle means the hub gene and blue circle means the transcription. (B) The TF regulatory network of selected DEGs (COL4A1, COL5A1, and COL6A3). The pink circle means the selected DEGs and blue square means the miRNAs.
The potential TFs of ten hub genes
| Genes | TFs | Counts |
|---|---|---|
|
| ATF71IP, BRD7, USF2, CREB1, GATA1, TP53, GATA2, GATA3, MED31, ZHX1, EEF1A1 | 11 |
|
| SP3, JUN, MYB, NFKB1, POU2F1, RELA, SP1FLI1, EGR1, SMAD1, WFIC | 10 |
|
| MEF2A, FOXO3B, TBP, PBX1, FOXO1, FOXO4, FOXO3, REL | 8 |
|
| SMAD1, IRF1, NFIC, SMAD3, JUN | 5 |
|
| STAT2, ZEB1, CTCF, STAT1 | 4 |
|
| IILF, REL, PPARG, ATF2 | 4 |
|
| REST, RREB1, MEIS1, TFAP4 | 4 |
|
| ZBTB6, SMAD3, WFIC | 3 |
|
| NFIC, TFAP2A | 2 |
|
| SMAD3 | 1 |
Figure 8Model diagnosis diagram, Residuals vs. Leverage diagram
The red dotted line presented the COOK distance, and the points with COOK distance > 0.5 were considered as influential points, which would affect the reliability of the model. There were no influential points in the model.
Figure 9The component plus residual plot of the three genes included in the model
A relatively obvious linear relationship between the horizontal axis and the vertical axis in the graph could be observed, which indicated that the independent variables were suitable for being included in the model.
Figure 10The ROC analysis results of the model