Literature DB >> 34541602

Identification of potential genes related to breast cancer brain metastasis in breast cancer patients.

Lijian Zhang1,2,3, Luxuan Wang4, Hua Yang5, Chunhui Li2, Chuan Fang1,2,3.   

Abstract

Brain metastases (BMs) usually develop in breast cancer (BC) patients. Thus, the molecular mechanisms of breast cancer brain metastasis (BCBM) are of great importance in designing therapeutic strategies to treat or prevent BCBM. The present study attempted to identify novel diagnostic and prognostic biomarkers of BCBM. Two datasets (GSE125989 and GSE100534) were obtained from the Gene Expression Omnibus (GEO) database to find differentially expressed genes (DEGs) in cases of BC with and without brain metastasis (BM). A total of 146 overlapping DEGs, including 103 up-regulated and 43 down-regulated genes, were identified. Functional enrichment analysis showed that these DEGs were mainly enriched for functions including extracellular matrix (ECM) organization and collagen catabolic fibril organization. Using protein-protein interaction (PPI) and principal component analysis (PCA) analysis, we identified ten key genes, including LAMA4, COL1A1, COL5A2, COL3A1, COL4A1, COL5A1, COL5A3, COL6A3, COL6A2, and COL6A1. Additionally, COL5A1, COL4A1, COL1A1, COL6A1, COL6A2, and COL6A3 were significantly associated with the overall survival of BC patients. Furthermore, COL6A3, COL5A1, and COL4A1 were potentially correlated with BCBM in human epidermal growth factor 2 (HER2) expression. Additionally, the miR-29 family might participate in the process of metastasis by modulating the cancer microenvironment. Based on datasets in the GEO database, several DEGs have been identified as playing potentially important roles in BCBM in BC patients.
© 2021 The Author(s).

Entities:  

Keywords:  Brain metastasis; bioinformatics; tumor microenvironments

Mesh:

Substances:

Year:  2021        PMID: 34541602      PMCID: PMC8521534          DOI: 10.1042/BSR20211615

Source DB:  PubMed          Journal:  Biosci Rep        ISSN: 0144-8463            Impact factor:   3.840


Background

Breast cancer (BC) is the most commonly diagnosed cancer in women [1]. The central nervous system (CNS) is one of the most common sites to which BC usually spreads, along with the bones, lungs, and liver. Metastasis of cancer to the brain has potentially devastating clinical consequences, resulting in an estimated survival time of less than 1 year, despite recent breakthroughs in neurologic therapies [2]. CNS metastases occur as a distant recurrence of BC. When cancer cells metastasize to the brain, patients’ prognoses are poor, and they have few therapeutic options [3]. Surgical resection of single brain metastasis (BM) is considered a standard treatment for patients with metastases in accessible locations, good functional status, and absent/controlled extracranial disease. Overall surgical mortality is approximately 0.7–1.9%, while neurological morbidity is 3.9–6% [4]. The rate of postoperative complications, including pneumonia, urinary infections, and venous thrombosis, is approximately 13.9% [5,6]. Further, chemotherapy drugs targeting BC cells in the brain are not effective, because they cannot cross the blood–brain barrier [7,8]. In light of the poor outcomes and limited treatment options following a diagnosis of breast cancer brain metastasis (BCBM), a better understanding of the mechanisms may help improve clinical decision-making. The development of novel biomarkers for BCBM would not only provide deeper insights into its pathology but would also help provide sensitive and efficient approaches for diagnosis and treatment. Previous studies have documented that tumor BM is a complicated process, including steps such as the detachment of cells from the primary tumor, invasion of the extracellular matrix (ECM), travel through the bloodstream to arrest at a secondary site, and extravasation from the vasculature to establish metastasis in a new organ [9]. Numerous animal experiments and clinical trials appear to support Paget’s ‘seed and soil’ hypothesis for non-random metastatic spread [10,11]. Although the initial steps for the metastasis of primary tumor cells may significantly overlap, the circulation patterns, extravasation barriers, and potential to survive in foreign tissue varies with the molecular subtype of the primary tumor [12]. Molecular cross-talk between primary BC and other cells in the surrounding microenvironment, as well as the role of primary BC cells in modulating the brain microenvironment during metastasis, are also key factors driving metastatic outgrowth [13,14]. Evidence has shown that, prior to reaching the brain, some metastatic primary BC cells already express proteins that are essential for the establishment of brain metastases, including serpins [15], matrix metalloproteases [16], and αB-crystallin [17]. BCBM cells overexpress bone morphogenic proteins (BMPs) to promote the differentiation of neural progenitor cells (NPCs) into astrocytes and create a permissive microenvironment which allows cancer cells to colonize and proliferate [18]. Until now, extensive work in identifying driver genes of BCBM has provided some potential explanations for its metastasis to the brain. However, the underlying mechanisms involved in the invasion and metastatic outgrowth of BC cells into the CNS are largely unknown, and a novel option for the prevention or treatment of BM is unavailable [19]. With the development of genomics and bioinformatics, large amount of BC-related data are publicly available in several databases. Bioinformatics is a useful tool for extracting valuable information from existing data at a relatively low cost, and can help identify novel diagnostic biomarkers of BCBM [20,21]. In this study, by screening differentially expressed genes (DEGs) between BC patients with and without BM in two individual Gene Expression Omnibus (GEO) datasets, we aimed to explore potentially important genes associated with BCBM, and to investigate more information about the microenvironmental influence on BCBM development. The present study may help contribute to preventing BCBM at early stages in the future.

Materials and methods

Microarray data

Datasets were obtained from the National Center of Biotechnology Information (NCBI) GEO (https://www.ncbi.nlm.nih.gov/geo/) database. Two datasets (GSE125989 [22] and GSE100534 [23]) were used in the present study. The samples in the GSE125989 and GSE100534 datasets were detected using the Affymetrix Human Genome U133A 2.0 Array and the Affymetrix Human Gene 1.0 ST Array, respectively.

Data processing

The GEO2R (http://www.ncbi.nlm.nih.gov/geo/geo2r/) based on R language limma package was used to analyze the gene expression data and to identify DEGs [24]. Those genes with |logFC| > 1 and P-values <0.05 were selected as DEGs (BC without BM vs. BCBM). There are 16 BC samples and 16 BCBM samples in the GSE125989 dataset, and there are 16 BC samples and 3 BCBM samples in the GSE100534 dataset (after removing the primary meningioma samples). To visualize the overlapping DEGs in both GSE125989 and GSE100534, a Venn diagram was generated using a freely accessible online tool (http://bioinformatics.psb.ugent.be/webtools/Venn/).

Bioinformatics analysis of DEGs

In the Gene Ontology (GO) database, gene functions are divided into three parts, including cellular components (CCs), biological processes (BPs), and molecular functions (MFs). The Kyoto Encyclopedia of Genes and Genomes (KEGG) is a database integrating genome, chemistry, and system function information, which aims to reveal the genetic and chemical blueprint of life. It is a system function knowledge base that is based on computable experiments. We used the The Database for Annotation, Visualization and Integrated Discovery (DAVID) database for GO and KEGG enrichment analyses (http://david.abcc.ncifcrf.gov/). Functional protein–protein interaction (PPI) analysis is important for interpreting the potential molecular mechanisms of key cellular activities in pathogenicity. The Search Tool for the Retrieval of Interacting Genes (STRING) database (http://string-db.org) was used to construct PPI networks of high and low expression genes. An interaction score of 0.4 was treated as the cut-off criteria, and Cytoscape (http://www.cytoscape.org/) was used to visualize the PPI network. Hub genes with connection degrees > 9 were selected. Then, the Molecular Complex Detection (MCODE) in the Cytoscape software was used to obtain the modules within the PPI network with MCODE scores > 2 and number of nodes > 4. Functional enrichment analysis of the genes in each module was also completed using Metascape [25]. Finally, a principal component analysis (PCA) model was visualized for clustering trends based on the variation in the phase of injury using the ‘principal’ function in the R software [26].

Transcription factors and expression correlation analyses

To elucidate the selected DEGs expression regulation on a systematic level, a transcription factor (TF)–miRNA coregulatory network was generated using NetworkAnalyst based on the TF-miRNA coregulatory interaction database. The expression correlation analysis was conducted using Gene Expression Profiling Interactive Analysis (GEPIA), which was based on available The Cancer Genome Atlas (TCGA) datasets. Pearson correlation coefficients and P-values were calculated (http://gepia.cancer‐pku.cn/index.html) [27].

Survival analysis

To evaluate the effects of hub genes on the overall survival of BC patients, the publicly available Kaplan–Meier Plotter (http://kmplot.com/analysis/) was used. According to the median expression levels, the patients were classified into high and low expression groups [28]. Differences between the high and low expression groups were assessed using log‐rank tests. P<0.05 was considered to indicate statistically significant difference.

Construction of BCBM prediction model

Here, we classified all samples into two sets, namely the samples with BM and samples without BM, adopted the expression of genes as a continuous predictor variable by glm package in R software [29]. We set the expression level of the genes as an independent variable, and whether it developed BM as the dependent variable to construct the logistic regression model. Firstly, the expression levels of COL4A1, COL5A1, and COL6A3 were extracted from GSE125989 and GSE100534 datasets. In each dataset, z-score normalization (∼N (0,1) normal distribution) was conducted, and then we merged the two datasets. Subsequently, the z-scores of these genes were used for building a logistic regression prediction model to predict whether BM would develop in BC patients. And the accuracy of the model was determined by receiver operating characteristic (ROC) curve.

Results

Identification of DEGs related to BCBM in BC patients

In order to screen the genes potentially associated with BCBM in BC patients, DEG analysis was performed between BC specimens (without BM) and BCBM specimens, which were conducted in GSE100534 and GSE125989, respectively. In the GSE100534 dataset, BCBM specimens were compared with BC specimens. A total of 2485 DEGs were obtained, including 971 up-regulated and 1593 down-regulated (Figure 1A,B). In the GSE125989 dataset, BCBM specimens were compared with BC specimens. A total of 1417 DEGs were obtained, including 984 up-regulated and 433 down-regulated (Figure 1C,D). A total of 146 overlapping genes (103 up-regulated and 43 down-regulated) were obtained between the two datasets (Figure 1E), which we further analyzed in our research.
Figure 1

Identification of DEGs related to BCBM in BC patients

(A) Volcano map of DEGs in GSE100534. X-axis: log2FC; Y-axis: −log10 (FDR). Blue: down-regulated genes; red: up-regulated genes; green: non-significant. (B) Heat map of DEGs in GSE100534. X-axis: sample; Y-axis: different genes. Red: high expression; blue: low expression. BCBM and BC samples could be obviously clustered into two clusters. (C) Volcano map of DEGs in GSE125989. X-axis: Log2FC; Y-axis: −log10 (FDR). Blue: down-regulated genes; red: up-regulated genes; green: non-significant. (D) Heat map of DEGs in GSE125989. X-axis: sample; Y-axis: different genes. Red: high expression; blue: low expression. BCBM and BC samples could be obviously clustered into two clusters. (E) Venn diagram of overlapped DEGs between GSE100534 and GSE125989.

Identification of DEGs related to BCBM in BC patients

(A) Volcano map of DEGs in GSE100534. X-axis: log2FC; Y-axis: −log10 (FDR). Blue: down-regulated genes; red: up-regulated genes; green: non-significant. (B) Heat map of DEGs in GSE100534. X-axis: sample; Y-axis: different genes. Red: high expression; blue: low expression. BCBM and BC samples could be obviously clustered into two clusters. (C) Volcano map of DEGs in GSE125989. X-axis: Log2FC; Y-axis: −log10 (FDR). Blue: down-regulated genes; red: up-regulated genes; green: non-significant. (D) Heat map of DEGs in GSE125989. X-axis: sample; Y-axis: different genes. Red: high expression; blue: low expression. BCBM and BC samples could be obviously clustered into two clusters. (E) Venn diagram of overlapped DEGs between GSE100534 and GSE125989.

PPI network construction and Hub gene selection

We next constructed a PPI network based on the 146 overlapping DEGs using the STRING database, and visualized it using Cytoscape software. There were 64 nodes and 174 edges in the PPI network (Figure 2A). Among these, the top ten genes with the highest interaction degrees were then identified, including Collagen Type I α 1 Chains (COL1A1), Collagen Type III α 1 Chains (COL3A1), Collagen Type V α 2 Chains (COL5A2), Collagen Type V α 3 Chains (COL5A3), Collagen Type VI α 1 Chains (COL6A1), Collagen Type VI α 2 Chains (COL6A2), Collagen Type VI α 3 Chains (COL6A3), Collagen Type XIV α 1 Chains (COL14A1), Collagen Type XV, α 1 (COL15A1), and Laminin Subunit α 4 (LAMA4) (Figure 2B).
Figure 2

PPI network and top module of 146 DEGs

(A) PPI network of DEGs in light blue and top one module in orange. (B) Top ten hub genes.

PPI network and top module of 146 DEGs

(A) PPI network of DEGs in light blue and top one module in orange. (B) Top ten hub genes.

PCA of the 146 DEGs

We performed PCA based on the expression profiles of DEGs to find DEGs that were highly related to the development of BCBM (Figure 3). Results showed that the first principal component (PC1) accounted for 60.8% of the total variance. The second principal component (PC2) accounted for an additional 11.4% of the total variance. Finally, changes in COL6A1, COL5A3, COL5A1, COL5A2, COL6A1, COL3A, COL6A1, COL14A1, and LAMA4 contributed the most to the separation between the BCBM and BC samples, which were also within top modules of the 146 DEGs. All of these genes are associated with ECM organization, the collagen catabolic process, and cellular adhesion.
Figure 3

PCA of DEGs

The variance in gene expression between BCBM group and BC group.

PCA of DEGs

The variance in gene expression between BCBM group and BC group.

KEGG pathway and GO term enrichment analyses

We next performed functional enrichment analyses on the DEGs to obtain more information about the genes. Detailed results of GO enrichment analysis are shown in Table 1. The most significantly enriched MF terms were ECM organization, collagen catabolic process, and cellular adhesion (Figure 4A). The most noteworthy enriched CC terms were extracellular region and endoplasmic reticulum lumen ECM (Figure 4B). The most significantly enriched BP terms were ECM structural constituent, platelet-derived growth factor binding, and SMAD binding (Figure 4C). Detailed results of KEGG enrichment analysis are also shown in Table 2. The top six KEGG pathways are displayed in Figure 4D, including protein digestion and absorption, the PI3K-Akt signaling pathway, the Focal adhesion pathway, and ECM–receptor interactions.
Table 1

GO functional enrichment analysis for the DEGs

TermCategoryP-valueUniProt ID
BP
Extracellular matrix organizationGO:00301982.66E-15COL1A1, COL3A1, COL14A1, LAMA4, COL6A2, COL5A3, COL5A2, COL6A1, COL6A3
Collagen fibril organizationGO:00301993.10E-09COL1A1, COL3A1, COL14A1, COL5A3, COL5A2
Cellular response to amino acid stimulusGO:00712301.71E-06COL1A1, COL3A1, COL5A2, COL6A1
Cell adhesionGO:00071551.72E-06COL1A1, COL15A1, LAMA4, COL6A2, COL6A1, COL6A3
Protein heterotrimerizationGO:00702082.32E-05COL1A1, COL6A2, COL6A1
Skin developmentGO:00435881.42E-04COL3A1, COL5A3, COL5A2
Skeletal system developmentGO:00015010.00229126COL1A1, COL3A1, COL5A2
Blood vessel developmentGO:00015680.020188218COL1A1, LAMA4
Cell–matrix adhesionGO:00071600.047226948COL3A1, COL5A3
Osteoblast differentiationGO:00016490.054392332COL1A1, COL6A1
Platelet activationGO:00301680.059988631COL1A1, COL3A1
Regulation of immune responseGO:00507760.091476749COL1A1, COL3A1
MF
Extracellular matrix structural constituentGO:00052012.59E-13COL1A1, COL3A1, COL15A1, COL14A1, LAMA4, COL5A3, COL5A2
Platelet-derived growth factor bindingGO:00484071.39E-05COL1A1, COL3A1, COL6A1
SMAD bindingGO:00463320.022698305COL3A1, COL5A2
Collagen bindingGO:00055180.03154496COL14A1, COL5A3
CC
Endoplasmic reticulum lumenGO:00057882.66E-15COL1A1, COL3A1, COL15A1, COL6A2, COL5A3, COL5A2, COL6A1, COL6A3
Collagen trimerGO:00055813.10E-09COL1A1, COL3A1, COL14A1, LAMA4, COL6A2, COL5A3, COL5A2, COL6A1, COL6A3
Extracellular matrixGO:00310121.71E-06COL1A1, COL3A1, COL14A1, COL5A3, COL5A2
Extracellular regionGO:00055761.72E-06COL1A1, COL3A1, COL5A2, COL6A1
Proteinaceous extracellular matrixGO:00055782.32E-05COL1A1, COL15A1, LAMA4, COL6A2, COL6A1, COL6A3
Extracellular spaceGO:00056151.42E-04COL1A1, COL6A2, COL6A1
Extracellular exosomeGO:00700620.00229126COL3A1, COL5A3, COL5A2
SarcolemmaGO:00423830.020188218COL1A1, COL3A1, COL5A2
Collagen type V trimerGO:00055880.047226948COL1A1, LAMA4
Collagen type VI trimerGO:00055892.98E-16COL3A1, COL5A3

Category refers to the GO functional categories.

Figure 4

GO enrichment and KEGG pathway analysis of DEGs in BC and BCBM group

(A) GO categories of MF. (B) GO categories of CC. (C) GO categories of BP. (D) KEGG pathway analysis of the DEGs.

Table 2

Pathway enrichment analysis for the DEGs

TermCategoryP-valueUniProt ID
Protein digestion and absorptionhsa049744.62E-15COL1A1, COL3A1, COL15A1, COL14A1, COL6A2, COL5A3, COL5A2, COL6A1, COL6A3
ECM–receptor interactionhsa045121.43E-12COL1A1, COL3A1, LAMA4, COL6A2, COL5A3, COL5A2, COL6A1, COL6A3
Focal adhesionhsa045106.68E-10COL1A1, COL3A1, LAMA4, COL6A2, COL5A3, COL5A2, COL6A1, COL6A3
PI3K-Akt signaling pathwayhsa041512.48E-08COL1A1, COL3A1, LAMA4, COL6A2, COL5A3, COL5A2, COL6A1, COL6A3
Amoebiasishsa051466.33E-06COL1A1, COL3A1, LAMA4, COL5A3, COL5A2
Platelet activationhsa046115.10E-04COL1A1, COL3A1, COL5A3, COL5A2

Category refers to the pathway functional categories.

GO enrichment and KEGG pathway analysis of DEGs in BC and BCBM group

(A) GO categories of MF. (B) GO categories of CC. (C) GO categories of BP. (D) KEGG pathway analysis of the DEGs. Category refers to the GO functional categories. Category refers to the pathway functional categories. To investigate the impacts of those ten hub genes on the survival rates of BC patients, we used Kaplan–Meier curve analysis. The median expression level was regarded as the cut‐off value for dividing the patients into two groups (high and low expression groups). Results showed that high expression of COL14A1 (HR = 0.56 [0.4–0.79], P=0.00081), COL6A2 (HR = 0.74 [0.6–0.93], P=0.0082), COL6A1 (HR = 0.59 [0.45–0.78], P=0.00012), COL5A1 (HR = 0.76 [0.59–0.98], P=0.034), and collagen type I α 1 chain (COL1A1) (HR = 0.8 [0.65–0.99], P=0.044) were associated with poor overall survival rates for BC patients. LAMA4, COL3A1, and COL5A1 expression had no significant influence on the overall survival rates of BC patients (P>0.05) (Figure 5). Moreover, correlation analysis results indicated that the expression of COL4A1, COL5A1, and COL6A3 was positively correlated with HER2 positive expression (Figure 6).
Figure 5

Prognostic value of ten genes

Figure 6

Correlation between HER2 and genes

TF regulatory network analysis of ten genes

We constructed a gene–TF regulatory network based on the ten identified genes—which included 56 interaction pairs between 10 genes and 45 TFs (Figure 7A). LAMA4 was found to be regulated by 11 TFs, COL1A1 was regulated by 10 TFs, COL5A2 was regulated by 8 TFs, COL3A1 was regulated by 5 TFs, COL4A1 was regulated by 4 TFs, COL5A1 was regulated by 4 TFs, COL5A3 was regulated by 4 TFs, COL6A3 was regulated by 3 TFs, and COL6A2 was regulated by 2 TFs. We further investigated the miRNAs that could regulate COL4A1, COL5A1, and COL6A3 (Table 3). Our findings suggested that the expression of these three genes was significantly correlated with each other, and that they were all also under the regulation of miRNA-29 family, including miR-29a, miR-29b, and miR-29c (Figure 7B).
Figure 7

The TF regulatory network

(A) The TF regulatory network of the top ten hub genes. Pink circle means the hub gene and blue circle means the transcription. (B) The TF regulatory network of selected DEGs (COL4A1, COL5A1, and COL6A3). The pink circle means the selected DEGs and blue square means the miRNAs.

Table 3

The potential TFs of ten hub genes

GenesTFsCounts
LAMA4 ATF71IP, BRD7, USF2, CREB1, GATA1, TP53, GATA2, GATA3, MED31, ZHX1, EEF1A111
COL1A1 SP3, JUN, MYB, NFKB1, POU2F1, RELA, SP1FLI1, EGR1, SMAD1, WFIC10
COL5A2 MEF2A, FOXO3B, TBP, PBX1, FOXO1, FOXO4, FOXO3, REL8
COL3A1 SMAD1, IRF1, NFIC, SMAD3, JUN5
COL4A1 STAT2, ZEB1, CTCF, STAT14
COL5A1 IILF, REL, PPARG, ATF24
COL5A3 REST, RREB1, MEIS1, TFAP44
COL6A3 ZBTB6, SMAD3, WFIC3
COL6A2 NFIC, TFAP2A2
COL6A1 SMAD31

The TF regulatory network

(A) The TF regulatory network of the top ten hub genes. Pink circle means the hub gene and blue circle means the transcription. (B) The TF regulatory network of selected DEGs (COL4A1, COL5A1, and COL6A3). The pink circle means the selected DEGs and blue square means the miRNAs.

Construction and validation of BCBM prediction model

Using the three most important genes (COL4A1, COL5A1, and COL6A3), we constructed a logistic regression prediction model (Y = 0.2987 × COL5A1 + 0.9790 × COL4A1 − 2.6080 × COL6A3 − 0.6142), using the glm function in the R language. When Y ≥ 0.5, the model predicts that BM will occur in BC patients. Furthermore, we found that there was no extreme point affecting the accuracy of the model (Figure 8). All points in the model had good linear relationships (Figure 9). Further, the area under the curve (AUC = 0.91) indicated that the logistic regression prediction model—which was based on the three genes—could predict whether BM would occur in BC patients (Figure 10). Our results suggested that COL4A1, COL5A1, and COL6A3 were potentially crucial genes for the development of BCBM.
Figure 8

Model diagnosis diagram, Residuals vs. Leverage diagram

The red dotted line presented the COOK distance, and the points with COOK distance > 0.5 were considered as influential points, which would affect the reliability of the model. There were no influential points in the model.

Figure 9

The component plus residual plot of the three genes included in the model

A relatively obvious linear relationship between the horizontal axis and the vertical axis in the graph could be observed, which indicated that the independent variables were suitable for being included in the model.

Figure 10

The ROC analysis results of the model

Model diagnosis diagram, Residuals vs. Leverage diagram

The red dotted line presented the COOK distance, and the points with COOK distance > 0.5 were considered as influential points, which would affect the reliability of the model. There were no influential points in the model.

The component plus residual plot of the three genes included in the model

A relatively obvious linear relationship between the horizontal axis and the vertical axis in the graph could be observed, which indicated that the independent variables were suitable for being included in the model.

Discussion

The development of BCBM is a multistep process that is still poorly understood. Despite advances in targeted treatment, patients with BCBM continue to exhibit an undesirable prognosis and impaired quality of life. Patients with a high risk of developing BCBM could potentially benefit from early screening strategies, as earlier diagnoses would lead to more effective BCBM treatments [30]. Thus, in the present study, we compared BC and BCBM samples in the GSE125989 and GSE100534 datasets, and first identified 146 DEGs in total. Then, based on PPI network construction, ten DEGs were selected as the most crucial genes, including LAMA4, COL1A1, COL5A2, COL3A1, COL4A1, COL5A1, COL5A3, COL6A3, COL6A2, and COL6A1. Further PCA results indicated that these genes largely contributed to the differences between BC and BCBM samples, suggesting that the ten genes were potentially important genes related to BCBM. Previous studies have demonstrated that BC could regulate the surrounding stroma or tumor microenvironments, and that various components in the BC microenvironment, including suppressive immune cells, soluble factors, and alterations in the ECM, act together to effectively impede antitumor immunity, and promote BC progression and metastasis [31-34]. In this complicated process, some key genes indeed play vital roles. Many, including BRCA1 [23], TP53 [35], and Sdc1 [36] have been reported widely on. Moreover, the distinct expression patterns between BC and BCBM might be associated with the microenvironment surrounding the primary tumor, as well as its final metastatic sites [37]. Better understandings of the functional role of DEGs is of importance to identify metastasis-associated therapeutic biomarkers [38]. Thus, we performed GO and KEGG enrichment analyses on 146 DEGs. These DEGs were mainly enriched in GO terms including ECM structural constituent, ECM organization, collagen catabolic process, and extracellular region, which might be responsible for brain metastases. As previous studies have reported, the ECM which affects cancer cell characteristics is increasingly recognized as an important regulator in BC [39]. ECM in BC development features numerous changes in composition and organization when compared with the mammary gland under homeostasis [40]. Prior evidence supports our results to an extent. Additionally, most of the overlapping DEGs were enriched in KEGG pathways associated with cancer metastases, the including PI3K-Akt signaling pathway, the focal adhesion pathway, and ECM–receptor interaction pathways. Notably, it has been reported that mutations and activation of the PI3K signaling pathway in BC cells are linked to brain metastases [41]. Additionally, levels of PI3K pathway activity were significantly increased in brain metastases when compared with extracranial metastases [42]. PI3K/AKT/mTOR signaling has also been reported as one of the most important intracellular pathways, and can be considered a master regulator for cancer. However, the functional details of the overlapped DEGs still need to be further clarified. Previous studies have demonstrated that cancer progression is highly associated with an increase in ECM deposition and stiffening [43-45]. As the major component of the ECM, collagen could increase the ability of cancer cells to reduce apoptosis, and to modulate angiogenesis which gradually promotes cancer progression [46,47]. Moreover, Gilkes et al. reported that intertumoral hypoxia could also lead to increased collagen deposition during breast cancer progression [48]. In our study, survival analyses also showed that increased expression of Collagen genes—including COL1A1, COL14A1, COL5A1, COL6A1, and COL6A2—was associated with the poor overall survival rates of BC patients. Our results support the idea that the collagen could contribute to tumor progression and result in poor prognoses. Recent studies have suggested that BCBM occurs most frequently in HER2-positive BC, with an incidence of 20–50% [49,50]. As an oncogene, HER2 may drive brain trophism, as it induces a mesenchymal state in BC cells and increases the invasiveness and metastatic potential [51]. The brain microenvironment involves several HER family ligands, including neuregulins, which result in the dimerization and activation of these receptors in metastatic brain cells [52,53]. Recent studies have demonstrated that the risk of cancer cells spreading to the brain is usually highest for women with more aggressive subtypes of BC, such as HER2 positive BC [54,55]. Thus, in this study, we analyzed the correlation between the top ten hub gene with HER2 positive expression. Our results demonstrated that COL4A1, COL5A1, and COL6A3 were significantly correlated with HER2 positive expression. Further, the TF–miRNA coregulatory network indicated that the gene expression patterns of COL4A1, COL5A1, and COL6A3 were significantly correlated with each other, and they were also under the regulation of the miRNA-29 family. We speculated that those findings might provide more evidence to illustrate the underlying mechanisms of BCBM progression. Additionally, the logistic regression prediction model based on the three genes could predict whether BM could occur in BC patients, further suggesting that COL4A1, COL5A1, and COL6A3 were potentially crucial genes for BCBM. Our results were consistent with some findings from previous studies. A recent study suggested that COL4A1 was a prognostic biomarker for BC patients who had received neoadjuvant chemotherapy [56]. COL5A1 was reported to be involved in the regulation of circRNA ACAP2 (circACAP2) in BC [57]. As for the miR-29 family, it has been reported to regulate tumor-related changes, including metastasis [58]. They might also contribute to the involvement of the multiple fibrosis process in many organs’ loss of ECM regulation, which might also be associated with the promotion of cancer cell migration and metastasis [59]. Thus, we speculated that the miR-29 family participated in the process of metastasis by modulating the cancer microenvironment. Nevertheless, exactly how the COL4A1, COL5A1, COL6A3, and miR-29 family are involved in the underlying mechanisms of BCBM remains to be further investigated. Moreover, the treatment value of modifying collagen conditions in BCBM is also worth exploring. Cancer resistance has substantially hindered the ability to control cancer. Therefore, both cancer cells and tumor microenvironment must be treated, and collagen is a potential target [46]. The result obtained from Awasthi’s group showed that anti-matrix metallopeptidase 9 (MMP-9) antibody therapy could decrease the COLI expression and the metastatic burden in pancreatic ductal adenocarcinoma mouse models [60]. Employing therapeutic agents directly or indirectly targeting collagen might be another promising therapeutic strategy in the treatment of cancers including BC. There are also some limitations to our study. First, the present study was based on bioinformatics analysis of published datasets, and does not have experimental verification of the observations. Second, as we combined only two GEO datasets, the total sample size is relatively small which may result in some biasing of the results. Finally, we only analyzed the correlation between top ten hub genes with HER2 positive expression. It would also be important to investigate the correlations with other BCBM oncogenes, including estrogen receptors (ERs). Future clinical studies with larger cohorts are needed to validate our results.

Conclusions

Here, by comparing the BC and BCBM specimens across two datasets using a series of bioinformatics analyses, we identified 146 DEGs that were potentially related to the BCBM. The DEGs were significantly enriched for ECM organization, collagen catabolic process, and cellular adhesion pathways. Among these, COL4A1, COL5A1, and COL6A3 were potentially crucial genes that were associated with BCBM in BC patients. Our findings should provide more reference information for diagnostic and prognostic investigation of BCBM in the future.
  60 in total

Review 1.  Brain metastasization of breast cancer.

Authors:  Tânia Custódio-Santos; Mafalda Videira; Maria Alexandra Brito
Journal:  Biochim Biophys Acta Rev Cancer       Date:  2017-03-21       Impact factor: 10.680

2.  Syndecan-1 facilitates breast cancer metastasis to the brain.

Authors:  Megan R Sayyad; Madhavi Puchalapalli; Natasha G Vergara; Sierra Mosticone Wangensteen; Melvin Moore; Liang Mu; Chevaunne Edwards; Aubree Anderson; Stefanie Kall; Megan Sullivan; Mikhail Dozmorov; Jaime Singh; Michael O Idowu; Jennifer E Koblinski
Journal:  Breast Cancer Res Treat       Date:  2019-07-20       Impact factor: 4.872

Review 3.  The molecular pathobiology of metastasis to the brain: a review.

Authors:  Kara D Beasley; Steven A Toms
Journal:  Neurosurg Clin N Am       Date:  2011-01       Impact factor: 2.509

4.  Tumour-infiltrating regulatory T cells stimulate mammary cancer metastasis through RANKL-RANK signalling.

Authors:  Wei Tan; Weizhou Zhang; Amy Strasner; Sergei Grivennikov; Jin Q Cheng; Robert M Hoffman; Michael Karin
Journal:  Nature       Date:  2011-02-16       Impact factor: 49.962

Review 5.  Strategy of surgery and radiation therapy for brain metastases.

Authors:  Yoshitaka Narita; Soichiro Shibui
Journal:  Int J Clin Oncol       Date:  2009-08-25       Impact factor: 3.402

Review 6.  The role of miRNA-29 family in cancer.

Authors:  Yang Wang; Xinwei Zhang; Hui Li; Jinpu Yu; Xiubao Ren
Journal:  Eur J Cell Biol       Date:  2013-01-26       Impact factor: 4.492

Review 7.  Brain metastasis: Unique challenges and open opportunities.

Authors:  Frank J Lowery; Dihua Yu
Journal:  Biochim Biophys Acta Rev Cancer       Date:  2016-12-06       Impact factor: 10.680

8.  Predominance of the basal type and HER-2/neu type in brain metastasis from breast cancer.

Authors:  Jochen Gaedcke; Frank Traub; Simone Milde; Ludwig Wilkens; Alexandru Stan; Helmut Ostertag; Mathias Christgen; Reinhard von Wasielewski; Hans H Kreipe
Journal:  Mod Pathol       Date:  2007-06-01       Impact factor: 7.842

9.  Effect of COL4A1 Expression on the Survival of Neoadjuvant Chemotherapy Breast Cancer Patients.

Authors:  Shin-Mae Wang; Po-Ming Chen; Yu-Wen Sung; Wei-Chieh Huang; Hung-Sen Huang; Pei-Yi Chu
Journal:  J Oncol       Date:  2020-05-14       Impact factor: 4.375

Review 10.  The role of collagen in cancer: from bench to bedside.

Authors:  Shuaishuai Xu; Huaxiang Xu; Wenquan Wang; Shuo Li; Hao Li; Tianjiao Li; Wuhu Zhang; Xianjun Yu; Liang Liu
Journal:  J Transl Med       Date:  2019-09-14       Impact factor: 5.531

View more
  6 in total

1.  Delineation of Pathogenomic Insights of Breast Cancer in Young Women.

Authors:  Aswathy Mary Paul; Bijesh George; Sunil Saini; Madhavan Radhakrishna Pillai; Masakazu Toi; Luis Costa; Rakesh Kumar
Journal:  Cells       Date:  2022-06-15       Impact factor: 7.666

2.  Persistent Properties of a Subpopulation of Cancer Cells Overexpressing the Hedgehog Receptor Patched.

Authors:  Álvaro Javier Feliz Morel; Anida Hasanovic; Aurélie Morin; Chloé Prunier; Virginie Magnone; Kevin Lebrigand; Amaury Aouad; Sarah Cogoluegnes; Judith Favier; Claude Pasquier; Isabelle Mus-Veteau
Journal:  Pharmaceutics       Date:  2022-05-05       Impact factor: 6.525

3.  Identification of Key Genes Associated with Brain Metastasis from Breast Cancer: A Bioinformatics Analysis.

Authors:  Cheng Zeng; Mingxi Lin; Yizi Jin; Jian Zhang
Journal:  Med Sci Monit       Date:  2022-03-17

4.  Differences in the Expression Patterns of TGFβ Isoforms and Associated Genes in Astrocytic Brain Tumors.

Authors:  Natalia Kurowska; Barbara Strzalka-Mrozik; Marcel Madej; Klaudia Pająk; Celina Kruszniewska-Rajs; Wojciech Kaspera; Joanna Magdalena Gola
Journal:  Cancers (Basel)       Date:  2022-04-08       Impact factor: 6.575

5.  Comprehensive Analysis Identifies COL1A1, COL3A1, and POSTN as Key Genes Associated with Brain Metastasis in Patients with Breast Cancer.

Authors:  Weidan Wu; Linlin Zheng
Journal:  Evid Based Complement Alternat Med       Date:  2022-08-12       Impact factor: 2.650

6.  Identification of a three-gene expression signature and construction of a prognostic nomogram predicting overall survival in lung adenocarcinoma based on TCGA and GEO databases.

Authors:  Yuwei Zhou; Shenhu Gao; Rong Yang; Chengli Du; Yanli Wang; Yihe Wu
Journal:  Transl Lung Cancer Res       Date:  2022-07
  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.