Literature DB >> 35296631

Identification of Key Genes Associated with Brain Metastasis from Breast Cancer: A Bioinformatics Analysis.

Cheng Zeng1,2, Mingxi Lin1,2, Yizi Jin1,2, Jian Zhang1,2.   

Abstract

BACKGROUND As the second most frequent factor of brain metastasis worldwide, breast cancer and its pathogenesis have been researched intensively. Nevertheless, the molecular mechanisms of brain metastasis from breast cancer (BMBC) remain uncertain. The purpose of this study was to explore the key genes concerning the prognosis of BMBC and identify their predictive value. MATERIAL AND METHODS Obtained from the Gene Expression Omnibus (GEO) database, microarray datasets GSE125989, GSE52604, and GSE159956 were used to identify the differentially expressed genes (DEGs) and perform function enrichment analysis. RESULTS Of a total of 240 DEGs, 113 genes were upregulated and 127 genes were downregulated. The protein-protein interaction (PPI) was performed through STRING, and 29 hub genes were screened through Cytoscape. After being examined through the cBioportal online platform and the Oncomine database, 8 key genes were finally obtained, including COL14A1, COL3A1, COL6A3, THY1, MMP14, GAP43, PTPRN, and SNAP25. In the validation dataset GSE46928, COL14A1 was shown to have predictive significance of brain metastasis in breast cancer. CONCLUSIONS The key genes explored in this article could assist in identifying the molecular mechanism of BMBC. Also, COL14A1, COL3A1, COL6A3, THY1, MMP14, GAP43, PTPRN, and SNAP25 might be candidate targets for diagnosis and treatment of BMBC, and COL3A1 might have predictive value.

Entities:  

Mesh:

Substances:

Year:  2022        PMID: 35296631      PMCID: PMC8939238          DOI: 10.12659/MSM.935071

Source DB:  PubMed          Journal:  Med Sci Monit        ISSN: 1234-1010


Background

In the last decades, bioinformatics analysis has been used to identify genetic alternations to help discover key genes and interrelated pivotal pathways in brain metastasis from breast cancer. The Gene Expression Omnibus (GEO) database, as a well-known public platform, can store chip, second-generation sequencing, and other high-throughput sequencing data. With this database, experimental sequencing data uploaded by other researchers can be retrieved. As the most common malignant lesion in women worldwide, breast cancer is the second most common cause of brain metastasis [1]. Approximately 12% of patients have metastases, and the brain is the initial location [2]. As previously reported, for advanced breast cancer, the incidence rate of brain metastases rises from 10% to 30% [3]. Although it has been verified that local therapies are effective for brain metastases from breast cancer (BMBC), and systematic therapies mitigating extracranial symptoms are becoming increasingly used, the blood-brain barrier could prevent the penetration of anticancer drugs, which could reduce drug delivery to the site of metastasis and lessen drug effects [4]. Therefore, specific measures targeting BMBC remain unsolved, and the prognosis of BMBC is still disappointing. Therefore, it is important to identify the underlying biomarkers of BMBC.

Material and Methods

Microarray Data

Three datasets, GSE125989, GSE52604, and GSE159956, were downloaded from the National Center of Biotechnology Information (NCBI) GEO database (https://www.ncbi.nlm.nih.gov/geo/; GPL571, GPL6480, and GPL2567 platforms). An analysis from the database was designed with the approval of local Ethics Committee of the Fudan University Shanghai Cancer Center.

Identification of Differentially Expressed Genes

The differentially expressed genes (DEGs) between primary breast cancer and BMBC were inspected by the LIMMA package of R in datasets GSE125989 and GSE52604. Genes that lacked specific gene symbols were deleted, and genes with more than 1 probe were averaged. Adjusted P values were used to reduce the false-positive rate. The criteria of differential genes were set as genes with |log2fold change| >1 and an adjusted P value <0.05. For visualizing the DEGs, ggplot2 and Venn Diagram packages of R were used to build volcano plots and Venn diagrams.

Enrichment Analyses of DEGs

The Database for Annotation, Visualization and Integrated Discovery (DAVID; http://david.ncifcrf.gov; version 6.8), as an online bioinformatic database, includes 5 functional annotation tools aiming to provide functional interpretation of series of genes with Gene Ontology (GO) at the cellular level and Kyoto Encyclopedia of Genes and Genomes (KEGG) at the pathway level. P<0.05 was considered to be statistically significant.

Protein–Protein Interaction Network Construction and Module Analysis

With a combined interaction score of >0.4 considered significant, the protein–protein interaction (PPI) network was built to search the pivotal interactions through the Search Tool for the Retrieval of Interacting Genes (STRING; http://string-db.org; version 10.0) online database, which provides insights into the mechanisms of brain metastasis. Also, the inside-software plugin cytoHubba of Cytoscape (version 3.4.0), an open source platform for complex network analysis, was used to identify hub genes among the genes with 12 different arithmetic methods. The top 10 nodes were ranked by every algorithm and were selected in gene collection.

Survival Analysis and Hub Genes Selection

The Kaplan-Meier curve on the cBioPortal (http://www.cbioportal.org) online platform and the box line diagram on the Oncomine database (http://www.oncomine.com) were used to explore the relationship between expression patterns and metastasis and to determine the final hub genes.

Validation Study

The dataset GSE52604 was used to reduce false positives through the LIMMA package of R. Finally, the validation dataset GSE159956 was used to conduct survival analysis to predict the value of key genes with the Survival and Survminer packages of R.

Results

Identification of DEGs

The GSE125989 dataset included 16 paired samples of primary breast cancer and brain metastases; the GSE52604 dataset contained 55 unpaired samples, including 10 primary breast cancer samples, 10 non-neoplastic brain samples, and 35 BMBC samples; and the GSE159956 dataset contained 285 primary breast cancer patients. Differential genes (734 in GSE125989 and 5088 in GSE52604) were identified and represented in the volcano plots, and Venn diagram displayed the overlap between the 2 datasets, containing 240 genes, of which 113 genes showed upregulation and 127 genes showed downregulation between primary breast cancer and BMBC (Figure 1).
Figure 1

(A) Volcano plots and (B) Venn diagram of differentially expressed genes of 2 datasets (GSE52604 and GSE125989) by ggplot2 and Venn Diagram packages of R (version 4.1.1).

To further analyze the DEGs, GO and KEGG analysis were constructed through DAVID. According to the outcomes in the GO database, the changes in biological processes were significantly enriched in extracellular matrix organization, collagen fibril organization, cell adhesion, neurotransmitter uptake, negative regulation of angiogenesis, and reaction to wounding. Additionally, in the case of the molecular function analysis, the genes were mainly interrelated to extracellular matrix structural constituent, integrin binding, platelet-derived growth factor binding, and heparin binding. Moreover, for the cell component analysis, the differential genes were predominantly involved in the extracellular matrix, extracellular space, proteinaceous extracellular matrix, and plasma membrane. Furthermore, KEGG pathway analysis demonstrated that the DEGs were mostly enriched in protein digestion and absorption, extracellular matrix-receptor interaction, focal adhesion, platelet activation, PI3K-Akt signaling pathway, and phagosome. The above results are shown in Figure 2.
Figure 2

Results of Gene Ontology and Kyoto Encyclopedia of Genes and Genomes pathway enrichment analysis of the differentially expressed genes obtained from the Database for Annotation, Visualization and Integrated Discovery (DAVID; http://david.ncifcrf.gov) (version 6.8).

PPI Network Construction

For the PPI network, 36 nodes and 722 edges were exhibited based on the STRING database, with a local clustering coefficient of 0.437 and P<1.0e-16. The 29 related genes were as follows: COL14A1, COL15A1,COL1A1, COL1A2, COL3A1, COL5A1, COL5A2, COL6A3, DCN, GAP43, LOX, LOXL1, LUM, MMP14, NID1, PLAU, POSTN, PTPRN, SNAP25, SPON1, THY1, TPM4, TUBB4A, PAX6, COL8A2, COL10A1, THBS1, MXRA5, and ITGBL1 (Figure 3).
Figure 3

In the protein–protein interaction network exhibited by the Cytoscape database (version 4.3.0), blue nodes represent associated products of differentially expressed genes, pink nodes represent 8 key genes identified below, and edges represent the protein–protein associations.

Subsequently, survival analysis was done to select the hub genes, as shown in Figures 4 and 5. Breast cancer patients with COL14A1, COL3A1, COL6A3, GAP43, MMP14, SNAP25, and THY1 alterations had worse overall survival. Meanwhile, breast cancer patients with GAP43, LOXL1, MMP14, PAX6, PTPRN, and SNAP25 alteration had worse disease-free survival. Overall, among these selected hub genes, 10 genes tested as significant, of which COL14A1, COL3A1, COL6A3, LOXL1, MMP14, and THY1 were downregulated, and GAP63, PAX6, PTPRN, and SNAP25 were upregulated. These 10 genes were further examined in an independent dataset (Symmans Breast 2 dataset) of the Oncomine database. In the Symmans Breast 2 dataset, lower mRNA levels of downregulated genes (COL14A1, COL3A1, COL6A3, THY1, and MMP14) were associated with a metastatic event, while higher mRNA levels of upregulated genes (GAP43, PTPRN, and SNAP25) were related to a metastatic event, which indicated that these 8 genes may play prominent parts in brain metastasis of breast cancer (Figure 6).
Figure 4

Overall survival analysis on cBioPortal online platform.

Figure 5

Disease-free survival analysis on cBioPortal online platform.

Figure 6

Association between expression patterns of hub genes and metastatic event status by the Symmans Breast 2 dataset.

To reduce false positives, the comparison between normal brain tissues and tissues of brain metastases from breast cancer was performed with the GSE52604 dataset, and 4881 differentially expressed genes were identified, in which the final hub genes mentioned above were included. Finally, to judge the predictive value of the 8 identified hub genes, we performed survival analysis on the GSE159956 dataset and found that COL3A1 was statistically significant, as shown in Figure 7.
Figure 7

COL3A1 was shown to have predictive value with statistical significance in the validation study. The survival curve was created with Survival and Survminer packages of R.

Discussion

As the most common carcinoma for women, breast cancer is also the second most common cause of brain metastases [5]. In recent years, BMBC showed a younger trend and more frequently occurs in patients with a larger tumor size or higher grade in some subtypes [6]. However, the mechanisms of BMBC remain poorly understood. Because it is extremely difficult for chemotherapy drugs to infiltrate the blood-brain barrier, novel biological therapy, instead of chemotherapy, is needed to prolong survival [7]. With the progress of bioinformatics analysis, DEGs associated with BMBC have become more accessible, providing an efficacious mode to identify potential targets in dealing with BMBC [5]. In the present study, to discover the DEGs between primary breast cancer and BMBC, 2 mRNA datasets were explored, resulting in 240 DEGs identified between the 2 datasets, of which 113 genes were upregulated and 127 genes were downregulated. According to functional analysis, these differential genes were chiefly related to biological processes, such as extracellular matrix organization, collagen fibril organization, cell adhesion, chemical synaptic transmission, and negative regulation of angiogenesis. In addition, the KEGG pathways of differential genes were mostly enriched in protein digestion and absorption, focal adhesion, platelet activation, PI3K-Akt signaling pathway, and phagosome. Moreover, the PPI network was created, with 29 hub genes being identified. After being examined through the cBioportal online platform and the Oncomine database, 8 key genes were finally obtained, including COL14A1, COL3A1, COL6A3, THY1, MMP14, GAP43, PTPRN, and SNAP25, of which COL3A1 and COL6A3 were also identified to play potentially important roles in BMBC in patients with breast cancer in a recent study [8]. COL14A1 is related to the regulation of fibrillogenesis and interacts with the fibril surface. Compared with in patients without metastases, COL14A1 is highly expressed in patients with metastasis, and further investigation is desired to verify its function in metastases diagnosis and prediction of prognosis [9]. Also, several studies have revealed that COL3A1 is vital for tumorigenesis and metastasis of invasive breast cancer [10-12]. In addition, m6A methylation-mediated COL3A1 upregulation reduces the expression level of METTL3, which assists metastasis of triple-negative breast cancer [13]. The encoded protein of THY1 is bound with cell adhesion and communication in numerous cells, but particularly in immunocytes and neurocytes [14,15]. One study demonstrated that immune-related THY1 is a potential therapeutic target and prognostic marker for breast cancer with brain metastasis [16]. Owing to its function on tumor progression and metastasis, MMP-14 is a vital target with clinical significance in metastatic cancers [17-20]. Since active MMP-14 locates on the cellular surface, the antibody-mediated blockade was shown to be a novel idea in oncotherapy [17]. It has been reported that MMP14 is probably interrelated with breast cancer, and MMP-14 acts as a biological target and can block immunodepression, tumor progression, and metastasis in triple-negative breast cancer in immunotherapies [21-23]. Type VI collagen alpha3 (COL6A3) is a critical component in breast cancer development, and its 6-EMT gene signature at diagnosis can help identify patients with metastasis with triple-negative breast cancer who could not achieve pathological complete response to neoadjuvant chemotherapy and underwent surgery combined with adjuvant therapy [24]. Several studies have indicated that the relationship between BMBC and hub genes, including SNAP25, GAP43, and PTPRN, has not been widely explored. SNAP25 has been reported to be relevant to metastasis in endometrial carcinoma [25]. Autophagy is thought to promote tumor progression by means of activating SNAP25 transcription, showing that SNAP25 has a significant role in the later period of carcinoma [26]. GAP43 is an absolute biomarker with predictive value of non-small cell lung cancer brain metastasis and it could assist metastasis through the Rac1/F-actin pathway [27]. However, GAP43 has rarely been researched in breast cancer. As signaling molecules that regulate multiple processes, including cell growth, differentiation, and oncogenic transformation, the protein encoded by PTPRN has a crucial role in glioblastoma and ovarian cancer, but its function in breast cancer needs to be further studied [27-29]. It has been shown that the prognosis of patients with breast cancer with brain metastases is very poor, and brain metastases shorten survival [6]. However, the progress of research on effective treatments is still slow, mainly because the standard treatment options for brain metastasis are limited [6]. Therefore, it is necessary to identify sensitive biomarkers as new targets to improve the prognosis of patients with metastatic breast cancer. The findings of the present study may provide new insights into the understanding of brain metastases in breast cancer. However, this study has some limitations. First, the clinical data in the GEO database is incomplete, which affected the comprehensive survival analysis. More data libraries should be exploited for clinical data, such as The Cancer Genome Atlas, and a recently published study on this topic is a good example [30]. Future studies can provide larger analyses of real-life data and the risk of a higher rate of brain metastasis at presentation. Second, this study included only 3 datasets, which may have biased the results of the study. Third, the results of this study need further experimental verification. For example, western blotting and enzyme real-time polymerase chain reaction can be used to test the expression of the selected key genes. It is also necessary to construct a nude mouse brain metastasis model of breast cancer and observe the effects of the selected key factors on the phenotypes of genes in BMBC.

Conclusions

In conclusion, 8 key genes were identified as potential biomarkers for BMBC. COL3A1 was significant for predicting brain metastasis from breast cancer. Further in vivo and in vitro studies could confirm our results, and the discovery of DEGs could provide new information regarding cell pathways and targets for new drugs.
  30 in total

1.  Breast cancer-induced bone remodeling, skeletal pain, and sprouting of sensory nerve fibers.

Authors:  Aaron P Bloom; Juan M Jimenez-Andrade; Reid N Taylor; Gabriela Castañeda-Corral; Magdalena J Kaczmarska; Katie T Freeman; Kathleen A Coughlin; Joseph R Ghilardi; Michael A Kuskowski; Patrick W Mantyh
Journal:  J Pain       Date:  2011-04-15       Impact factor: 5.820

Review 2.  Brain metastasis in breast cancer: a comprehensive literature review.

Authors:  Rezvan Rostami; Shivam Mittal; Pooya Rostami; Fattaneh Tavassoli; Bahman Jabbari
Journal:  J Neurooncol       Date:  2016-02-24       Impact factor: 4.130

3.  Organelle Specific O-Glycosylation Drives MMP14 Activation, Tumor Growth, and Metastasis.

Authors:  Anh Tuan Nguyen; Joanne Chia; Manon Ros; Kam Man Hui; Frederic Saltel; Frederic Bard
Journal:  Cancer Cell       Date:  2017-11-13       Impact factor: 31.743

4.  Hijacking a Morphogenesis Proteinase for Cancer Cell Invasion.

Authors:  Nasreen Akhtar
Journal:  Dev Cell       Date:  2018-10-22       Impact factor: 12.270

5.  Matrix metalloproteinase expression and outcome in patients with breast cancer: analysis of a published database.

Authors:  P M McGowan; M J Duffy
Journal:  Ann Oncol       Date:  2008-05-23       Impact factor: 32.976

Review 6.  Investigational chemotherapy and novel pharmacokinetic mechanisms for the treatment of breast cancer brain metastases.

Authors:  Neal Shah; Afroz S Mohammad; Pushkar Saralkar; Samuel A Sprowls; Schuyler D Vickers; Devin John; Rachel M Tallman; Brandon P Lucke-Wold; Katherine E Jarrell; Mark Pinti; Richard L Nolan; Paul R Lockman
Journal:  Pharmacol Res       Date:  2018-03-28       Impact factor: 7.658

7.  Identification of markers associated with brain metastasis from breast cancer through bioinformatics analysis and verification in clinical samples.

Authors:  Yongchang Gao; Jianjing Liu; Xiaolong Qian; Xianghui He
Journal:  Gland Surg       Date:  2021-03

8.  A novel immunotherapy targeting MMP-14 limits hypoxia, immune suppression and metastasis in triple-negative breast cancer models.

Authors:  Binbing Ling; Kathleen Watt; Sunandan Banerjee; Daniel Newsted; Peter Truesdell; Jarrett Adams; Sachdev S Sidhu; Andrew W B Craig
Journal:  Oncotarget       Date:  2017-05-09

9.  Identification of potential genes related to breast cancer brain metastasis in breast cancer patients.

Authors:  Lijian Zhang; Luxuan Wang; Hua Yang; Chunhui Li; Chuan Fang
Journal:  Biosci Rep       Date:  2021-10-29       Impact factor: 3.840

10.  Genomic landscape of the immune microenvironments of brain metastases in breast cancer.

Authors:  Wei-Cheng Lu; Hui Xie; Ce Yuan; Jin-Jiang Li; Zhao-Yang Li; An-Hua Wu
Journal:  J Transl Med       Date:  2020-08-31       Impact factor: 5.531

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.