Literature DB >> 35037689

Identification of potential immunotherapy biomarkers for breast cancer by bioinformatics analysis.

Yao Song1, Meiling Lu2, Lijin Feng3, Qian Chen1, Hua Huang4, Qing Lin1.   

Abstract

Breast cancer is a serious malignancy with a high incidence worldwide and a tendency to relapse. We used integrated bioinformatics analysis to identify potential biomarkers in breast carcinoma in the present study. Microarray data, 127breast tumor samples and 23 non-tumor samples, received from the Gene Expression Omnibus (GEO) dataset; 121 differentially expressed genes (DEGs) were selected. Functional analysis using DAVID revealed that these DEGs were highly gathered in endodermal cell differentiation and proteinaceous extracellular matrix. Five bioactive compounds (prostaglandin J2, tanespimycin, semustine, 5182598, and flunarizine) were identified using Connectivity Map. We used Cytoscape software and STRING dataset to structure a protein-protein interaction (PPI) network. The expression of CD24, MMP1, SDC1, and SPP1 was much higher in breast carcinoma tissue than in Para cancerous tissues analyzed by Gene Expression Profiling Interactive Analysis (GEPIA) and ONCOMINE. Overexpression ofCD24, MMP1, SDC1, and SPP1 indicated the poor prognosis in breast carcinoma patients analyzed by Kaplan-Meier (KM) Plotter. Immunohistochemistry microarray was used to further confirm that protein expression of CD24, MMP1, SDC1, and SPP1 was much higher in tumor sections than in Para cancerous tissues. Hub genes expression at the protein level was correlated tothe breast cancer subtype and grade. Furthermore, immunity analysis showed that CD24, MMP1, SDC1, and SPP1 were potentially associated with five immune cell types infiltration (CD8+ T cells, CD4+ T cells, neutrophils, macrophages,and dendritic cells) by TIMER. Thus, this study indicates potential biomarkers that could have applications in the development of immune therapy for breast cancer. However, further studies are required for verifying these results in vivo and vitro.
© 2022 The Author(s).

Entities:  

Keywords:  Breast cancer; bioinformatics analysis; biomarkers; immunotherapy; prognosis

Mesh:

Substances:

Year:  2022        PMID: 35037689      PMCID: PMC8819662          DOI: 10.1042/BSR20212035

Source DB:  PubMed          Journal:  Biosci Rep        ISSN: 0144-8463            Impact factor:   3.840


Introduction

Breast cancer occurs frequently in non-skin carcinoma and leading to death in America females at the second level, with 268600 new patients and 41760 deaths in 2019, accounting for up to 30% of all new cancers and 15% of cancer deaths [1]. Risk factors for breast cancer include age, environment, smoking, and inheritance; approximately 5–10% of cases are due to genes inherited from the patient’s parents, such as BRAC1 and BRAC2 mutations [2,3]. There are four subtypes of breast cancer based on molecular characteristics: luminal A, luminal B, triple negative and HER2 overexpressing. Besides the traditional treatment methods of surgery, chemotherapy and/or radiation therapy used for breast cancer, endotherapy can be used in HR+ cases. Anti-HER-2 therapy is commonly used in HER2+ patients, and in the past several years, immunotherapy has been commonly used to treat triple-negative breast cancer [4]. Although the survival rate has improved quickly with the development of surgical techniques and exploration of new targeted drugs, the effects of therapies for advanced breast cancer have remained poor, only 27% for a 5-year survival rate [5]. Hence, investigation the mechanism of cancer progression and finding the potential prognostic biomarkers is very important. High-throughput microarray technology and bioinformatics analysis can be used to identify differences in gene expression between cancerous and para- cancerous tissues, analyze the DEGs, and identify the pathways leading to tumorigenesis and cancer progression. To understand the molecular mechanisms associated with breast cancer progression, we performed bioinformatics analysis using the GEO and The Cancer Genome Atlas (TCGA) databases to review all DEGs in breast cancer, for identifying prognostic biomarkers and potential molecular targets. Then, tissue microarray analysis (TMA) was used to validate the protein expression of hub genes. Our results indicated that MMP1, CD24, SDC1, and SPP1 are potential novel prognostic biomarkers and candidate immunotherapy targets for breast cancer.

Materials and methods

Breast cancer data preparation

Microarray platform (GLP570)[6] was used to collect two independent breast cancer gene expression profiles, GSE26910 [7] and GSE42568 [8], which contain 127 breast carcinoma samples and 23 non-cancer samples. These data were analyzed using the Affymetrix Human Genome U133 Plus 2.0 Array [transcript (gene) version; Santa Clara, CA, U.S.A.]. Furthermore, we used 1,105 samples in breast carcinoma and 113 samples in para-cancerous from TCGA dataset for validation.

Identification of DEGs

After downloading the datasets from GEO, the GEO2R online tool was used to find the DEGs between breast cancer tissue and non-tumor tissue. The conditions for screening of the DEGs were P<0.05 and |log (fold change)| > 2. An online Venn diagram tool was used to identify up- and down-regulated genes in the two datasets.

Identification of potential drugs

CMap was used to identify possible drugs that could be used to inhibit tumor progression in breast cancer patients based on targeting the discovered DEGs [9]. First, we arranged the lists of up- and down-regulated genes for both tumor and non-tumor tissues and uploaded the genes to the CMap online tool. Next, an enrichment score, representing the similarity in up- and down-regulated genes, were calculated for each drug. A positive enrichment score indicates that the drug can exacerbate breast cancer progression, whereas a negative enrichment number means that the drug may inhibit tumor formation.

Functional enrichment analysis

Gene ontology (GO) terms and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways correlated to the DEGs were identified by DAVID online tool [10]. GO analysis describes genes in terms of the related molecular functions, biological processes, and cellular components [11]. KEGG pathway analysis was used to check their reference pathways in the indicated genes [12].

PPI network construction

STRING was used for construction a PPI network and molecular function network for the DEGs [13]. After collecting data for the PPI network, visualized the network by Cytoscape, distinguished whether the highly connected modules were molecular complexes or clusters by the plugin Molecular Complex Detection (MCODE).

Selection of hub genes

We checked the 32 selected genes expression in tumor tissues and non-tumor tissues based on TCGA data by GEPIA [14]. The DEGs with significantly differential expression between the two tissue types were selected for further analysis. KM Plotter can analysis the effect of 54675 genes on patient survival in 18674 cancer samples, mainly based on the GEO, TCGA, and EGA databases [15]. The associations of the DEGs verified by GEPIA with survival rates of breast cancer patients were explored by KM Plotter; the DEGs for which higher expression indicated significantly worse survival were selected as hub genes. P<0.05 looked as statistical significance. ONCOMINE dataset, confirmed the focus genes expression between cancer and para cancerous tissues in breast cancer [16].

Gene set enrichment analysis (GSEA)

We separated breast carcinoma tissues in two groups through the hub genes median expression on TCGA dataset. GSEA[17] was used to identify potential functions for hub genes. KEGG pathways associated with the up- and down-regulated genes were selected. The cut-off condition was considered for adjusted P<0.05.

Associations of immune cell infiltration with hub gene expression

TIMER used for systematically analyzing the relation between immune infiltration and hub gene expression [18]. Hub gene expression, the correlations with the infiltration of immune cells were evaluated by “Gene module”.

Immunohistochemistry validation

A total of 657 breast carcinoma samples and paired para-cancerous samples were collected from 2013 to 2018 at Tenth Hospital of Tongji University, China. Pathologic diagnoses and classifications were made according to the UICC Classification of Malignant Tumors. Tissue cores were obtained from the formalin-fixed paraffin-embedded (FFPE) blocks; the diagnosis of breast tumor was based on review by a pathologist after coloration with Hematoxylin and Eosin. TMA analysis (Shanghai Outdo Biotech, Shanghai, China) checked by 2-mm tissue cores two areas of the tumor in each patients(invasive margin and tumor bulk). Ethical approval for this study was obtained from The Human Research Ethics Committee of Tenth Hospital of Tongji University. All patients provided informed consent. Antigen retrieval was conducted by microwave pre-treatment in EDTA buffer (pH 9.0) for 20 min; then, endogenous peroxidase was removed using 3% H2O2, after that blocking for 20 min with avidin. Furthermore, rabbit polyclonal antibodies against MMP1 (ab52631, Abcam, UK; dilution 1:60) incubated on the slides overnight at 4 °C, CD24 (ab31622, Abcam, U.K.; dilution 1:100), SDC1 (ab7280, Abcam, U.K.; dilution 1:500), and SPP1 (ab214050, Abcam, U.K.; dilution 1:500) for 24 h, before being incubated with secondary antibodies at room temperature for 30 min. Tissue sections were incubated with 3,3′-diaminobenzidine for 10 min, then counterstained, dehydrated, and mounted. Reading slides by a Nano-Zoomer 2.0-HT slide scanner (Hamamatsu Photonics K.K., Hamamatsu, Japan), analyzing the images by the NDP. Captured images in TMA slides. The tissue immunostaining was read independently by two trained pathologists. The criteria for immunohistochemistry evaluation were as follows. 0, negative; 1, weak; 2, medium; and 3, strong scored as the staining intensity; 0, 0%; 1, 1–25%; 2, 26–50%; 3, 51–75%; and 4, 76–100% scored as the staining extent, which counted by the percentage of positively stained areas in relation to the whole cancer area. Sum of the extent score and the intensity score as the final expressed results, which was graded as follows: −, score 0–2; +, score 3 or 4; ++, score 5 or 6; or +++, score 7. Here, − and 1+ represent low expression; and 2+ and 3+ represent high expression.

Results

DEGs in breast cancer

GSE26910 and GSE42568 datasets were obtained in the GEO database, containing 127 breast carcinoma tissues and 23 non-tumor samples. There were 325 DEGs (130 up-regulated and 195 down-regulated) obtained from GSE26910, and 1170 DEGs (459 up-regulated and 711 down-regulated) from GSE42568 between tumor and non-tumor tissues. These results are shown as volcano plots in Figure 1A,B. Furthermore, the DEGs were analyzed by Venn diagram; 121 DEGs (31 up-regulated and 90 down-regulated) were found in the two datasets, as shown in Figure 1C,D and Supplementary Table S1.
Figure 1

DEGs in breast cancer and non-cancer tissues

(A) Volcano plot, represented DEGs in breast cancer tissues and non-tumor samples in GSE26910 dataset. (B)Volcano plot, represented DEGs in breast cancer tissues and non-tumor tissues in GSE42568 dataset. Red dots, indicate genes highly induced in breast cancer; green dots indicate genes greatly reduced in breast cancer, blue dots indicate non-DEGs. (C)Venn diagram represented the downregulated overlapping DEGs from GSE26910 and GSE42568 datasets. (D) Venn diagram represented the upregulated overlapping DEGs from GSE26910 and GSE42568 datasets.

DEGs in breast cancer and non-cancer tissues

(A) Volcano plot, represented DEGs in breast cancer tissues and non-tumor samples in GSE26910 dataset. (B)Volcano plot, represented DEGs in breast cancer tissues and non-tumor tissues in GSE42568 dataset. Red dots, indicate genes highly induced in breast cancer; green dots indicate genes greatly reduced in breast cancer, blue dots indicate non-DEGs. (C)Venn diagram represented the downregulated overlapping DEGs from GSE26910 and GSE42568 datasets. (D) Venn diagram represented the upregulated overlapping DEGs from GSE26910 and GSE42568 datasets.

DEGs enrichment analysis

DAVID online was used to further analyze the functions of the overlapping two GEO datasets DEGs in breast cancer tissues. The up-regulated DEGs were mainly enriched in endodermal cell differentiation, collagen catabolic process, and cell adhesion (biological processes); proteinaceous extracellular matrix (ECM; cellular component); and lipid binding (molecular function). Down-regulated DEGs were mainly enriched in negative regulation of cell proliferation, extracellular region, and lipid binding. In the KEGG pathway analysis, DEGs were mainly enriched in ECM–receptor interaction and TNF signaling pathway (Tables 1 and 2).
Table 1

GO analysis of DEGs in breast cancer

ExpressionCategoryTermCount%P-valueFDR
Up-regulated GOTERM_BP_DIRECTGO:0035987∼endodermal cell differentiation49.151.32E-050.017315
GOTERM_BP_DIRECTGO:0030574∼collagen catabolic process35.862.05E-040.268194
GOTERM_BP_DIRECTGO:0007155∼cell adhesion51.142.44E-040.319249
GOTERM_BP_DIRECTGO:0002063∼chondrocyte development36.862.82E-040.368791
GOTERM_BP_DIRECTGO:0001502∼cartilage condensation36.864.20E-040.548872
GOTERM_CC_DIRECTGO:0005578∼proteinaceous extracellular matrix818.301.35E-081.30E-05
GOTERM_MF_DIRECTGO:0008289∼lipid binding64.844.06E-040.516306
GOTERM_MF_DIRECTGO:0001077∼transcriptional activator activity,RNA polymerase II core promoter proximalregion sequence-specific binding75.654.35E-040.056502
Down- regulation GOTERM_BP_DIRECTGO:0008285∼negative regulation of cell proliferation108.076.54E-050.101387
GOTERM_BP_DIRECTGO:0006869∼lipid transport54.043.69E-040.570794
GOTERM_BP_DIRECTGO:0050873∼brown fat cell differentiation43.233.87E-040.597749
GOTERM_BP_DIRECTGO:0045429∼positive regulation of nitric oxide biosynthetic process43.239.29E-041.430151
GOTERM_CC_DIRECTGO:0005576∼extracellular region211.701.50E-050.016859
GOTERM_CC_DIRECTGO:0005615∼extracellular space191.541.67E-050.018833
GOTERM_MF_DIRECTGO:0008289∼lipid binding64.844.06E-040.516306
GOTERM_MF_DIRECTGO:0001077∼transcriptional activator activity,RNA polymerase II core promoter proximalregion sequence-specific binding75.654.35E-040.553146

Abbreviations: BP, biological process; CC, cell component; MF, molecular function.

Table 2

KEGG pathway analysis of DEGs in breast cancer

Pathway IDCategoryCount%P-valueGenes
bta04512ECM–receptor interaction74.181.85E-05 SDC1, CD36, COMP, ITGA7, COL11A1, FN1, SPP1
bta04668TNF signaling pathway63.606.31E-04 FOS, PTGS2, MMP9, CXCL2, CXCL10
bta05200Pathways in cancer95.370.003734 EGFR, FOS, EDNRB, BMP2, PTGS2, MMP9, ZBTB16, MMP1, FN1
bta05144Malaria42.390.004793 SDC1, CD36, COMP, HBB
bta03320PPAR signaling pathway42.390.01037 CD36, FABP4, ADIPOQ, MMP1
bta05205Proteoglycans in cancer63.580.009768 EGFR, SDC1, GPC3, ERBB3, MMP9, FN1
bta04510Focal adhesion63.580.010782 EGFR, COMP, ITGA7, COL11A1, FN1, SPP1
Abbreviations: BP, biological process; CC, cell component; MF, molecular function.

Identification of small molecular drugs

CMap was used to identify small molecular drugs according to the up- and down-regulated DEGs. Prostaglandin J2, tanespimycin, semustine, 5182598, and flunarizine were the five small molecules most significantly negatively correlated with the DEGs in breast cancer. Thus, these molecules are potentially targeted drugs for breast cancer (Table 3).
Table 3

CMap results

RankCMap nameMean n EnrichmentP-value
1Quinpirole0.717150.9450
215-δ prostaglandin J2−0.44262−0.5830
3Tanespimycin−0.2944−0.3780
4Semustine−0.6372−0.9030.00016
55182598−0.7134−0.9880.00036
6Flunarizine−0.6244−0.8820.00048
7Propafenone−0.5884−0.8630.00064
8Securinine−0.6414−0.8420.00109
9Altizide0.4520.830.00127
105224221−0.74−0.9750.00135
11Scopolamine0.38260.8240.00151
12Lanatoside C−0.5424−0.6960.00193
13Pridinol−0.544−0.8160.00209
14Mesoridazine−0.533−0.8160.00209
15Sulfaquinoxaline0.589150.8880.00264

PPI network and module analysis

We identified protein connections between the overlapping DEGs in STRING online database; connection score was 0.2. Next, Cystoscope was used to establish a PPI network consisting of 32 nodes and 134 edges (Figure 2A). MCODE was then used to find two clusters: cluster 1 contained 32 nodes and 134 edges, with a score of 8.645; cluster 2 contained 3 nodes with 3 edges, with a score of 3 (Figure 2B,C).
Figure 2

PPI network and module analysis

(A) PPI network. (B) Top module cluster analyzed by MCODE. (C) Top two module clusters analyzed by MCODE.

PPI network and module analysis

(A) PPI network. (B) Top module cluster analyzed by MCODE. (C) Top two module clusters analyzed by MCODE.

Hub genes survival analysis

GEPIA online tool was used for checking the hub genes expression in breast cancer and non-tumor tissues by TCGA dataset. CD24, MMP1, MMP9, SDC1, COMP, GOLM1, POSTN, FN1, SPP1, CXCL10, WISP1, ERBB3, COL10A1, KRT19 and MMP11 were upregulated in breast cancer tissues compared with normal tissues (Figure 3A). FOSB, FOS, PTGS2, EGFR, EFEMP1, KLF4, CXCL2, GPC3, ITGA7, CHRDL1, CD36, PDK4, BMP2, ATF3, ADIPOQ, and ADAMTS1 expression was much lower in tumor tissues than in normal tissues (Supplementary Figure S1).
Figure 3

Survival analysis for hub genes

(A) 15 hub genes expressed much higher in breast cancer tissues than normal breast tissues by GEPIA. (B)9 hub genes was correlated with OS in breast cancer patients. (C) MMP1, CD24, SDC1, and SPP1 was expression correlated with RFS in breast cancer patients. Abbreviation: OS, overall survival; RFS, relapse-free survival.

*,P<0.01.

Survival analysis for hub genes

(A) 15 hub genes expressed much higher in breast cancer tissues than normal breast tissues by GEPIA. (B)9 hub genes was correlated with OS in breast cancer patients. (C) MMP1, CD24, SDC1, and SPP1 was expression correlated with RFS in breast cancer patients. Abbreviation: OS, overall survival; RFS, relapse-free survival. *,P<0.01. KM Plotter was used to check the prognostic value of the 33 DEGs. Higher MMP1, SDC1, SPP1, and CD24 expression was correlated with worse overall survival (OS) and relapse-free survival (RFS) in breast cancer patients (Figure 3B,C). Other genes showed no significant correlation with survival (Supplementary Figure S2). Hence, we focused on these four genes in the subsequent analysis. In the luminal A subtype, higher MMP1 and CD24 expression levels were correlated with worse OS. In the luminal B subtype, SDC1 and SPP1 expression was significantly correlated with prognosis of breast cancer patients. MMP1 and SDC1 expression was correlated with prognosis in the triple-negative subtype. In HER-2+ breast cancer, MMP1, SDC1, SPP1, and CD24 expression showed no significant association with prognosis (Figure 4A–D). None of the four genes had any significant correlation with tumor stages I–IV, as shown in Supplementary Figure S3.
Figure 4

OS curves for MMP1, CD24, SDC1, and SPP1 expression in breast cancer subtypes

(A) Luminal A; (B) luminal B; (C) triple-negative; (D) HER2+.

OS curves for MMP1, CD24, SDC1, and SPP1 expression in breast cancer subtypes

(A) Luminal A; (B) luminal B; (C) triple-negative; (D) HER2+.

GSEA

To find the potential function of the four genes according to the TCGA breast cancer dataset, we evaluated the KEGG pathways involved in the highly expressed samples by GSEA. The most enriched KEGG pathways were chemokine signaling pathway, cytokine receptor interaction, natural killer cell-mediated cytotoxicity, and T cell receptor signaling pathway (Figure 5).
Figure 5

GSEA was applied to identify enriched biological processes for the four key genes (MMP1, CD24, SDC1 and SPP1) with highly expressed samples

Immune infiltration and hub genes

Immune cell infiltration was analyzed with respect to its associations with MMP1, SDC1, SPP1, and CD24 expression. MMP1 expression was positively associated with infiltration of CD4+ T cells (rho = 0.105, P=1.13e-03), CD8+ T cells (rho = 0.154, P=1.35e-05), macrophages (rho = 0.179, P=1.62e-8), neutrophils (rho = 0.299, P=3.92e-21), and dendritic cells (DCs; rho = 0.325, P=6.58e-25). CD24 expression was positively correlated with infiltration of CD8+ T cells (rho = 0.095, P=2.97e-03) and neutrophils (rho = 0.068, P=3.47e-02). SDC1 expression was significantly positive associated with CD8+ T cells (rho = 0.171, P=6.16e-08), macrophages (rho = 0.189, P=2.17e-09), neutrophils (rho = 0.12, P=2.1e-4), and DCs (rho = 0.169, P=1.69e-07). SPP1 expression higher, the infiltration of CD4+ T cells (rho = 0.098, P=2.26e-03), CD8+ T cells (rho = 0.077, P=1.58e-02), macrophages (rho = 0.407, P=1.78e-40), DC (rho = 0.339, P=5.24e-27), and neutrophils (rho = 0.371, P=2.51e-32) was much more (Figure 6).
Figure 6

Correlations of expression of four key genes (MMP1, CD24, SDC1, and SPP1) with immune cell infiltration

TIMER survival model can be used for analyzing the clinical relevance of one or more tumor immune subsets. In the multivariable Cox proportional hazard model, in BRCA, stage 3 (P=0.000), stage 4 (P=0.000), CD24 (P=0.002), and MMP1 (P=0.042) were significantly correlated with survival. Then, we performed subtype analysis; the BRCA-basal group comprised 118 patients with the following associations with survival: stage (P=0.000), purity (P=0.000), CD4+ T cells infiltration (P=0.000), DC infiltration (P=0.001), SDC1 expression (P=0.025), and SPP1 expression (P=0.037). In the BRCA-luminal group, 591 patients were analyzed; stage (P=0.000), DC infiltration (P=0.008), SDC1 expression (P=0.014), CD24 expression (P=0.028), and MMP1 expression (P=0.001) were correlated with survival. However, in the BRAC-HER2 group, no clinically relevant associations with survival were found among the 65 patients (Tables 4-7). *,P<0.05. *,P<0.05. *,P<0.05. *,P<0.05.

Validation of hub gene expression in different samples

ONCOMINE, used for confirming the selected four DEGs in breast tumor and non-tumor tissues at the mRNA level. MMP1, SDC1, SPP1, and CD24 mRNA declaration was much higher in tumor tissues than in non-tumor tissue (Figure 7).

Evaluation of MMP1, CD24, SDC1, and SPP1 expression in breast cancer tissue and normal tissue

(A) MMP1, CD24, SDC1, and SPP1 expression was up-regulated in breast cancer tissue compared with normal tissue in the Oncomine dataset. (B) MMP1, CD24, SDC1, and SPP1 protein expression was up-regulated in breast cancer tissue compared with normal tissue according to tissue microarray analysis. Furthermore, a tissue microarray was used for verifying the protein levels of these four hub genes in breast cancer and para cancerous tissues. High expression of MMP1 was found in 30.7% of cancer tissues but 1.2% of pan-cancer tissues (P=0.00). The percentages of high protein expression of CD24, SDC1, and SPP1 were 28.6, 34.2, and 19.2% in breast cancer tissue, compared with 1.5, 1.5, and 1.1% in pan-cancer tissue, respectively (P=0.00, P=0.00, and P=0.00) (Table 8, Figure 5B). The protein expression of MMP1, CD24, SDC1, and SPP1 was significantly correlated with tumor grade and subtype but not with tumor stage (Tables 9-12). In the molecular correlation analysis, MMP1 protein was found to be significantly correlated with SPP1 protein expression (P=0.03) but not with that of SDC1 or CD24 (P=0.33 and P=0.74); CD24 protein expression was significantly correlated with SDC1 and SPP1 (P=0.00 and P=0.00) but not MMP1 protein expression (P=0.74); SDC1 protein expression was significantly correlated with SDC1 and SPP1 (P=0.00, P=0.00) but not MMP1 protein expression (P=0.33); and SPP1 protein expression was significantly correlated with MMP1, CD24, and SDC1 protein expression (P=0.03, P=0.00, and P=0.00) (Tables 9-12).
Table 8

MMP1, CD24, SDC1 and SPP1 protein expression in cancer and paracancerous tissues

Paracancerous tissue (%)Cancer tissue (%)χ2 P
SPP1117.3160.00
Low644 (98.9)531 (80.8)
High7 (1.1)126 (19.2)
SDC1237.3890.00
Low641 (98.5)432 (65.8)
High10 (1.5)225 (34.2)
CD24186.6490.00
Low641 (98.5)469 (71.4)
High10 (1.5)188 (28.6)
MMP1211.3850.00
Low643 (98.8)455 (69.3)
High8 (1.2)202 (30.7)
*,P<0.05; **,P<0.01. **,P<0.01. **,P<0.01. *,P<0.05; **,P<0.01.

Discussion

In this study, two microarray datasets GSE26910 and GSE42568 were collected from GEO database, and a total of 121 DEGs between breast cancer and non-cancer tissues were identified, comprising 31 up-regulated and 90 down-regulated genes. Bioinformatics analysis was conducted based on these DEGs. First, several target small molecules that could be used for inhibition of breast cancer development were identified. Prostaglandin J2, the endogenous product of the cyclooxygenase pathway, mediated pro- and anti-inflammatory effects through receptor-dependent or -independent pathways. Moreover, 15-deoxy-δ (12,14)-prostaglandin J (2), one of the main subtypes of prostaglandin J2, inhibited cancer growth through arresting cell growth in G2/M phase and inducing apoptosis of breast cancer cells [20]. However, this requires further exploration through in vivo and clinical studies. Tanespimycin, an inhibitor of heat shock protein 90, combination with trastuzumab showed higher anticancer affection in metastatic breast cancer patients for HER2+ molecular subtype, compared with trastuzumab alone in a phase II study [19]. However, tanespimycin had no effect in metastatic or locally advanced, unresectable breast cancer in a phase II study involving three patients. Hence, further study of tanespimycin in unselected breast cancer patients were not recommended [20]. However, tanespimycin is a potential targeted therapy drug in selected breast cancer populations. Semustine, a 4-methyl derivative of lomustine, is widely used in glioma therapy [21]. The growth of MCF-7 breast carcinoma cells was shown to be inhibited by semustine, but the mechanism and effects of semustine in vivo remain unclear [22]. A benzylisoquinoline alkaloid, 5182598, is considered to be an important anticancer drug because it can repair damaged metabolic pathways in metastatic prostate carcinoma [23]; its effects on breast cancer need to be further clarified. Flunarizine, a selective calcium entry blocker with calmodulin-binding properties and histamine H1-blocking activity has been reported to inhibit tumor cell growth in melanoma and lymphoma through inhibiting the Wnt pathway [24]. Flunarizine mediated cell autophagy by degrading N-Ras induction to inhibit growth of basal-like tumor cells in vitro and in vivo, with low toxicity; thus, it should be the subject of further investigation as a potential targeted therapy [25]. Second, upregulated DEGs mainly associated with endodermal cell differentiation and tumor behavior; generally, tumors with immature differentiation are much more aggressive than those with more mature differentiation. In breast cancer, stem cell activation and inhibition of cell differentiation are associated with tumorigenesis. Hypoxia is correlated with tumor differentiation, and increased protein levels of HIF-1 and HIF-2 are linked to worse prognosis of breast cancer patients [26]. Wnt/β-catenin pathway, a classic pathway in the formation of cancer stem cells, could enhance tumor growth through inhibiting cell differentiation [27]. Cell component GO enrichment analysis showed that the DEGs were mainly associated with the proteinaceous ECM. The ECM is an important factor in tumor growth, migration, and vascular formation in human breast cancer [28]. Cancer-associated fibroblasts have been found in the ECM and shown to have a role in breast cancer growth and chemoresistance. Hoang et al. found that ERK5, a member of the MAPK family, regulated the ECM to induce tumor proliferation and migration in triple-negative breast cancer [29]. Molecular function GO enrichment analysis indicated that the DEGs were mainly associated with lipid binding. Abnormal lipid metabolism is closely linked with tumorigenesis [30]. FAC, a key enzyme in fatty acid biosynthesis, acts as a metabolic oncogene in cancer growth. In breast cancer, HBXIP could modulate abnormal lipid metabolism and tumor growth by activating FAS signaling [31]. The long-chain ω-3 fatty acids have an important role in inflammation resolution, inhibiting breast cancer occurrence through production of lipid mediators. A meta-analysis of 16 cohort studies indicated that ω-3 intake was associated with a reduction in breast cancer risk [32]. The KEGG pathway analysis for DEGs mainly focused on ECM–receptor interactions, which have important roles in tumor growth and migration [33]. Third, four hub genes that could have important key functions in tumor growth in breast cancer, MMP1, SDC1, CD24, and SPP1, were identified using GEO and TCGA public datasets as potential prognostic biomarkers. In breast cancer tissue samples, MMP1, SDC1, CD24, and SPP1 showed higher protein expression compared with that in tumor-adjacent tissues, and their expression was positively correlation with tumor subtype and grade. MMP1, a member of the matrix metalloproteinase family, functions as an interstitial collagenase and fibroblast collagenase. Higher MMP1 expression could predict worse disease-free survival (DFS) and OS in patients with invasive breast cancer, but the mechanism underlying this association is not clear [34]. In the breast cancer xenograft model, reduction of MMP-1 expression significantly inhibited breast cancer growth, brain metastasis, and lung metastasis through reducing TGFa release and phosphor-EGFR expression [35]. In triple-negative breast cancer tissues, MMP1 protein expression positively depended on lymph node metastasis; furthermore, in an in vitro study, knockdown of MMP1 inhibited cell proliferation in triple-negative breast cancer MBA-231 cells [36]. In MCF-7 breast cancer cells, MMP1 is activated by Slug and enhances multidrug resistance; knockdown of Slug reduced MMP1 expression in these cells, further enhancing adriamycin resistance [37]; in immune analysis, in luminal breast cancer, MMP1 is negatively associated with tumor survival. Our results are almost consistent with those of previous studies. MMP1 has potential as a prognostic biomarker and therapeutic target in breast cancer, but more in vivo and clinical studies are required. CD24 is a small and heavily glycosylated mucin-like glycosylphosphatidyl-inositol-linked cell surface protein, was detected in several types of carcinomas but is rarely expressed in normal tissues [38]. In breast cancer, the effects of CD24 on prognosis in terms of OS and DFS are controversial. Meta-analysis with breast cancer tissues indicated that higher CD24 expression was associated with shorter OS and correlated with tumor stage and lymph node metastasis [39]. Moreover, Jing et al. found that CD24 overexpressed in cancer tissues than in normal breast tissue commonly; in addition, CD24 expression was closely correlated with SDC1 mRNA expression, indicating that it could serve as a prognostic indicator for breast cancer [40]. In our study, CD24 protein expression was positively correlated with tumor grade, subtype and SDC1 expression, with higher expression of CD24 in breast cancer tissues compared with tumor-adjacent tissues. CD24 could thus be a prognostic biomarker and therapeutic target in breast cancer, but more in vitro and in vivo studies are needed to clarify its potential. SDC1, a heparin cell surface proteoglycan that functions as a growth factors and chemokines co-receptor, which strongly correlated to the tumor aggressiveness and clinical results [41]. In breast cancer, higher expression of SDC1 is correlated to worse OS and positively correlated with grade. Cui et al. used the database to identify SDC1 as positively associated with PLAU expression and a potential prognostic marker and target in breast cancer [42]; however, the clinical effects and prognostic value were controversial. Knockdown of SDC1 could increase cell adhesion and motility dependent on integrin expression and IL6 secretion [43]. SDC1 is positively correlated with tumor subtype and grade according to our study and could thus become a prognostic biomarker. In basal breast cancer, SDC1, CD4+ T cells, and DCs were negatively correlated with tumor survival; however, the associated molecular mechanism needs to be further verified. SPP1 is a protein overexpressed in breast tumors. Higher plasma levels of OPN with shorter OS in patients through inducing the tumor burden[44]. In our study, SPP1 expressed much higher in breast cancer tissue than in para cancerous tissues. SPP1 expression was correlated with the Luminal B breast cancer tissue. Fourth, the higher expression of MMP1, CD24, SDC1 and SPP1 enriched in NK cell-mediated cytotoxicity and T cell receptor signaling pathway according to GSEA analysis indicated that immune cell infiltration might be correlated with expression of hub genes. High expression of MMP1 was positively correlated with infiltration of CD4+ T cells, CD8+ T cells, DC, neutrophils, and macrophages. In dermatofibrosarcoma protuberance tumor tissues, MMP1 was found to be prominent in tumor-associated macrophages [45]. CD24 expression was positively correlated with CD8+ T cell and neutrophil infiltration. In ovarian and triple-negative breast cancers, CD24 signaling could serve as a target for cancer immunotherapy through enhancing tumor-associated macrophage expression of Siglec-10 [46]. CD24 can aggravate acute liver injury via expression of IFN-γ on CD4+ T cells [47]. CD24 has potential as an immune therapy target in breast cancer, but further in vitro and in vivo studies are needed to clarify this. SDC1 expression was significantly positively correlated with infiltration of CD8+ T cells, macrophages, DC, and neutrophils. In pancreatic cancer, SDC1 regulates micropinocytosis to enhance tumor growth [48]. More studies are needed to explore whether SDC1 could be used as an immune therapy target in breast cancer. SPP1 was positively linked with infiltration of CD4+ T cells, CD8+ T cells, macrophages, neutrophils, and DC. In lung cancer, SPP1 enhanced PD-L1 expression and mediated macrophage polarization to facilitate immune escape [49]. SPP1 could have an important role as an immune therapy target in breast cancer. In our results, high expression of MMP1, CD24, SDC1, and SPP1 correlated to the development of breast cancer, worse OS and immune cell infiltration, indicating that MMP1, CD24, SDC1, and SPP1 might be as the potential prognostic biomarkers and immunotherapy targets for breast tumor. We verified the results in several different datasets, including our own clinical datasets. However, there were some limitations of our study. First, there was a lack of in vitro and in vivo studies to verify the results, some other datasets were not used for verification the hub genes and immunity, like ArrayExpress database, InSilicoDB and METABRIC [50,51]. Second, further study of the roles of the four hub genes in different subtypes of breast cancer is required. Third, the survival results of our own clinical datasets not be analyzed. Finally, the molecular status of the four hub genes with respect to mutation and methylation was not checked in this study. Hence, further study is required to determine whether MMP1, CD24, SDC1, and SPP1 could be used as biomarkers or immune therapy targets in breast cancer. Click here for additional data file.
Table 4

The Cox proportional hazard model of SDC1, SPP1, CD24, MMP1 and tumor-infiltrating immune cells in breast carcinoma (TIMER)

BRCACoefHR95% CI-l95% CI-uP-value
Stage 20.3551.4260.7572.6880.272
Stage 31.2113.3571.7536.4270.000*
Stage 42.41811.2185.03624.9910.000*
Purity0.4121.5090.5424.2040.431
CD8-T cell−1.1260.3240.0185.8390.445
CD4-T cell2.40611.0940.135914.8720.285
Macrophage2.61913.7170.759247.730.076
Neutrophil0.9862.680.008897.3840.74
Dendritic−1.4450.2360.0282.0020.186
SDC1−0.0310.9690.7931.1850.762
SPP1−0.0350.9660.8371.1140.633
CD240.1891.2081.071.3630.002*
MMP10.1081.1141.0041.2370.042*

*,P<0.05.

Table 5

The Cox proportional hazard model of SDC1, SPP1, CD24, MMP1 and tumor-infiltrating immune cells in basal like breast carcinoma (TIMER)

BRCA-basalCoefHR95% CI-l95% CI-uP-value
Stage 29.7551724.455628.552833.270.000*
Stage 311.60110916.8336549.63.26*10⁁50.000*
Stage 413.30259858.153221.186.73*10⁁60.000*
Purity6.405605.0421.9116710.190.000*
CD8-T cell−1.2090.2980.00328.120.602
CD4-T cell18.874157415300412998.975.999*10⁁100.000*
Macrophage4.01555.40.14121703.20.188
Neutrophil1.4174.130.023737.40.592
Dendritic−2.9810.0510.0080.3110.001*
SDC10.6231.8651.0823.2140.025*
SPP10.3521.5221.0221.980.037*
CD24−0.3390.7120.4661.090.118
MMP1−0.1370.8720.7941.0950.237

*,P<0.05.

Table 6

The Cox proportional hazard model of SDC1, SPP1, CD24, MMP1 and tumor-infiltrating immune cells in luminal breast carcinoma (TIMER)

BRCA-luminalCoefHR95% CI-l95% CI-uP-value
Stage 20.0421.0430.5032.1610.91
Stage 30.7452.1070.9454.70.069
Stage 42.158.5833.16423.290.000*
Purity−0.3840.6810.1642.8350.598
CD8-T cell1.5234.5870.047451.930.515
CD4-T cell2.93718.8650.01131150.40.437
Macrophage2.2019.0340.155526.650.289
Neutrophil8.5665248.6480.2361.17*10⁁80.093
Dendritic−5.1140.0060.0000.2560.008*
SDC1−0.3430.710.540.9330.014*
SPP1−0.0320.9690.7971.1770.75
CD240.1681.1831.0181.3750.028*
MMP10.2841.1320.7941.560.001*

*,P<0.05.

Table 7

The Cox proportional hazard model of SDC1, SPP1, CD24, MMP1 and tumor-infiltrating immune cells in HER2+ breast carcinoma (TIMER)

BRCA-HER2CoefHR95% CI-l95% CI-uP value
Stage 215.686.45*10⁁60.000Inf0.999
Stage 319.5743.168*10⁁80.000Inf0.999
Stage 420.4317.466*10⁁80.000Inf0.999
Purity4.751115.70.5252.548*10⁁40.084
CD8-T cell−2.2660.1040.0008.899*10⁁70.829
CD4-T cell14.2131.487*10⁁60.0003.54*10⁁180.328
Macrophage4.79120.340.0001.234*10⁁130.711
Neutrophil−10.2410.0000.0003.602*10⁁120.608
Dendritic−0.1390.870.0001.299*10⁁50.982
SDC1−1.0660.3450.1350.8810.026*
SPP1−0.3770.6860.3011.5650.371
CD240.5211.6840.7613.7260.198
MMP10.1231.1310.5592.2880.732

*,P<0.05.

Table 9

MMP1 expression in carcinoma tissue and the correlation with clinical factors

CharacteristicsMMP1 (low)MMP1 (high)P-value
Age (years)
 ≤500.12
 >50333157
12041
Grade 0.01 *
 0/I66
 II207113
 III19765
Vascular invasion0.76
 N378167
 Y7330
Stage0.24
 I16260
 II19893
 III8341
Subtype 0.00 **
 Luminal A194114
 Luminal B9241
HER2-positive6414
Triple-negative9924
N0.66
 Negative264111
 Positive18785
CD240.74
 Low32851
 High123145
SDC10.33
 Low303124
 High14872
SPP1 0.03 *
 Low376149
 High7547

*,P<0.05;

**,P<0.01.

Table 10

CD24 expression in carcinoma tissue and the correlation with clinical factors

CharacteristicsCD24 (low)CD24 (high)P-value
Age (years)0.12
 ≤50351139
 >5012237
Grade 0.00 **
 0/I111
 II25466
 III17290
Vascular invasion0.78
 N399146
 Y7429
Stage0.36
 I16756
 II21081
 III9635
Subtype 0.00 **
 Luminal A25652
 Luminal B10231
HER2-positive3840
Triple-negative7748
N0.70
 Negative272103
 Positive20171
MMP10.74
 Low328123
 High14551
SDC1 0.00 **
 Low33097
 High14377
SPP1 0.00 **
 Low400125
 High7349

**,P<0.01.

Table 11

SDC1 expression in carcinoma tissue and the correlation with clinical factors

CharacteristicsSDC1 (low)SDC1 (high)P-value
Age (years)0.86
 ≤50322168
 >5010754
Grade0.00**
 0/I111
 II22892
 III152110
Vascular invasion0.07
 N359186
 Y6736
Stage0.36
 I15371
 II187106
 III8844
Subtype 0.00 **
 Luminal A23971
 Luminal B9243
HER2-positive3050
Triple-negative6758
N0.45
 Negative253124
 Positive17698
MMP10.33
 Low303148
 High12472
CD24 0.00 **
 Low330143
 High9777
SPP1 0.00 **
 Low371156
 High5866

**,P<0.01.

Table 12

SPP1 expression in carcinoma tissue and the correlation with clinical factors

CharacteristicsSPP1 (low)SPP1 (high)P-value
Age (years)0.28
 ≤5039298
 >5013526
Grade 0.00**
 0/I120
 II27743
 III19171
Vascular invasion0.37
 N444101
 Y8023
Stage0.36
 I18242
 II23657
 III10923
Subtype 0.00**
 Luminal A27337
 Luminal B10926
HER2-positive5426
Triple-negative9035
N0.29
 Negative30077
 Positive22747
MMP1 0.03*
 Low37675
 High14947
CD24 0.00**
 Low40073
 High12549
SDC1 0.00**
 Low37158
 High15666

*,P<0.05;

**,P<0.01.

  50 in total

1.  clusterProfiler: an R package for comparing biological themes among gene clusters.

Authors:  Guangchuang Yu; Li-Gen Wang; Yanyan Han; Qing-Yu He
Journal:  OMICS       Date:  2012-03-28

Review 2.  Chloroethylating nitrosoureas in cancer therapy: DNA damage, repair and cell death signaling.

Authors:  Teodora Nikolova; Wynand P Roos; Oliver H Krämer; Herwig M Strik; Bernd Kaina
Journal:  Biochim Biophys Acta Rev Cancer       Date:  2017-01-29       Impact factor: 10.680

3.  Flunarizine exhibits in vitro efficacy against lymphoma and multiple myeloma cells.

Authors:  Leonard Christopher Schmeel; Frederic Carsten Schmeel; Young Kim; Sabine Blaum-Feder; Tomoyuki Endo; Ingo G H Schmidt-Wolf
Journal:  Anticancer Res       Date:  2015-03       Impact factor: 2.480

4.  TIMER: A Web Server for Comprehensive Analysis of Tumor-Infiltrating Immune Cells.

Authors:  Taiwen Li; Jingyu Fan; Binbin Wang; Nicole Traugh; Qianming Chen; Jun S Liu; Bo Li; X Shirley Liu
Journal:  Cancer Res       Date:  2017-11-01       Impact factor: 12.701

Review 5.  Biomarker of long-chain n-3 fatty acid intake and breast cancer: Accumulative evidence from an updated meta-analysis of epidemiological studies.

Authors:  Bo Yang; Xiao L Ren; Zhi Y Wang; Liang Wang; Feng Zhao; Xiao J Guo; Duo Li
Journal:  Crit Rev Food Sci Nutr       Date:  2018-07-09       Impact factor: 11.176

6.  Syndecan-1 modulates β-integrin-dependent and interleukin-6-dependent functions in breast cancer cell adhesion, migration, and resistance to irradiation.

Authors:  Hebatallah Hassan; Burkhard Greve; Mauro S G Pavao; Ludwig Kiesel; Sherif A Ibrahim; Martin Götte
Journal:  FEBS J       Date:  2013-01-31       Impact factor: 5.542

7.  Effect of curcumin on the cell surface markers CD44 and CD24 in breast cancer.

Authors:  Gloria M Calaf; Richard Ponce-Cusi; Jorge Abarca-Quinones
Journal:  Oncol Rep       Date:  2018-04-20       Impact factor: 3.906

8.  GEPIA: a web server for cancer and normal gene expression profiling and interactive analyses.

Authors:  Zefang Tang; Chenwei Li; Boxi Kang; Ge Gao; Cheng Li; Zemin Zhang
Journal:  Nucleic Acids Res       Date:  2017-07-03       Impact factor: 16.971

9.  CD24 aggravates acute liver injury in autoimmune hepatitis by promoting IFN-γ production by CD4+ T cells.

Authors:  Chenhong Zheng; Shulei Yin; Yang Yang; Yizhi Yu; Xiaohua Xie
Journal:  Cell Mol Immunol       Date:  2017-01-09       Impact factor: 11.530

10.  MMP-1 is overexpressed in triple-negative breast cancer tissues and the knockdown of MMP-1 expression inhibits tumor cell malignant behaviors in vitro.

Authors:  Qi-Min Wang; Li Lv; Ying Tang; Li Zhang; Li-Fen Wang
Journal:  Oncol Lett       Date:  2018-11-30       Impact factor: 2.967

View more
  2 in total

1.  Identification of Significant Secreted or Membrane-Located Proteins in Laryngeal Squamous Cell Carcinoma.

Authors:  Li Yan; Chunyan Hu; Yangyang Ji; Lifen Zou; Yang Zhao; Yi Zhu; Xiaoshen Wang
Journal:  J Immunol Res       Date:  2022-05-23       Impact factor: 4.493

Review 2.  CD24: A Novel Target for Cancer Immunotherapy.

Authors:  Emmanouil Panagiotou; Nikolaos K Syrigos; Andriani Charpidou; Elias Kotteas; Ioannis A Vathiotis
Journal:  J Pers Med       Date:  2022-07-28
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.