Literature DB >> 33907159

Mining immune-related genes with prognostic value in the tumor microenvironment of breast invasive ductal carcinoma.

Qiang He1, Shuyin Xue2, Qingbiao Wa1, Mei He1, Shuang Feng1, Zhibing Chen1, Wei Chen3, Xinrong Luo3.   

Abstract

ABSTRACT: The tumor microenvironment (TME) plays an important role in the development of breast cancer. Due to limitations in experimental conditions, the molecular mechanism of TME in breast cancer has not yet been elucidated. With the development of bioinformatics, the study of TME has become convenient and reliable.Gene expression and clinical feature data were downloaded from The Cancer Genome Atlas database and the Molecular Taxonomy of Breast Cancer International Consortium database. Immune scores and stromal scores were calculated using the Estimation of Stromal and Immune Cells in Malignant Tumor Tissues Using Expression Data algorithm. The interaction of genes was examined with protein-protein interaction and co-expression analysis. The function of genes was analyzed by gene ontology enrichment analysis, Kyoto Encyclopedia of Genes and Genomes analysis and gene set enrichment analysis. The clinical significance of genes was assessed with Kaplan-Meier analysis and univariate/multivariate Cox regression analysis.Our results showed that the immune scores and stromal scores of breast invasive ductal carcinoma (IDC) were significantly lower than those of invasive lobular carcinoma. The immune scores were significantly related to overall survival of breast IDC patients and both the immune and stromal scores were significantly related to clinical features of these patients. According to the level of immune/stromal scores, 179 common differentially expressed genes and 5 hub genes with prognostic value were identified. In addition, the clinical significance of the hub genes was validated with data from the molecular taxonomy of breast cancer international consortium database, and gene set enrichment analysis analysis showed that these hub genes were mainly enriched in signaling pathways of the immune system and breast cancer.We identified five immune-related hub genes with prognostic value in the TME of breast IDC, which may partly determine the prognosis of breast cancer and provide some direction for development of targeted treatments in the future.
Copyright © 2021 the Author(s). Published by Wolters Kluwer Health, Inc.

Entities:  

Mesh:

Substances:

Year:  2021        PMID: 33907159      PMCID: PMC8084029          DOI: 10.1097/MD.0000000000025715

Source DB:  PubMed          Journal:  Medicine (Baltimore)        ISSN: 0025-7974            Impact factor:   1.817


Introduction

Breast cancer is the most common malignant tumor in women worldwide.[ Although comprehensive treatments such as surgery, chemotherapy, radiotherapy and molecular targeted therapy have effectively reduced the mortality rate of breast cancer, a large number of patients still die from local recurrence and distant metastasis.[ Increasing evidence indicates that failures in treatment may be attributed to dysfunction in the tumor microenvironment (TME).[ The TME refers to the environment in which tumors occur, grow, invade and metastasize, which is divided into tumor components and non-tumor components.[ The latter contains cellular components (such as immune cells, fibroblasts, endothelial cells, inflammatory cells, adipocytes and other stromal cells) and non-cellular components (such as cytokines and extracellular matrix).[ The TME is in a dynamic state and is heterogeneous between different tumors.[ Stromal cells interact with tumor cells by secreting extracellular matrix proteins, chemokines and cytokines, resulting in activation of tumor cells that already have genetic abnormalities.[ Studies have shown that the clinical characteristics and prognoses of patients can be predicted by analyzing the features of gene expression and cell infiltration in the stromal environment.[ Tumor-infiltrating lymphocytes (TILs) are important prognostic indicators for triple-negative breast cancer (TNBC).[ Tomioka et al. found that low levels of TILs and high levels of programmed death-ligand 1 (PD-L1) in the TME are unfavorable to the prognosis of TNBC patients.[ In addition, blockade of some immune checkpoint proteins, such as CTLA-4, PD-1 and PD-L1, shows promising effects in breast cancer treatment.[ Therefore, the study of TME is of great significance for the early diagnosis and targeted treatment of breast cancer. In recent years, with the development of high-throughput genome sequencing technology, many biological information databases of high-quality have emerged, such as The Cancer Genome Atlas (TCGA; https://portal.gdc.cancer.gov), Molecular Taxonomy of Breast Cancer International Consortium (METABRIC; http://www.cbioportal.org), Gene Expression Omnibus (GEO; https://www.ncbi.nlm.nih.gov/geo) and International Cancer Genome Consortium (https://icgc.org), which can be analyzed freely by researchers, providing the support of big data for tumor research. Yoshihara et al. designed the Estimation of Stromal and Immune Cells in Malignant Tumor Tissues Using Expression Data (ESTIMATE) algorithm based on genetic sequencing data of tumor samples to study the infiltration of immune cells and stromal cells in the TME and assess tumor purity.[ The ESTIMATE algorithm generates an immune score, stromal score and estimate score through GSEA, which represent the infiltration of immune cells, the presence of stroma and tumor purity of the TME, respectively.[ At present, the ESTIMATE algorithm has been used to analyze the TME of various tumors, such as breast cancer, glioblastoma and prostate cancer.[ Studies on breast cancer have found that immune and stromal scores are related to the clinical characteristics and prognoses of patients, and that scores differ between races.[ However, the detailed characteristics and molecular mechanisms of TME in breast cancer have not yet been fully elucidated. In this study, the ESTIMATE algorithm was utilized to study the relationship between immune/stromal scores and clinical characteristics of breast invasive ductal carcinoma (IDC) patients. The R project was used to screen for hub genes with clinical significance in the TME of IDC, which may provide clues about the underlying molecular mechanisms active in breast TME.

Methods

Database

All data in this study are from public domain, no approval from the ethics committee is required. Gene expression profiles and clinical data of breast cancer patients were downloaded from TCGA (https://portal.gdc.cancer.gov) and the METABRIC database (http://www.cbioportal.org). Immune scores and stromal scores were calculated using the ESTIMATE algorithm in the R project (www.r-project.org).[ The immune/stromal scores and gene expression levels were compared between groups according to age, molecular subtype, TNM stage, adjuvant treatment, cellularity and menopause state.

Identification of differentially expressed genes (DEGs)

DEGs were selected using R package “limma” with FDR < 0.05 and log2 FC> 1.[ The heatmap was drawn with R package “pheatmap”,[ and the Venn diagram was drawn with R package “VennDiagram”.[

Functional and pathway enrichment analysis of DEGs

Functional and pathway enrichment analysis of DEGs was performed using R packages “clusterProfiler”, “org.Hs.eg.db,” “enrichplot” and “ggplot2”.[ The gene ontology analysis included biological process (BP), molecular function (MF) and cell component (CC). Kyoto Encyclopedia of Genes and Genomes analysis shows the enrichment of DEGs in signaling pathways.[ Gene set enrichment analysis (GSEA) (https://gsea-msigdb.org) was performed to study the enrichment in signaling pathways of the hub genes with data from the METABRIC database.[

Protein-protein interaction (PPI) network analysis

The PPI network analysis of DEGs was performed with STRING (https://string-db.org).[ The minimum required interaction score was 0.7 (high confidence). Edges and nodes of the PPI were counted in screening hub genes. Co-expression analysis was performed to identify the genes most positively or negatively related to the hub genes.[

Prognostic analysis

Patients were divided into low and high expression groups according to the median expression level of genes and Kaplan-Meier survival analysis was used to analyze overall survival (OS) using the R package “survival”.[ The univariate and multivariate Cox regression analyses were performed to select genes with prognostic value and ROC analysis was performed to evaluate the diagnostic capability of genes with the package “survivalROC”.[

Statistical analysis

All statistical analyses were performed using R project (version 3.5) and SPSS 22.0 software (IBM, Armonk, NY, USA). P values were calculated with the log-rank test and P < 0.05 was considered statistically significant.

Results

Immune scores and/or stromal scores correlate with overall survival and clinical characteristics of breast IDC patients

Immune scores and stromal scores, calculated with the ESTIMATE algorithm based on genetic sequencing data of tumor samples, reflect the presence of immune cells and stromal cells, respectively, and tumor purity. Studies have shown that the immune score and stromal score are valuable for tumor diagnosis and prognostic evaluation, including breast cancer.[ IDC and invasive lobular carcinoma (ILC) are the two most common pathological types of breast cancer.[ To compare the immune/stromal scores in these two types of breast cancer, the gene expression profiles of 964 breast cancer patients from TCGA were first downloaded, including 764 IDC and 200 ILC profiles. Then the immune scores and stromal scores were calculated by using the ESTIMATE algorithm. The results showed that the immune scores of IDC ranged from -1129.95 to 3661.56, with an average of 712.72, and the immune scores of ILC ranged from −796.34 to 3142.77, with an average of 941.17. The stromal scores of IDC ranged from −2070.44 to 2058.27, with an average of 401.84, and the stromal scores of ILC ranged from −1143.49 to 2099.46, with an average of 715.75. Therefore, both the immune scores and stromal scores of IDC were significantly lower than those of ILC (Fig. 1A, P < .0001).
Figure 1

Immune scores and stromal scores correlate with clinical features and OS of breast IDC. (A) Comparison of immune/stromal scores between IDC and ILC of breast. (B) Relationship between immune/stromal scores and OS of patients with breast IDC. (C-H) Comparison of immune scores according to age, molecular subtype, AJCC stage and TNM stage of patients with breast IDC. (I-N) Comparison of stromal scores according to age, molecular subtype, AJCC stage and TNM stage of patients with breast IDC.

Immune scores and stromal scores correlate with clinical features and OS of breast IDC. (A) Comparison of immune/stromal scores between IDC and ILC of breast. (B) Relationship between immune/stromal scores and OS of patients with breast IDC. (C-H) Comparison of immune scores according to age, molecular subtype, AJCC stage and TNM stage of patients with breast IDC. (I-N) Comparison of stromal scores according to age, molecular subtype, AJCC stage and TNM stage of patients with breast IDC. Studies have shown that immune and stromal scores are related to clinical characteristics of tumors, such as age at diagnosis, clinical stage, tumor size, local and distant metastasis, pathological type, etc.[ In order to study the relationship between immune/stromal scores and clinical characteristics of IDC, the clinical data of the above 764 IDC patients were downloaded from TCGA, including age, AJCC stage, TMN stage, molecular subtypes and survival information (Table 1). The patients were divided into low and high immune/stromal score groups according to the median of scores, and Kaplan-Meier analysis was performed to compare the OS between the low score and high score groups. The results showed that the OS of patients with high immune scores was significantly longer than that of patients with low immune scores (Fig. 1B, p = 0.004). Next, the correlation between immune/stromal scores and clinical characteristics of IDC patients was analyzed. The results indicated that the immune scores were significantly related to the molecular subtype of breast IDC (Fig. 1D, P < .0001), and the stromal scores were significantly related to the molecular subtype (Fig. 1J, P < .0001), AJCC stage (Fig. 1K, P < .0001) and T stage (Fig. 1L, P = .009) of breast IDC. Other results were not statistically significant (Fig. 1C, E-I, M, and N). Therefore, immune scores and/or stromal scores are closely related to the clinical characteristics and OS of patients with IDC.
Table 1

Clinical characteristics and immune/stromal scores of patients from The Cancer Genome Atlas database with breast invasive ductal carcinoma.

CharacteristicsNumber of case (%) 764Immune score (range)Stromal score (range)
Age
 ≤55343 (44.9)−983.81 to 3115.37−1437.84 to 2058.27
 >55421 (55.1)−1129.95 to 3661.56−2070.44 to 2015.35
AJCC stage
 I139 (18.2)−1002.17 to 3060.80−1166.48 to 1784.36
 II441 (57.7)−1129.95 to 3661.56−2070.44 to 2058.27
 III152 (19.9)−763.283 to 3115.37−1524.09 to 1933.37
 IV16 (2.1)−735.324 to 1728.34653.169 to 718.05
 Unknow16 (2.1)−151.38 to 2170.75−609.76 to 1610.23
T stage
 T1217 (28.4)−1018.74 to 3060.80−1173.81 to 1784.36
 T2456 (59.7)−1129.95 to 3661.56−2070.44 to 2058.27
 T358 (7.6)−735.32 to 3115.37−1093.36 to 1664.96
 T431 (4.1)−554.44 to 2060.92−1524.09 to 1537.87
 Unknow2 (0.2)−353.38 to 117.33−396.75 to -244.47
N stage
 N0350 (45.8)−1018.74 to 3661.56−2070.44 to 2015.35
 N1269 (35.2)−1129.95 to 3115.37−1268.12 to 2058.27
 N293 (12.2)−763.28 to 2800.07−1328.32 to 1933.37
 N338 (5.0)−735.32 to 2136.87−1056.02 to 1635.95
 Unknow14 (1.8)−462.91 to 1441.60−509.84 to 914.78
M stage
 M0662 (86.6)−1129.95 to 3661.56−2070.44 to 2058.27
 M118 (2.4)−735.32 to 1728.34−783.53 to 1627.82
 Unknow84 (11.0)−870.44 to 2805.71−1328.32 to 1699.96
Molecular subtype
 Luminal A181 (23.7)−943.65 to 3076.70−1268.12 to 1754.23
 Luminal B141 (18.5)−1018.74 to 3661.56−2070.44 to 2015.35
 HER2 enriched65 (8.5)−840.66 to 2800.07−1093.36 to 1500.36
 Basal-like118 (15.4)−1129.95 to 3115.37−1173.81 to 1910.33
 Normal-like5 (0.7)635.97 to 2694.14−259.59 to 1933.37
 Unknow254 (33.2)−870.44 to 2805.70−1437.84 to 2058.27
Clinical characteristics and immune/stromal scores of patients from The Cancer Genome Atlas database with breast invasive ductal carcinoma.

DEGs in the TME of breast IDC and their functions

Studies have shown that networks of gene regulation vary in different types of TMEs.[ To study the relationship between immune/stromal scores and gene expression, the gene expression profiles of 764 breast IDC patients were analyzed and the results are displayed in the form of a heatmap (Fig. 2A). The results showed that compared with the low immune score group, the high immune score group had 763 high expression genes and 106 low expression genes (fold change [FC]> 2, false discovery rate [FDR] < 0.05). Compared with the low stromal score group, the high stromal score group had 566 high expression genes and 106 low expression genes (FC> 2, FDR < 0.05). By intersecting these DEGs, we found that there were 164 common high-expressing genes and 15 common low-expressing genes (Fig. 2B).
Figure 2

DEGs according to immune scores and stromal scores. The DEGs were compared between low and high immune/stromal score groups according to the median score. (A) Heatmap showing DEGs according to immune and stromal scores. (B) Venn diagram showing the intersection of DEGs between immune score and stromal score groups. (C) Results of GO functional enrichment analysis of the DEGs, including BP, MF and CC. (D) Top ten pathways from KEGG analysis of the DEGs.

DEGs according to immune scores and stromal scores. The DEGs were compared between low and high immune/stromal score groups according to the median score. (A) Heatmap showing DEGs according to immune and stromal scores. (B) Venn diagram showing the intersection of DEGs between immune score and stromal score groups. (C) Results of GO functional enrichment analysis of the DEGs, including BP, MF and CC. (D) Top ten pathways from KEGG analysis of the DEGs. To further analyze the function of the 179 common DEGs, Gene ontology function enrichment analysis was performed, including BP, MF and CC. The results showed that these genes were mainly enriched in functions of T cell activation, lymphocyte differentiation, immune response, plasma membrane and cytokine receptor activity. Figure 2C shows the top 15 enriched functions (Fig. 2C). We also performed Kyoto Encyclopedia of Genes and Genomes pathway analysis. These results showed that the 179 genes were mainly enriched in cytokine-cytokine receptor interaction, hematopoietic cell lineage and chemokine signaling pathways. Fig. 2D shows the top ten enriched pathways. Thus, these common DEGs are involved in the regulation of the immune process.

Hub genes in the TME of breast IDC

To study the interaction of DEGs in the TME of breast IDC, a protein-protein interaction (PPI) network was constructed through the STRING database. The results showed that there were 223 edges and 101 nodes in the PPI network of these 179 common DEGs (Fig. 3A). The number of nodes of each gene were counted and the top 30 genes with the most nodes are displayed in Figure 3B. In order to identify the genes with clinical significance, Kaplan-Meier survival analysis was performed for these 30 genes in patients with breast IDC. The results showed that CD5, CD3E, CD40LG, CD52, CD27, CD69 and IL7R were significantly related to the OS of patients with breast IDC (Fig. 3C-I). Then univariate and multivariate Cox regression analysis of these seven genes was performed. The results indicated that the hazard ratio (HR) values of CD3E, CD5, CD27, CD40LG and CD52 were under 1 and statistically significant in both univariate and multivariate Cox regression analyses (Fig. 4A and B, Table 2). Thus, these five genes are likely the hub genes in the TME of breast IDC.
Figure 3

Hub genes in the TME of breast IDC. (A) PPI network of the DEGs. (B) Top 30 genes with the most nodes in the PPI network. (C-I) The Kaplan-Meier survival analysis of patients expressing CD5, CD3E, CD27, CD69, CD40LG, IL7R and CD52.

Figure 4

Clinical relevance of hub genes. (A-B) Univariate and multivariate Cox regression analyses of CD3E and CD40LG, respectively. (C-E) Relationship between gene expression levels and clinical characteristics of patients with breast IDC. C-E show CD52, CD5 and CD27, respectively.

Table 2

Univariate and multivariate Cox regression analysis of patients from the cancer genome atlas database with breast invasive ductal carcinoma.

OS
Univariate analysisMultivariable analysis
VariantsHR (95% CI)P-valueHR (95% CI)P-value
Age1.035 (1.017–1.053)<.001
Stage2.123 (1.616–2.789)<.001
T stage1.508 (1.161–1.957).002
N stage1.942 (1.530–2.466)<.001
M stage5.774 (2.965–11.245)<.001
CD3E0.941 (0.904–0.980).0040.952 (0.914–0.991).016
CD50.882 (0.801–0.971).0100.910 (0.830–0.998).045
CD270.908 (0.847–0.972).0060.927 (0.868–0.989).022
CD40LG0.632 (0.456–0.874).0060.710 (0.521–0.966).029
CD690.927 (0.845–1.018).113
IL7R0.952 (0.908–0.999).044
CD520.978 (0.965–0.992).0020.983 (0.969–0.997).014
Hub genes in the TME of breast IDC. (A) PPI network of the DEGs. (B) Top 30 genes with the most nodes in the PPI network. (C-I) The Kaplan-Meier survival analysis of patients expressing CD5, CD3E, CD27, CD69, CD40LG, IL7R and CD52. Clinical relevance of hub genes. (A-B) Univariate and multivariate Cox regression analyses of CD3E and CD40LG, respectively. (C-E) Relationship between gene expression levels and clinical characteristics of patients with breast IDC. C-E show CD52, CD5 and CD27, respectively. Univariate and multivariate Cox regression analysis of patients from the cancer genome atlas database with breast invasive ductal carcinoma.

Validation in the METABRIC database

To verify the clinical significance of these five hub genes, another large breast cancer database, METABRIC, was analyzed.[ The gene expression data and the complete clinical data of 384 patients with breast IDC from METABRIC were downloaded from the cBioPortal for Cancer Genomics database (http://www.cbioportal.org). The patients were divided into different groups according to their clinical features, such as age at diagnosis, cellularity, chemotherapy, Claudin subtype, ER status, and inferred menopausal status. Then the gene expression level between groups was compared, which showed that the expression levels of CD3E, CD5, CD27, CD40LG, and CD52 significantly differed between groups of clinical features (Fig. 4C-E, Table 3). Therefore, the expression levels of these five genes were related to clinical characteristics of breast IDC.
Table 3

Statistically significant clinical features of each hub gene from clinical correlation analysis.

Gene namesStatistically significant clinical features
CD3ECellularity, chemotherapy, Claudin subtype, ER status, inferred menopausal status
CD40LGCellularity, Claudin subtype
CD52Age at diagnosis, chemotherapy, Claudin subtype, ER status, radiotherapy
CD5Cellularity, chemotherapy, Claudin subtype, ER status, HER2 SNPs
CD27Age at diagnosis, chemotherapy, Claudin subtype, ER status, inferred menopausal status
Statistically significant clinical features of each hub gene from clinical correlation analysis. To further study the function and protein-protein correlation of these five genes, GSEA and co-expression analysis were performed with data from METABRIC.[ The results of GSEA showed that the 5 genes were mainly enriched in immune-related and breast cancer-related signaling pathways, such as T and B cell receptor signaling pathways, natural killer cell-mediated cytotoxicity, MAPK, ERBB, WNT, VEGF signaling pathway, and others (Fig. 5A-B and Table 4). The results of co-expression analysis showed the genes with the strongest correlation with CD3E, CD5, CD27, CD40LG and CD52 (Fig. 5C-D and Table 5). Thus, we infer these five genes may play an important role in the TME to some extent.
Figure 5

Enrichment analysis and co-expression analysis of hub genes. (A-B) GSEA of CD5 and CD27, respectively. C-D show co-expression analysis of CD5 and CD27, respectively.

Table 4

The most enriched signaling pathways of hub genes from gene set enrichment analysis.

KEGG pathwaysGenes
KEGG_VEGF_SIGNALING_PATHWAYCD3E/CD5/CD27/CD40LG/CD52
KEGG_JAK_STAT_SIGNALING_PATHWAYCD3E/CD5/CD27/CD40LG/CD52
KEGG_MAPK_SIGNALING_PATHWAYCD3E/CD27/CD40LG
KEGG_PATHWAYS_IN_CANCERCD3E/CD5/CD27/CD40LG/CD52
KEGG_ERBB_SIGNALING_PATHWAYCD5/CD27/CD40LG
KEGG_WNT_SIGNALING_PATHWAYCD3E/CD27/CD40LG
KEGG_T_CELL_RECEPTOR_SIGNALING_PATHWAYCD3E/CD5/CD27/CD40LG/CD52
KEGG_NATURAL_KILLER_CELL_MEDIATED_CYTOTOXICITYCD3E/CD5/CD27/CD40LG/CD52
KEGG_LEUKOCYTE_TRANSENDOTHELIAL_MIGRATIONCD3E/CD5/CD27/CD40LG/CD52
KEGG_B_CELL_RECEPTOR_SIGNALING_PATHWAYCD3E/CD5/CD27/CD40LG/CD52
Table 5

The top 10 related hub genes from co-expression analysis.

GenesPositively relatedNegatively related
CD3EITK/PTPN7/TBC1D10C/ACAP1/ZAP70CDK14/POP5/TTC8/IFT81/CDKL3
CD5CCR7/P2RY8/PTPRCAP/PVRIG/ITKPOP5/OVOL2/CDK14/KCTD3/CGN
CD27CD2/TCRVB/CD3D/CD247/PTPRCAPPOP5/CGN/P4HTM/CELSR2/TUBB2A
CD40LGGPR18/CCR7/TRAF3IP3/CD5/CXCR5CDK14/CGN/OVOL2/KCTD3/DPCD
CD52TCRVB/GZMA/GZMK/HCST/CD3DP4HTM/CELSR2/POMT2/CCDC24/KDM4B
Enrichment analysis and co-expression analysis of hub genes. (A-B) GSEA of CD5 and CD27, respectively. C-D show co-expression analysis of CD5 and CD27, respectively. The most enriched signaling pathways of hub genes from gene set enrichment analysis. The top 10 related hub genes from co-expression analysis.

Discussion

In this study, the immune scores and stromal scores of breast IDC were calculated using data from TCGA, and the relationship between the scores and clinical characteristics was analyzed. Then, DEGs were analyzed according to the immune/stromal scores and the hub genes in the TME of breast IDC were identified with PPI analysis and univariate/multivariate Cox regression analysis. Finally, the clinical significance and the correlation with expression of these hub genes were validated with data from METABRIC. Dysfunction of the TME is one of the most important causes of breast cancer recurrence and metastasis[. An increasing number of studies have recently focused on features of the breast TME. Chen et al. compared the characteristics of the TME of breast cancer in Asian and Western populations from GEO and TCGA databases by using ESTIMATE and CIBERSORT algorithms. Their results showed that the immune scores of Asian populations were higher than Western populations, especially for luminal B and HER2-enriched breast cancer patients.[ Vincent et al. found that the immune/stromal scores of breast cancer cell lines were inconsistent with those of the corresponding breast cancer tissue by using data from TCGA, GEO, UCSC Cancer Genomics Browser and cBio Cancer Genomics Portal databases.[ In a study of the relationship between DNA methylation and TME, Li et al. found that DNA methylation regulated the immune response by suppressing ACDY6 in the TME of luminal-like breast cancer.[ However, the detailed molecular mechanism for this effect in the TME of breast cancer has not yet been elucidated. It has been reported that immune/stromal scores vary in different molecular subtypes and different histological types of breast cancer.[ In the present study, we compared the immune/stromal scores between breast IDC and ILC. The results showed that the immune scores of breast IDC were significantly lower than those of breast ILC, suggesting that different pathological types of breast cancer have different TME characteristics. According to the meaning of immune scores and stromal scores, we infer that this discrepancy is due to the difference in TME between IDC and ILC, that is, the immune and stromal components in the TME of IDC are less numerous than those of ILC. Several studies are consistent with our speculation. Tian et al. analyzed immune cell-type specific signatures of breast luminal (Lum) A IDC and ILC in TCGA and in the Genotype-Tissue Expression Project. Their results showed that LumA ILC had a higher proportion of high-immune phenotypes compared to LumA IDC. In addition, higher expression was observed for several critical immune checkpoint genes, such as CD274 (PD-L1), PDCD1 (PD-1) and CTLA4 in LumA ILC.[ Lee et al. collected 123 samples of invasive mammary carcinomas and observed the pattern of inflammation. They found that a combination of perilobular and perivascular inflammation composed of B and T cells was seen more frequently in ILC than in IDC.[ Zhao et al. performed a gene expression profiling study analyzing 21 ILC and 38 IDC samples. The results showed that stromal and adipose tissue markers, such as FBN1, KRT5, APM1 and FABP4, presented mainly in the ILC samples.[ However, there is one study that contradicts our speculation. Desmedt et al. compared the number of tumor-infiltrating lymphocytes (TILs) in ER-positive/HER2-negative ILC and IDC. The study indicated that the TIL levels were significantly lower in ILC compared with IDC.[ Thus, previous studies have explained our results in this study to some extent, but it is still too early to draw a conclusion and further research should be conducted to cover more subtypes of breast cancer and in a larger sample size. In addition, our results showed that LumA, LumB and HER2-enriched breast cancer have higher stromal scores and lower immune scores compared with basal-like breast cancer. Normal-like breast cancer had the highest immune scores (Fig. 1D), which is similar to previous findings.[ Current studies indicated that immune scores and stromal scores reflect the purity of the tumor and the level of immune response, that is, low immune/stromal scores indicate high tumor purity, relatively weak immune response and poor prognosis to some extent.[ In our study, we found that the stromal scores were significantly related to AJCC stage and T stage of breast IDC (as the stage increases, the stromal scores decrease). In addition, although not statistically significant, the immune/stromal scores were lower in patients younger than 60 years old (Fig. 5C and I) and in patients with distant metastasis (Fig. 5H and N). Therefore, the immune scores and stromal scores reflect the clinical characteristics of patients with breast IDC to a certain extent. With PPI network analysis and univariate/multivariate Cox regression analysis, five hub genes in the TME of breast IDC with prognostic significance were identified: CD5, CD3E, CD40LG, CD52 and CD27. Receiver operating characteristic (ROC) curves were drawn with R package “survivalROC” to evaluate the diagnostic ability of these hub genes.[ However, none of the genes were statistically significant under the curve values (data not shown). Next, the clinical significance of these genes was validated in the METABRIC database. The results showed that the expression level of these five hub genes was related to the clinical features of patients with breast IDC and were enriched in immune-related and breast cancer-related signaling pathways. Finally, genes whose expression levels were most significantly related to these five hub genes were identified. Thus, our study identified five genes with prognostic significance from the TME of breast IDC. The five hub genes, CD40LG, CD5, CD27, CD3E and CD52, are cluster of differentiation (CD) leukocyte surface antigens, which are mainly related to immune function.[Table 6 summarizes current studies on these five hub genes in breast cancer.[ CD40LG, also called CD154 or CD40L, is mainly expressed in activated T lymphocytes and belongs to the tumor necrosis factor family. It regulates the immune response by binding to its ligand CD40.[ Our results from PPI network analysis showed that CD40LG is closely related to many important proteins, including CD69, VCAM1 and TLR7, and its expression level is an independent prognostic factor in patients with breast IDC. Furthermore, the expression of CD40LG is closely related to cellularity and the Claudin subtype of breast IDC, and the results of GSEA indicated that CD40LG was mainly enriched in WNT, ERBB, VEGF, MAPK and JAK signaling pathways. We are mainly interested in CD40LG since several studies have already reported its function and molecular mechanism in the TME. Brummer et al. found that CCR2 signaling promoted breast cancer cell proliferation and invasion by inhibiting CD40LG while activating CCL2. In addition, high expression of CD40LG is a favorable indicator for the recurrence-free survival of patients with breast IDC.[ Mesenchymal stem cells edited with TNF-α and CD40L enhanced the anti-tumor immune function of mice by promoting the function of Th1 cells and inhibiting the function of Th2 and Treg cells.[ From immunotherapy research, CD8+ T cell-based adoptive cell therapy suppresses tumor development by activating dendritic cells and releasing tumor necrosis factor through the CD40/CD40LG signaling pathway.[ Soliman et al. found that a cell vaccine containing granulocyte macrophage-colony stimulating factor and CD40L enhanced anti-tumor ability in a mouse model of breast cancer by increasing infiltration of CD3+ lymphocytes through IL-2 and TNF-γ activation.[ Studies on non-solid tumors have shown that CD40LG and IL4 promote tumor cell proliferation and chemotherapy resistance by activating STAT3, NF-κB, ERK and AKT signaling pathways.[ Thus, CD40L plays an important role in the TME of various tumors, but the functions and molecular mechanism of CD40L in the TME of breast IDC requires further research.
Table 6

Previous studies on five hub genes in breast cancer.

StudyBreast cancer typeGeneBrief summary
Brummer et al.[42]Breast IDCCD40LGCCR2 signaling promoted breast cancer cell proliferation and invasion by inhibiting CD40LG while activating CCL2. In addition, high expression of CD40LG was a favorable indicator for recurrence-free survival of patients with breast IDC.
Tong et al.[43]Breast cancer cell linesCD40LGSoluble recombinant CD40 ligand (CD40L) molecules effectively inhibited the growth of breast cancer in vivo by inducing apoptosis of breast cancer cells.
Gomes et al.[44]Breast cancer cell linesCD40LGCD40LG inhibited the in vitro growth of CD40+ human breast cancer lines by blocking the cell cycle and inducing apoptosis of breast cancer cells.
Pan et al.[45]Breast IDCCD40LGThe expression levels of CD40/CD40L on B cells and T cells in breast IDC patients were significantly increased, and CD40/CD40L levels had a significant positive relationship with pathological grades.
Voorzanger-Rousselot et al.[46]Breast cancer cell linesCD40LGCD40LG reduced the apoptosis of breast cancer cells induced by chemotherapeutic drugs.
Kim et al.[47]Breast cancer cell linesCD40LGThe CD40-CD40L interaction promoted the proliferation of breast cancer cells (MDA-MB-231) by increasing TGF-β production and Th17 differentiation.
Voorzanger-Rousselot et al.[48]Breast cancer cell linesCD40LGCD40L was expressed on a CD40-positive breast cancer cell line (T47D) and induced an antiapoptotic signal when cells were exposed to cytotoxic agents.
Wang et al.[49]Breast cancer cell linesCD40LGCo-expression of Drosophila melanogaster deoxyribonucleoside kinase and CD40L decreased cell proliferation and induced cell apoptosis of MDA-MB-231 and MCF7 cells in vitro and in vivo.
Yu et al.[50]Breast cancer cell linesCD40LGCD4+T cells in cytokine-induced killer cells induced Fas-dependent apoptosis of MDA-MB-231 cells through CD40/CD40L ligation by inhibiting synthesis of c-FLIP.
Shousha et al.[51]Breast IDCCD5Massive infiltration of axillary lymph nodes with CD5-positive B lymphocytes was found in a breast IDC patient. Strong staining for CD5 was also observed in tumor cells within the metastases of breast and lymph nodes.
Walsh et al.[52]Breast cancer (pathological type not mentioned)CD5A negative correlation was found between CD5 positivity and tumor grade in breast cancer patients.
Alotaibi et al.[53]Breast cancer cell linesCD5The use of a function-blocking anti-CD5 monoclonal antibody or knockout of CD5 inhibited tumor growth in a breast cancer mouse model by enhancing the capability of CD8+ T cells.
Xu et al.[54]Breast cancer (pathological type not mentioned)CD27The rs3136550 CT and rs2267966 AT genotypes of CD27 were associated with a decreased risk of breast cancer. In haplotype analysis, the CCGAG haplotype conferred an increased risk of breast cancer. Significant associations were shown between the SNPs of CD27 and lymph node metastasis, and ER and PR status.
Han et al.[55]Breast cancer cell linesCD27CD27 or 4–1BB-costimulated, self-enriched NKG2D CAR-redirected T cells effectively recognized and inhibited the proliferation of MDA-MB-231 cells in vitro and in vivo.
Wang et al.[56]Breast cancer (pathological type not mentioned)CD52The expression level of CD52 was related to the prognosis and pathological stages of BRCA patients. Analysis based on RNA-seq and clinical data from TCGA datasets showed that CD52 was positively correlated with immune response-related pathways and immune metagenes.
Wang et al.[57]Breast cancer (pathological type not mentioned)CD52The expression level of CD52 was related to the prognosis of breast cancer patients. Tumor-infiltrating immune cell analysis showed the relationship between CD52 expression and CD8+ T cells, activated memory CD4+ T cells, macrophage M1, and gamma delta T cells.
Khatibi et al.[59]Breast cancer cell linesCD3EA recombinant anti-CD3E nanobody effectively suppressed angiogenesis and tumor cell proliferation in a breast cancer mouse model.
Moradi-Kalbolandi et al.[58]Breast cancer cell linesCD3EA purified anti-CD3E nanobody effectively inhibited the growth of breast cancer in vivo.
Previous studies on five hub genes in breast cancer. CD5 is a glycoprotein receptor on the surface of T lymphocytes.[ A recent study found that the use of function-blocking anti-CD5 monoclonal antibody or the knockout of CD5 inhibits tumor growth in a breast cancer mouse model by enhancing the capability of CD8+ T cells.[ Hosaka et al. detected the expression of CD5 in both thymic cancer tissue and lymphoid stroma. Their results showed that the expression of CD5 was significantly increased at the junction of tumor and stroma, and its expression in lymphocytes was significantly higher than in tumor cells. Furthermore, high levels of CD5 indicated elevated Ki-67 and activated tumor cell proliferation.[ Another study on pancreatic ductal adenocarcinoma indicated that Bruton's tyrosine kinase promoted the proliferation of cancer cells by activating CD1dhiCD5+ regulatory B cells, which accumulated in the stroma of pancreatic lesions of mice.[ CD27 is a member of the tumor necrosis factor receptor superfamily, which regulates lymph cell activation and immunoglobulin synthesis by binding to its ligand CD70.[ Its potential in immune therapy has been recently noticed.[ Roberts et al. demonstrated that agonistic anti-CD27 antibodies inhibited the growth and metastasis of melanoma in vivo by enhancing the function of tumor-specific CD8+ T cells.[ In addition, clinical trials showed that an agonist of CD27, varlilumab, was effective and well tolerated in patients with advanced solid tumors by enhancing anti-tumor immunity driven by T cells.[ For CD52 and CD3E, there are relatively fewer studies on their activity in the TME. CD52, also known as CAMPATH-1 antigen, is a glycoprotein which presents on the surface of lymphocytes.[ Current research has mainly focused on the anti-tumor effects of the CD 52 antibody alemtuzumab in patients with lymphoma.[ Together with CD3-gamma, -delta and -zeta, CD3-epsilon/CD3E forms the T cell receptor-CD3 complex, which plays a key role in antigen recognition and signal transduction.[ Azadeh et al. found that a recombinant anti-CD3E nanobody effectively suppressed angiogenesis and tumor cell proliferation in a breast cancer mouse model.[ In summary, the TME of breast cancer is affected by many factors, such as race, tumor type, clinical stage and molecular subtype.[ In this study, the correlation between immune/stromal scores and clinical characteristics was analyzed, and the hub genes in the TME of breast IDC were identified. Although similar results have been shown in other studies, they did not focus on breast IDC, and the identified genes were not further studied for their prognostic and clinical significance.[ In addition, previous studies were mainly limited to one database while our study combined two databases to minimize the bias of data.[ Nevertheless, more studies can be expected in the future, such as the analysis of infiltrated immune cells in the TME of breast IDC, and the verification of functions and molecular mechanisms of these hub genes in vitro and in vivo. We hope that our work will provide some new possibilities for the diagnosis and treatment of breast cancer.

Acknowledgments

We thank International Science Editing (http://www.internationalscienceediting.com) for editing this manuscript.

Author contributions

Conceptualization: Qiang He. Data curation: Qiang He. Formal analysis: Qiang He. Investigation: Qiang He. Methodology: Qiang He. Project administration: Qiang He. Resources: Shuyin Xue. Software: Qiang He. Supervision: Mei He. Validation: Qingbiao Wa, Xinrong Luo. Visualization: Wei Chen. Writing – original draft: Qiang He, Shuang Feng. Writing – review & editing: Zhibing Chen, Xinrong Luo.
  71 in total

1.  A conformational epitope expressed upon association of CD3-epsilon with either CD3-delta or CD3-gamma is the main target for recognition by anti-CD3 monoclonal antibodies.

Authors:  A Salmerón; F Sánchez-Madrid; M A Ursa; M Fresno; B Alarcón
Journal:  J Immunol       Date:  1991-11-01       Impact factor: 5.422

Review 2.  Therapeutic targeting of CD70 and CD27.

Authors:  Harald Wajant
Journal:  Expert Opin Ther Targets       Date:  2016-03-15       Impact factor: 6.902

3.  CD5 positive breast carcinoma in a patient with untreated chronic lymphocytic leukaemia: molecular studies of chromosome 13q.

Authors:  S Shousha; C Costello; Y A Luqmani; H D Sinnett
Journal:  J Clin Pathol       Date:  1998-11       Impact factor: 3.411

4.  Coexpression of CD40 and CD40L on B lymphoma and carcinoma cells: an autocrine anti-apoptotic role.

Authors:  Nathalie Voorzanger-Rousselot; Jean-Yves Blay
Journal:  Leuk Lymphoma       Date:  2004-06

5.  Evaluation the potential of recombinant anti-CD3 nanobody on immunomodulatory function.

Authors:  Shima Moradi-Kalbolandi; Azadeh Sharifi-K; Behrad Darvishi; Keivan Majidzadeh-A; Neda Jalili; Solmaz Sadeghi; Marjan Mosayebzadeh; Hassan Sanati; Malihe Salehi; Leila Farahmand
Journal:  Mol Immunol       Date:  2019-12-26       Impact factor: 4.407

6.  Direct Interaction of CD40 on Tumor Cells with CD40L on T Cells Increases the Proliferation of Tumor Cells by Enhancing TGF-β Production and Th17 Differentiation.

Authors:  Hyemin Kim; Yejin Kim; Seyeon Bae; Joo Myoung Kong; Jiwon Choi; Mirim Jang; Jiyea Choi; Jun-Man Hong; Young-Il Hwang; Jae Seung Kang; Wang Jae Lee
Journal:  PLoS One       Date:  2015-05-18       Impact factor: 3.240

7.  Control of triple-negative breast cancer using ex vivo self-enriched, costimulated NKG2D CAR T cells.

Authors:  Yali Han; Wei Xie; De-Gang Song; Daniel J Powell
Journal:  J Hematol Oncol       Date:  2018-07-06       Impact factor: 17.388

8.  Identification of Genes with Prognostic Value in the Breast Cancer Microenvironment Using Bioinformatics Analysis.

Authors:  Haoyu Ren; Daixing Hu; Yu Mao; Xinliang Su
Journal:  Med Sci Monit       Date:  2020-04-06

9.  Assessing breast cancer cell lines as tumour models by comparison of mRNA expression profiles.

Authors:  Krista Marie Vincent; Scott D Findlay; Lynne Marie Postovit
Journal:  Breast Cancer Res       Date:  2015-08-20       Impact factor: 6.466

10.  Alterations in Immune-Related Genes as Potential Marker of Prognosis in Breast Cancer.

Authors:  Bei Li; Rongxin Geng; Qi Wu; Qian Yang; Si Sun; Shan Zhu; Zhiliang Xu; Shengrong Sun
Journal:  Front Oncol       Date:  2020-03-12       Impact factor: 6.244

View more
  1 in total

1.  Galloflavin Relieves the Malignant Behavior of Colorectal Cancer Cells in the Inflammatory Tumor Microenvironment.

Authors:  Li Guo; Yi Yang; Yongjia Sheng; Jin Wang; Wenyan Li; Xiaohong Zhou; Shuiliang Ruan; Chenyang Han
Journal:  Front Pharmacol       Date:  2021-12-10       Impact factor: 5.810

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.