Xiaoyu Wang1, Yan Li2, Jianchang Fu3, Kewen Zhou1, Tinghuai Wang1. 1. Department of Physiology, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, 510080, People's Republic of China. 2. Department of Pathology, The First Affiliated Hospital of Sun Yat-sen University, Guangzhou, 510080, People's Republic of China. 3. Department of Pathology, Sun Yat-sen University Cancer Center, Guangzhou, 510060, People's Republic of China.
Abstract
BACKGROUND: Triple-negative breast cancer (TNBC) is the most aggressive breast cancer subtype and is associated with poor prognosis. The aberrant expression of circadian genes contributes to the origin and progression of breast cancer. The present study was designed to explore the potential function and prognosis value of circadian genes in TNBC. METHODS: The transcriptome data of circadian genes were downloaded from The Cancer Genomic Atlas (TCGA), GSE25066 and GSE31448 datasets. The differential expressed circadian genes between non-TNBC and TNBC patients were analysed by Wilcoxon test. Univariate and multivariate Cox regression analyses were employed to identify the prognostic circadian genes. Gene Ontology (GO), Kyoto Encyclopedia of Genes and Genomes (KEGG) and Gene Set Enrichment Analysis (GSEA) were performed to study the biological functions of ARNTL2. The composition of 22 immune cells in the tumour samples was estimated with CIBERSORT algorithm. The correlations between ARNTL2 expression and tumour-infiltrating immune cells were evaluated by Spearman correlation coefficient. RESULTS: A total of 8 circadian genes were found to be differentially expressed between non-TNBC and TNBC, but only ARNTL2 has prognostic value. Multivariate Cox analysis identified that ARNTL2 was an independent prognosis factor for overall survival and relapse-free survival in TNBC patients. Functionally, ARNTL2 was mainly involved in immune response processes such as positive regulation of cytokine production, regulation of innate immune response, and cellular responses to molecules of bacterial origin. High expression of ARNTL2 was positively correlated with activated CD4 memory T cells, activated mast cells, and neutrophil infiltration and the expression of markers of neutrophils (ITGAM), dendritic cells (HLA-DRA, HLA-DPA1, ITGAM), Th1 (IL1B, STAT1), Th2 (IL13), Th17 (STAT3) and mast cells (TPSB2, TPSAB1). CONCLUSION: ARNTL2 may be linked with the functional modulation of the tumour immune microenvironment and serve as a potential biomarker for predicting the prognosis of TNBC patients.
BACKGROUND: Triple-negative breast cancer (TNBC) is the most aggressive breast cancer subtype and is associated with poor prognosis. The aberrant expression of circadian genes contributes to the origin and progression of breast cancer. The present study was designed to explore the potential function and prognosis value of circadian genes in TNBC. METHODS: The transcriptome data of circadian genes were downloaded from The Cancer Genomic Atlas (TCGA), GSE25066 and GSE31448 datasets. The differential expressed circadian genes between non-TNBC and TNBC patients were analysed by Wilcoxon test. Univariate and multivariate Cox regression analyses were employed to identify the prognostic circadian genes. Gene Ontology (GO), Kyoto Encyclopedia of Genes and Genomes (KEGG) and Gene Set Enrichment Analysis (GSEA) were performed to study the biological functions of ARNTL2. The composition of 22 immune cells in the tumour samples was estimated with CIBERSORT algorithm. The correlations between ARNTL2 expression and tumour-infiltrating immune cells were evaluated by Spearman correlation coefficient. RESULTS: A total of 8 circadian genes were found to be differentially expressed between non-TNBC and TNBC, but only ARNTL2 has prognostic value. Multivariate Cox analysis identified that ARNTL2 was an independent prognosis factor for overall survival and relapse-free survival in TNBC patients. Functionally, ARNTL2 was mainly involved in immune response processes such as positive regulation of cytokine production, regulation of innate immune response, and cellular responses to molecules of bacterial origin. High expression of ARNTL2 was positively correlated with activated CD4 memory T cells, activated mast cells, and neutrophil infiltration and the expression of markers of neutrophils (ITGAM), dendritic cells (HLA-DRA, HLA-DPA1, ITGAM), Th1 (IL1B, STAT1), Th2 (IL13), Th17 (STAT3) and mast cells (TPSB2, TPSAB1). CONCLUSION: ARNTL2 may be linked with the functional modulation of the tumour immune microenvironment and serve as a potential biomarker for predicting the prognosis of TNBC patients.
Breast cancer accounts for approximately 25% of all types of cancers and is the most common malignancy diagnosis among women globally.1 Clinically and scientifically, breast cancer is well established as a heterogeneous disease. It traditionally is categorized as four subtypes according to the expression status of estrogen receptor (ER), progesterone receptor (PR) and the human epidermal growth factor 2 (HER2): luminal A (ER+ and/or PR+, HER2-); luminal B (ER+ and/or PR+, HER2+); HER2 overexpressing (ER-, PR- and HER2+); and triple-negative (ER-, PR- and HER2-).2 Triple-negative breast cancer (TNBC) accounts for 15–20% of all breast cancer, and it is the most aggressive subtype of breast cancer. TNBC exhibits high rates of cellular proliferation and lethality due to visceral and brain metastasis.3 Unlike other breast cancer subtypes with active target agents, such as Tamoxifen for luminal subtype and Trastuzumab for HER2 overexpressing subtype, there is a lack of FDA-approved molecular targeting agents for TNBC, and traditional chemotherapy remains the primary medical treatment option.4 Owing to the inherently aggressive characteristics and lack of an identified therapeutic target, TNBC patients are generally associated with a dismal prognosis. Therefore, identifying novel prognostic biomarkers is a pressing clinical need and essential for developing valid target therapeutic agents and guiding individual patient treatment strategies.The circadian clock is an endogenous time tracking system that can coordinate various physiological behaviours and functional activities and enable living organisms to adapt to the change of environmental circumstances such as light and temperature.5 In mammals, the circadian clock system consists of a master clock located in the suprachiasmatic nucleus of the hypothalamus and the peripheral clock located in almost all peripheral tissues.6 Molecularly, the circadian clock is generated by a core transcription and translation feedback loop comprising CLOCK, BMAL1 and ARNTL, Period proteins (PER1, PER2, PER3) and two Cryptochromes (CRY1, CRY2).7 The transcription factors CLOCK and BMAL1 can form heterodimers that drive the transcription of PERs and CRYs through binding to the E-box response element within their promoters.8 The PERs and CRY proteins are phosphorylated by CSNK1A1 and subsequently translocate into the nucleus, where they negatively regulate the transcriptional activities of CLOCK and BMAL1.9 CLOCK and BMAL1 heterodimers also activate the expression of a large number of clock-control genes (CCG) including but not limit to the proto-oncogene c-Myc, the essential enzyme for nucleotide excision repair XPA as well as the NADPH oxidase complexes and superoxide dismutases, glutathione peroxidases involved in reactive oxygen species (ROS) metabolism.10 They are thus involved in regulating critical cellular activities like cell proliferation, apoptosis, DNA damage repair, oxidative stress, and inflammation.11Current evidence indicates that disruption of the circadian clock plays a critical role in the development and progression of breast cancer.12–14 For example, epidemiological studies reported that shift work and light exposure at night were associated with an increased risk of 19% and 12% for breast cancer, respectively.15,16 Disruption of the circadian rhythm in mice by chronically alternating light cycles increases the risk of formation of mammary tumors.17,18 Furthermore, several circadian genes such as CLOCK, BMAL1, and TIMELESS are over-expressed in breast cancer tissue and promote breast cancer cell proliferation and invasion.19–21 These studies indicate the potential of circadian genes as promising biomarkers to predict prognosis in breast cancer patients.In this study, we aimed to screen the circadian genes with prognostic value in patients with TNBC using the transcriptome profiling and clinical data from TCGA, GSE25066 and GSE31448 datasets and validate its prognosis value in the Cancer Center of Sun Yat-sen University (SYSUCC) cohort. We further investigated its functions by functional enrichment analysis. Our results may provide a potential prognostic biomarker for TNBC in the clinic.
Materials and Methods
Patients and Databases
The level three transcriptome data of breast cancer patients, including 115 TNBC and 1107 non-TNBC, were downloaded from the TCGA data portal () and the corresponding clinical information was collected from cBioportal (TCGA provisional, ). The TNBC samples were selected base on the criteria of ER-, PR- and HER2- by immunohistochemistry. First, the transcriptome data of 1017 non-TNBC and 115 TNBC were used to identify the differentially expressed genes. Then, patients lacking complete clinical information including age, menopausal status, history of other malignancies, surgical procedure first, primary site, and TNM stage were excluded and remain 97 TNBC for further analysis. The microarray data of GSE25066 (non-TNBC, n = 308; TNBC, n = 178) and GSE31448 (non-TNBC, n = 195; TNBC, n = 96) were obtained from Gene Expression Omnibus (GEO; ) database. Furthermore, tumor tissues were also collected from 131 TNBC patients who underwent surgery at the Cancer Center of Sun Yat-sen University (SYSUCC) between 1999 and 2019. All cases were pathologically confirmed as TNBC based on a cut-off point of <1% of ER/PR/HER-2-positive tumour cells by immunohistochemistry.22 Available clinical-pathological parameters, including age, primary site, surgical procedure, TNM stage, radiotherapy, and chemotherapy were collected. The study was approved by the Ethics Committee of Sun Yat-sen University and was conducted in accordance with the Declaration of Helsinki. Written informed consent was obtained from each patient.
Immunohistochemistry
Paraffin-embedded tumor tissues were serially sectioned at a thickness of 4 microns and then subjected to deparaffinization, hydration followed by antigen retrieval in sodium citrate-hydrochloric acid buffer solution (pH = 6.0) for 10 min by microwave heating. The sections were then incubated with anti-ARNTL2 (1:50, Sigma-Aldrich) antibody at 4°C overnight. The sections were incubated with biotin-conjugated secondary antibodies for 1 h at 37°C and stained with 3,3′-diaminobenzidine (DAB), followed by counterstaining with hematoxylin. The sections were evaluated by two experienced pathologists, and the immunohistochemistry scores were calculated by multiplying the staining intensity and the percentage of positively stained cells. The staining intensity was graded as 0, negative; 1, weak; 2, moderate; and 3, strong staining. The ARNTL2 expression was divided into high or low expression groups based on the optimal cut-off value determined by the surv_cutpoint function of the R package survminer, which acquired the optimal cut-off value according to the immunohistochemistry scores, the survival time, and the survival status. ARNTL2 immunohistochemistry scores > optimal cutoff was regarded as high expression and < optimal cutoff as low expression.
Functional Enrichment Analysis
Pearson correlation coefficients were calculated between ARNTL2 expression and those of protein-coding genes (PCGs). The PCGs with correlation coefficients >0.3 were identified as the co-expression PCGs. Gene Ontology (GO), biological process (BP), and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analysis were then carried out for the ARNTL2 co-expression PCGs using R package cluster Profiler. The GO terms and KEGG pathways with a false discovery rate (FDR) <0.05 were considered to be significantly enriched function annotations. The Gene Set Enrichment Analysis (GSEA) was performed to further validate the altered GO and KEGG pathways between the high and low expression of ARNTL2 group with java software GSEA (). Nominal p-value <0.05 after 1000 random permutations are considered statistically significant.
Evaluation of Immune Cell Infiltration
The correlation between ARNTL2 expression and immune cell infiltration in breast cancer and other tumor types was analysed using Tumor Immune Estimation Resource [TIMER, ]. Furthermore, the abundance of infiltrating immune cells in the tumor sample was evaluated with the CIBERSORT deconvolution algorithm.23 CIBERSORT is an analytical tool that can characterize the cell composition of a mixed cell population based on gene expression profiles.24 Using the LM22 signature as a reference with 1000 permutations, we used CIBERSORT to calculate the relative fractions of 22 infiltrating immune cells in each tumor sample. The sum of all immune cell fractions equalled 1 for every sample. The 22 immune cell types included naïve and memory B cells, plasma cells, CD8 T cells, naïve CD4 T cells, resting and activated CD4 memory T cells, T follicular helper cells, T regulatory cells, γδ T cells, resting and activated NK cells, monocytes, M0–M2 macrophages, resting and activated dendritic cells, resting and activated mast cells, eosinophils and neutrophils.
Statistical Analysis
All statistical analyses were performed using R software (Version, 4.0.2). The differentiated circadian genes between non-TNBC and TNBC were identified using the Wilcoxon test. The prognostic values of circadian genes were evaluated by univariate Cox proportional hazards regression analysis using the survival package of R. Kaplan–Meier (KM) survival analysis with Log rank test was employed to compare differences in OS and RFS between the high and low expression of ARNTL2 groups. The survminer package of R was used to determine the optimal cut-off value. Multivariate Cox regression analysis was performed to explore the independent prognostic capacity of ARNTL2. The correlation between ARNTL2 expression and PCGs or immune cells was evaluated by Pearson correlation coefficients and Spearman correlation coefficients, respectively. P-value <0.05 was considered as statistical significance.
Results
The Differential Expression Circadian Genes Between Non-TNBC and TNBC
The expression data of a total of 20 circadian genes including the central components of CLOCK, ARNTL, ARNTL2, PER1, PER2, PER3, CRY1, CRY2 and other regulators in the clock machinery: NR1D1, RORA, CSNK1E, DBP, NFIL3, STRA13, BHLHE41, NPAS2, NR1D2, RORB, RORC, and TIMELESS were obtained from the TCGA,25 GSE25066 and GSE31448 dataset. Note that the GSE25066 and GSE31448 datasets are lack for the expression data of PER1 and NR1D1 and then cover 18 circadian genes. We first compared their gene expression between non-TNBC and TNBC samples in each dataset. In the data from TCGA (non-TNBC, n = 1107; TNBC, n = 115), we found 6 genes (ARNTL2, NFIL3, NPAS2, TIMELESS, CSNK1E, STRA13) were significantly upregulated in TNBC, while 12 genes were downregulated based on the criterion of P-value <0.05 (Figure 1A). In the GSE25066 dataset which contains 308 non-TNBC and 178 TNBC, four genes (NFIL3, ARNTL2, NPAS2, BHLHE41) were upregulated and five genes (PER2, DBP, CRY2, PER3, ARNTL) were downregulated in TNBC (Figure 1B). As for GSE31448, there were 13 differential expression genes between non-TNBC (n = 195) and TNBC (n = 96). Six genes, including NFIL3, ARNTL2, NPAS2, CSNK1E, TIMELESS and ARNTL were significantly upregulated, and seven genes (RORB, PER2, DBP, RORC, CRY2, CLOCK, PER3) were downregulated in TNBC (Figure 1C). A total of 8 differential expression genes (NFIL3, ARNTL2, NPAS2, PER2, DBP, CRY2, PER3, ARNTL) were overlapped in the three datasets (Figure 1D). To evaluate the degree of differential expression observed in the 8 circadian genes with respect to all genes, we performed the differential expression analysis in the whole transcriptome between non-TNBC and TNBC using the TCGA dataset with more strict screen criteria (|log2 FC| ≥1, adjusted P < 0.01). The result demonstrated that 1432 genes were upregulated, and 1621 genes were downregulated in TNBC, respectively (). Among the 8 circadian genes, ARNTL2 was the only significantly upregulated gene in TNBC, although it is not the top rank gene in the gene list (rank 958 of 1432 upregulated genes).
Figure 1
The differential expression circadian genes between non-TNBC and TNBC patients. (A–C) Heat maps showed the differential expression of circadian genes between non-TNBC and TNBC in TCGA (A), GSE25066 (B) and GSE31448 (C) dataset, respectively. (D) The Venn diagram showed the common differential expression genes among the three datasets. *P < 0.05; **P < 0.01; ***P < 0.001.
The differential expression circadian genes between non-TNBC and TNBC patients. (A–C) Heat maps showed the differential expression of circadian genes between non-TNBC and TNBC in TCGA (A), GSE25066 (B) and GSE31448 (C) dataset, respectively. (D) The Venn diagram showed the common differential expression genes among the three datasets. *P < 0.05; **P < 0.01; ***P < 0.001.
ARNTL2 is a Prognostic Circadian Gene in Patients with TNBC
Next, univariate Cox regression analysis was performed to screen out the eight differential expression genes with prognostic predictive capacity in the TCGA dataset. The result showed that only high expression of ARNTL2 was significantly associated with shorter OS (HR = 1.159, P = 0.034) in TNBC patients (Figure 2A). The KM survival curve also demonstrated that patients in the ARNTL2 high expression group had shorter OS time than those in the low expression group (Figure 2D). ARNTL2 was not significantly related to survival in patients with Lum (Figure 2B and E) and HER-2 subtype (Figure 2C) in the univariate COX analysis. However, high expression of ARNTL2 predicted longer OS time in the HER-2 subtype by KM survival curve (Figure 2F). The distribution of ARNTL2 expression, survival status and expression profiles of ARNTL2 in patients with TNBC are shown in Figure 2G. The ROC curve demonstrated that the AUC of ARNTL2 expression for predicting survival was 0.92, demonstrating the good prognostic power of ARNTL2 expression (Figure 2H). Multivariate COX regression analysis further showed that ARNTL2 could be an independent factor for OS (HR = 1.166, 95% CI = 1.020–1.333, P = 0.024) and RFS (HR = 1.148, 95% CI = 1.021–1.291, P = 0.021) after adjustment for the clinical factors, including age, menopause status, history of other malignancy, surgical procedure first, primary site and TNM stage (Table 1). To validate the prognostic potential of ARNTL2 in an independent cohort, we performed IHC staining of ARNTL2 in 131 TNBC samples with detailed clinical information from the SYSUCC. The patients were divided into high and low cohorts of the expression of ARNTL2 according to the optimal cut-off value determined by the survival package of R. The representative samples of low and high ARNTL2 staining are demonstrated in Figure 3A. Kaplan–Meier survival curves combined with the Log rank test showed that patients in the high expression group of ARNTL2 exhibited significantly shorter OS (P = 0.002) and RFS (P = 0.003) than those in the low expression group (Figure 3B and C). Moreover, multivariate Cox analysis again verified the prognostic value of ARNTL2 for OS (HR = 2.520, 95% CI = 1.023–6.209, P = 0.044) and RFS (HR = 3.421, 95% CI = 1.024–11.428, P = 0.046) in TNBC patients independent of conventional clinical parameters (Table 1). No significant correlation was revealed between ARNTL2 expression and clinicopathological characteristics of age, primary site, surgical procedure, stage, radiotherapy, and chemotherapy (Table 2). Collectively, these results indicate that high expression of ARNTL2 was associated with adverse prognosis in patients with TNBC.
Figure 2
The prognostic value of ARNTL2 in TNBC patients. (A–C) Univariate Cox regression analysis showed the association between ARNTL2 expression and overall survival in patients with TNBC from the TCGA dataset. (A), Lum (B) and HER-2 (C) subtype. (D–F) KM survival analysis was performed to evaluate the difference of overall survival time between high and low groups of ARNTL2 expression in patients with TNBC from the TCGA dataset. (D), Lum (E) and HER-2 (F) subtype. (G) ARNTL2 expression distribution and survival status in TNBC patients. Upper panel: The y-axis shows the ARNTL2 expression levels, and the x-axis shows the different patients. The green dots indicate the patients of ARNTL2 low expression, and the red dots represent the patients of ARNTL2 high expression. Lower panel: The y-axis shows the survival time, and the x-axis shows the different patients. The green dots indicate the patients alive, and the red dots represent the patients of the dead. (H) ROC curves of ARNTL2.
Table 1
Univariate and Multivariate Analyses of Survival in TNBC Patients of TCGA and SYSUCC Cohort
The prognostic value of ARNTL2 in the SYSUCC cohort. (A) The representative pictures of low and high ARNTL2 expressions were shown. (B and C) KM survival curve showed the differential overall survival (B) and relapse-free survival (C) between the high and low group of ARNTL2 expression in the SYSUCC cohort.
Table 2
Relationship Between ARNTL2 Expression and Clinicopathological Parameters in Patients with TNBC
Classification
Total
Low ARNTL2 Expression
High ARNTL2 Expression
P
Age (year)
≤60
118
89
29
0.199
>60
13
9
4
Primary_site
0.185
Right
66
53
13
Left
65
45
20
Surgical_procedure
0.661
Lumpectomy
13
10
3
Modified Radical Mastectomy
118
88
30
Stage
0.074
Stage I-Stage II
96
68
28
Stage III
35
30
5
Radiotherapy
0.124
Yes
67
52
15
No
64
46
18
Chemotherapy
0.451
Yes
106
78
28
No
25
20
5
Univariate and Multivariate Analyses of Survival in TNBC Patients of TCGA and SYSUCC CohortNote:
aP < 0.001.Abbreviations: HR, hazard ratio; CI, confidence interval.Relationship Between ARNTL2 Expression and Clinicopathological Parameters in Patients with TNBCThe prognostic value of ARNTL2 in TNBC patients. (A–C) Univariate Cox regression analysis showed the association between ARNTL2 expression and overall survival in patients with TNBC from the TCGA dataset. (A), Lum (B) and HER-2 (C) subtype. (D–F) KM survival analysis was performed to evaluate the difference of overall survival time between high and low groups of ARNTL2 expression in patients with TNBC from the TCGA dataset. (D), Lum (E) and HER-2 (F) subtype. (G) ARNTL2 expression distribution and survival status in TNBC patients. Upper panel: The y-axis shows the ARNTL2 expression levels, and the x-axis shows the different patients. The green dots indicate the patients of ARNTL2 low expression, and the red dots represent the patients of ARNTL2 high expression. Lower panel: The y-axis shows the survival time, and the x-axis shows the different patients. The green dots indicate the patients alive, and the red dots represent the patients of the dead. (H) ROC curves of ARNTL2.The prognostic value of ARNTL2 in the SYSUCC cohort. (A) The representative pictures of low and high ARNTL2 expressions were shown. (B and C) KM survival curve showed the differential overall survival (B) and relapse-free survival (C) between the high and low group of ARNTL2 expression in the SYSUCC cohort.
The Function of ARNTL2 Links with Immune Response in TNBC
To study the biological function of ARNTL2 in TNBC, we determined the genes which showed co-expression with ARNTL2 among the whole transcriptome. Upon calculating the Pearson correlation coefficients, we identified a total of 942 genes significantly co-expression with ARNTL2 (correlation coefficient >0.3). The top 50 and top five positive and negative correlation genes are shown in Figure 4A and B. GO enrichment analysis in terms of biological process (BP), cellular component (CC) and molecular function (MF) revealed that the co-expression genes were mainly associated with GO BP in terms of positive regulation of cytokine production, regulation of innate immune response and cellular responses to molecules of bacterial origin (Figure 4C). KEGG analysis demonstrated that the co-expression genes were mainly involved in Epstein-Barr virus infection, Human T-cell leukemia virus 1 infection, Human cytomegalovirus infection, and the TNF signalling pathway (Figure 4D). These functions and pathways are related to the regulation of immune responses, particularly the innate immune response. ARNTL2 was also involved in other cellular processes closely related to cancer progressions such as mitotic nuclear division, focal adhesion, and damaged DNA binding, suggesting the multiple roles of ARNT2 in breast cancer. To validate these observations, we employed Gene Set Enrichment Analysis (GSEA) to identify the cellular processes and pathways associated with ARTNL2 expression. Again, high expression of ARNTL2 was significantly enriched in GO terms of immune response to tumor cell, positive regulation of cytokine production involved in immune response (), and KEGG terms of antigen processing and presentation and natural killer cell mediated cytotoxicity (). Together, these results indicated that ARNL2 might function by regulating the local immune response in TNBC.
Figure 4
Functional characterize of ARNTL2 in TNBC. (A) Heatmap showed the top 50 co-expression genes of ARNTL2 in the TCGA dataset. The criteria for co-expression genes is Pearson correlation coefficient ≥ 0.3 or ≤ −0.3. (B) The circular plot indicated the top five genes positively and negatively correlated with ARNTL2. Green means negative correlation, and red means positive correlation. (C) GO enrichment analysis of the co-expression genes of ARNTL2. The y-axis shows the enriched GO term, and the x-axis shows the numbers of the core genes. The numbers inside the dots indicate the numbers of core genes, and the colour of the dots represent the adjusted p-value. (D) KEGG analysis of the co-expression genes of ARNTL2. The y-axis indicates the enriched KEGG pathways, and the x-axis shows the numbers of the core genes. The colour of the bars represents the adjusted p-value.
Functional characterize of ARNTL2 in TNBC. (A) Heatmap showed the top 50 co-expression genes of ARNTL2 in the TCGA dataset. The criteria for co-expression genes is Pearson correlation coefficient ≥ 0.3 or ≤ −0.3. (B) The circular plot indicated the top five genes positively and negatively correlated with ARNTL2. Green means negative correlation, and red means positive correlation. (C) GO enrichment analysis of the co-expression genes of ARNTL2. The y-axis shows the enriched GO term, and the x-axis shows the numbers of the core genes. The numbers inside the dots indicate the numbers of core genes, and the colour of the dots represent the adjusted p-value. (D) KEGG analysis of the co-expression genes of ARNTL2. The y-axis indicates the enriched KEGG pathways, and the x-axis shows the numbers of the core genes. The colour of the bars represents the adjusted p-value.
ARNTL2 Correlates with Immune Cell Infiltration in TNBC
To investigate the relationship of ARNTL2 expression to the tumor immune microenvironment, we employed the TIMER database to evaluate the correlation between ARNTL2 expression and the infiltration levels of various immune cells in different breast cancer subtypes. As shown in Figure 5, ARNTL2 was negatively associated with Purity and positively correlated with CD8+ T cell (r = 0.204, P = 0.0235), CD4+ T cell (r = 0.297, P < 0.001), Neutrophil (r = 0.417, P < 0.001), and Dendritic cell (r = 0.303, P = 0.00104) in breast cancer basal subtype, which represents a majority of TNBC. Similar results were observed for HER-2 and luminal subtype, except for B cells, which was significantly positively associated with ARNTL2 in the luminal subtype, but the correlation did not reach a significant difference in HER-2 and TNBC subtypes. For further investigation, the CIBERSORT algorithm was used to calculate the relative fraction of 22 immune cell types in each tumor sample. As shown in Figure 6A, the distribution of 22 immune cell types in each sample varied substantially. ARNTL2 expression was positively associated with activated CD4 memory T cells (r = 0.28, P = 0.011), activated Mast cells (r = 0.22, P = 0.046), and Neutrophils (r = 0.29, P = 0.0072) (Figure 6B–D). No significant correlation was found between ARNTL2 expression and B cells, CD8 T cells, Natural killer (NK) cells, M0-M2 Macrophages, Dendritic cells, and Eosinophils (). We also evaluated the possible correlation between different immune cell subpopulations (Figure 6E). The results from the correlation matrix demonstrated that CD8 T cells was positive correlate with activated CD4 memory T cells (r = 0.51) but negative correlate with M0 Macrophages (r=−0.48) and resting CD4 memory T cells (r=−0.51). To validate the correlation between ARNTL2 expression and immune cell infiltration, we conducted the correlation analysis in the GSE31448 dataset. The results showed that ARNTL2 expression was positively associated with activated CD4 memory T cells (r = 0.37, P < 0.001), M1 Macrophages (r = 0.42, P < 0.001), resting Dendritic cells (r = 0.23, P = 0.039), activated Dendritic cells (r = 0.33, P = 0.002), Eosinophils (r = 0.23, P=0.035), Neutrophils (r=0.28, P=0.01) and negatively associated with T regulatory cells (r=−0.26, P=0.018), M2 Macrophages (r=−0.31, P=0.005), resting Mast cells (r=−0.32, P=0.004). There were no significant correlation between ARNTL2 expression and B cells, CD8 T cells, Natural killer (NK) cells, Monocytes, M0 Macrophages, and activated Mast cells (). Collectively, these data indicated that ARNTL2 is highly involved with the immune cell composition in the tumor microenvironment of TNBC.
Figure 5
Correlation between ARNTL2 expression and immune cell infiltration in data from TIMER. The scatter plot showed the association between ARNTL2 expression and Purity and the immune cell infiltration of B cell, CD8+ T cell, CD4+ T cell, Microphage, Neutrophil, and Dendritic cell in different breast cancer subtypes (basal, luminal, and HER-2).
Figure 6
CIBERSORT analysis of the relationship between ARNTL2 expression and 22 immune cell infiltration in data from TCGA. (A) The stacked bar chart shows the relative fiction of 22 immune cells in each sample. (B–D) The correlation between ARNTL2 expression and activated CD4 memory T cells (B), activated Mast cells (C), and Neutrophils (D). (E) The correlation among different immune cells. The red colour indicates a positive correlation and the blue colour shows a negative correlation.
Correlation between ARNTL2 expression and immune cell infiltration in data from TIMER. The scatter plot showed the association between ARNTL2 expression and Purity and the immune cell infiltration of B cell, CD8+ T cell, CD4+ T cell, Microphage, Neutrophil, and Dendritic cell in different breast cancer subtypes (basal, luminal, and HER-2).CIBERSORT analysis of the relationship between ARNTL2 expression and 22 immune cell infiltration in data from TCGA. (A) The stacked bar chart shows the relative fiction of 22 immune cells in each sample. (B–D) The correlation between ARNTL2 expression and activated CD4 memory T cells (B), activated Mast cells (C), and Neutrophils (D). (E) The correlation among different immune cells. The red colour indicates a positive correlation and the blue colour shows a negative correlation.
Association Between ARNTL2 Expression and Immune Cells Markers
The markers of immune cells are essential for their functional activities such as differentiation, cell–cell adhesion, and immune regulation. To further study the relationship between ARNTL2 and immune cells, we evaluated the correlation between ARTNL2 expression and signature genes for various innate and adaptive immune cells. As shown in Table 3, ARNTL2 expression was positively associated with markers of Neutrophils (ITGAM), Dendritic cells (HLA-DRA, HLA-DPA1, ITGAM), Th1 (IL1B, STAT1), Th2 (IL13), Th17 (STAT3), Mast cells (TPSB2, TPSAB1) and negatively associated with markers of Tfh (BCL6). These results suggested that ARNTL2 may involve in the functional activities of multiple immune cell types.
Table 3
Correlation Between ARNTL2 and Markers of Immune Cells
Description
Gene Markers
Cor
P
CD8+ T cell
CD8A
0.021
0.839
CD8B
−0.075
0.463
T cell (general)
CD3E
−0.023
0.822
CD2
0.027
0.792
CD3D
−0.014
0.889
B cell
CD19
−0.086
0.403
CD79A
−0.110
0.283
Monocyte
CD86
0.170
0.097
CD115(CSF1R)
0.132
0.197
CCL2
0.036
0.730
CD68
0.004
0.973
IL10
0.023
0.825
M1 Macrophage
INOS(NOS2)
−0.14
0.173
IRF5
0.137
0.180
COX2(PTGS2)
−0.044
0.627
M2 Macrophage
CD163
0.16
0.117
VSIG4
0.097
0.345
MS4A4A
0.068
0.508
Neutrophil
CD66b(CEACAM8)
−0.075
0.468
CD11b(ITGAM)
0.26
0.010
CCR7
−0.084
0.412
Natural killer cell
KIR2DL1
0.026
0.804
KIR2DL3
0.086
0.405
KIR2DL4
0.125
0.224
KIR3DL1
0.084
0.412
KIR3DL2
0.112
0.276
KIR3DL3
0.045
0.665
Dendritic cell
HLA-DPB1
0.165
0.106
HLA-DQB1
0.124
0.226
HLA-DRA
0.284
0.005
HLA-DPA1
0.25
0.013
BDCA-1(CD1C)
−0.006
0.951
BDCA-1(NRP1)
0.007
0.947
CD11c(ITGAX)
0.046
0.654
CD11b (ITGAM)
0.26
0.010
Th1
IL12A
0.095
0.357
IL1B
0.472
0.000a
T-bet(TBX21)
0.008
0.941
STAT4
−0.029
0.781
STAT1
0.221
0.029
IFN-gamma(IFNG)
0.102
0.322
TNF-alpha(TNF)
0.175
0.087
Th2
GATA3
0.019
0.854
STAT6
0.136
0.185
STAT5A
0.083
0.419
IL13
0.219
0.031
Tfh
BCL6
−0.237
0.020
IL21
0.029
0.775
Th17
STAT3
0.226
0.026
IL17A
0.136
0.185
Treg
FOXP3
0.064
0.536
CCR8
0.146
0.152
STAT5B
−0.142
0.165
TGFbeta(TGFB1)
−0.077
0.455
CD25(IL2RA)
0.163
0.112
CD4
0.100
0.332
Mast cells
TPSB2
0.407
0.000a
TPSAB1
0.397
0.000a
CPA3
0.025
0.811
MS4A2
−0.077
0.452
HDC
−0.113
0.272
Note:
aP < 0.001.
Correlation Between ARNTL2 and Markers of Immune CellsNote:
aP < 0.001.
ARNTL2 Expression Correlates with Immune Infiltration in Multiple Human Cancers
The data above found that ARNTL2 expression was positively associated with activated CD4 memory T cells, activated Mast cells, and Neutrophils. To extend these findings to other cancer types, we evaluated the correlation of ARNTL2 expression and the infiltration levels of the abovementioned three immune cells in other 32 cancer types using TIMER database. First, the ARNTL2 expression pattern in different cancer types was shown in , which demonstrated that ARNTL2 was differentially expressed significantly in various cancer types, including upregulation in 16 types of cancer and downregulation in 2 types of cancer compared with their normal tissue counterpart. Next, the association of ARNTL2 expression and the infiltration of the three immune cells were examined, as shown in Figure 7A and B, ARNTL2 expression positively correlated with activated CD4 memory T cells in 11 types of cancer, and Neutrophils in 13 types of cancer analysis by CIBERSORT, suggesting a durable consistency of the correlation between ARNTL2 expression and these immune cells. However, ARNTL2 showed no or negative association with activated mast cells in most cancer types. These data suggested that the relationship between ARTNL2 expression and immune cell infiltration may, to some extent, influence by cancer types.
Figure 7
ARNTL2 expression correlates with activated CD4 memory T cell and neutrophil infiltration in multiple cancer types. (A) Heatmap showed the association between ARNTL2 expression and activated CD4 memory T cell infiltration in multiple tumor types. (B) Heatmap showed the association between ARNTL2 expression and Neutrophil and Mast cell infiltration in multiple cancer types. The row name represents the abbreviation and the sample number of 32 cancer types, the column name represents the method used to calculate the infiltration levels of immune cells.
ARNTL2 expression correlates with activated CD4 memory T cell and neutrophil infiltration in multiple cancer types. (A) Heatmap showed the association between ARNTL2 expression and activated CD4 memory T cell infiltration in multiple tumor types. (B) Heatmap showed the association between ARNTL2 expression and Neutrophil and Mast cell infiltration in multiple cancer types. The row name represents the abbreviation and the sample number of 32 cancer types, the column name represents the method used to calculate the infiltration levels of immune cells.
Discussion
Despite the appreciable reduction of breast cancer mortality owing to the advancement of early detection and treatment strategies, the prognosis of TNBC, which is the most aggressive subtype of breast cancer, remains dismal.26 This situation calls for a deeper understanding of the molecular mechanisms underlying the development and progression of TNBC in search of potential treatment targets and predictive biomarkers. A large body of evidence suggests that disturbance of the circadian clock is closely associated with the occurrence and progression of various cancers, including breast cancer.27,28 In 2007, the international agency for research on cancer (IARC) listed shiftwork that involves disruption of the circadian rhythm as a probable human carcinogen, Group 2A.29 ARNTL2 is a central component of the molecular clock. It is the analogue of BMAL1 and can form a heterodimer with CLOCK to drive the transcription of PERs, CYRs, and a large number of clock control genes,30 which are involved in regulating several critical cellular processes. ARNTL2 promotes the migration and invasion of colon and lung cancer cells and is a potential biomarker for aggressiveness in colorectal cancer.31–33 Notably, a recent study conducted in an animal model revealed that ARNTL2 promoted ER-negative breast cancer metastasis,34 suggesting a potential role of ARNTL2 in breast cancer progression. However, the underlying mechanism(s) and the clinical significance of ARNTL2 in breast cancer remained unclear. In the current study, using transcriptome data from TCGA and data from our clinical cohort, we demonstrated that ARNTL2 was highly expressed in TNBC compared with non-TNBC patients and that ARNTL2 can be an independent risk factor for overall survival and relapse-free survival for patients with TNBC. Our study is in line with a previous study showing that ARNTL2 was upregulated in TNBC compared with non-TNBC patients.35 However, their study reported that high expression of ARNTL2 was associated with favourable RFS in TNBC using Kaplan-Meier Plotter online tool. One explanation for the discrepancy would be the highly heterogeneous nature of TNBC, and the relatively low sample number enrolled compared with the ER+ breast cancer subtype. It has been demonstrated that TNBC can be further subdivided into six subtypes with distinct chemotherapy responses based on gene expression profiles.36 Thus, a future study with an enlarged sample size and analysis of the prognosis capacity of ARNTL2 in the different subtypes of TNBC was warranted.The immune cell composition of the tumor microenvironment plays a critical role in the origin, progression, and response to therapeutic agents in cancers.37,38 Intriguingly, the circadian clock has an intimate connection with the immune system.39,40 For example, the number of immune cells exhibits a robust circadian variability,41 resulting in different sensitivity to microbiome infection according to circadian time.42 Consistent with this notion, we found in our study that high ARNL2 expression related to multiple immune response processes, such as antigen processing and presentation of peptide antigen, the defense response to viruses, graft versus host disease and the NOD-like receptor signalling pathway, suggesting that ARNTL2 may function to regulate the immune response in TNBC. Further evaluating the correlation between ARNTL2 expression and multiple immune cells demonstrated that high expression of ARNTL2 was involved in the infiltration of various immune cell types, such as activated CD4 memory T cells, activated Mast cells, and Neutrophils. In addition, ARNTL2 was associated with markers of Neutrophils (ITGAM), Dendritic cells (HLA-DRA, HLA-DPA1, ITGAM), Th1 (IL1B, STAT1), Th2 (IL13), Th17 (STAT3), Mast cells (TPSB2, TPSAB1), and Tfh (BCL6). Strikingly, a positive correlation between ARNTL2 expression and activated CD4 memory T cells and Neutrophils was observed in other human cancer types besides TNBC, for example, Bladder Urothelial Carcinoma (BLCA), Skin Cutaneous Melanoma (SKCM), and Lung adenocarcinoma (LUAD). Moreover, a recent study demonstrated that high expression of ARNTL2 was associated with infiltration of CD8 + T cell, CD4 + memory T cell, Myeloid dendritic cell, macrophage and CD4 + Th2 T cell in clear cell renal cell carcinoma (ccRCC).43 Thus, the results of the present study and others demonstrated that ARNTL2 may functionally link with immune responses in the tumor microenvironment and represent a potential indicator for immune cell infiltration.We found that high expression of ARNTL2 was associated with increased infiltration of innate immune cells such as Mast cells, and Neutrophils in TNBC. Mast cells are unique tissue-resident immune cells and the potential sources of a large variety of cytokines and growth factors.44 Mast cells infiltrate the stroma of numerous solid cancers, and its high infiltration levels were associated with adverse prognosis in pancreatic cancer, nasopharyngeal carcinoma, non-small cell lung cancer, and breast cancer.45–48 Functionally, mast cells promote angiogenesis and suppress cytotoxic T cell responses in the tumor microenvironment.49–51 Neutrophils, also known as polymorphonuclear leukocytes, are regarded as the first line of defence against pathogen infection. Recent studies indicated that Neutrophils have a critical role in tumor progression.52 Epidemiologic data showed that Neutrophil infiltration or a high Neutrophil-to-lymphocyte ratio was associated with a poor prognosis in multiple cancer types.53–55 Biochemical and animal studies suggested that Neutrophils favour breast cancer metastasis by a variety of mechanisms, including but not limited to suppression of the effector function of CD8+ T cells and Natural killer cells, formation of the Neutrophil extracellular traps (NETs) as well as stimulation of tumor growth by releasing bioactive substances.56,57 Based on the above studies, the enrichment of Mast cells and Neutrophils in high ARNTL2 expression patients may explain their worse prognosis documented in our study. However, the causal link between ARNTL2 expression and immune cell infiltration, and the underlying mechanism(s) requires further investigation.
Conclusions
We found in the current study that the circadian gene, ARNTL2, showed higher expression levels in TNBC compared with non-TNBC patients and ARNTL2 may be an independent factor for predicting the prognosis of TNBC. The functional analysis further revealed that ARNTL2 might be involved in regulating the immune microenvironment of TNBC. Further research is required to gain a better understanding of the molecular mechanisms involved.
Authors: F Wang; K L Yeung; W C Chan; C C H Kwok; S L Leung; C Wu; E Y Y Chan; I T S Yu; X R Yang; L A Tse Journal: Ann Oncol Date: 2013-08-23 Impact factor: 32.976
Authors: Jennifer J Brady; Chen-Hua Chuang; Peyton G Greenside; Zoë N Rogers; Christopher W Murray; Deborah R Caswell; Ursula Hartmann; Andrew J Connolly; E Alejandro Sweet-Cordero; Anshul Kundaje; Monte M Winslow Journal: Cancer Cell Date: 2016-04-14 Impact factor: 31.743
Authors: Gianluigi Mazzoccoli; Valerio Pazienza; Anna Panza; Maria Rosa Valvano; Giorgia Benegiamo; Manlio Vinciguerra; Angelo Andriulli; Ada Piepoli Journal: J Cancer Res Clin Oncol Date: 2011-12-24 Impact factor: 4.553