Manal S Fawzy1,2, Eman A Toraih3,4. 1. Department of Biochemistry, Faculty of Medicine, Northern Border University, Arar, Saudi Arabia. 2. Department of Medical Biochemistry and Molecular Biology, Faculty of Medicine, Suez Canal University, Ismailia, Egypt. 3. Department of Surgery, School of Medicine, Tulane University, New Orleans, Louisiana, USA. 4. Genetics Unit, Department of Histology and Cell Biology, Faculty of Medicine, Suez Canal University, Ismailia, Egypt.
Abstract
BACKGROUND: Recently, unexpected autoimmune regulator (AIRE) implication in the scenario of several cancers, including breast cancer (BC), has emerged. This study aims to explore for the first time the possible association between AIRE gene rs2075876 G>A variant and BC risk in a sample of the Middle East population. METHOD: In this case-control study, we genotyped AIRE rs2075876 G>A variant in 200 unrelated patients with BC and 340 cancer-free controls using a real-time allelic discrimination polymerase chain reaction. Odds ratios (ORs) and 95% confidence intervals (CIs) were applied to estimate the strength of association under several genetic models. In silico analysis of AIRE was also executed. RESULTS: The minor allele (A) frequency of the specified variant accounted for 0.28 in the controls. G/G homozygote was significantly more frequent among patients (94%) compared to controls (66%) (P < .001). After adjusting confounding variables, individuals with A allele conferred protection against developing BC under allelic model (OR = 0.33, 95% CI = 0.20-0.55), recessive model (OR = 0.25, 95% CI = 0.10-0.60), dominant model (OR = 0.12, 95% CI = 0.05-0.29), and homozygote comparison (OR = 0.20, 95% CI = 0.08-0.50). In silico analysis revealed AIRE enrichment in several cancer-related pathways. Kaplan-Meier plotter for the cancer databases showed association of AIRE expression with prognosis in triple-negative BC (HR = 2.44, 95% CI = 1.44-4.15, log-rank P-value < .001). CONCLUSION: The AIRE rs2075876 G>A variant showed association with BC risk in the study population. Further large-scale replication studies in different ethnicity are warranted to confirm the findings.
BACKGROUND: Recently, unexpected autoimmune regulator (AIRE) implication in the scenario of several cancers, including breast cancer (BC), has emerged. This study aims to explore for the first time the possible association between AIRE gene rs2075876 G>A variant and BC risk in a sample of the Middle East population. METHOD: In this case-control study, we genotyped AIRErs2075876 G>A variant in 200 unrelated patients with BC and 340 cancer-free controls using a real-time allelic discrimination polymerase chain reaction. Odds ratios (ORs) and 95% confidence intervals (CIs) were applied to estimate the strength of association under several genetic models. In silico analysis of AIRE was also executed. RESULTS: The minor allele (A) frequency of the specified variant accounted for 0.28 in the controls. G/G homozygote was significantly more frequent among patients (94%) compared to controls (66%) (P < .001). After adjusting confounding variables, individuals with A allele conferred protection against developing BC under allelic model (OR = 0.33, 95% CI = 0.20-0.55), recessive model (OR = 0.25, 95% CI = 0.10-0.60), dominant model (OR = 0.12, 95% CI = 0.05-0.29), and homozygote comparison (OR = 0.20, 95% CI = 0.08-0.50). In silico analysis revealed AIRE enrichment in several cancer-related pathways. Kaplan-Meier plotter for the cancer databases showed association of AIRE expression with prognosis in triple-negative BC (HR = 2.44, 95% CI = 1.44-4.15, log-rank P-value < .001). CONCLUSION: The AIRErs2075876 G>A variant showed association with BC risk in the study population. Further large-scale replication studies in different ethnicity are warranted to confirm the findings.
According to the global cancer statistics, breast cancer (BC) is the most commonly diagnosed female cancer worldwide.
,
It is a heterogeneous disease that can be classified into distinct subgroups: luminal A [estrogen receptor (ER)+ and/or progesterone (PR)+/human epidermal receptor 2 (HER2)−], luminal B (ER+ and/or PR+/HER2+), (ER− and PR−/HER2+), or triple‐negative (ER−, PR−, and HER2−) subtypes according to the expression of particular clusters of co‐expressed genes.
Despite the breast cancer etiology has not been completely elucidated, it is thought to be multifactorial, with implication of both environmental and genetic factors.The immune system serves as an important natural barrier to cancer development. Innate and adaptive responses are carefully orchestrated through soluble and membrane‐bound receptors to eliminate precancerous cells and control neoplastic progression.
Avoiding immune destruction has been considered a hallmark of cancer.
,
It is therefore of great interest to identify immune genes that influence susceptibility to breast cancer.One of the best characterized genes that play a crucial role in autoimmunity is the protein‐coding autoimmune regulator (AIRE) gene (Gene ID: 326). This gene encodes a transcriptional regulator that plays an essential role to promote the negative selection of autoreactive T cells in the thymus by regulating the expression of a wide array of self‐antigens that have the commonality of being tissue‐restricted in their expression pattern in the periphery, called tissue‐restricted antigens (https://www.genecards.org/cgi‐bin/carddisp.pl?gene=AIRE). AIRE protein, in addition, was shown to have E3 ubiquitin ligase activity, which mediates proteasome pathway crucial for developing self‐tolerance, cell cycle, DNA repair, and cell trafficking.
Interestingly, not only is AIRE highly expressed in medullary thymic epithelial cells (mTECs), but there is an experimental evidence of its expression in peripheral lymphoid organs.
,
,
Gardner et al, have reported the extra‐thymic AIRE‐expressing cells (eTACs) to be a distinct “bone marrow‐derived antigen‐presenting CD45low” population that induce functional inactivation of effectors cells (CD4+) T cells mediating tolerance through a mechanism involving deficient co‐stimulation.
Very recently, AIRE expression was reported in some solid cancers, including BC cells.
,
,
Bianchi et al, for the first time, have detected an unexpected involvement of AIRE in the BC scenario.
They reported AIRE expression in BC and an association with prognosis, which provides compelling evidence for the relevance of this unusual transcription factor as a new marker for BC.Single nucleotide polymorphisms (SNPs) are the most common genetic variations, which influence inter‐individual predisposition to breast carcinogenesis and prognosis.
SNP association analysis has provided valuable information about the genetic susceptibility of BC.
The humanAIRE gene is located in the chromosome 21q22.3 region. Polymorphisms in this gene were previously shown to be genetically associated with autoimmune diseases, such as rheumatoid arthritis and autoimmune polyendocrine syndrome type 1 among others.
,
The pathogenic variant rs2075876, caused by a point mutation (c.653‐387G>A) in intron 5 (chr21:44289270), is located within exons forming the domain in the middle of protein chain, which functions in chromatin‐dependent transcriptional regulation, nuclear localization, and oligomer formation.As no case‐control study of AIRE gene variant rs2075876 in sporadic BC has been reported and given the association of AIRE expression with breast cancer prognosis reported by Bianchi et al
and confirmed by our in silico analysis (Figure 1), the authors were inspired to examine for the first time the possible association between rs2075876 variant and BC risk in a sample of Middle East population.
Figure 1
Prognostic role of AIRE gene in cancer. A, Alteration frequency of the AIRE gene in 32 pan‐cancer studies in The Cancer Genome Atlas (TCGA). B, Type of somatic mutations and copy number variations. C, Enrichment of the AIRE gene in cancer hallmarks. D‐G, Kaplan‐Meier estimates for overall survival analysis in different molecular intrinsic subtypes
Prognostic role of AIRE gene in cancer. A, Alteration frequency of the AIRE gene in 32 pan‐cancer studies in The Cancer Genome Atlas (TCGA). B, Type of somatic mutations and copy number variations. C, Enrichment of the AIRE gene in cancer hallmarks. D‐G, Kaplan‐Meier estimates for overall survival analysis in different molecular intrinsic subtypes
MATERIALS AND METHODS
Study participants
In this case‐control study, a total of 540 unrelated Egyptian women were enrolled (340 controls and 200 cases with primary BC). The cases were consecutively recruited from the Oncology Diagnostic Unit of Suez Canal University, Ismailia, Egypt. Patients had a histologically confirmed primary BC with relevant stage assigned according to the TNM classification of the Sixth Edition of the American Joint Committee on Cancer (AJCC) Manual for Staging of Cancer
and graded by the Nottingham grading system.
The participants who have pre‐surgical treatment of chemotherapy, radiotherapy, endocrine therapy, or biotherapy were excluded. The unrelated healthy control group was randomly selected from the blood bank donors based on a negative history of any disease interfering with the study and has no family history of breast cancer or any other neoplastic diseases. All participants also had no history of any chronic diseases (eg, heart disease, renal problems, and diabetes mellitus). All relevant clinical and pathological information was revised and completed from the medical files including cancer type, grade and stage at the time of diagnosis, presence of lymph node metastases, and ER, PR, and HER2 expression in breast cancer tissue. The study was conducted in accordance with the guidelines in the Declaration of Helsinki and was approved by the ethics committee of the researchers' university. The informed consent was obtained from all participants before taking part.
Blood sampling
Venous blood samples were collected from all participants under a complete aseptic condition on plain and trisodium ethylene‐diamine‐tetra‐acetic acid (EDTA) (1 mg/mL) tubes. The former tubes were centrifuged at 2500 g for 30 minutes at 4°C, and the serum was divided into aliquots and kept frozen at −80°C until analysis of any required laboratory test that is not available in the patients' medical records. The EDTA tubes were used for genotyping studies.
AIRE mutation analysis in cancer databases
The mutation spectrum of the AIRE gene in cancer was identified from The Cancer Genome Atlas (TCGA, https://cancergenome.nih.gov/), which contains genome maps from >30 cancer types. The role of the AIRE gene in cancer hallmarks was determined using Cancer Hallmarks Analytics Tool (CHAT) (http://chat.lionproject.net/), a web browser for classifying cancer‐related text and text mining–associated biological processes from PubMed articles.
The prognostic value of the AIRE gene in BC patients was explored in Kaplan‐Meier Plotter (https://kmplot.com/analysis/), which includes gene chip and RNA‐seq data from microarray and TCGA data.
The following thirty‐three datasets were included: GSE11121 (N = 200), GSE12093 (N = 136), GSE12276 (N = 204), GSE1456 (N = 159), GSE16391 (N = 55), GSE6446 (N = 120), GSE16716 (N = 47), GSE17705 (N = 196), GSE17907 (N = 54), GSE9195 (N = 77), GSE18728 (N = 61), GSE19615 (N = 115), GSE20194 (N = 45), GSE20271 (N = 96), GSE2034 (N = 286), GSE20685 (N = 327), GSE20711 (N = 90), GSE21653 (N = 240), GSE2063 (N = 99), GSE26971 (N = 276), GSE2990 (N = 102), GSE31448 (N = 71), GSE31519 (N = 67), GSE32646 (N = 115), GSE3494 (N = 251), GSE37946 (N = 41), GSE41998 (N = 279), GSE42568 (N = 121), GSE45255 (N = 139), GSE4611 (N = 153), GSE5327 (N = 58), GSE6532 (N = 82), and GSE7390 (N = 198). Multivariate analysis was performed with proportionate hazards assumption. Cutoff value for gene expression was set at the median, and the following variables were included in the model MK167, ESR1, and HER2 status. Next, stratified analysis by molecular subtyping was performed.
Allelic discrimination of rs2075876 G/A variant
Genomic DNA was extracted from the buffy coat using QIAamp DNA Blood Mini kit (Catalog No. 51 104; Qiagen) in accordance with the manufacturer's instructions. NanoDrop ND‐1000 (NanoDrop Technologies, Inc) was used for assessment of DNA concentration and purity. Genotyping of AIRE gene rs2075876 (G/A) variant was identified using StepOne™ Real‐Time Polymerase Chain Reaction (Applied Biosystems, USA). PCRs were run in a total volume of 20 µL containing genomic DNA (20 ng) with nuclease‐free water up to 9 µL, TaqMan SNP Genotyping Assay Mix (1 µL) (Applied Biosystems, assay ID C__15863141_20), TaqMan Genotyping PCR Master Mix (10‐µL). The thermocycler conditions were followed: denaturation at 95°C for 10 minutes, followed by 40 cycles of 90°C for 15 seconds and 60°C for 1 minutes.
Genotyping was conducted blind to the case/control status. Ten percent of the samples were randomly selected and re‐genotyped with 100% reproducibility in genotyping recall (Figure 2).
Figure 2
Allele discrimination plot of one allele (FAM probe) versus the other allele (VIC probe) of the real‐time PCR run (SNP Assay: rs2075876 G/A) (left panel) and the amplification plot for each sample, which represents fluorescence from both probes FAM/VIC (right panel). Alleles as wild‐type homozygous G/G (upper left blue cluster), heterozygous allele G/A (middle green cluster), mutant homozygous A/A (lower right red cluster), and the negative controls (x) that represent the background fluorescence from samples with no template DNA were retrieved by allele calling software (SDS version 1.3.1; Applied Biosystems)
Allele discrimination plot of one allele (FAM probe) versus the other allele (VIC probe) of the real‐time PCR run (SNP Assay: rs2075876 G/A) (left panel) and the amplification plot for each sample, which represents fluorescence from both probes FAM/VIC (right panel). Alleles as wild‐type homozygous G/G (upper left blue cluster), heterozygous allele G/A (middle green cluster), mutant homozygous A/A (lower right red cluster), and the negative controls (x) that represent the background fluorescence from samples with no template DNA were retrieved by allele calling software (SDS version 1.3.1; Applied Biosystems)
Statistical analysis
Data were analyzed using SPSS (IBM SPSS Statistics for Windows, version 26.0., IBM Corp.). Chi‐square and Fisher's exact tests were used for qualitative variables. Allele and genotype frequencies were calculated as previously described.
The Hardy‐Weinberg equilibrium (HWE) was estimated using the Online Encyclopedia for Genetic Epidemiology (OEGE) software (http://www.oege.org/software/hwe‐mr‐calc.shtml). Adjusted odds ratios (ORs) with a 95% confidence interval (CI) using logistic regression models were calculated for multiple genetic association models.
The study power estimation using G power‐3 software (http://www.gpower.hhu.de/) showed that with the specified case‐control study design and allowable error rate (alpha error = 0.05), the specified sample size for each group can give 99% study power. A two‐tailed P‐value of .05 was considered statistically significant.
RESULTS
Baseline characteristics of the study population
The mean age of controls was 46.1 ± 10.8 and that of BC patients was 45.3 ± 12.3 (P = .557). Demographic, clinical, and pathological characteristics are represented in Table 1.
Table 1
Demographic and clinical characteristics of breast cancer patients
Characteristics
n
%
Risk factors
FH of cancer
Negative
140
70.0
Positive
60
30.0
Degree of relation
First degree
28
14.0
Second degree
24
12.0
Third degree
8
4.0
Menarche
Late
72
36.0
Early
128
64.0
Menopause
Early
90
45.0
Late
10
5.0
OCP usage
Negative
160
80.0
Positive
40
20.0
Parity
Multipara
172
86.0
Nullipara
28
14.0
Late first gravida
Negative
192
96.0
Positive
8
4.0
Clinical presentation
Symptoms
Mastalgia
62
31.0
Breast mass
164
82.0
Skin lesions
32
16.0
Nipple change
36
18.0
Axillary pain
14
7.0
Axillary mass
36
18.0
Side
Left
74
37.0
Right
383
191.5
Site
LOQ
18
9.0
UIQ
38
19.0
UOQ
96
48.0
Retro
48
24.0
No of masses
Single
152
76.0
Multiple
48
24.0
Pathological analysis
HPD
Ductal
76
38.0
Lobular
52
26.0
Medullary
28
14.0
Mucinous
20
10.0
Tubular
12
6.0
Metaplastic
12
6.0
Grade
G2
128
64.0
G3
72
36.0
LVI
Negative
104
52.0
Positive
96
48.0
T stage
T2
100
50.0
T3
44
22.0
T4B
40
20.0
T4D
16
8.0
N stage
N0
56
28.0
N1
68
34.0
N2
64
32.0
N3
12
6.0
M stage
M0
88
44.0
M1
84
42.0
Mx
28
14.0
Site of metastasis
Bone
36
18.0
Liver
12
6.0
Lung
30
15.0
Skin infiltration
Negative
160
80.0
Positive
40
20.0
Receptor status
ER/PR
Negative
80
40.0
Positive
120
60.0
HER2+
Negative
164
82.0
Positive
36
18.0
Molecular subtype
Luminal A
96
48.0
Luminal B
24
12.0
HER2+
12
6.0
TNBC
68
34.0
IHPI score
Good
120
60.0
Moderate
68
34.0
Poor
12
6.0
Prognostic factors
Clinical stage
IIA
36
18.0
IIB
28
14.0
IIIA
24
12.0
IIIB
28
14.0
IV
84
42.0
NPI score
Good
100
50.0
Poor
100
50.0
ESMO score
Low risk
76
38.0
High risk
124
62.0
Recurrence
Negative
96
48.0
Positive
104
52.0
DFS
Prolonged
96
48.0
Short
104
52.0
Data are presented as number and percentage. Chi‐square test was applied.
Abbreviations: FH, family history; OCP, oral contraceptive pills; HRT, hormonal replacement therapy; HPD, histopathological diagnosis; LVI, lymphovascular infiltration; ER/PR, estrogen and progesterone receptor; HER2+, positive for human epidermal growth factor receptor 2/neu receptor; IHPI, Immunohistochemical Prognostic Index estimated based on the three receptor status (HER2, ER, and PR); NPI, Nottingham Prognostic Index, calculated as [0.2 × tumor size in cm] +tumor grade [1‐3] + lymph node stage [1‐3, according to stages A‐C]; ESMO, European Society of Medical Oncology; DFS, disease‐free survival.
Demographic and clinical characteristics of breast cancerpatientsData are presented as number and percentage. Chi‐square test was applied.Abbreviations: FH, family history; OCP, oral contraceptive pills; HRT, hormonal replacement therapy; HPD, histopathological diagnosis; LVI, lymphovascular infiltration; ER/PR, estrogen and progesterone receptor; HER2+, positive for humanepidermal growth factor receptor 2/neu receptor; IHPI, Immunohistochemical Prognostic Index estimated based on the three receptor status (HER2, ER, and PR); NPI, Nottingham Prognostic Index, calculated as [0.2 × tumor size in cm] +tumor grade [1‐3] + lymph node stage [1‐3, according to stages A‐C]; ESMO, European Society of Medical Oncology; DFS, disease‐free survival.
Prognostic value of AIRE from the cancer databases
Screening 10 953 cancerpatients in thirty‐two TCGA pan‐cancer studies revealed an alteration of the AIRE gene in 1.5% of cases (Figure 1A). Somatic mutations included amplification, deep deletions, fusion events, and mutations in various types of cancer (Figure 1B). According to CHAT database, AIRE is enriched in immune destruction, genome instability and mutation, resisting cell death, sustaining proliferative signaling, and tumor‐promoting inflammation (Figure 1C). KM survival curves demonstrated an association of AIRE expression level with prognosis in triple‐negative basal‐like molecular subtype of BC (HR = 2.44, 95% CI = 1.44‐4.15, log‐rank P‐value < .001) (Figure 1G).
Genotyping of AIRE polymorphism
AIRE gene variant rs2075876 is located within intron 5 and caused by substitution of G to A (c.653‐387G>A). Genotype frequencies of rs2075876 were deviated from HWE in the study population (ie, a marked decrease in the heterozygous frequency of the overall study population was observed; hence, the observed genotype frequency was not in line with the expected one) (Supplementary Table S1). MAF (A allele) accounted for 0.28 in controls. According to 1000 Genome Project (https://www.ncbi.nlm.nih.gov/variation/tools/1000genomes/), this figure was significantly different from that of Africans (MAF = 0.35) and East Asians (MAF = 0.37). On comparing between patients and controls, G/G homozygote was significantly more frequent among patients (94%) compared to controls (66%) (P < .001). In contrast, the other homozygote A/A was less represented among cases, accounting for 6% of cases, while heterozygote genotype G/A was absent in patients (Figure 3).
Figure 3
Genotype and allele frequencies of the AIRE variant in the study population. A, Chromosomal and genomic location of the AIRE rs2075876 (G/A) variant within intron 5. B, Genotype frequency of rs2075876 variant. Chi‐square test was applied. C, Allele frequency of rs2075876 variant. Chi‐square test was applied. D, Genetic association models and breast cancer risk. OR, odds ratio; 95% CI, 95% confidence interval. Adjusted by age, family history, early menarche, late menopause, nullipara, late first pregnancy, absent breastfeeding, and hormonal treatment. E, Comparison between the study allele frequency and that in other populations. AMR, American; AFR, African; EAS, East Asian; EUR, European; SAS, South Asians. (*) Significant from allele frequency identified in the study
Genotype and allele frequencies of the AIRE variant in the study population. A, Chromosomal and genomic location of the AIRErs2075876 (G/A) variant within intron 5. B, Genotype frequency of rs2075876 variant. Chi‐square test was applied. C, Allele frequency of rs2075876 variant. Chi‐square test was applied. D, Genetic association models and breast cancer risk. OR, odds ratio; 95% CI, 95% confidence interval. Adjusted by age, family history, early menarche, late menopause, nullipara, late first pregnancy, absent breastfeeding, and hormonal treatment. E, Comparison between the study allele frequency and that in other populations. AMR, American; AFR, African; EAS, East Asian; EUR, European; SAS, South Asians. (*) Significant from allele frequency identified in the study
Genetic association model for disease risk assessment
After adjusting confounding variables, individuals with A allele conferred protection against developing BC under allelic model (OR = 0.33, 95% CI = 0.20‐0.55), recessive model (OR = 0.25, 95% CI = 0.10‐0.60), dominant model (OR = 0.12, 95% CI = 0.05‐0.29), and homozygote comparison (OR = 0.20, 95% CI = 0.08‐0.50) (Figure 3D). Given the low frequency of A/A in patients (6%) and absence of heterozygote genotype, multiple regression analysis was not feasible. Table 2 shows no significant associations between the homozygous minor allele AIRE variant (A/A) with the clinicopathological features of BC patients.
Table 2
Association between clinicopathological characteristics of BC patients and AIRE rs2075876 G>A variant homozygous genotypes
Characteristics
A/A (n = 12)
G/G (n = 188)
P value
Age
Mean ± SD
48.0 ± 15.2
45.1 ± 12.3
.27
FH
8 (66.7)
172 (70.2)
.85
4 (33.3)
56 (29.8)
Skin infiltration
No
10 (83.3)
150 (79.8)
.83
Yes
2 (16.7)
38 (20.2)
Metastasis
M0
4 (33.3)
84 (44.7)
.86
M1
6 (50)
78 (41.5)
Mx
2 (16.7)
26 (13.8)
Clinical stage
IIA
2 (16.7)
34 (18.1)
.88
IIB
0 (0)
28 (14.9)
IIIA
2 (16.7)
22 (11.7)
IIIB
2 (16.7)
26 (13.8)
IV
6 (50)
78 (41.5)
NPI
Good
4 (33.3)
96 (51.1)
.67
Poor
8 (66.7)
92 (48.9)
ESMO
Low risk
4 (33.3)
72 (38.3)
.80
High risk
8 (66.7)
116 (61.7)
ER/PR
Negative
2 (16.7)
78 (41.5)
.39
Positive
10 (83.3)
110 (58.5)
HER2+
Negative
10 (83.3)
154 (81.9)
.93
Positive
2 (16.7)
34 (18.1)
Molecular subtype
Luminal A
8 (66.7)
88 (46.8)
.67
Luminal B
2 (16.7)
22 (11.7)
HER2+
0 (0)
12 (6.4)
TNBC
2 (16.7)
66 (35.1)
.67
IHP
Good
10 (83.3)
110 (58.5)
.37
Moderate
2 (16.7)
66 (35.1)
Poor
0 (0)
12 (6.4)
Recurrence
Negative
4 (33.3)
92 (48.9)
.45
Positive
8 (66.7)
96 (51.1)
Survival
Alive
8 (66.7)
156 (83)
.29
Dead
4 (33.3)
32 (17)
DFS
Mean ± SD
12.5 ± 7.8
14.3 ± 5.8
.45
OS
Mean ± SD
16.0 ± 5.4
16.5 ± 4.2
.75
Data are presented as mean ± SD or number (percentage). The t test and two‐sided chi‐square tests were used.
Abbreviations: FH, family history; NPI, Nottingham Prognostic Index; calculated as [0.2 × tumor size in cm] +tumor grade [1‐3] + lymph node stage [1‐3, according to stages A‐C]; ESMO, European Society of Medical Oncology; ER/PR, estrogen and progesterone receptors; HER2+, positive for human epidermal growth factor receptor 2/neu receptor; IHPI, Immunohistochemical Prognostic Index estimated based on the three receptor status (HER2, ER, and PR); DFS, disease‐free survival; OS, overall survival.
Association between clinicopathological characteristics of BC patients and AIRErs2075876 G>A variant homozygous genotypesData are presented as mean ± SD or number (percentage). The t test and two‐sided chi‐square tests were used.Abbreviations: FH, family history; NPI, Nottingham Prognostic Index; calculated as [0.2 × tumor size in cm] +tumor grade [1‐3] + lymph node stage [1‐3, according to stages A‐C]; ESMO, European Society of Medical Oncology; ER/PR, estrogen and progesterone receptors; HER2+, positive for humanepidermal growth factor receptor 2/neu receptor; IHPI, Immunohistochemical Prognostic Index estimated based on the three receptor status (HER2, ER, and PR); DFS, disease‐free survival; OS, overall survival.
DISCUSSION
Given the newly emerged implication of several AIRE gene variations in multiple cancers (Figure 1B), and as there is no association analysis of the study AIRE variant with BC risk has been documented, this study aimed to investigate whether AIRErs2075876 G>A polymorphism is associated with the development of BC in our population. Here, we identified that the study variant is associated with BC susceptibility under several genetic association models.As a transcriptional and post‐transcriptional regulator in medullary thymic epithelial cells and peripheral tissues, AIRE plays a substantial role in “thymocyte education” and “negative selection” by controlling the peripheral antigen expression in thymus.
Accumulating evidence indicated that AIRE protein is a non‐conventional transcriptional factor as it does not bind directly to any DNA motifs; however, it is implied in releasing the stalled RNA polymerase II at the promoter regions of the target genes to induce gene expression.
Our in silico analysis revealed that this transcriptional regulator can significantly interact with several protein partners implied in many physiological/pathological cellular processes with high confidence level (Figure 4).
These proteins, including the AIRE, have been enriched in nucleosome assembly, chromatin silencing, interleukin‐7‐mediated signaling pathway, telomere organization, regulation of immune system process, response to stress and stimulus, regulation of transcription and RNA metabolic process, and positive regulation of cell growth among other identified processes (Table S2). KEGG (Kyoto Encyclopedia of Genes and Genomes) enrichment analysis also revealed that AIRE is involved in “transcriptional misregulation in cancer” pathway (hsa05202) (Table S2). Interestingly, all the above‐mentioned pathways and processes were identified to play essential roles in BC etiology, progression, and/or outcome,
,
,
,
,
,
,
,
,
which add evidence to AIRE implication in the BC.
Figure 4
STRING analysis of the AIRE protein (accession#: CAA08759) with its experimentally validated functional partners. All the proteins encoded by a single, protein‐coding gene locus are represented as network nodes. Protein‐protein associations (ie, contribute to a shared function) are represented by colored edges, which indicate the type of interaction evidence that is explained in the figure key (right side). HIST2H3PS2, histone cluster 2 H3 pseudogene; HIST1H3A, histone cluster 1 H3 family member a; H3F3A, H3 histone, family 3A. Source, STRING version 11 (https://string‐db.org/)
STRING analysis of the AIRE protein (accession#: CAA08759) with its experimentally validated functional partners. All the proteins encoded by a single, protein‐coding gene locus are represented as network nodes. Protein‐protein associations (ie, contribute to a shared function) are represented by colored edges, which indicate the type of interaction evidence that is explained in the figure key (right side). HIST2H3PS2, histone cluster 2 H3 pseudogene; HIST1H3A, histone cluster 1 H3 family member a; H3F3A, H3 histone, family 3A. Source, STRING version 11 (https://string‐db.org/)Several AIRE variants were previously shown to be associated with many autoimmune diseases,
including rheumatoid arthritis,
,
vitiligo,
systemic sclerosis,
Addison disease,
systemic lupus erythematosus,
and alopecia areata.
However, a limited number of association studies for rs2075876 G>A polymorphism were identified.
,
,
,
,
Furthermore, no previous study explored the relationship of the specified study variant with the BC.As rs2075876 G>A is an intronic variant with low pathogenicity prediction score (6.657) based on the online CADD PHRED score prediction tool for single or combined variantshttps://cadd.gs.washington.edu/snv,
it is difficult to explain the exact biological function of this polymorphism. However, the non‐coding variants at the transcription level may disrupt the transcription factor binding sites, the splice sites, and other functional sites as proposed by Chen et al
and supported recently by Conteduca et al.
According to the GEO (Gene Expression Omnibus) database mining, Terao et al revealed a statistically significant correlation between the risk allele (A) of rs2075876 and AIRE down‐regulation.
Furthermore, the specified variant may be in linkage disequilibrium with distinct genetic markers, which synergistically influence disease/cancer risk that deserve future functional studies to evaluate the role of this putative variant in disease risk.It is worth noting that despite the genotype frequencies of the study variant deviated from the HWE, no standard guidelines for rejecting SNPs that depart from HWE have been developed.
In practice, all variants for which HWE P‐values decrease below a predetermined threshold should be checked manually for genotyping quality.
As the current genotyping of this study was based on a high‐quality real‐time TaqMan allelic discrimination PCR assay with genotype recall rate (100%), and the specified quality control measurements were applied throughout the methodology, the current variant departure from HWE suggested a violation of HWE assumptions in the study groups rather than technical genotyping errors.
,Although this study has enough power to detect the association of the specified polymorphism with BC risk, some potential limitations merit careful consideration: first, as the patients and controls recruited from one center may not be representative of the general population. Second, with a deficit in the number of observed heterozygotes as well as the presence of few cases carrying A/A genotype, this could potentially affect HWE congruence. Thus, our findings should be confirmed by large‐scale multicenter studies in different populations. Third, the authors have to confirm that despite there was a significant association with BC risk conferred by an individual locus, this might have a small impact on the disease risk as genes act multiplicatively. However, this small impact could be included in more advanced multiplicative polygenic risk scores.
Individually, these risk alleles confer a very small increase in BC risk but their joint effect with other lifestyle risk factors could “improve risk‐reduction and screening strategies by targeting those most likely to benefit” as concluded by Lakeman et al.
Fourth, the underlying mechanisms of the implication of this SNP in BC were not investigated. Further functional analyses are required to explore the molecular mechanisms by which this variant could participate in BC susceptibility.In conclusion, our study provided evidence that rs2075876 G>A variant of the AIRE gene may contribute to BC susceptibility. It is worth noting that in combination with other genetic and non‐genetic data, this variant might be helpful in refining the BC patients profile. The efficiency of population‐based preventive programs such as screening mammography could be improved in a near future by targeting women who are at the greatest risk for BC according to their genotyping profile and other environmental risk factors.
CONFLICT OF INTEREST
The authors declare that they have no conflict of interest.Table S1Click here for additional data file.Table S2Click here for additional data file.
Authors: Xiaofen Ye; Alexa A Franco; Hidelita Santos; David M Nelson; Paul D Kaufman; Peter D Adams Journal: Mol Cell Date: 2003-02 Impact factor: 17.970
Authors: Matthieu Giraud; Hideyuki Yoshida; Jakub Abramson; Peter B Rahl; Richard A Young; Diane Mathis; Christophe Benoist Journal: Proc Natl Acad Sci U S A Date: 2011-12-27 Impact factor: 11.205
Authors: Isela Montufar-Robles; José Carlos Robles-Garnica; Daniel Cadena-Sandoval; Rosa Elda Barbosa-Cobos; Daniel David González-Castillo; Juanita Romero-Diaz; Fausto Sánchez-Muñoz; Miguel A Saavedra; Elizabeth Olivares-Martínez; Dafhne Miranda-Hernández; Julian Ramírez-Bello Journal: Cell Immunol Date: 2019-09-11 Impact factor: 4.868
Authors: Eman A Toraih; Manal S Fawzy; Eman A Mohammed; Mohammad H Hussein; Mohamad M El-Labban Journal: Mol Diagn Ther Date: 2016-12 Impact factor: 4.074
Authors: M Faiyaz-Ul-Haque; B Bin-Abbas; A Al-Abdullatif; H Abdullah Abalkhail; M Toulimat; S Al-Gazlan; A M Almutawa; A Al-Sagheir; I Peltekova; F Al-Dayel; S H E Zaidi Journal: Clin Genet Date: 2009-09-15 Impact factor: 4.438
Authors: Freddie Bray; Jacques Ferlay; Isabelle Soerjomataram; Rebecca L Siegel; Lindsey A Torre; Ahmedin Jemal Journal: CA Cancer J Clin Date: 2018-09-12 Impact factor: 508.702
Authors: Enas F Elngar; Mona A Azzam; Ayman A Gobarah; Eman A Toraih; Manal S Fawzy; Nouran B AbdAllah Journal: Front Pediatr Date: 2022-05-20 Impact factor: 3.569