Literature DB >> 30922380

Genomic signature of parity in the breast of premenopausal women.

Julia Santucci-Pereira1, Anne Zeleniuch-Jacquotte2,3, Yelena Afanasyeva2, Hua Zhong2, Michael Slifker4, Suraj Peri4, Eric A Ross4, Ricardo López de Cicco5, Yubo Zhai5, Theresa Nguyen5, Fathima Sheriff5, Irma H Russo5, Yanrong Su5, Alan A Arslan2,6, Pal Bordas7,8, Per Lenner8, Janet Åhman7, Anna Stina Landström Eriksson7, Robert Johansson9, Göran Hallmans10, Paolo Toniolo6, Jose Russo5.   

Abstract

BACKGROUND: Full-term pregnancy (FTP) at an early age confers long-term protection against breast cancer. Previously, we reported that a FTP imprints a specific gene expression profile in the breast of postmenopausal women. Herein, we evaluated gene expression changes induced by parity in the breast of premenopausal women.
METHODS: Gene expression profiling of normal breast tissue from 30 nulliparous (NP) and 79 parous (P) premenopausal volunteers was performed using Affymetrix microarrays. In addition to a discovery/validation analysis, we conducted an analysis of gene expression differences in P vs. NP women as a function of time since last FTP. Finally, a laser capture microdissection substudy was performed to compare the gene expression profile in the whole breast biopsy with that in the epithelial and stromal tissues.
RESULTS: Discovery/validation analysis identified 43 differentially expressed genes in P vs. NP breast. Analysis of expression as a function of time since FTP revealed 286 differentially expressed genes (238 up- and 48 downregulated) comparing all P vs. all NP, and/or P women whose last FTP was less than 5 years before biopsy vs. all NP women. The upregulated genes showed three expression patterns: (1) transient: genes upregulated after FTP but whose expression levels returned to NP levels. These genes were mainly related to immune response, specifically activation of T cells. (2) Long-term changing: genes upregulated following FTP, whose expression levels decreased with increasing time since FTP but did not return to NP levels. These were related to immune response and development. (3) Long-term constant: genes that remained upregulated in parous compared to nulliparous breast, independently of time since FTP. These were mainly involved in development/cell differentiation processes, and also chromatin remodeling. Lastly, we found that the gene expression in whole tissue was a weighted average of the expression in epithelial and stromal tissues.
CONCLUSIONS: Genes transiently activated by FTP may have a role in protecting the mammary gland against neoplastically transformed cells through activation of T cells. Furthermore, chromatin remodeling and cell differentiation, represented by the genes that are maintained upregulated long after the FTP, may be responsible for the lasting preventive effect against breast cancer.

Entities:  

Keywords:  Breast cancer risk; Breast differentiation; Gene expression profiling; Immune response; Normal breast transcriptome; Parous and nulliparous breast transcriptome; Pregnancy

Mesh:

Substances:

Year:  2019        PMID: 30922380      PMCID: PMC6438043          DOI: 10.1186/s13058-019-1128-x

Source DB:  PubMed          Journal:  Breast Cancer Res        ISSN: 1465-5411            Impact factor:   6.466


Background

The association of parity with breast cancer risk is well documented by both epidemiological and experimental data [1-4]. While the relationship is complex, with a transient increase in risk after each full-term pregnancy (FTP), the long-term effect for women who have their first FTP at an early age is a marked reduction in risk [5]. A better understanding of the molecular mechanisms underlying the effects of parity on the breast may help develop strategies to prevent breast cancer. We previously reported the results of a study that assessed gene expression differences in the breast of 67 parous (P) and 40 nulliparous (NP) postmenopausal women who were free of any pathology and had volunteered to undergo a tissue biopsy [6-8]. We reported that in the postmenopausal breast, parity-induced gene expression changes were related to differentiation of this organ [6]. More specifically, we found that genes upregulated in the P breast, as compared to the NP breast, represented biological processes involved in differentiation and development, cell junction, RNA metabolic processes, and splicing machinery. The downregulated genes represented biological processes involved in cell proliferation, regulation of IGF-like growth factor receptor signaling, somatic stem cell maintenance, muscle cell differentiation, and apoptosis [6, 7]. We here report on a study with a similar design and conducted in the same general population, but focusing on premenopausal women. The main objective of this study was to assess the parity-associated gene expression differences in the breast of premenopausal women. Because the breast undergoes involution after pregnancy and there is a short-term increase in breast cancer risk following each FTP, we examined the gene expression differences in P vs. NP women as a function of time since last FTP. Additionally, we conducted a substudy, in which laser capture microdissection (LCM) was used to isolate breast epithelial cells from the stroma to evaluate how the gene expression observed in RNA extracted from whole breast tissue relates to gene expression in RNA extracted from breast epithelial cells and from stroma separately.

Methods

Study population and eligibility criteria

Study subjects were volunteers recruited among healthy women between the ages of 29 and 47 years and residing in Norrbotten County, Sweden. Exclusion criteria included a history of any cancer, complete bilateral oophorectomy, breast biopsy or breast implants, and hormonal treatment for infertility. Women who had completed a FTP or breastfed in the 12 months prior to enrollment, used oral contraceptives in the 6 months prior to enrollment, or used thyroid or steroid hormones, anti-coagulants, or diabetes medications in the 3 months prior to enrollment were also ineligible. The study was approved by the Regional Ethical Review Board for Northern Sweden at the University of Umeå, Sweden. Volunteers who signed informed consent were scheduled for a biopsy. Women who had not had a mammogram within the year preceding enrollment received one prior to the biopsy to exclude breast cancer. Parous (P) women were defined as women who had had one or more full-term pregnancies, defined as a pregnancy lasting at least 37 weeks. The nulliparous (NP) group included women who had never been pregnant or who had no history of pregnancies lasting more than 8 weeks.

Data and breast tissue collection

Eligible volunteers completed a questionnaire that collected data on reproductive history, medical history, height and weight, first-degree family history of breast cancer, history of tobacco use, and current medications. Breast core needle biopsies were performed by two experienced radiologists (P. Bordas and A. Eriksson) at the Mammography Department at Sunderby Hospital, Luleå, Sweden. A 12-Gauge BARD® MONOPTY® core biopsy needle was used, and four to eight biopsies were taken from the upper outer quadrant of one breast. One biopsy specimen was placed in 70% ethanol for histopathological analysis, and the remaining in RNALater® (Ambion) solution. Tissue samples were stripped of all personal identifiers and sent to Fox Chase Cancer Center for analysis. All samples were first reviewed by the study pathologist (J. Russo) to confirm the absence of atypia or cancer. During all the experiments, the researchers at Fox Chase Cancer Center were blinded to the parity status of the samples.

Gene expression microarrays

Total RNA from the biopsies was isolated using the Allprep RNA/DNA Mini Kit (Qiagen, Alameda, CA, USA). Quantity and quality of total RNA were assessed using NanoDrop v3.3.0 (NanoDrop Technologies, Wilmington, DE, USA) and Agilent 2100 Bioanalyzer (Agilent Technologies, CA, USA), respectively. GeneChip Expression 3′-Amplification Two-Cycle cDNA Synthesis Kit (Affymetrix, Santa Clara, CA) was used for sample preparation and hybridization to Affymetrix Human Genome Gene Chip U133 Plus 2.0 arrays. For quality control purposes, we included in each batch (9 to 12 samples) one blinded duplicate sample from another batch. After scanning, all microarrays were subjected to quality control (QC) analysis to ensure that they were in the acceptable ranges for standard Affymetrix quality measures (Scale Factor, Percent Present, and Average Background). In addition, quality was assessed using graphical tools based on Affymetrix probe-level models (PLM) [9]. The Normalized Unscaled Standard Error (NUSE) plot, in particular, was used to disqualify lower quality arrays. Ten arrays (8%—9 P and 1 NP) did not fulfill quality criteria and were not included in the statistical analysis. The concordance correlation coefficients for the QC replicates were greater than 98%.

Data preprocessing and batch adjustment

The Affymetrix data were analyzed using R language for statistical computing (R version 2.14.1) [10] and Bioconductor [11]. Preprocessing methods and filtering criteria were similar to those used in our previous study [6, 7]. Probesets for which both the proportion of present calls was < 75% and the difference in proportion of present calls between P and NP samples was < 25% were filtered out. Probesets with coefficient of variation (CV) across samples falling in the first quartile were also excluded. After filtering, 14,920 probesets (27%) remained in the analysis. To account for inter-batch variability, the data were adjusted for batch using the ComBat method [12] implemented in the Bioconductor package sva [13].

Statistical analysis of gene expression differences between parous and nulliparous women

Linear regression models were used to identify probesets differentially expressed in P vs. NP samples. For each probeset, an unadjusted p value measuring the significance of parity (yes/no) as an independent predictor of the log-transformed normalized gene expression value was calculated using single regression. We also used multiple regression analysis to identify differentially expressed probesets while controlling for potential confounders. The associations of subject characteristics with parity status and with gene expression were examined to identify potential confounders. Multivariate models were adjusted for age, body mass index (BMI) (which was associated with gene expression), and smoking duration (which was associated with both gene expression and parity status). We also adjusted for phase of cycle/use of a hormonal IUD, which could affect gene expression. False discovery rate (FDR) was used to control for multiple comparisons, using QVALUE in the R package version 1.28.0 [14]. In order to identify the most robust parity-associated differences in gene expression, we first analyzed the data using a discovery/validation resampling approach. A discovery dataset was generated by selecting at random 2/3 of the P women and 2/3 of the NP women from the complete dataset. The remaining women formed a corresponding validation dataset. This step was repeated 12 times, leading to 12 discovery/validation dataset pairs. Probesets with FDR < 20% in any discovery dataset and p value < 0.05 in the corresponding validation dataset were considered validated for this dataset pair. We report the probesets (and corresponding genes) that were validated in at least 2 of the 12 dataset pairs. The breast undergoes involution after pregnancy, which is likely to be associated with transient changes in gene expression. Further, although the long-term effect of early parity (before 35 years of age) is a reduction in breast cancer risk for pregnancy [5], it is well documented that there is a short-term increase in risk after each FTP. This suggests that the gene expression pattern in the breast may be different in the first few years after pregnancy than in later years. Therefore, we examined the parity-associated gene expression differences according to time between last FTP and biopsy (time since last pregnancy, TSLP). To optimize our chances of detecting TSLP-related differences in gene expression, we included in these analyses probesets that were differentially expressed (FDR < 10% and at least 1.2-fold change) in the subgroup of P women whose last FTP was ≤ 5 years before biopsy as compared to NP women, in addition to the probesets identified in the overall P-NP comparison. All women were included in these analyses and the patterns of expression were examined using clustering analysis in women classified according to TSLP (< 5, 5–10, or > 10 years). Considering upregulated and downregulated genes separately, K-means cluster analyses were performed using Multiexperiment Viewer software (MeV- v.4.8.1) [15], with Pearson uncentered as distance metric. We examined the clusters formed after randomly setting the number of clusters (K) to 2, 3, 4, or 5.

Mining for functional categories and pathways

Data mining methods were applied to the differentially expressed genes to detect biological processes and pathways affected by parity. Ingenuity® Pathways Analysis (IPA) software version 24390178 (QIAGEN) was used to investigate canonical pathways. Gene Ontology (GO) enrichment analysis was performed using conditional hypergeometric tests in the Bioconductor GOstats package [16]. We carried out analyses separately for each cluster of upregulated genes. GO (gene ontology) terms with p value < 0.01 were considered enriched. To evaluate the GO terms enriched by each cluster of genes, the terms were grouped into broader terms (developmental process, immune response, or others) following the GO hierarchical tree graph view from GO consortium [17]. Few genes were identified as downregulated; therefore, we did not conduct GO analyses for these genes but rather examined the literature to identify their roles. In addition, genes that were validated by real-time RT-PCR were used for analysis into cBioPortal for Cancer Genomics (http://www.cbioportal.org/) [18, 19]. We evaluated whether these genes have been described to be modified in breast cancer cases available in the cBioPortal databank; in addition, an overall survival Kaplan-Meier curve was generated. In total, 11 genes were evaluated among 5796 breast cancer patients.

Validation through real-time RT-PCR

Eleven genes were selected for real-time RT-PCR analyses based on their biological roles in cell differentiation, proliferation, and chromatin remodeling. The assays were performed in the subset of 17 NP and 20 P samples (10 with TSLP < 5, and 10 with TSLP > 5) from whom sufficient RNA remained available, using TaqMan® Gene Expression Master Mix and TaqMan® Gene Expression Assays (Life Technologies). The end point used in the RT-PCR quantification, Ct, was defined as the PCR cycle number at which each assay target passes the threshold. Each assay was normalized to 18S, a housekeeping gene used as endogenous control (ΔCt). The difference between parous and nulliparous were estimated as the difference in mean ΔCt values (−ΔΔCt). To assess the statistical significance of the differences between P and NP, batch-adjusted p values were calculated using linear regression and comparisons with p value < 0.10 and fold change of at least 1.2 were considered statistically significant. Gene expression measured using RT-PCR and Affymetrix arrays were compared in the 37 subjects for whom both assays were used. Fold changes were estimated from multiple regressions using batch-adjusted, RMA (Robust Multi-Array Average)-normalized gene expression intensities, and intraclass and Spearman correlation coefficients were calculated.

Immunohistochemical (IHC) staining

Paraffin sections at 4 μm were deparaffinized and placed in the antigen unmasking solution (Vector Laboratories, Burlingame, CA) and microwaved for 10 min at 100 °C. After cooling for 20 min, slides were quenched with Peroxide Block (BioGenex, Fremont, CA; #HK111) for 10 min, followed by blocking with Power Block (BioGenex, #HK085) for 20 min at room temperature. The sections were then stained with primary antibodies using a i6000 BioGenex Autostainer following standard protocol. The antibodies used were as follows: Cytokeratin 5 (BioGenex, #AN484-10 M, pre-diluted), CD123 (BD Biosciences, #554527; 1:400 dilution), LAMP3 (Abcam, #ab111090), Desmocollin 3 (Abcam, #ab190118; 1:150 dilution), CD2 (Abcam, #ab131276; 1:200 dilution), and CD3D (Abcam, #ab109531; 1:150 dilution). A Super Sensitive TM Polymer-HRP Detection System (BioGenex; #QD430-XAKE) was used to detect the staining. The images were acquired at × 400 magnification using an Aperio Digital Pathology Slide Scanner (Leica Biosystems, Buffalo Grove, IL) and analyzed by Aperio ImageScope Software (Leica Biosystems).

Laser capture microdissection (LCM)

In additional samples, we conducted a substudy to assess how gene expression in RNA extracted from whole breast tissue relates to gene expression in RNA extracted from epithelial and stromal tissues. Breast biopsy tissue fixed in RNA later was frozen and cryostat was used for histological sectioning. The frozen sections were then stained with hematoxylin and eosin specially prepared utilizing RNAse-free water to avoid RNA degradation [20]. LCM was performed using the VERITAS Microdissection Instrument (Arcturus, CA, USA) to select and capture all the epithelial tissue present in each section. The tissue left on the slide was then scrapped into a different tube and classified as stroma. The RNA extraction from the collected cells was performed using the Arcturus® PicoPure®RNA Isolation Kit (Life Technologies). RNA was also extracted from a second core biopsy, in which no LCM was performed (called hereafter whole tissue, WT). For each woman included in this substudy, three microarrays were performed in the same batch, using RNA extracted from (1) the epithelial cells of the mammary gland, (2) the stroma, and (3) WT. RNA amplification and labelling was performed using MessageAmpTM Premier RNA Amplification Kit (Life Technologies), and the arrays were hybridized to Affymetrix Human Genome Gene Chip U133 Plus 2.0 arrays. All arrays were subjected to QC analysis as described earlier. The arrays were RMA pre-processed, and probesets with < 75% of present calls and/or low CV (i.e., CV in the first quartile) were filtered out, resulting in 10,252 probesets for analysis. All values were log-transformed and normalized prior to analysis. Nine subjects (5 NP and 4 P) had successful arrays for whole tissue, epithelial tissue, and stromal tissue. For each subject, a linear regression model was fitted across all genes. The gene expression values in the whole tissue were modeled as a linear function of the gene expression in epithelial and stromal tissues. Gene expression comparison between the breast tissue types was performed for ten subjects (5 P and 5 NP) with successful epithelium and stroma arrays. For each probe, the fold changes were calculated as the median of within subject fold changes [expression in epithelium] / [expression in stroma] for each subject. A paired two-sample t-test was performed for each probe set, and p values were adjusted for multiple comparisons using the FDR approach. Probes with FDR < 10% and fold changes of at least 20% were considered statistically different between epithelium and stroma. GO analysis was performed using the same methodology described above. The small number of samples limited our gene expression analyses between tissue types in function of parity.

Results

Volunteers included in the analysis

A total of 307 women between ages 29 and 47 volunteered between March 2011 and June 2012 (Fig. 1). After exclusions related to eligibility, lack of epithelial structures, or QC failures, samples from 109 women (30 NP and 79 P) were included in the main study comparing P vs. NP, and samples from 10  women were included in the LCM substudy comparing the tissue types (Fig. 1).
Fig. 1

Subject accrual and sample processing summary. Figure shows the number of women who volunteered, reasons for exclusion, and allocation of samples to the P/NP comparison study and LCM substudy

Subject accrual and sample processing summary. Figure shows the number of women who volunteered, reasons for exclusion, and allocation of samples to the P/NP comparison study and LCM substudy P and NP women were similar with respect to most characteristics, such as age of menarche, breast density, and body mass index (BMI) (Table 1). However, the median age and median smoking duration were lower in NP than P women, and a larger proportion of P had a hormonal intra-uterine device (IUD).
Table 1

Descriptive characteristics of the study subjects

CharacteristicsParous (n = 79)Nulliparous (n = 30)p value
Age at visit, years39.9 (30.1, 47.3)35.8 (30.1, 46.2)0.03
Number of FTP
 113 (16%)
 244 (46%)
 3+22 (28%)
Time since last FTP, years
 ≤ 530 (39%)
 6–1029 (37%)
 > 1020 (25%)
Age at first FTP, years26 (18, 27)
Phase of cycle/hormonal IUD10.05
 Luteal21 (28%)14 (52%)
 Ovulatory10 (13%)4 (15%)
 Follicular21 (28%)7 (26%)
 Hormonal IUD23 (31%)2 (7%)
 Missing43
OC ever use74 (94%)27 (90%)0.68
Breast density, BIRADS
 110 (13%)4 (13%)0.38
 216 (20%)2 (7%)
 310 (13%)4 (13%)
 443 (54%)20 (67%)
Age at menarche, years13.0 (11.0, 16.0)13.0 (11.0, 15.0)0.60
Family history of breast cancer7 (9%)2 (7%)0.99
Height, cm165.0 (148.0, 184.0)166.5 (157.0, 174.0)0.77
Weight, kg68.0 (41.0, 115.0)64.0 (46.0, 97.0)0.38
BMI24.2 (18.7, 38.0)23.4 (18.7, 37.1)0.24
Smoking
 Never51 (64%)20 (67%)0.15
 Past23 (29%)5 (17%)
 Current5 (6%)5 (17%)
 Smoking duration, years8.0 (0.5, 20.0)17.0 (7.0, 30.0)0.006
 Years since quitting (past smokers)13.0 (0.1, 26.0)6.0 (1.0, 12.0)0.09

1p value for IUD/ no IUD = 0.01; excluding women having an IUD, there was no statistically significant difference in phase of menstrual cycle between P and NP

Descriptive characteristics of the study subjects 1p value for IUD/ no IUD = 0.01; excluding women having an IUD, there was no statistically significant difference in phase of menstrual cycle between P and NP

Differential gene expression

Using the discovery and validation approach described in “Methods”, 54 probesets, representing 43 genes (Table 2), were differentially expressed between P and NP women. Of the 43 genes, 40 were upregulated and 3 downregulated in the parous premenopausal breast (Table 2). Upregulated genes in the P breast included APOBEC3G, DSC3, FZD8, and EAF2, while FOXQ1 was among the downregulated genes.
Table 2

Genes differentially expressed between parous and nulliparous premenopausal women (discovery/validation approach)

ProbeIDEntrezIDSymbolGeneNameFDRFold change
Genes upregulated in parous women
 206641_at608TNFRSF17Tumor necrosis factor receptor superfamily, member 170.0132.41
 228504_at6332SCN7ASodium channel, voltage-gated, type VII, alpha subunit0.0062.15
 237625_s_at3514IGKCImmunoglobulin kappa constant0.0061.94
 222838_at57823SLAMF7SLAM family member 70.0171.93
 224342_x_at96610LOC96610BMS1 homolog, ribosome assembly protein (yeast) pseudogene0.0321.80
 206121_at270AMPD1Adenosine monophosphate deaminase 10.0211.76
 206033_s_at1825DSC3Desmocollin 30.0061.68
 1555759_a_at6352CCL5Chemokine (C-C motif) ligand 50.0181.67
 206478_at9834KIAA0125KIAA01250.0061.65
 213193_x_at28639TRBC1T cell receptor beta constant 10.0061.60
 207651_at29909GPR171G protein-coupled receptor 1710.0141.60
 206310_at6691SPINK2Serine peptidase inhibitor, Kazal type 2 (acrosin-trypsin inhibitor)0.0061.59
 231647_s_at83416FCRL5Fc receptor-like 50.0121.59
 203130_s_at3800KIF5CKinesin family member 5C0.0231.58
 205831_at914CD2CD2 molecule0.0121.54
 216430_x_at28823IGLV1-44Immunoglobulin lambda variable 1-440.0341.52
 211339_s_at3702ITKIL2-inducible T cell kinase0.0161.50
 206666_at3003GZMKGranzyme K (granzyme 3; tryptase II)0.0301.49
 204562_at3662IRF4Interferon regulatory factor 40.0171.47
 213539_at915CD3DCD3d molecule, delta (CD3-TCR complex)0.0111.45
 206181_at6504SLAMF1Signaling lymphocytic activation molecule family member 10.0141.45
 206150_at939CD27CD27 molecule0.0191.44
 206991_s_at1234CCR5Chemokine (C-C motif) receptor 5 (gene/pseudogene)0.0101.41
 235153_at138065RNF183Ring finger protein 1830.0261.41
 214470_at3820KLRB1Killer cell lectin-like receptor subfamily B, member 10.0381.40
 230011_at150365MEI1Meiosis inhibitor 10.0101.39
 211902_x_at10730YME1L1YME1-like 1 (S. cerevisiae)0.0151.37
 221584_s_at3778KCNMA1Potassium large conductance calcium-activated channel, subfamily M, alpha member 10.0341.36
 219551_at55840EAF2ELL associated factor 20.0151.35
 220402_at63970TP53AIP1Tumor protein p53 regulated apoptosis inducing protein 10.0151.35
 206761_at10225CD96CD96 molecule0.0361.33
 212314_at23231SEL1L3Sel-1 suppressor of lin-12-like 3 (C. elegans)0.0141.31
 204682_at4053LTBP2Latent transforming growth factor beta binding protein 20.0061.31
 207655_s_at29760BLNKB cell linker0.0151.31
 204205_at60489APOBEC3GApolipoprotein B mRNA-editing enzyme, catalytic polypeptide-like 3G0.0141.26
 223322_at83593RASSF5Ras association (RalGDS/AF-6) domain family member 50.0241.25
 227405_s_at8325FZD8Frizzled family receptor 80.0241.24
 211469_s_at10663CXCR6Chemokine (C-X-C motif) receptor 60.0191.22
 233555_s_at55959SULF2Sulfatase 20.0321.20
 209574_s_at753LDLRAD4Low-density lipoprotein receptor class A domain containing 40.0451.15
Genes downregulated in parous women
 227475_at94234FOXQ1Forkhead box Q10.0260.64
 244680_at2743GLRBGlycine receptor, beta0.0460.79
 236399_at54103PIONPigeon homolog (Drosophila)0.0410.86
Genes differentially expressed between parous and nulliparous premenopausal women (discovery/validation approach) Analyses of gene expression according to TSLP identified 286 genes (416 probesets) differentially expressed between P and NP samples (Fig. 2). Among these, 238 genes (352 probesets) were upregulated in P women, while 48 genes (64 probesets) were downregulated (Additional file 1).
Fig. 2

Heatmap of all 416 probesets found differentially expressed between parous and nulliparous. Each column represents a volunteer. Dark blue represents parous women within 5 years since last full-term pregnancy, blue represents parous women with more than 5 years since pregnancy, and cyan represents nulliparous women. Each line corresponds to a probeset, which accordingly with its pattern of expression within the parous breast samples compared to nulliparous, was classified as downregulated (light yellow), transiently upregulated (yellow), long-term changing (orange), and long-term constant upregulated (brown)

Heatmap of all 416 probesets found differentially expressed between parous and nulliparous. Each column represents a volunteer. Dark blue represents parous women within 5 years since last full-term pregnancy, blue represents parous women with more than 5 years since pregnancy, and cyan represents nulliparous women. Each line corresponds to a probeset, which accordingly with its pattern of expression within the parous breast samples compared to nulliparous, was classified as downregulated (light yellow), transiently upregulated (yellow), long-term changing (orange), and long-term constant upregulated (brown) For probesets upregulated in the P women, gene expression differences clustered in three expression patterns as a function of TSLP (Fig. 3). The first cluster consisted of 83 genes (107 probesets) which were upregulated following the last FTP, but whose expression progressively returned to the level of expression observed in NP women (Fig. 3a). These 83 genes were named “transient” genes. The second cluster consisted of probesets for which the P-NP differences were the highest for women with < 5 years since last FTP, decreased with increasing TSLP but remained elevated as compared to NP women. The 95 genes (154 probesets) in this cluster were called “long-term changing” (Fig. 3b). In the last cluster, which included 60 genes (91 probesets), the fold changes between P an NP appeared constant, regardless of the TSLP. Therefore, we called these “long-term constant” genes (Fig. 3c).
Fig. 3

Three expression patterns were identified among the upregulated genes. In the top, the graph shows the fold change ranges of the each cluster: a transient, b long-term changing, and c long-term constant according to time since last full-term pregnancy (< 5 years, 5–10 years, and > 10 years). Heatmaps show the expression patterns of each cluster. d Schematic representation of the expression levels of each cluster in function of time since last pregnancy

Three expression patterns were identified among the upregulated genes. In the top, the graph shows the fold change ranges of the each cluster: a transient, b long-term changing, and c long-term constant according to time since last full-term pregnancy (< 5 years, 5–10 years, and > 10 years). Heatmaps show the expression patterns of each cluster. d Schematic representation of the expression levels of each cluster in function of time since last pregnancy Gene Ontology (GO) enrichment analyses showed an abundance of GO terms associated to developmental processes or immune response (Fig. 4), other less-abundant enriched GO terms were related to proliferation, intracellular transport, and cell death. Among the genes whose expression was transiently affected by parity 55%, including CD8A, XCL1, and GZMA (Table 3), enriched GO terms related to immune response (Additional file 2). Among the genes classified as long-term changing, 32% were related to immune response (e.g., CD2) and 24% were involved in developmental processes (e.g., EAF2) (Table 3, Additional file 3). Notably, of the long-term constant genes, 56% were related to developmental/differentiation processes (Additional file 4), including EGR3, DSC3, KRT5, and FZD8 (Table 3). These data indicate that the proportion of genes involved in immune response decreases among the genes that are upregulated irrespectively of TSLP, while the proportion of genes related to developmental processes increases (chi-squared p value = p < 0.001).
Fig. 4

Graph shows the percentage of genes associated with GO terms related to developmental processes (red) and immune response (blue) for each of the three clusters. 74, 79, and 50 are the total number of genes associated at least with one GO term for each cluster

Table 3

Genes that enriched developmental processes and immune response gene ontologies

Transient genes
 Developmental process
  ANXA1 CAMK4 CD8A CD27 EOMES ITK
  JAK3 LCK PTPN22 PTPRC SASH3 THEMIS
 Immune response
  ANXA1 CAMK4 CCL5 CD8A CD27 CD48
  CD96CD247CORO1ACST7CXCL14CXCL9
  EOMESFASLGFYBGZMAHCSTIGLC1
  IKZF1IL16IL7RITGAL ITK JAK3
  KLHL6LATLAX1 LCK LCP2LY9
  PRKCB PTPN22 PTPRC SASH3 SELPLGSEMA4D
  sSH2D1ASLAMF1 THEMIS TRACXCL1
Long-term changing genes
 Developmental process
  C3 CCR2 CD2 CXCL10 DACT1DKK3
  EAF2EPHA7FGF1FGFR2HCLS1 HLA-DOA
  IL12RB1 IRF4 LAMA2OSR2PDGFRASIPA1L1
  SPHK1
 Immune response
  APOBEC3GBLNKC1S C3 CCR2 CD2
  CD3DCD38CRTAM CXCL10 GPR183 HLA-DOA
  HLA-DPB1IGKC IL12RB1 IRF4 LPXNMZB1
  NFATC2PAWRPOU2AF1PRKCQSAMSN1SLAMF7
  TRBC1
Long-term constant genes
 Developmental process
  BHLHE22 CCL19 CCL2 DCNDSC3EGR3
  EPHB1FHL2FZD8 GLI3 GPR65KCNMA1
  KIF5CKRT5MYLKNFASCPRKCASALL1
  SDC1SULF1SULF2TAGLNTREM2TYMS
  VCAM1WIPF3XDHZIC1
 Immune response
  CCL2 CCL19 GLI3 HLA-DPA1HLA-DRA VCAM1

Italicized genes enriched both developmental and immune response processes

Graph shows the percentage of genes associated with GO terms related to developmental processes (red) and immune response (blue) for each of the three clusters. 74, 79, and 50 are the total number of genes associated at least with one GO term for each cluster Genes that enriched developmental processes and immune response gene ontologies Italicized genes enriched both developmental and immune response processes Evaluation of the canonical pathways represented by the differentially expressed genes also showed that pathways involved in signaling and activation of T cells were enriched by both transient and long-term genes (Fig. 5).
Fig. 5

Pathway representing T cell signaling. Genes in red were upregulated in the parous breast. Genes with a blue border had a long-term effect, while genes with a yellow border were transiently upregulated. Pathway was modified from IPA software (QIAGEN) using Path Designer

Pathway representing T cell signaling. Genes in red were upregulated in the parous breast. Genes with a blue border had a long-term effect, while genes with a yellow border were transiently upregulated. Pathway was modified from IPA software (QIAGEN) using Path Designer The 48 genes (64 probesets) downregulated in the P breast (Table 4) fell into two patterns. Twenty-two genes, including WIF1, EDN1, CXCL1, and FOXQ1, were downregulated in the P breast and remained with lower expression levels compared to NP breast irrespective of the number of years since last FTP. The remaining 26 genes were downregulated in women with TSLP < 5 years; however, the expression increased reaching similar or higher levels than those in NP women when TSLP increased. These genes included DLG5, KDM4B, and TOX3. We did not conduct a GO enrichment analysis for these genes because of their limited number.
Table 4

Genes downregulated within parous breast

EntrezIDProbeIDSymbolGeneName
Genes downregulated constantly
 653205431_s_atBMP5Bone morphogenetic protein 5
 6941559975_atBTG1B cell translocation gene 1, anti-proliferative
 285382242447_atC3orf70Chromosome 3 open reading frame 70
 1277202310_s_atCOL1A1Collagen, type I, alpha 1
 131873242641_atCOL6A6Collagen, type VI, alpha 6
 2919204470_atCXCL1Chemokine (C-X-C motif) ligand 1 (melanoma growth-stimulating activity, alpha)
 55184219951_s_atDZANK1Double zinc ribbon and ankyrin repeat domains 1
 1906222802_atEDN1Endothelin 1
 133121229916_atENPP6Ectonucleotide pyrophosphatase/phosphodiesterase 6
 94234227475_atFOXQ1Forkhead box Q1
 2743205280_atGLRBGlycine receptor, beta
 253012242601_atHEPACAM2HEPACAM family member 2
 3736230849_atKCNA1Potassium voltage-gated channel, shaker-related subfamily, member 1 (episodic ataxia with myokymia)
 79442219949_atLRRC2Leucine-rich repeat containing 2
 9848205442_atMFAP3LMicrofibrillar-associated protein 3-like
 4300204918_s_atMLLT3Myeloid/lymphoid or mixed-lineage leukemia (trithorax homolog, Drosophila); translocated to, 3
 33401554010_atNDST1N-Deacetylase/N-sulfotransferase (heparan glucosaminyl) 1
 11069205651_x_atRAPGEF4Rap guanine nucleotide exchange factor (GEF) 4
 950201647_s_atSCARB2Scavenger receptor class B, member 2
 1317236217_atSLC31A1Solute carrier family 31 (copper transporters), member 1
 116441228489_atTM4SF18Transmembrane 4 L six family member 18
 11197204712_atWIF1WNT inhibitory factor 1
Genes downregulated transiently
 170692230040_atADAMTS18ADAM metallopeptidase with thrombospondin type 1 motif, 18
 267202203_s_atAMFRAutocrine motility factor receptor, E3 ubiquitin protein ligase
 56892218541_s_atC8orf4Chromosome 8 open reading frame 4
 401546229964_atC9orf152Chromosome 9 open reading frame 152
 760209301_atCA2Carbonic anhydrase II
 54102227742_atCLIC6Chloride intracellular channel 6
 1281211161_s_atCOL3A1Collagen, type III, alpha 1
 9231229689_s_atDLG5Discs, large homolog 5 (Drosophila)
 54898213712_atELOVL2ELOVL fatty acid elongase 2
 2069205767_atEREGEpiregulin
 2330205776_atFMO5Flavin containing monooxygenase 5
 2717214430_atGLAGalactosidase, alpha
 26585218469_atGREM1Gremlin 1, DAN family BMP antagonist
 2892230144_atGRIA3Glutamate receptor, ionotropic, AMPA 3
 23704222379_atKCNE4Potassium voltage-gated channel, Isk-related family, member 4
 23030212492_s_atKDM4BLysine (K)-specific demethylase 4B
 5366204286_s_atPMAIP1Phorbol-12-myristate-13-acetate-induced protein 1
 11098202458_atPRSS23Protease, serine, 23
 5744211756_atPTHLHParathyroid hormone-like hormone
 157869214725_atSBSPONSomatomedin B and thrombospondin, type 1 domain containing
 1811206143_atSLC26A3Solute carrier family 26, member 3
 51012229835_s_atSLMO2Slowmo homolog 2 (Drosophila)
 6431206108_s_atSRSF6Serine/arginine-rich splicing factor 6
 27324216623_x_atTOX3TOX high mobility group box family member 3
 7366207392_x_atUGT2B15UDP glucuronosyltransferase 2 family, polypeptide B15
 1511261555801_s_atZNF385BZinc finger protein 385B
Genes downregulated within parous breast Finally, we examined genes reported to be related to chromatin remodeling. Among those upregulated, we observed APOBEC3G, TOX, UHRF1, and NAP1L2, while KDM4B and TOX3 were downregulated.

Validation of microarray results

P/NP differences in gene expression were confirmed for eight out of the 11 genes analyzed by real-time PCR in a subset of 37 samples (20 P and 17 NP); Ct values, ∆Cts, and −∆∆Cts are shown in Additional file 5. WIF1 was downregulated, while EAF2, BHLHE22, APOBEC3G, DSC3, KRT5, EGR3, and RASGRP1 were upregulated in the P breast (Table 5). The intraclass correlation coefficient (ICC) comparing the real-time to microarray measurements varied from 0.35 (EAF2) to 0.92 (WIF1), and Spearman correlation coefficients varied from 0.41 (EAF2) to 0.94 (WIF1) (Table 5).
Table 5

Correlation of real-time RT-PCR and microarray results

GenesAffymetrix probesetTaqMan Assay IDMicroarrayReal timeICCSpearman correlation
FCPFCP
WIF1204712_atHs00183662_m10.390.0550.320.0340.920.94
FOXQ1227475_atHs00536425_s10.640.0200.700.2180.640.69
FZD8227405_s_atHs00259040_s11.30.0031.170.1880.430.42
SDC1201286_atHs00896423_m11.480.0141.190.4560.730.62
EAF2219551_atHs00218407_m11.360.0081.240.0590.350.41
BHLHE22228636_atHs01084964_s11.240.0781.350.0220.410.53
APOBEC3G204205_atHs00222415_m11.440.0001.440.0010.390.48
DSC3206033_s_atHs00170032_m11.700.0021.450.0630.640.51
KRT5201820_atHs00361185_m11.510.0121.540.0620.640.71
EGR3206115_atHs00231780_m11.710.0261.570.0720.760.70
RASGRP1205590_atHs00996727_m11.610.0021.710.0020.660.74

Correlation of real-time RT-PCR and microarray results were calculated using the 20 parous vs. 17 nulliparous samples, for whom enough RNA material was still available. FC mean fold change, P batch-adjusted p value, ICC intraclass correlation coefficient

Correlation of real-time RT-PCR and microarray results Correlation of real-time RT-PCR and microarray results were calculated using the 20 parous vs. 17 nulliparous samples, for whom enough RNA material was still available. FC mean fold change, P batch-adjusted p value, ICC intraclass correlation coefficient We also analyzed the 11 selected genes in 5796 patients of breast cancer available in cBioPortal. One or more of these genes were altered in 770 (13%) breast cancer patients. BHLHE22 was the gene that appears altered in the largest percentage of patients (6.9%); the alterations included amplification, deep deletion, and missense mutations. The other ten genes showed to be altered in 1.5% or less patients (Table 6). An overall Kaplan-Meier survival curve has been generated comparing the group of patients with these 11 genes altered versus patients without the alterations. The group of patients that contain alteration in these genes has shorter overall survival, median of 143.13 versus 168.3 months, logrank test p value = 9.09e−5 (Fig. 6).
Table 6

Gene alteration in 5796 breast cancer subjects

GenesNumber of samples alteredPercent of samples altered
BHLHE224006.9
EGR3871.5
WIF1731.3
FOXQ1731.3
FZD8641.1
DSC3571.0
KRT5551.0
SDC1350.6
EAF2300.5
RASGRP1300.5
APOBEC3G280.5
Fig. 6

Analyses of the parous differentially expressed genes in breast cancer patients. Genes that were selected for real-time RT-PCR validation were also used for evaluation into breast cancer patients using the cBioPortal tool (http://www.cbioportal.org/), overall Kaplan-Meier survival curve comparing between the cases with and without alterations in these genes. Patients that contained alterations had a median of 143.13 months overall survival while patients without alteration in these genes had median of 168.3 (logrank test p value = 9.09e−5)

Gene alteration in 5796 breast cancer subjects Analyses of the parous differentially expressed genes in breast cancer patients. Genes that were selected for real-time RT-PCR validation were also used for evaluation into breast cancer patients using the cBioPortal tool (http://www.cbioportal.org/), overall Kaplan-Meier survival curve comparing between the cases with and without alterations in these genes. Patients that contained alterations had a median of 143.13 months overall survival while patients without alteration in these genes had median of 168.3 (logrank test p value = 9.09e−5) We performed IHC analyses to evaluate whether the changes in gene expression reflected into the protein expression levels. We have evaluated two proteins related to differentiation Desmocollin 3 and Cytokeratin 5. These two markers were presented in all tested samples, and although we have observed slight upregulation of their genes by microarray and RT-PCR, we did not detect protein differences between P and NP breast tissues. Related to immune response, we evaluated markers for dendritic cells because of their role in antigen presentation for T cell activation. Using two markers CD123 and LAMP3, we detected few positive cells with no differences between P and NP samples. We also evaluated CD2 and CD3D, markers of T cell activation (Fig. 7). The percentage of positive cells for CD2 was low in both groups, we observed slight increase in the percentage of CD2-positive cells in parous women (1.6 times; P median = 0.59%, NP median = 0.31%); however, it was not statistically significant (p value = 0.156). The overall percentage of CD3D-positive cells was higher ranging from 0.6 to 9.4%. We confirmed a statistically significant twofold increase in the percentage of CD3D-positive cells in the P breast (P median = 3.28%, NP median = 1.62%, p value = 0.006) (Fig. 7). We also compared whether the percentage of positive cells between P women with TSLP ≤ 5 and > 5 differed. For both CD2 and CD3D, we see a slight larger number of positive cells in TSLP ≤ 5, but were not statistically significant (CD2 p value = 0.16 and CD3D p value = 0.09).
Fig. 7

Immunohistochemical evaluation of CD2 and CD3D for T cell activation. On the top are results regarding CD2, and on the bottom are results of CD3D. Panels on the left show the imunohistochemical reactions (× 400), each panel contains two nulliparous samples (left) and two parous samples (right). On the right, boxplots show  the percentage of positive cells. We observe an increase in the percentage of CD3D-positive cells in the parous group (p value = 0.006)

Immunohistochemical evaluation of CD2 and CD3D for T cell activation. On the top are results regarding CD2, and on the bottom are results of CD3D. Panels on the left show the imunohistochemical reactions (× 400), each panel contains two nulliparous samples (left) and two parous samples (right). On the right, boxplots show  the percentage of positive cells. We observe an increase in the percentage of CD3D-positive cells in the parous group (p value = 0.006)

Laser capture microdissection (LCM) results

We found that a linear function of gene expression levels in RNA from epithelial tissue and RNA from stromal tissue predicted extremely well the level of expression in RNA extracted from the whole tissue (0.91 < R2 < 0.95), i.e., the gene expression in whole tissue was a weighted average of the gene expression in epithelial tissue and in stromal tissue for all nine individuals included in this substudy (Table 7). This was observed regardless of the proportion of each tissue type found on the paraffin sections, which varied across individuals (regression coefficient for epithelial tissue ranged from 0.09 to 0.68).
Table 7

Linear regression coefficients of gene expression levels in breast tissue

Subjectβ coefficients R 2
Epithelial tissueStromal tissue
10.0910.9100.943
20.4610.5030.945
30.4950.4250.905
40.5160.4060.922
50.2870.6240.929
60.3540.6220.953
70.6810.3080.920
80.3420.6540.939
90.1220.8770.926
Linear regression coefficients of gene expression levels in breast tissue Analysis of gene expression differences between epithelium and stroma showed 730 genes (956 probesets) with higher expression levels in the epithelium, while 663 genes (1020 probesets) were more expressed in the stroma (Additional file 6). GO analyses of these genes demonstrated a broader range of biological processes enriched by the genes with higher expression levels in the stroma (306 GO terms), which included cell motility, cell signaling, angiogenesis, development, and lipid process among other processes. Conversely, the genes with higher expression in the epithelium, enriched 12 GO terms, consisting a biological process involved in epithelial development and tight junction among other GOs (Additional file 7).

Discussion

This study evaluated gene expression differences in parous and non-parous breast using biopsies from healthy premenopausal volunteers. Using a discovery/validation approach, we found 43 differentially expressed genes (Table 2). Evaluation of expression differences between NP and P as a function of TSLP identified 238 genes up- and 48 genes downregulated in the P breast. The downregulated genes fell into two patterns of expression (transient and long-term), while the upregulated genes fell into three patterns (transient, long-term changing, and long-term constant) (Fig. 3). Through GO enrichment analyses, we found that genes whose expression was transiently increased after pregnancy were mainly related to immune response. Long-term changing genes included immune- and development-related genes, while genes categorized as long-term constant were mainly involved in cell differentiation and developmental processes (Fig. 4). LCM performed in a small set of samples indicated that the gene expression observed on whole-tissue arrays corresponded to the weighted average of the gene expression observed in the epithelial and stromal tissues. Among the 43 differentially expressed genes (Table 2) found in our discovery/validation analysis were DSC3 and KRT5, whose differential expression was confirmed by RT-PCR. These genes were also found in our previous study conducted in postmenopausal women [6, 7], indicating that the expression of these genes is durably modified by pregnancy. Consistent with this observation, DSC3 and KRT5 fell into the “long-term constant” category in our analysis by TSLP. These two genes are involved in cell communication and epithelial differentiation [21, 22]. Additionally, DSC3 has been suggested to act as a tumor suppressor [23-26], and its silencing is a common event in breast tumors [26]. While a discovery/validation approach is valuable to reduce the chance of false-positive results, our sample size was fairly small, which can lead to unstable results and lack of detection of some associations [27, 28]. This was a concern particularly because the breast undergoes major remodeling after a pregnancy/breastfeeding. Thus, genes may go through successive changes in pattern of expression after pregnancy, and analyses ignoring time since last pregnancy could miss transient expression modifications. We therefore also examined gene expression differences according to time since last pregnancy. To the best our knowledge, this is the first study that used a whole-transcriptome approach to demonstrate a cluster of biological functions following distinct expression patterns in the human breast following pregnancy. The observation that the “long-term constant” genes were mainly involved in cell differentiation and developmental processes (Fig. 4) is consistent with the transcriptomic profile we previously described in the parous postmenopausal breast, in which upregulated genes showed enrichment of similar processes [6-8]. These findings indicate that the parity signature set after pregnancy remains until the postmenopausal years. Other development-related genes upregulated in the P breast and confirmed by RT-PCR were RASGRP1 (RAS guanyl-releasing protein 1calcium and DAG-regulated), EGR3 (early growth response 3), and BHLHE22 (basic helix-loop-helix family member e22). These genes, in addition to differentiation, are known to regulate proliferation and cell growth [29-32]. Among the genes classified as developmental, we also found components of the WNT pathway. WNT canonical and non-canonical pathways participate in cell fate determination, cell polarity, adhesion, and motility [33, 34], all important functions in the differentiation of the breast. Differentiation induced by parity has been demonstrated to alter WNT/Notch signaling in mice [35], and we have described alterations in the methylation profile of genes belonging to this pathway and its regulation in the postmenopausal breast [36]. In the current study, we observed two genes of this pathway upregulated in the P breast, FZD8 (frizzled family receptor 8) which is a transmembrane receptor transducing WNT signals, and EAF2 (ELL-associated factor 2), which is important for early embryonic development and critical for adult tissue homeostasis and prevention against tumor initiation [37, 38]. WNT inhibitory factor 1, WIF1, was downregulated in P women, and although methylation of WIF1 has been observed in several tumors [39], including breast cancer [40], this can be an indication that the WNT pathway has an important role in the shifting of the stem cells to a more differentiated status in the P breast, as demonstrated earlier [8, 41]. Yet related to WNT pathway, we found FOXQ1 or forkhead box Q1, constantly downregulated in the P breast. This gene is a direct target of the canonical WNT pathway and its overexpression has been associated with different tumors and cancer cell lines [42]. Suppression of FOXQ1 inhibits cell proliferation, motility/invasion, and epithelial-mesenchymal transition phenotypes in cancer cells [43-45]; a similar effect could be predicted in the breast of P women. Also consistent with our previous study in postmenopausal women, we observed several genes with roles in chromatin remodeling [8]. In the current study, there were four long-term upregulated genes involved in chromatin remodeling: APOBEC3G, TOX, UHRF1, and NAP1L2. These genes interact with chromatin, either by binding with histones or recruiting histone modifiers, influencing cell proliferation and differentiation among other biological processes [46-52]. In addition, APOBEC3G (apolipoprotein B mRNA-editing enzyme, catalytic polypeptide-like 3G) is involved in RNA editing [53], and deletion in APOBEC3 gene has been correlated to breast cancer risk [54]. The upregulation of these genes in the breast of P women years after delivery indicates chromatin remodeling is enabling a permanent differentiation of the breast epithelial cells. We also evaluated whether selected genes from this study are modified in breast cancer cases. Of the genes evaluated, BHLHE22 was the most commonly modified in breast cancer (6.9% of the cases). The other genes were altered in a small percentage of breast cancer cases (Table 6). Of interest is that patients who have some alteration (amplification, deep deletion, or missense mutation) in the tested genes have a lower overall survival (Fig. 6). Although this analysis can indicate that these genes are associated with breast cancer, our interpretation is limited because the parity status of these breast cancer cases is not available. In addition to genes involved in development and chromatin remodeling, we observed a large number of genes known to participate in immune response. The immune system has several roles in the mammary gland, being important not only for protection against pathogens, but also it is secreted into the milk and participates at the different stages of the gland development, including involution [55-58]. Of great interest, most of these immune-related genes were only upregulated in the biopsies collected within 5 years since pregnancy. This observation is consistent with our previous work in postmenopausal women, in which we did not observe enrichment of immune response in an older population [6, 7]. It is also in agreement with previous studies on molecular profiling of pregnancy performed in younger populations that reported changes in immune-response genes [59, 60]. Previous studies have showed differences in expression patterns of immune-related genes at distinct mammary developmental stages, before and/or after pregnancy in humans [59] and rodents [61, 62]. Rodent studies demonstrated that in the first days of mammary gland involution there is activation of genes related to acute-phase response and inflammation [61, 62], followed by activation of monocyte and lymphoid chemokines and immunoglobulin genes [61]. The inflammatory-like environment observed during the breast involution has been proposed as one of the mechanisms that could explain the transient increase in breast cancer risk observed after each pregnancy [57, 63]. In this study, we observed that most of the genes that underwent transient changes in expression enriched biological processes related to activation and development of lymphocytes, mainly T cells (Additional file 2). Only one GO term related to inflammation (GO:0006925: inflammatory cell apoptotic process) was enriched among the transient genes (Additional file 2). This may be due to the fact that biopsies were collected at least 1 year after FTP and/or breastfeeding; thus, we may have missed an early inflammation stage. Both human and murine postpartum mammary glands have an organized influx of immune cells; however, these cells are not observed after 1 year postpartum in human, or 12 weeks in murine [57]. We have performed immunohistochemistry reactions for dendritic cells and T cell activation markers on a subset of our samples. Using antibodies anti-LAMP3 and anti-CD123 for dendritic cells, antigen presenters, we detected few positive cells and no differences between P and NP samples (data not shown). When using anti-CD2 and anti-CD3D, markers of T cell activation, we observed an increase in cells positive for CD3D in P breast. This data suggest that there are differences in the number of activated T cells between P and NP breast (Fig. 7). Normal pregnancy is characterized by an early expansion of regulatory T cells [64], which modulate immune tolerance during pregnancy [64]. In addition, microchimeric cells of fetal origin that persist in the maternal circulation after delivery are postulated to provide immune surveillance with the contribution of T cells, protecting against breast cancer [65-67]. Evidence shows that microchimeric cells are more frequent in healthy women than in breast cancer-affected women, and breast cancers without circulating microchimeric cells are more aggressive [65-70]. In this work, we found several upregulated genes related to activation of T cells after FTP (Additional files 2, 3, and 4). This suggests that the activation of T cells in the breast tissue of P women could trigger an early response against transformed cells, destroying them and protecting the mammary gland from neoplastic transformation. Tumor surveillance by the immune system and its impact on disease outcomes in cancer patients, and in breast cancer patients in particular, have been documented [71-75]. Finak et al. studied the stromal gene expression and clinical outcomes in breast cancer and observed that the gene set expressed predominantly in the good-outcome cluster was enriched by elements of the T helper type 1 [71]. Eight (CD2, CD8A, XCL1, CD3D, GZMA, CD247, CD48, CD52) of the genes present in this cluster [71] were also upregulated in the parous women. Poor-outcome cluster [71] showed downregulation of CXCL14, which was upregulated in the parous breast, and upregulation of CXCL1 and EDN1, which were downregulated in the parous breast. This overlap between Finak study and ours indicates that the activation of the T cell response is a beneficial mechanism against transformed cells. Winslow et al. described gene sets with better prognostics for triple-negative breast cancers [72]. These gene sets involved cytotoxic immune response, including the genes mentioned above, and HLA encoding genes [72]. HLA-DRA (major histocompatibility complex, class II, DR alpha) and HLA-DPA1 (major histocompatibility complex, class II, DP alpha 1) were constantly activated in our study. The upregulation of HLA-DPA1 has been associated with a benign behavior of certain neurological tumors and related to the immune defense-associated genes [76]. Therefore, a similar role could be attributed to the upregulation of these two genes in the cancer preventative effect of pregnancy in the premenopausal women. Using tissue from nine women whom arrays could be conducted in epithelial tissue, stromal tissue, and whole tissue, we observed that a linear regression of gene expression in epithelial and stromal tissues predicted gene expression in the whole tissue extremely well (R2 ≥ 0.90 for all subjects). We also observed that the proportion of each tissue in the whole-tissue biopsy sample (as estimated by the linear regression coefficient) varied substantially across women (range for epithelial tissue, 0.09–0.68) (Table 7). This suggests that the P/NP gene expression differences we observed either are present in each of the two tissues (epithelial and stromal) or are present in only one of the two tissues but are of such magnitude that they are observed in whole tissue biopsies, despite the fact that some of the individual biopsies may contain only a small proportion of this tissue type. When comparing gene expression in stroma versus epithelia, we observed a large number of genes differentially expressed among these two tissues. The epithelia, being a more differentiated tissue, enriched fewer GO terms, and these were mainly involved in epithelial development and tight junction (Additional file 7). The stroma showed enrichment of a broader range of functions, lipid homeostasis, lipid storage, metabolic process, vascularization, and migration to cite a few (Additional file 7). This extensive list of biological process is expected because the stroma is constituted of different types of cells (e.g., adipocytes, fibroblasts, endothelial cells). Among the genes found in the stroma, we did not observe enrichment of many immune-related GO terms, which is consistent with the fact we did not observe a large number of immune cells in the histopathology of these samples. Furthermore, parity status was not included in the comparison of the epithelia vs. stroma. Among the samples used in the LCM analyses (5 NP and 5 P), only one had the FTP less than 5 years before the biopsy, the group in which we found significant enrichment of immune response genes in the P/NP main study. The limited number of successful LCM microarrays did not allow us to perform analyses to understand whether parity induced more changes in expression of genes present in the stromal or epithelial cells of the breast. This study has several strengths. A major strength is that all women were volunteers from the general population who were free of any breast abnormality. Also, all biopsies were histologically examined to confirm normality of tissue and presence of breast epithelium and stroma. Histological evaluations and gene expression experiments were performed on unidentifiable samples to prevent bias. Our study also had limitations. Because our main analysis was based on whole breast tissue microarrays, we were not able to characterize parity-associated differences in gene expression separately for epithelial and stromal tissues. While this would be of interest, it would require large biopsies in order to obtain a sufficient amount of each tissue type, which is difficult to justify in studies of healthy volunteers. The variability in composition of breast tissue among women adds to the challenge of understanding the mechanisms responsible for the preventive effect of parity and comparison of different studies. Because of the limited sample size, we could not conduct our analysis by time since last pregnancy partitioning our data in discovery/validation. Lastly, these results were generated based on a relatively homogenous population; therefore, confirmation of these results including a more ethnically heterogeneous population is needed.

Conclusions

Altogether, parous premenopausal breast shows a specific transcriptome profile, in which genes controlling chromatin remodeling and cell differentiation are activated after FTP and stay upregulated for many years, supporting the data from postmenopausal parous women previously published [6-8]. Of great novelty is that this work shows that the genes involved in immune response were in majority related to T cell activation and these were activated soon after the FTP; however, their expression return to levels similar to those observed in the nulliparous breast five or more years after FTP. These transcripts may work in protecting the mammary gland against neoplastically transformed cells through T cells. However, because this immune surveillance appears transitory, we infer that cell differentiation, activated by the genes whose expression was permanently affected by parity, may be the main molecular mechanism responsible for the preventive effect of parity against breast cancer. Complete list of differentially expressed probesets (416 probes). (XLSX 101 kb) Complete list of GO terms enriched by the transient genes. (XLSX 23 kb) Complete list of GO terms enriched by the long-term changing genes. (XLSX 16 kb) Complete list of GO terms enriched by the long-term constant genes. (XLSX 20 kb) Real-time RT-PCR raw data. (XLSX 17 kb) Genes differentially expressed between epithelial and stromal tissues. (XLSX 189 kb) Complete list of GO terms enriched by the genes differentially expressed in epithelial and stromal tissues. (XLSX 61 kb)
  73 in total

1.  Transient increase in breast cancer risk after giving birth: postpartum period with the highest risk (Sweden).

Authors:  Qin Liu; Joanne Wuu; Mats Lambe; Shu-Feng Hsieh; Anders Ekbom; Chung-Cheng Hsieh
Journal:  Cancer Causes Control       Date:  2002-05       Impact factor: 2.506

2.  The sva package for removing batch effects and other unwanted variation in high-throughput experiments.

Authors:  Jeffrey T Leek; W Evan Johnson; Hilary S Parker; Andrew E Jaffe; John D Storey
Journal:  Bioinformatics       Date:  2012-01-17       Impact factor: 6.937

3.  Adjusting batch effects in microarray expression data using empirical Bayes methods.

Authors:  W Evan Johnson; Cheng Li; Ariel Rabinovic
Journal:  Biostatistics       Date:  2006-04-21       Impact factor: 5.899

4.  Fetal microchimerism in women with breast cancer.

Authors:  Vijayakrishna K Gadi; J Lee Nelson
Journal:  Cancer Res       Date:  2007-10-01       Impact factor: 12.701

5.  Distinct desmocollin isoforms occur in the same desmosomes and show reciprocally graded distributions in bovine nasal epidermis.

Authors:  A J North; M A Chidgey; J P Clarke; W G Bardsley; D R Garrod
Journal:  Proc Natl Acad Sci U S A       Date:  1996-07-23       Impact factor: 11.205

6.  Influence of terminal differentiation and PACAP on the cytokine, chemokine, and growth factor secretion of mammary epithelial cells.

Authors:  Katalin Csanaky; Wolfgang Doppler; Andrea Tamas; Krisztina Kovacs; Gabor Toth; Dora Reglodi
Journal:  J Mol Neurosci       Date:  2013-12-10       Impact factor: 3.444

7.  Tumor suppressor U19/EAF2 regulates thrombospondin-1 expression via p53.

Authors:  F Su; L E Pascal; W Xiao; Z Wang
Journal:  Oncogene       Date:  2009-10-12       Impact factor: 9.867

8.  The role of forkhead box Q1 transcription factor in ovarian epithelial carcinomas.

Authors:  Min Gao; Ie-Ming Shih; Tian-Li Wang
Journal:  Int J Mol Sci       Date:  2012-10-26       Impact factor: 5.923

9.  FOXQ1, a novel target of the Wnt pathway and a new marker for activation of Wnt signaling in solid tumors.

Authors:  Jon Christensen; Susanne Bentz; Thierry Sengstag; V Prasad Shastri; Pascale Anderle
Journal:  PLoS One       Date:  2013-03-26       Impact factor: 3.240

10.  Parity-related molecular signatures and breast cancer subtypes by estrogen receptor status.

Authors:  Melissa Rotunno; Xuezheng Sun; Jonine Figueroa; Mark E Sherman; Montserrat Garcia-Closas; Paul Meltzer; Tyisha Williams; Sallie Smith Schneider; D Joseph Jerry; Xiaohong R Yang; Melissa A Troester
Journal:  Breast Cancer Res       Date:  2014-07-08       Impact factor: 6.466

View more
  12 in total

Review 1.  Postpartum Involution and Cancer: An Opportunity for Targeted Breast Cancer Prevention and Treatments?

Authors:  Virginia F Borges; Traci R Lyons; Doris Germain; Pepper Schedin
Journal:  Cancer Res       Date:  2020-02-19       Impact factor: 12.701

2.  Serum hormone levels and normal breast histology among premenopausal women.

Authors:  Mark E Sherman; Thomas de Bel; Michael G Heckman; Launia J White; Joshua Ogony; Melody Stallings-Mann; Tracy Hilton; Amy C Degnim; Robert A Vierkant; Tanya Hoskin; Matthew R Jensen; Laura Pacheco-Spann; Jill E Henry; Anna Maria Storniolo; Jodi M Carter; Stacey J Winham; Derek C Radisky; Jeroen van der Laak
Journal:  Breast Cancer Res Treat       Date:  2022-05-03       Impact factor: 4.624

3.  Towards defining morphologic parameters of normal parous and nulliparous breast tissues by artificial intelligence.

Authors:  Joshua Ogony; Thomas de Bel; Derek C Radisky; Jeroen van der Laak; Mark E Sherman; Jennifer Kachergus; E Aubrey Thompson; Amy C Degnim; Kathryn J Ruddy; Tracy Hilton; Melody Stallings-Mann; Celine Vachon; Tanya L Hoskin; Michael G Heckman; Robert A Vierkant; Launia J White; Raymond M Moore; Jodi Carter; Matthew Jensen; Laura Pacheco-Spann; Jill E Henry; Anna Maria Storniolo; Stacey J Winham
Journal:  Breast Cancer Res       Date:  2022-07-11       Impact factor: 8.408

Review 4.  Key steps for effective breast cancer prevention.

Authors:  Kara L Britt; Jack Cuzick; Kelly-Anne Phillips
Journal:  Nat Rev Cancer       Date:  2020-06-11       Impact factor: 60.716

5.  The role of gene to gene interaction in the breast's genomic signature of pregnancy.

Authors:  Pedro J Gutiérrez-Díez; Javier Gomez-Pilar; Roberto Hornero; Julia Martínez-Rodríguez; Miguel A López-Marcos; Jose Russo
Journal:  Sci Rep       Date:  2021-01-29       Impact factor: 4.379

Review 6.  Postpartum breast cancer: mechanisms underlying its worse prognosis, treatment implications, and fertility preservation.

Authors:  Hanne Lefrère; Liesbeth Lenaerts; Virginia F Borges; Pepper Schedin; Patrick Neven; Frédéric Amant
Journal:  Int J Gynecol Cancer       Date:  2021-03       Impact factor: 3.437

7.  Enhancing Fatty Acid Catabolism of Macrophages Within Aberrant Breast Cancer Tumor Microenvironment Can Re-establish Antitumor Function.

Authors:  Yucui Gu; Xingjian Niu; Lei Yin; Yiran Wang; Yue Yang; Xudong Yang; Qingyuan Zhang; Hongfei Ji
Journal:  Front Cell Dev Biol       Date:  2021-04-15

8.  Sericin inhibits MDA‑MB‑468 cell proliferation via the PI3K/Akt pathway in triple‑negative breast cancer.

Authors:  Lin Niu; Songhe Yang; Xueying Zhao; Xiaochao Liu; Lina Si; Meng Wei; Lei Liu; Luyang Cheng; Yuebing Qiao; Zhihong Chen
Journal:  Mol Med Rep       Date:  2020-12-14       Impact factor: 2.952

9.  Menstrual and reproductive characteristics and breast cancer risk by hormone receptor status and ethnicity: The Breast Cancer Etiology in Minorities study.

Authors:  Esther M John; Amanda I Phipps; Lisa M Hines; Jocelyn Koo; Sue A Ingles; Kathy B Baumgartner; Martha L Slattery; Anna H Wu
Journal:  Int J Cancer       Date:  2020-02-29       Impact factor: 7.396

10.  Exposure to Propylparaben During Pregnancy and Lactation Induces Long-Term Alterations to the Mammary Gland in Mice.

Authors:  Joshua P Mogus; Charlotte D LaPlante; Ruby Bansal; Klara Matouskova; Benjamin R Schneider; Elizabeth Daniele; Shannon J Silva; Mary J Hagen; Karen A Dunphy; D Joseph Jerry; Sallie S Schneider; Laura N Vandenberg
Journal:  Endocrinology       Date:  2021-06-01       Impact factor: 4.736

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.