| Literature DB >> 35139887 |
Natascia Marino1,2, Rana German3, Ram Podicheti4, Douglas B Rusch4, Pam Rockey3, Jie Huang4, George E Sandusky5, Constance J Temm5, Sandra Althouse6, Kenneth P Nephew7, Harikrishna Nakshatri8, Jun Liu4, Ashley Vode3, Sha Cao6, Anna Maria V Storniolo3,9.
Abstract
BACKGROUND: Genome-wide association studies have identified several breast cancer susceptibility loci. However, biomarkers for risk assessment are still missing. Here, we investigated cancer-related molecular changes detected in tissues from women at high risk for breast cancer prior to disease manifestation. Disease-free breast tissue cores donated by healthy women (N = 146, median age = 39 years) were processed for both methylome (MethylCap) and transcriptome (Illumina's HiSeq4000) sequencing. Analysis of tissue microarray and primary breast epithelial cells was used to confirm gene expression dysregulation.Entities:
Keywords: Cancer risk; DNA methylation; Normal breast; Transcriptome
Mesh:
Substances:
Year: 2022 PMID: 35139887 PMCID: PMC8830042 DOI: 10.1186/s13148-022-01239-1
Source DB: PubMed Journal: Clin Epigenetics ISSN: 1868-7075 Impact factor: 6.551
Fig. 1Transcriptome profiling of breast tissues from women at either high- or average risk of breast cancer. A Schematics of the study design. Cancer-free breast tissue cores (N = 146) were divided in either high-risk or average-risk group according to the Tyrer-Cuzick breast cancer risk evaluation score (20% used as threashold). The tissues were processed and analyzed for whole transcriptome and methylome profiling and differentially expressed genes (DEG) and differentially methylated sites between high- and average-risk samples were identified. Thirty five women (10 high risk and 25 average risk) donated also a second biopsy (D2) allowing to detect age-dependent aberrations. B Pathway analysis of the transcripts differentially expressed (FDR < 0.05) between average and high- risk breasts. C Major molecular network of the differentially expressed transcripts between the two experimental groups. Genes upregulated in high-risk breasts are in red, while downregulated genes are in green. D FAM83A and NEK2 transcription level in breast tissues from women at either average- or high-risk of developing breast cancer. E Upper panel: Representative image of the immunofluorescence staining of primary breast epithelial cells with the epithelial marker, E-Cadherin (red), mesenchymal marker, Vimentin (green) as control, and nuclear staining, DAPI (blue). E-Cadherin and Vimentin staining of primary cells revealed that isolated primary cells are epithelial in nature. Lower panel: FAM83A and NEK2 expression in primary epithelial cells isolated from either average-risk (n = 4) and high-risk breast (n = 3). F FAM83A and NEK2 expression in primary and h-TERT immortalized isogenic breast epithelial cells (n = 7) from the GSE108541 dataset. G) Representative images of immunohistochemical staining for FAM83A and NEK2 are shown at 20X magnification. Staining quantification is expressed as positivity and H-score. Data are shown as mean ± standard error. #FDR < 0.005, *p < 0.05,**p < 0.001, ***p < 0.0001. Pvalue is calculated using either unpaired nonparametric Mann–Whitney test or nonparametric Wilcoxon test
Gene expression differences in high- versus average-risk breasts (FC > 2; FDR < 0.05)
| Gene name | Description | log2fca | FDR | % genetic alterationsb | Tumor/ Normal expression ( | Copy number variation (CNV)d | Oncoscore | |
|---|---|---|---|---|---|---|---|---|
| CNV = 2 (%), | CNV = − 2 (%), | |||||||
| MEPE | Matrix extracellular phosphoglycoprotein | 2.28 | 2E-02 | 0.7 | 0.02 (n.s.) | 12 (0.6), | 2 (0.1), < 0.001 | 15.6 |
| OPRPN | Opiorphin prepropeptide | 2.10 | 3E-03 | 1.3 | N.A | 31 (1.4), | 0 (0) | N.A |
| CXCL13 | C-X-C motif chemokine ligand 13 | 2.07 | 4E-03 | 1.3 | 6.6 (0,003) | 26 (1.2), | 0 (0) | 33.7 |
| APELA | Apelin receptor early endogenous ligand | 1.87 | 8E-04 | 0.3 | N.A | N.A | 0 (0) | N.A |
| CA6 | Carbonic anhydrase 6 | 1.78 | 6E-04 | 0.8 | 0 (n.s.) | 2 (0.1), | 3 (0.1), < 0.001 | 14.4 |
| DIO2 | Iodothyronine deiodinase 2 | 1.60 | 2E-03 | 0.6 | 1.94 (n.s.) | 13 (0.6), | 0 (0) | 7.7 |
| FEZF2 | FEZ family zinc finger 2 | 1.55 | 7E-03 | 0.7 | 0.04 (< 0.001) | 3 (0.1), | 0 (0) | 16.1 |
| TNNT1 | Troponin T1%2C slow skeletal type | 1.52 | 9E-03 | 2.3 | 51.87 (n.s.) | 36 (1.7), | 0 (0) | 12.3 |
| MMP3 | Matrix metallopeptidase 3 | 1.43 | 2E-02 | 1.8 | 5.66 (< 0.001) | 26 (1.2), | 1 (0.04), < 0.001 | 31.9 |
| SERPINA12 | Serpin family A member 12 | 1.42 | 2E-02 | 0.9 | 1.26 (< 0.001) | 12 (0.6), | 1 (0), < 0.001 | 11.9 |
| C8B | Complement C8 beta chain | 1.42 | 3E-02 | 1.8 | 0.014 (n.s.) | 37 (1.7), | 1 (0), < 0.001 | 7.3 |
| KCNJ13 | Potassium voltage-gated channel subfamily J member 13 | 1.41 | 3E-03 | 0.6 | 0.16 (0.03) | 2 (0.1), | 1 (0), < 0.001 | 9.0 |
| CXCL6 | C-X-C motif chemokine ligand 6 | 1.37 | 5E-03 | 2.2 | 0.10 (0.04) | 43 (2), | 0 (0), | 31.0 |
| SLC12A1 | Solute carrier family 12 member 1 | 1.33 | 1E-02 | 0.9 | 0.48 (< 0.001) | 4 (0.2), | 1 (0), < 0.001 | 5.6 |
| CYP24A1 | Cytochrome P450 family 24 subfamily A member 1 | 1.33 | 3E-02 | 7.0 | 0.22 (n.s.) | 164 (7.5), < 0.001 | 1 (0), n.s | 30.2 |
| ASB5 | Ankyrin repeat and SOCS box containing 5 | 1.29 | 4E-03 | 1.3 | 0.01 (n.s.) | 6 (0.3), n.s | 5 (0.2), < 0.001 | 0.0 |
| NPY2R | Neuropeptide Y receptor Y2 | 1.27 | 3E-02 | 1.0 | 0.003 (< 0.001) | 10 (0.5), n.s | 0 (0) | 7.9 |
| C2CD4A | C2 calcium dependent domain containing 4A | 1.26 | 2E-02 | 0.6 | 0.9 (< 0.001) | 12 (0.6), n.s | 1 (0), < 0.001 | 11.2 |
| GABRR1 | gamma-aminobutyric acid type A receptor rho1 subunit | 1.26 | 3E-02 | 1.1 | 1.03 (0.03) | 13 (0.6), n.s | 5 (0.2), < 0.001 | 8.7 |
| KIAA1210 | KIAA1210 | 1.25 | 7E-03 | 1.6 | 0.43 (n.s.) | 18 (0.8), n.s | 3 (0.1), < 0.001 | 0.0 |
| MMP10 | Matrix metallopeptidase 10 | 1.23 | 2E-02 | 1.6 | 7.07 (< 0.001) | 26 (1.2), n.s | 1 (0), < 0.001 | 38.2 |
| FAM83A | Family with sequence similarity 83 member A | 1.22 | 5E-03 | 16.0 | 1.23 (< 0.001) | 503(23.1), < 0.001 | 0 (0) | 74.5 |
| LPO | Lactoperoxidase | 1.21 | 1E-02 | 7.0 | 0.5 (< 0.001) | 168 (7.7),2E-24 | 1 (0), n.s | 11.5 |
| CRISP2 | Cysteine rich secretory protein 2 | 1.19 | 3E-02 | 1.5 | 0.06 (0.01) | 31 (1.4), 3E-05 | 0 (0) | 8.2 |
| NMU | Neuromedin U | 1.19 | 2E-02 | 0.8 | 3.6 (< 0.001) | 18 (0.8), n.s | 1 (0), < 0.001 | 41.6 |
| MAGEB4 | MAGE family member B4 | 1.18 | 9E-03 | 0.8 | 8.6 (0.004) | 10 (0.5), n.s | 2 (0.1), < 0.001 | 55.9 |
| MAG | Myelin associated glycoprotein | 1.17 | 4E-02 | 2.3 | 5.3 (< 0.001) | 42 (1.9), 0.001 | 0 (0) | 13.2 |
| DAPL1 | Death associated protein like 1 | 1.17 | 5E-03 | 0.7 | 0.09 (n.s.) | 10 (0.5), n.s | 0 (0) | 14.0 |
| PRSS51 | Serine protease 51 | 1.16 | 2E-02 | 1.6 | N.A | 0 (0) | 0 (0) | N.A |
| PBK | PDZ binding kinase | 1.14 | 4E-03 | 3.0 | 15.7 (< 0.001) | 20 (0.9), n.s | 15(0.7), < 0.001 | 28.3 |
| KRT77 | Keratin 77 | 1.13 | 4E-02 | 0.8 | 0.04 (n.s.) | 12 (0.6), n.s | 0 (0) | 0.0 |
| CALML3 | Calmodulin like 3 | 1.12 | 3E-02 | 4.0 | 0.15 (n.s.) | 108 (5), < 0.001 | 0 (0) | 37.7 |
| ACBD7 | Acyl-CoA binding domain containing 7 | 1.12 | 3E-03 | 2.3 | 1.13 (0.002) | 78 (3.6), < 0.001 | 0 (0) | 0.0 |
| UNC5D | Unc-5 netrin receptor D | 1.11 | 2E-02 | 8.0 | 0.001 (n.s.) | 152 (7), n.s | 6 (0.3), < 0.001 | 44.8 |
| ESCO2 | Establishment of sister chromatid cohesion N-acetyltransferase 2 | 1.11 | 2E-03 | 3.0 | 8.02 (< 0.001) | 20 (0.9), n.s | 14(0.6), < 0.001 | 25.1 |
| BARX1 | BARX homeobox 1 | 1.09 | 4E-02 | 5.0 | 1.54 (9E-08) | 9 (0.4), n.s | 1 (0), < 0.001 | 22.3 |
| CTXND1 | Cortexin domain containing 1 | 1.09 | 3E-02 | 0.0 | N.A | 0 (0) | 0 (0) | N.A |
| SYT13 | Synaptotagmin 13 | 1.08 | 4E-03 | 1.3 | 4.6 (< 0.001) | 36 (1.7), < 0.001 | 1 (0), n.s | 38.8 |
| PRAME | Preferentially expressed antigen in melanoma | 1.06 | 2E-02 | 1.2 | 1.8 (< 0.001) | 21 (1), n.s | 1 (0), < 0.001 | 82.6 |
| SLC39A12 | Solute carrier family 39 member 12 | 1.05 | 4E-03 | 2.4 | 0.18 (n.s.) | 72 (3.3), < 0.001 | 1 (0), n.s | 12.0 |
| IGHV2-26 | Immunoglobulin heavy variable2-26 | 1.04 | 4E-02 | 0.1 | N.A | 0 (0) | 0 (0) | N.A |
| APLN | Apelin | 1.04 | 7E-04 | 0.6 | 0.93 (n.s.) | 16 (0.7), n.s | 2 (0.1), < 0.001 | 13.8 |
| IGHV3-30 | Immunoglobulin heavy variable3-30 | 1.04 | 2E-02 | 0.1 | N.A | 0 (0) | 0 (0) | 48.0 |
| LPAR3 | Lysophosphatidic acid receptor 3 | 1.04 | 8E-03 | 0.9 | 0.28 (n.s.) | 13 (0.6), n.s | 0 (0) | 12.9 |
| ECEL1 | Endothelin converting enzyme like1 | 1.03 | 2E-02 | 0.8 | 0.9 (n.s.) | 1 (0), n.s | 1 (0), < 0.001 | N.A |
| DCX | Doublecortin | 1.03 | 6E-03 | 0.5 | 0.1 (0.02) | 13 (0.6), n.s | 2 (0.1), < 0.001 | 8.7 |
| NEK2 | NIMA related kinase 2 | 1.02 | 7E-03 | 12.0 | 25.78 (< 0.001) | 473 (21.8), < 0.001 | 0 (0) | 61.4 |
| CWH43 | Cell wall biogenesis 43 C-terminal homolog | 1.02 | 3E-02 | 1.0 | 0.5 (< 0.001) | 6 (0.3), n.s | 0 (0) | 12.9 |
| PRSS21 | Serine protease 21 | 1.01 | 3E-02 | 5.0 | 0.2 (n.s.) | 154 (7.1),5E-102 | 0 (0) | 46.3 |
| FOXI3 | Forkhead box I3 | 1.01 | 2E-02 | 0.3 | 0.01 (< 0.001) | 10 (0.5), n.s | 0 (0) | 8.5 |
| FCER2 | Fc fragment of IgE receptor II | -0.98 | 1E-03 | 1.3 | 0.07 (0.04) | 11 (0.5), n.s | 2 (0.1), < 0.001 | 17.1 |
| DACH2 | Dachshund family transcription factor 2 | − 1.01 | 1E-02 | 0.8 | 0.3 (n.s.) | 17 (0.8), n.s | 9 (0.4), < 0.001 | 25.3 |
| LILRB5 | Leukocyte immunoglobulin like receptor B5 | − 1.02 | 8E-04 | 2.1 | 0.15 (< 0.001) | 39 (1.8), n.s | 0 (0) | 0.0 |
| SBK3 | SH3 domain binding kinase family member 3 | − 1.03 | 7E-03 | 2.3 | N.A | 48 (2.2), n.s | 0 (0) | 0.0 |
| TRDN | Triadin | − 1.03 | 3E-02 | 2.3 | 0.02 (n.s.) | 41 (1.9), n.s | 1 (0), < 0.001 | 1.0 |
| NXF3 | nuclear RNA export factor 3 | − 1.04 | 3E-03 | 0.6 | 0.9 (n.s.) | 8 (0.4), n.s | 4 (0.2), < 0.001 | 32.2 |
| LILRA6 | leukocyte immunoglobulin like receptor A6 | − 1.05 | 2E-03 | 2.1 | 1 (n.s.) | 39 (1.8), n.s | 1 (0), n.s | 0 |
| SYNDIG1L | synapse differentiation inducing 1 like | − 1.07 | 9E-03 | 0.5 | N.A | 8 (0.4), n.s | 1 (0), < 0.001 | 0 |
| ARPP21 | cAMP regulated phosphoprotein 21 | − 1.13 | 2E-02 | 1.1 | 0.42 (n.s.) | 11 (0.5), n.s | 1 (0), < 0.001 | 24.04 |
| SLC22A12 | solute carrier family 22 member 12 | − 1.13 | 2E-02 | 1.1 | 0.9 (< 0.001) | 20 (0.9), n.s | 0 (0) | 8.9 |
| CCL24 | C–C motif chemokine ligand 24 | − 1.17 | 1E-02 | 0.7 | 0.98 (< 0.001) | 21 (1), n.s | 0 (0) | 16.2 |
| TPSD1 | tryptase delta 1 | − 1.17 | 2E-02 | 5.0 | 0.55 (0.04) | 170 (7.8), < 0.001 | 0 (0) | 0 |
| PROK2 | prokineticin 2 | − 1.19 | 2E-02 | 0.7 | 0.24 (0.01) | 5 (0.2), n.s | 1 (0), 0.001 | 18.8 |
| HBG2 | hemoglobin subunit gamma 2 | − 1.59 | 4E-02 | 1.0 | 0.2 (n.s.) | 19 (0.9), n.s | 0 (0) | 11.3 |
| FGF8 | fibroblast growth factor 8 | − 1.68 | 3E-04 | 0.3 | 0.88 (0.005) | 2 (0.1), n.s | 1 (0), < 0.001 | 14.3 |
| SULT1C2 | sulfotransferase family1C member2 | − 1.74 | 2E-03 | 0.5 | 1.6 (0.02) | 9 (0.4), n.s | 0 (0) | 21.8 |
| MS4A6E | membrane spanning 4-domains A6E | − 2.24 | 4E-02 | 0.9 | N.A | 23 (1.1),n.s | 0 (0) | 0 |
N.A., not available; a, log fold change; b, breast cancer data from cBioportal; c: from UALCAN portal; d: data retrieved from the METABRIC, number of samples with either CNV = 2 or − 2 (%), p value
The 20 most differentially methylated regions between the high- and average-risk breast tissues
| Genomic Locus | Overlapping Gene Feature | Gene Name | Description | ∆Z# | FDR |
|---|---|---|---|---|---|
| Chr8:120,669,501 | Intron | SNTB1 | syntrophin beta 1 | 2.4 | 7E-53 |
| Chr18:6,803,751 | Intron | ARHGAP28 | Rho GTPase activating protein 28 | 2.0 | 3E-35 |
| Chr6:12,944,751 | Intron | PHACTR1 | phosphatase and actin regulator 1 | 1.9 | 1E-31 |
| Chr21:30,743,501 | promoter | KRTAP21-4P | keratin associated protein 21–4 2C pseudogene | 1.9 | 6E-31 |
| Chr2:115,663,751 | Intron | DPP10 | dipeptidyl peptidase like 10 | 1.8 | 2E-30 |
| Chr4:87,111,001 | Intron | AFF1 | AF4/FMR2 family member 1 | 1.8 | 2E-29 |
| Chr3:33,638,251 | Intron | CLASP2 | cytoplasmic linker associated protein 2 | 1.8 | 2E-29 |
| Chr14:106,498,501 | Intron | LINC01881 | long intergenic non-protein coding RNA1881 | 1.8 | 2E-29 |
| Chr6:129,416,001 | Intron | LAMA2 | laminin subunit alpha 2 | 1.8 | 4E-29 |
| Chr14:31,750,251 | Intron | NUBPL | nucleotide binding protein like | 1.8 | 2E-28 |
| Chr8:37,842,001 | Coding | ADGRA2 | adhesion G protein-coupled receptor A2 | − 1.2 | 1E-12 |
| Chr1:44,724,501 | Coding | C1orf228 | chromosome 1 open reading frame 228 | − 1.2 | 9E-13 |
| ChrX:46,575,001 | Coding | CHST7 | carbohydrate sulfotransferase 7 | − 1.2 | 5E-13 |
| Chr14:104,729,501 | Coding | ADSSL1 | adenylosuccinate synthase like 1 | − 1.3 | 1E-15 |
| Chr2:202,774,251 | Intron/promoter | ICA1L | islet cell autoantigen 1 like | − 1.3 | 1E-15 |
| Chr1:155,190,001 | Coding | MUC1 | mucin 1, 2C cell surface associated | − 1.4 | 2E-17 |
| Chr20:3,751,751 | Coding | HSPA12B | heat shock protein family A (Hsp70) member 12B | − 1.4 | 8E-18 |
| Chr11:58,141,001 | Intron | OR9Q1 | olfactory receptor family 9 subfamily Q member 1 | − 1.6 | 4E-22 |
| Chr1:45,803,751 | Coding | MAST2 | microtubule associated serine/threonine kinase 2 | − 1.6 | 3E-23 |
| Chr7:636,001 | Intron | PRKAR1B | protein kinase cAMP-dependent type I regulatory subunit beta | − 1.7 | 1E-24 |
#High- versus average-risk value
Fig. 2Methylome profiling of breast tissues from women at either high- or average risk of breast cancer. A Chromosomal distribution of the DNA methylation aberrations observed in high-risk versus average-risk group. B Heatmap of the 20 highest differentially methylated regions in high-risk breasts as compared with average-risk breasts at FDR < 0.05. The overlapping gene name is indicated on the left. C Genomic localization (intron, coding, promoter or UTR) of the DNA methylation aberrations including regions either hypo- or hyper-methylated in high-risk versus average-risk breasts. Data are shown as percentage of each genomic localizaton versus the total number of sites. D Pathway analysis of the genes affected by DNA methylation aberrations (FDR < 0.05) in high-risk breasts as compared with breast from women at average risk for breast cancer. E One of the molecular networks including the genes affected by DNA hypermethylation
Fig. 3Correlation between degree of DNA methylation and gene expression. A Pearson’s correlation analysis between DNA methylation value and expression of the genes found differentially expressed between high- and average-risk breasts. B Pearson’s correlation analysis of the DNA methylation and expression of PHACTR1, hypermethylated in the breasts of high-risk women. r is the correlation coefficient and p is pvalue
Fig. 4Age-related transcriptome and DNA methylation changes in healthy breast tissues. A Differentially expressed genes between the first (D1) and second (D2) donation time point in the breast tissues from average (blue bars) and high- (orange bars) risk women. Ratio between D2 and D1 is shown. B Pearson’s correlation test between DNA methylation and transcription of GRIA4 and DNM3 in average- and high-risk breasts at the two time points, D1 and D2, C Number of genomic locations (intron, coding regions, promoter, UTR) of the age-related DNA methylation events. N.A.: not available. D Venn diagram of the DNA methylation changes associated with age comparing our data set (D2/D1) with Horvath’ epigenetic clock (353 CpGs) or Johnson’s age-associated loci (787 CpGs) E Differentially methylated regions between the first (D1) and second (D2) donation time point in the breast tissues from average (blue bars) and high- (orange bars) risk women. Ratio between D2 and D1 is shown. *p < 0.05; **p < 0.001