| Literature DB >> 26510686 |
Garth H Rauscher1, Jacob K Kresovich2, Matthew Poulin3, Liying Yan4, Virgilia Macias5, Abeer M Mahmoud6, Umaima Al-Alem7, Andre Kajdacsy-Balla8, Elizabeth L Wiley9, Debra Tonetti10, Melanie Ehrlich11.
Abstract
BACKGROUND: Breast cancer formation is associated with frequent changes in DNA methylation but the extent of very early alterations in DNA methylation and the biological significance of cancer-associated epigenetic changes need further elucidation.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26510686 PMCID: PMC4625569 DOI: 10.1186/s12885-015-1777-9
Source DB: PubMed Journal: BMC Cancer ISSN: 1471-2407 Impact factor: 4.430
List of studied gene regions and number of CpGs covered, Breast Cancer Care in Chicago study (2005-2008)
| Gene/RNA isoforma | Test region | Test region coordinates (hg19) | Distance from TSS (bp)b | CGIc | # CpGsd | TSGe |
|---|---|---|---|---|---|---|
| Promoter region | ||||||
| BRCA1 | Exon 1 (extended promoter) | chr17: 41277463-41277365 | +37 to +135 | No | 11 | Yes |
| CD44 | Promoter | chr11: 35160374-35160443 | -43 to +26 | Yes | 8 | No |
| ESR1 | Exon 2 (extended promoter) | chr6: 152129110 - 152129167 | +656 to +713 | Yes | 5 | Unclear |
| GSTM2 | Promoter | chr1: 110210582-110210641 | -62 to -3 | Yes | 8 | No |
| GSTP1 | Exon 1 (extended promoter) | chr11: 67351205-67351215 | +139 to +149 | Yes | 4 | Yes |
| MAGEA1f | Promoter | chrX: 152486180-152486129 | -13 to -64 | No | 6 | No |
| MSI1 | Promoter | chr12: 120807571-120807474 | -588 to -491 | No | 5 | |
| NFE2L3 | Exon 1 (Extended promoter) | chr7: 26192663-26192744 | +816 to +897 | Yes | 14 | No |
| RASSF1A | Exon 1 (extended promoter) | chr3: 50378293-50378233 | +74 to +134 | Yes | 9 | Yes |
| RUNX3 | Exon 1 (extended promoter) | chr1: 25256198-25256306 | +464 to +572 | Yes | 28 | Yes |
| SIX3 | Exon 1 (extended promoter) | chr2: 45169609-45169529 | +492 to +572 | Yes | 12 | Unclear |
| TFF1 | Promoter | chr21: 43786664-43786628 | -20 to +16 | No | 5 | Unclear |
| Upstream of promoter | ||||||
| EN1 | Upstream of promoter | chr2: 119611385-119611338 | -5579 to -5626 | Yes | 6 | No |
| PAX3g | Far upstream | chr2: 223170608-223170643 | -6928 to -6893 | Yes | 5 | No |
| PITX2f,h | Far upstream or intron I | chr4: 111562566-111562677 | -18312 to -18413/+602 to +713 | No | 10 | No |
| SGK1h | Far upstream/ alt. exon 1 | chr6: 134638893-134638831 | -14823 to -14761/+303 to +365 | Yes | 6 | Unclear |
| Introns | ||||||
| APC | Intron 1 or promoter | chr5: 112073426-112073445 | +30224 to +30243/-130 to -111 | No | 4 | Yes |
| EGFR | intron 1 | chr7: 55088080-55088104 | +1355 to +1379 | Yes | 4 | No |
| LHX2 | Intron 3 | chr9: 126777854-126777983 | +3966 to +4095 | Yes | 11 | No |
| RFX1f | Intron 7 | chr19: 14089984-14089969 | +27150 to +27165 | No | 4 | No |
| SOX9 | Intron 2 | chr17: 70119151-70119195 | +1990 to +2034 | Yes | 4 | No |
| DNA Repeats | ||||||
| LINE-1 | N.A. | DNA Repeat | N.A. | 4 | N.A. | |
| Sat2 | N.A. | DNA Repeat | N.A. | 2 | N.A. | |
aWhere there are multiple RefSeq RNA isoforms and expression in HMEC cells by RNA-seq (ENCODE/Cold Spring Harbor), the RNA isoform closest to the predominant HMEC RNA was used in this table to determine the TSS. The isoforms for calculation of the distance from the TSS are given in Additional file 1: Tables S1 and S2
bTSS, transcription start site for the indicated RefSeq isoform. N.A., not applicable
cCGI, CpG island overlapping the test region
dThe number of CpG dinucleotide pairs in the test region (the amplicon used for pyrosequencing minus the primer regions)
eTSG, Tumor suppressor gene
fAlthough the sequences were in regions that did not make the criteria to be classified as CGI [13], the regions were rich in CpG compared to the average for human DNA
gThere is a little expressed, primate specific gene, CCDC140, between PAX3 and the test region whose 5’ end overlaps the 5’ end of PAX3
hThere are distant alternative 5’ ends of these genes
Fig. 1Example of how some gene regions were chosen for examination in this study on the basis of available RRBS DNA methylation profiles for breast cancer cell lines and normal cell cultures and tissues visualized in the UCSC Genome Browser [13]. a The EN1 gene structure with exons as heavy horizontal bars; b, the aligned CpG islands in the illustrated region.; c, DNA methylation (ENCODE/RRBS/HudsonAlpha) profiles for the indicated cell cultures and normal tissues using an 11-color, semi-continuous scale (see color key) to indicate the average DNA methylation levels at each monitored CpG site; d, aligned transcription results indicating that the non-transformed breast cancer cell line is not transcribing this gene irrespective of its lack of DNA methylation. Paradoxically, normal myoblasts are transcribing it despite some upstream DNA methylation. All data are from ENCODE [19]
Characteristics of the 129 breast cancer patients with adjacent normal and/or invasive samples, Breast Cancer Care in Chicago study (2005-2008)
| Patient characteristic | No. | Percent |
|---|---|---|
| Age | ||
| <50 | 44 | 34 |
| 50-59 | 36 | 28 |
| 60-69 | 32 | 25 |
| 70-79 | 17 | 13 |
| Race/Ethnicity | ||
| non-Hispanic White | 42 | 33 |
| non-Hispanic Black | 57 | 44 |
| Hispanic | 30 | 23 |
| Stage at Diagnosis | ||
| Stage I | 60 | 47 |
| Stage II-IV | 68 | 53 |
| Histologic Grade | ||
| Low | 39 | 32 |
| Intermediate | 42 | 34 |
| High | 42 | 34 |
| ER/PR status | ||
| One or both positive | 82 | 67 |
| Double negative | 40 | 33 |
Mean percent methylation by gene and tissue type from the Breast Cancer Care in Chicago study
| DNA region | Controla | Adjacentb | Invasivec | Adjacent vs. control | Invasive vs. control | Invasive vs. adjacent | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| N | Mean | SD | N | Mean | SD | N | Mean | SD | Diff. |
| Diff. |
| Diff. |
| |
| Promoter region | |||||||||||||||
| BRCA1 | 15 | 1.6 | 1.1 | 105 | 1.4 | 4.3 | 105 | 3.0 | 9.6 | -0.2 | NS | 1.4 | NS | 1.6 | 0.031 |
| CD44f | 18 | 1.6 | 0.6 | 37 | 1.8 | 0.9 | 37 | 2.6 | 2.7 | 0.2 | NS | 1.0 | NS | 0.8 | NS |
| ESR1f | 18 | 5.9 | 3.1 | 37 | 7.9 | 6.0 | 37 | 6.7 | 7.8 | 2.0 | NS | 0.8 | NS | -1.2 | NS |
| GSTM2 | 16 | 1.8 | 2.0 | 107 | 3.0 | 6.1 | 107 | 19.3 | 22.9 | 1.2 | NS | 17.5 | 0.004 | 16.3 | <0.0001 |
| GSTP1 | 17 | 2.5 | 2.0 | 118 | 1.6 | 3.7 | 118 | 9.9 | 16.8 | -0.9 | NS | 7.4 | NS | 8.3 | <0.0001 |
| MAGEA1f | 17 | 84.8 | 4.9 | 32 | 84.2 | 4.8 | 37 | 67.0 | 16.2 | -0.6 | NS | -17.8 | 0.0001 | -17.2 | <0.0001 |
| MSI1f | 18 | 3.8 | 2.1 | 36 | 4.2 | 2.5 | 37 | 9.2 | 7.4 | 0.4 | NS | 5.3 | 0.0001 | 5.0 | 0.0003 |
| NFE2L3f | 17 | 28.9 | 14.7 | 36 | 26.9 | 13.7 | 37 | 33.7 | 24.4 | -2.0 | NS | 4.8 | NS | 6.8 | NS |
| RASSF1A | 18 | 2.8 | 2.2 | 124 | 9.7 | 10.7 | 124 | 33.3 | 23.4 | 6.9 | <0.0001 | 30.5 | <0.0001 | 23.6 | <0.0001 |
| RUNX3 | 17 | 3.3 | 1.2 | 119 | 4.0 | 3.3 | 119 | 11.4 | 12.7 | 0.7 | NS | 8.1 | 0.061 | 7.4 | <0.0001 |
| SIX3 | 17 | 5.8 | 2.8 | 115 | 5.3 | 4.0 | 115 | 15.7 | 14.9 | -0.5 | NS | 9.9 | 0.026 | 10.4 | <0.0001 |
| TFF1 | 18 | 81.8 | 5.4 | 122 | 72.0 | 16.9 | 122 | 49.2 | 22.3 | -9.8 | 0.008 | -32.6 | <0.0001 | -22.8 | <0.0001 |
| Upstream of promoter | |||||||||||||||
| EN1 | 18 | 17.8 | 5.3 | 122 | 20.0 | 10.2 | 122 | 32.9 | 15.3 | 2.2 | NS | 15.1 | <0.0001 | 12.9 | <0.0001 |
| PAX3 | 17 | 2.9 | 1.2 | 121 | 4.0 | 5.2 | 121 | 10.8 | 11.1 | 1.1 | NS | 7.9 | 0.0003 | 6.8 | <0.0001 |
| PITX2f | 17 | 26.5 | 6.1 | 35 | 27.6 | 8.6 | 36 | 36.1 | 11.3 | 1.1 | NS | 9.6 | 0.001 | 8.5 | <0.0001 |
| SGK1 | 18 | 1.6 | 1.2 | 124 | 3.9 | 3.7 | 124 | 13.0 | 12.2 | 2.3 | <0.0001 | 11.4 | <0.0001 | 9.1 | <0.0001 |
| Introns | |||||||||||||||
| APC | 18 | 2.0 | 1.9 | 114 | 2.9 | 8.3 | 114 | 14.8 | 19.6 | 0.9 | NS | 12.8 | 0.058 | 11.9 | <0.0001 |
| EGFR | 18 | 4.5 | 1.3 | 126 | 7.3 | 5.2 | 126 | 19.4 | 14.8 | 2.8 | 0.006 | 14.9 | <0.0001 | 12.1 | <0.0001 |
| LHX2f | 18 | 21.1 | 5.3 | 36 | 25.8 | 13.1 | 37 | 36.1 | 12.6 | 4.7 | NS | 15.0 | <0.0001 | 10.3 | 0.0007 |
| RFX1 | 18 | 18.0 | 5.3 | 126 | 19.3 | 9.7 | 126 | 39.4 | 13.1 | 1.3 | NS | 21.4 | <0.0001 | 20.1 | <0.0001 |
| SOX9 | 18 | 8.4 | 4.3 | 123 | 9.1 | 7.4 | 123 | 15.2 | 14.3 | 0.7 | NS | 6.8 | NS | 6.1 | 0.002 |
| DNA Repeats | |||||||||||||||
| LINE-1 | 18 | 68.7 | 1.4 | 129 | 72.8 | 2.6 | 129 | 71.2 | 4.3 | 4.1 | <0.0001 | 2.5 | 0.0003 | -1.6 | 0.001 |
| Sat2 | 18 | 52.6 | 8.0 | 128 | 57.4 | 12.7 | 128 | 52.0 | 13.4 | 4.8 | 0.002 | -0.6 | NS | -5.4 | <0.0001 |
aReduction mammoplasty samples from women unaffected with breast cancer
bSamples from histologically normal tissue adjacent to the tumor
cSamples from the cancer component of the tumor
dFrom an independent sample Wilcoxon Rank Sum test comparing control mammoplasty vs. adjacent samples
eFrom a dependent sample Wilcoxon Sign Rank test. P-values > 0.10 are suppressed. Diff, difference in mean methylation; SD, standard deviation
fThese seven assays were not pursued beyond the pilot phase and, therefore, had 32-37 paired cancer and adjacent samples instead of 105-129
Differences were determined to be statistically significant at p < 0.001 for the complete sample set and p < 0.01 for regions only examined in the pilot study
Fig. 2Mean percent methylation and 95 % error bars by gene and tissue type for the DNA regions listed in Table 1. a DNA methylation analysis of samples from the Breast Cancer Care in Chicago study (2005-2008) as determined by our bisulfite pyrosequencing. Control samples (reduction mammoplasty) from unaffected women are represented by green bars, cancer-adjacent, histologically normal samples by blue bars and cancer samples by red bars. b Bioinformatic analysis of DNA methylation of breast cancer samples and paired non-cancerous adjacent samples from The Cancer Genome Atlas (TCGA). Paired non-cancerous adjacent samples are represented by blue bars and cancer samples by red bars. In both panels, promoter sequences are displayed first, followed by upstream sequences, then introns and lastly, DNA repeats
Adjusted differences in mean % methylation comparing adjacent (referent) to cancer tissue, overall and stratified by ER/PR status
| All samples | ER/PR Positive | ER/PR Negative | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Na | Diff.b | 95 % CIc |
| Na | Diff.b | 95 % CIc |
| Na | Diff.b | 95 % CIc |
| |
| Promoter regions | ||||||||||||
| BRCA1 | 198 | 1.7 | (0, 4) | NS | 136 | 1 | (-1, 4) | NS | 62 | 3 | (0, 8) | 0.09 |
| CD44e | 54 | 0.5 | (0, 1) | NS | 32 | 0 | (-0.4, 1) | NS | 22 | 0 | (0,2) | NS |
| ESR1e | 54 | -1.5 | (-3, 0) | 0.099 | 32 | -1 | (-4, 1) | NS | 22 | -2 | (-4, 1) | NS |
| GSTM2 | 212 | 16.8 | (13, 21) | < 0.0001 | 146 | 10 | (6, 16) | < 0.0001 | 66 | 32 | (24, 38) | < 0.0001 |
| GSTP1 | 227 | 8.3 | (6, 12) | < 0.0001 | 159 | 9 | (6, 14) | < 0.0001 | 68 | 6 | (2, 12) | 0.016 |
| MAGEA1e | 54 | -14.5 | (-22, -9) | < 0.0001 | 32 | -22 | (-30, -14) | < 0.0001 | 22 | -4 | (-12, 3) | NS |
| MSI1e | 54 | 4.7 | (2, 8) | 0.005 | 32 | 4 | (1, 10) | 0.057 | 22 | 5 | (1, 9) | 0.01 |
| NFE2L3e | 54 | 5.6 | (-4, 16) | NS | 32 | 19 | (9, 31) | 0.001 | 22 | -15 | (-26, -4) | 0.01 |
| RASSF1A | 234 | 23.5 | (19, 28) | < 0.0001 | 160 | 26 | (21, 31) | < 0.0001 | 74 | 18 | (12, 25) | < 0.0001 |
| RUNX3 | 227 | 6.6 | (4, 9) | < 0.0001 | 156 | 9 | (6, 11) | < 0.0001 | 71 | 2 | (-1, 7) | NS |
| SIX3 | 221 | 10.9 | (9, 14) | < 0.0001 | 151 | 10 | (7, 14) | < 0.0001 | 70 | 13 | (8, 19) | < 0.0001 |
| TFF1 | 230 | -21.6 | (-26, -17) | < 0.0001 | 159 | -25 | (-30, -19) | < 0.0001 | 71 | -14 | (-23, -5) | 0.002 |
| Upstream of promoter | ||||||||||||
| EN1 | 230 | 13.1 | (10, 17) | < 0.0001 | 158 | 12 | (8, 16) | < 0.0001 | 72 | 16 | (9, 24) | < 0.0001 |
| PAX3 | 230 | 7.3 | (5, 10) | < 0.0001 | 159 | 6 | (4, 9) | < 0.0001 | 71 | 10 | (6, 15) | < 0.0001 |
| PITX2e | 54 | 7.8 | (4, 11) | < 0.0001 | 32 | 7 | (3, 10) | < 0.0001 | 22 | 10 | (1, 16) | 0.026 |
| SGK1 | 233 | 9.8 | (8, 12) | < 0.0001 | 160 | 9 | (7, 12) | < 0.0001 | 73 | 11 | (7, 16) | < 0.0001 |
| Introns | ||||||||||||
| APC | 221 | 12.3 | (9, 16) | < 0.0001 | 153 | 12 | (8, 16) | < 0.0001 | 68 | 15 | (9, 22) | < 0.0001 |
| EGFR | 235 | 12 | (9, 15) | < 0.0001 | 161 | 15 | (11, 18) | < 0.0001 | 74 | 6 | (2, 11) | 0.009 |
| LHX2e | 54 | 8.1 | (2, 15) | 0.017 | 32 | 8 | (3, 13) | 0.006 | 22 | 9 | (-5, 22) | NS |
| RFX1 | 235 | 19.8 | (17, 23) | < 0.0001 | 161 | 20 | (16, 23) | < 0.0001 | 74 | 21 | (16, 26) | < 0.0001 |
| SOX9 | 231 | 6.2 | (3, 9) | < 0.0001 | 158 | 6 | (3, 10) | < 0.0001 | 73 | 6 | (1, 13) | 0.031 |
| DNA Repeats | ||||||||||||
| LINE-1 | 238 | -1.5 | (-2, -1) | < 0.0001 | 162 | -2 | (-3, -1) | < 0.0001 | 76 | -1 | (-3, 0) | 0.074 |
| Sat2 | 237 | -5.4 | (-9, -2) | 0.001 | 161 | -6 | (-10, -2) | 0.003 | 76 | -5 | (-10, 1) | 0.109 |
aNumber of samples analyzed; the small differences in numbers of samples in this table compared to Table 3 are due to missing data on ER/PR status
bDifference in mean percent methylation (cancer and adjacent) was estimated via linear regression model with generalized estimating equations to account for within-patient covariance
cBias corrected, bootstrapped 95 % confidence intervals (CI) were estimated via 1000 bootstrap replications to account for skewed methylation distributions
dApproximate p-value estimated from a Wald test of the normal-based bootstrapped estimate over its standard error. P-values > 0.1 are suppressed
eThese seven assays were not pursued beyond the pilot phase and therefore have considerably fewer cancer and adjacent tissue samples analyzed
All estimates of mean percent methylation are adjusted for age, race/ethnicity, stage at diagnosis, tumor grade, and either adjusted for or stratified by ER/PR status
Methylation comparing cancer to paired adjacent samples, and correlation of methylation in invasive breast cancer samples with gene expression
| This study | TCGA database, within study region +/- 100 bp | TCGA database, within study region | |||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Pyroseqa | Illum. 450 k: Mean % methylation | Assn. with expr. | Illum. 450 k: Mean % Methylation | Assn. with expr. | |||||||||||
| Gene | # CpG | # CpG | Adjacentb (N = 96) | Cancerc (N = 96) | Diffd |
| ρf |
| # CpG | Adjacentb (N = 96) | Cancerc (N = 96) | Diffd |
| ρf |
|
| Promoter region | |||||||||||||||
| BRCA1 | 11 | 13 | 4 | 4 | 0 | NS | -0.19 | 0.0000 | 9 | 3 | 4 | 0 | NS | -0.20 | 0.0000 |
| CD44 | 8 | 5 | 4 | 9 | 5 | 0.0001 | -0.18 | 0.0000 | 1 | 6 | 10 | 5 | 0.004 | -0.18 | 0.0000 |
| ESR1 | 5 | 2 | 67 | 40 | -28 | 0.0000 | -0.52 | 0.0000 | 2 | 67 | 40 | -28 | 0.0000 | -0.52 | 0.0000 |
| GSTM2 | 8 | 3 | 4 | 16 | 13 | 0.0000 | -0.13 | 0.0003 | 1 | 2 | 2 | -1 | 0.0000 | -0.13 | 0.0002 |
| GSTP1 | 4 | 2 | 7 | 24 | 17 | 0.0000 | -0.43 | 0.0000 | 0 | --- | --- | --- | --- | --- | |
| MAGEA1 | 6 | 2 | 97 | 77 | -21 | 0.0000 | -0.23 | 0.0000 | 2 | 97 | 77 | -20 | 0.0000 | -0.23 | 0.0000 |
| MSI1 | 5 | 2 | 3 | 6 | 3 | 0.0000 | -0.20 | 0.0000 | 0 | --- | --- | --- | --- | --- | |
| NFE2L3 | 14 | 1 | 39 | 48 | 9 | 0.0001 | -0.55 | 0.0000 | 0 | --- | --- | --- | --- | --- | |
| RASSF1 | 9 | 2 | 12 | 47 | 35 | 0.0000 | -0.18 | 0.0000 | 1 | 9 | 49 | 41 | 0.0000 | -0.16 | 0.0000 |
| RUNX3 | 28 | 1 | 2 | 23 | 21 | 0.0000 | -0.28 | 0.0000 | 0 | --- | --- | --- | --- | --- | |
| SIX3 | 12 | 4 | 6 | 24 | 18 | 0.0000 | -0.21 | 0.0000 | 2 | 4 | 26 | 22 | 0.0000 | -0.20 | 0.0000 |
| TFF1 | 5 | 4 | 77 | 46 | -31 | 0.0000 | -0.20 | 0.0000 | 1 | 74 | 47 | -27 | 0.0000 | -0.19 | 0.0000 |
| Upstream of promoter | |||||||||||||||
|
| 6 | 2 | 23 | 32 | 8 | 0.0000 | 0.12 | 0.001 | 0 | --- | --- | --- | --- | --- | |
| PAX3 | 5 | 2 | 6 | 30 | 24 | 0.0000 | -0.12 | 0.004 | 1 | 9 | 29 | 19 | 0.0000 | -0.10 | 0.003 |
|
| 10 | 2 | 23 | 38 | 15 | 0.0000 | 0.22 | 0.0000 | 0 | --- | --- | --- | --- | --- | |
| SGK1 | 6 | 4 | 9 | 21 | 12 | 0.0000 | -0.10 | 0.003 | 2 | 5 | 17 | 12 | 0.0000 | -0.11 | 0.003 |
| Introns | |||||||||||||||
|
| 4 | 11 | 6 | 28 | 22 | 0.0000 | 0.13 | 0.002 | 3 | 5 | 28 | 23 | 0.0000 | 0.12 | 0.004 |
| EGFR | 4 | 1 | 9 | 37 | 29 | 0.0000 | -0.15 | 0.0000 | 1 | 9 | 37 | 29 | 0.0000 | -0.15 | 0.0000 |
|
| 11 | 1 | 17 | 32 | 15 | 0.0000 | 0.24 | 0.0000 | 0 | --- | --- | --- | --- | --- | |
| RFX1 | 4 | 0 | --- | --- | --- | --- | --- | 0 | --- | --- | --- | --- | --- | ||
| SOX9 | 4 | 1 | 5 | 15 | 11 | 0.0000 | -0.34 | 0.0000 | 0 | --- | --- | --- | |||
aPyrosequencing (Pyroseq) assay coordinates are given in Table 1
bNon-cancer tissue adjacent to paired breast cancer from 96 patients in the TCGA Illumina Methylation450 database for genome-wide DNA methylation [14]
cSamples from the invasive component of the breast cancer from patients in the TCGA Methylation450 and expression (expr.; RNA-seq) databases [16, 17]
dDifference in mean % methylation for cancer minus that for the paired adjacent tissue for the 96 patients with both in the TCGA database
eFrom a dependent sample Wilcoxon Sign Rank test; P-values > 0.10 are suppressed
fSpearman correlation coefficient for methylation levels vs expression levels among all the invasive breast cancer samples in the TCGA database
gThe four underlined genes were the only ones that had a positive association of cancer methylation in non-promoter regions with expression levels of the associated gene