| Literature DB >> 26884818 |
Elizaveta V Benevolenskaya1, Abul B M M K Islam2, Habibul Ahsan3, Muhammad G Kibriya3, Farzana Jasmine3, Ben Wolff4, Umaima Al-Alem5, Elizabeth Wiley6, Andre Kajdacsy-Balla6, Virgilia Macias6, Garth H Rauscher5.
Abstract
BACKGROUND: We examined whether differences in tumor DNA methylation were associated with more aggressive hormone receptor-negative breast cancer in an ethnically diverse group of patients in the Breast Cancer Care in Chicago (BCCC) study and using data from The Cancer Genome Atlas (TCGA).Entities:
Keywords: Breast cancer; DNA methylation; ER/PR hormone receptor status
Mesh:
Substances:
Year: 2016 PMID: 26884818 PMCID: PMC4754852 DOI: 10.1186/s13148-016-0184-7
Source DB: PubMed Journal: Clin Epigenetics ISSN: 1868-7075 Impact factor: 6.551
Patient and tumor characteristics by hormone receptor status in the BCCC dataset
| Total ( | ER/PR positive | |||
|---|---|---|---|---|
|
| % | % |
| |
| Age at diagnosis | 0.88 | |||
| <50 | 20 | 27 | 75 | |
| 50–59 | 20 | 27 | 70 | |
| 60–79 | 35 | 47 | 69 | |
| Race/ethnicity | 0.44 | |||
| nH Black | 31 | 41 | 65 | |
| nH White | 21 | 28 | 81 | |
| Hispanic | 23 | 31 | 70 | |
| Pathological stage | 0.39 | |||
| 1 | 23 | 31 | 83 | |
| 2 | 33 | 44 | 67 | |
| 3 | 18 | 24 | 61 | |
| 4 | 1 | 1 | 100 | |
| Histologic grade | 0.10 | |||
| Low | 5 | 7 | 100 | |
| Moderate | 31 | 42 | 77 | |
| High | 38 | 51 | 61 | |
| 1st degree familial breast cancer | 0.25 | |||
| None | 50 | 68 | 68 | |
| Moderate | 18 | 24 | 67 | |
| Strong (<50/multiple affected) | 6 | 8 | 100 | |
Fig. 1Distribution of DNA methylation between ER/PR-negative and ER/PR-positive samples in the training dataset (BCCC). a Box plot of mean β values. The level of methylation at each CpG site was defined by β values. β values close to 0 indicated low level of DNA methylation, and β values close to 1 indicated high level of DNA methylation. The next levels of analysis were conducted at the gene level. Mean β values were averaged for all CpG sites on the array for each individual gene. Statistical significance of difference in β values for each gene between the two groups was determined by ks and Wilcox tests. b Scatter plot analysis of mean β values
Number of associations between DNA methylation and hormone receptor-positive breast cancer in the BCCC dataset
| Analysis | Number of coefficientsa | No. of positive associations | No. of inverse associations | % positiveb | 95 % CI | Ratioc | 95 % CI |
|---|---|---|---|---|---|---|---|
| Training dataset ( | |||||||
| All associations | 806 | 548 | 258 | 68 | (66, 72) | 2.2 | (1.9, 2.6) |
| Associations with | 180 | 146 | 34 | 81 | (75, 86) | 4.3 | (3.0, 6.4) |
| Associations with | 46 | 38 | 8 | 83 | (70, 93) | 4.8 | (2.4, 12.7) |
aNumber of logistic regression coefficients involved in each analysis
bPercentage of coefficients that represent a positive association between methylation and hormone receptor-positive breast cancer.
cRatio of the number of positive divided by the number of negative associations
Validation of genes with differential DNA methylation as predictors of hormone status from the Illumina BCCC (training dataset) using TCGA (validation dataset)
| Training dataset | Validation dataset | Correlation with expression | ||||
|---|---|---|---|---|---|---|
| Gene | Associationa | SAM | Associationa | SAM |
|
|
|
| Positive | 3.94 | Positive | 8.65 | −0.44 | 1.4E−15 |
|
| Positive | 2.95 | Positive | 2.14 | −0.13 | 2.1E−02 |
|
| Positive | 2.70 | Positivee | 1.11 | −0.27 | 1.4E−06 |
|
| Positive | 2.66 | Positive | 5.61 | −0.28 | 4.9E−07 |
|
| Positive | 2.64 | Positive | 7.60 | −0.15 | 8.5E−03 |
|
| Positive | 2.64 | Positive | 4.56 | −0.27 | 2.0E−06 |
|
| Positive | 2.53 | Positive | 9.51 | −0.29 | 3.8E−07 |
|
| Positive | 2.52 | Positive | 2.09 | −0.30 | 1.1E−07 |
|
| Positive | 2.51 | Positive | 6.37 | −0.22 | 1.1E−04 |
|
| Positive | 2.50 | Positive | 5.69 | −0.30 | 5.2E−08 |
|
| Positive | 2.50 | Not-associated | 0.00 | NA | NA |
|
| Positive | 2.45 | Positive | 2.65 | −0.69 | 2.2E−44 |
|
| Positive | 2.44 | Positive | 5.39 | 0.34 | 1.3E−09 |
|
| Positive | 2.43 | NA | NA | NA | NA |
|
| Positive | 2.43 | Positive | 7.78 | −0.05 | 4.2E−01 |
|
| Positive | 2.43 | Positive | 4.62 | −0.05 | 4.2E−01 |
|
| Positive | 2.38 | Positive | 2.48 | −0.12 | 2.0E−02 |
|
| Positive | 2.32 | Positive | 4.15 | 0.08 | 1.7E−01 |
|
| Positive | 2.32 | Positive | 2.22 | −0.13 | 5.7E−05 |
|
| Positive | 2.29 | Positive | 7.06 | −0.13 | 2.0E−02 |
|
| Positive | 2.23 | Positive | 2.15 | −0.45 | 2.6E−16 |
|
| Positive | 2.21 | Positivee | 0.17 | NA | NA |
|
| Positive | 2.21 | Positive | 9.32 | −0.52 | 2.6E−22 |
|
| Positive | 2.20 | Positive | 3.61 | −0.10 | 8.0E−02 |
|
| Positive | 2.19 | Positive | 4.66 | −0.37 | 1.3E−11 |
NA gene was absent in the dataset
aIndicates whether gene hypermethylation was associated with increased likelihood of ER/PR-positive breast cancer versus ER/PR-negative breast cancer (“Positive”)
b d scores from SAM analysis using Δ of 0.7 on the GoldenGate dataset
c d scores from SAM analysis using Δ of 3 on the TCGA dataset. In cases where several probes per gene were present, the data is shown for the probe with the highest SAM d score
dPearson correlation coefficient between methylation and expression from TCGA and the corresponding P value
eNon-significant association
Patient and tumor characteristics by hormone receptor status in the TCGA dataset
| Total ( | ER/PR positive | |||
|---|---|---|---|---|
|
| % | % |
| |
| Age at diagnosis | 0.008 | |||
| <50 | 72 | 24 | 68 | |
| 50–59 | 79 | 26 | 73 | |
| 60–79 | 155 | 51 | 85 | |
| Race/ethnicity | 0.001 | |||
| nH White | 182 | 59 | 77 | |
| nH Black | 22 | 7 | 59 | |
| Hispanic | 1 | 0 | 100 | |
| Asian | 21 | 7 | 57 | |
| Unknown | 80 | 26 | 91 | |
| Pathological stage | 0.271 | |||
| 1 | 55 | 18 | 82 | |
| 2 | 180 | 59 | 84 | |
| 3 | 51 | 17 | 74 | |
| 4 | 9 | 3 | 80 | |
| Missing | 11 | 4 | 100 | |
Fig. 2Training (BCCC) and validation (TCGA) datasets stratified based on DNA methylation data for disease predictor genes. a Sample level data for DNA methylation (GoldenGate) in the BCCC dataset. Results of SAM-supervised classification of ER/PR status from β values are shown. β values for disease predictors (one probe per gene listed in Table 3) are presented as a heat map. Data are shown grouped for hormone receptor-negative samples (N = 22) and hormone receptor-positive samples (N = 53). P values from the t test for the difference between ER/PR-negative and ER/PR-positive disease (Additional file 1: Table S1) are presented on the right. b Sample level data for DNA methylation in the TCGA. Mean β values (when multiple probes were present) are reported for Table 3 genes, stratifying samples according to ER/PR status. P values from the t test are shown on the right
Fig. 3Correlation of DNA methylation with the level of gene expression in the validation dataset (TCGA). a Significance of the correlation between the level of DNA methylation and expression for each gene as determined by Pearson correlation coefficient (PCC). The correlation test was run for all samples, only ER/PR-negative samples, or only ER/PR-positive samples. b Correlation test (methylation versus expression) for predictors (Table 3). c Sample level data for gene expression of predictors (Table 3). Data is presented for 22 ER/PR-negative tumors and 53 ER/PR-positive tumors. The PCC across the BCCC samples and TCGA samples (methylation versus expression) is shown on the right