| Literature DB >> 26235388 |
Sehrish Rafique1,2, Jeremy S Thomas3, Duncan Sproul4,5, Wendy A Bickmore6.
Abstract
BACKGROUND: Epigenetic changes are being increasingly recognized as a prominent feature of cancer. This occurs not only at individual genes, but also over larger chromosomal domains. To investigate this, we set out to identify large chromosomal domains of epigenetic dysregulation in breast cancers.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26235388 PMCID: PMC4536608 DOI: 10.1186/s13059-015-0719-9
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Fig. 1Identifying RER in breast tumors and cancer cell lines. a, c Transcription correlation score (TCS) maps for chromosomes 3 and 16 using data from breast tumors [22] (a) and breast cancer cell lines [20] (c). The horizontal dotted line indicates the significance threshold. Arrows indicate regions containing genes with significant TCSs. b Boxplots showing the distribution of TCSs generated for all genes and for RER genes using two independent breast tumor datasets [22, 23]. d Ideograms showing the location of the 45 RER regions identified in breast tumors (red) and the 71 RER regions identified in breast cancer cell lines (blue)
Fig. 2Gene expression changes in tumors and normal tissue. Box plots show the distribution of mean centered z scores of gene expression in tumor samples and normal breast tissue and breast organoids [28] for genes in the RER regions at 2p24.2-p25.1 (a), 12q15-q21.33 (b) and 16p11.2 (c). Data for tumors are separated according to ER status and Wilcoxon tests were used to determine whether or not there was a significant difference between tumor and normal samples taken together
Fig. 3Properties of RER regions and tumor subtypes. a Unsupervised hierarchical cluster analysis of breast tumor samples for RER regions at 3p14-p21.31 (left) and 16q12.2-q24.1 (right). Heat maps of gene expression z scores with hierarchical clustering of samples (red high expression, green low expression). Genes are ordered by their position in the genome. Subtype information [22] for each tumor sample is identified by the color-coded matrix: luminal A (blue), luminal B (turquoise), ERBB2 (purple), basal (red), normal-like (green). Only genes in the regions with significant TCSs are shown. b As in (a) but for mean expression (mean z score of genes with significant TCSs) for all RER regions in each breast tumor sample, showing clustering of RER regions into three groups. Both the RER regions and samples were subject to hierarchical clustering. c Box plots showing mean expression (mean z score of genes with significant TCSs) of RER regions from clusters 1, 2 and 3 in breast tumors of different subtypes; luminal A (LumA, blue), luminal B (LumB, turquoise), ERBB2 (purple), basal-like (red). **p < 0.01, ***p < 0.001, Wilcoxon test)
Cytogenetic band(s) where RER regions common to breast tumors and breast cancer cell lines are located
| Cytogenetic location | Size (Mb) | Number of significant TCSs (tumor/cell line) | Number of other significantly correlated genes (tumor/cell line) | Genes in RER regions with significant TCSs | Gene expression (tumor versus normal) |
|---|---|---|---|---|---|
| 6p22.2 | 0.89 | 3/34 | 14/28 |
| Up |
| 6q23.2-q23.3 | 3.43 | 12/3 | 26/10 |
| Down |
| 6q25.1-q25.3 | 6.71 | 12/3 | 26/7 |
| Down |
| 8p11.21-q11.23 | 13.41 | 6/6 | 17/8 |
| Down |
| 8q21.13-q21.3 | 10.54 | 32/6 | 33/11 |
| No change |
| 8q22.1-q23.1 | 11.99 | 32/11 | 33/3 |
| No change |
| 9p13.3 | 1.13 | 2/6 | 20/11 |
| No change |
| 10p13-p12.31 | 6.71 | 5/3 | 22/10 |
| Down |
| 10q26.11-q26.13 | 6.34 | 3/5 | 21/10 |
| No change |
| 11q12.3-q13.1 | 0.75 | 5/2 | 27/9 |
| Up |
| 11q13.2 | 0.36 | 6/2 | 18/9 |
| Up |
| 12q21.31-q21.33 | 10.39 | 2/4 | 17/12 |
| Down |
| 16p13.3 | 2.16 | 23/23 | 33/22 |
| Up |
| 16p13.3 | 1.45 | 4/5 | 16/12 |
| Up |
| 16p12.3 | 0.38 | 3/2 | 13/6 |
| No change |
| 16p11.2 | 1.36 | 6/25 | 22/13 |
| Up |
| 16q22.1-q22.2 | 4.9 | 22/11 | 71/17 |
| Down |
| 17p11.2 | 1.69 | 17/3 | 26/6 |
| Down |
| 17q11.2 | 0.53 | 17/4 | 26/6 |
| Up |
| 17q21.2 | 0.28 | 17/6 | 13/9 |
| No change |
| 17q25.1 | 0.9 | 24/4 | 20/6 |
| Up |
| 17q25.3 | 0.58 | 13/4 | 18/13 |
| Up |
| 22q11.23-q12.1 | 5.93 | 5/13 | 15/20 |
| Down |
| 22q12.2-q12.3 | 1.01 | 3/2 | 15/11 |
| No change |
| 22q12.3 | 0.24 | 3/4 | 15/12 |
| No change |
| 22q13.1-q13.2 | 1.37 | 2/6 | 16/10 |
| No change |
The RER region size (measured as the first to last significant TCS in the region), the number of genes with significant TCSs in the tumor or cell line data, the number of other genes in the region which have significantly correlated expression with the TCS genes (again for tumors and cell lines), and the list of those genes with significant TCSs. The gene expression of the region in tumors compared with normal samples is also indicated
Fig. 4Properties of RER regions in breast cancer cell lines. Analysis of mean expression (mean z score of genes with significant TCSs) levels in breast cancer cell lines for cluster 1 RER regions (a) and RER regions of clusters 2 and 3 (c). RER regions were subject to hierarchical clustering and cell lines were ordered by their overall level of expression of each RER cluster. Box plots showing mean expression (mean z score of genes with significant TCSs) of RER regions from cluster 1 (b) and clusters 2 and 3 (d) in ER+ (gray) and ER− (white) breast cancer cell lines (*p < 0.05, ***p < 0.001)
Fig. 5Refinement and analysis of the 16p11.2 RER region in breast cancer cell lines. a Black bars indicate genes along 16p11.2, oriented from centromere (top) to telomere (bottom), which have significant TCSs at varying window (2n + 1) size with n from 1 to 10. Gene expression data are from tumor cell lines [20]. Gene names are listed to the right, as are the position of fluorescence in situ hybridization (FISH) probes that were used to examine the four RER subregions. b Box plots show the distribution of normalized FISH interprobe distances (d /r ) [31, 32] measured across the four subregions of the 16p11.2 RER region in MCF7 and MDAMB231 breast cancer cell lines. n = 45–60 nuclei. The significance of differences between datasets was assessed by Wilcox test (Table S3 in Additional file 1). c Unsupervised cluster analysis of gene expression z scores for subregion 2 in 48 breast cancer cell lines (red ER−, blue ER+) [20]. Cell line names are indicated along the bottom of the heat map. Red/green z scores equate to increased/decreased gene expression, respectively. Genes are ordered by their position on the chromosome and listed to the right. The yellow boxes indicate cell lines examined by FISH. d Example FISH images using probe pairs (red and green) that delineate subregion 2 (as in (a)) in ER+ cell lines MCF7 and LY2 (upper panels), and ER− cell lines MDAMB231 and MDAMB468 (lower panels). DNA is stained with DAPI (blue). Scale bar = 5 μm. The boxplots to the right show the distribution of normalized FISH interprobe distances (d /r ) across subregion 2 in the four cell lines. n = 45–60 nuclei. The significance of differences between datasets was assessed by Wilcox test (Table S3 in Additional file 1)
Fig. 6Chromatin compaction at subregion 2 of the 16p11.2 RER region in breast cancer cell lines, in normal breast tissue and in primary breast tumors. a Box plots comparing the distribution of normalized FISH interprobe distances (d /r ) measured across subregion 2 of the 16p11.2 RER region in a normal breast cell line (HMLE) and in ER+ (MCF7, LY2, MDAMB361) and ER− (MDAMB231 and MDAMB468) breast cancer cell lines. n = 45–60 cells. The significance of differences between datasets was assessed by Wilcox test (Table S3 in Additional file 1). b Box plots showing the distribution of normalized FISH interprobe distances (d /r ) measured across subregion 2 of the 16p11.2 RER region in normal breast tissue and in ER+ and ER− tumor tissues. n = 250–300 alleles. Distances in the ER+ tumor were significantly greater than in normal tissue (p < 0.0001) or in the ER− tumor (p = 0.004). Differences between normal and ER− tumor tissue were not significant (p = 0.24). c Example FISH images using probe pairs (red and green) that delineate subregion 2 in normal breast tissue and in ER+ and ER− tumor tissue. DNA is stained with DAPI (blue). Scale bar = 5 μm
Fig. 7The effect of estrogen on chromatin compaction and nuclear organization at subregion 2 of the 16p11.2 RER region. a Map of the 16p11.2 RER subregion 2 showing the location of ER binding sites (red) in MCF7 cells (from [38]). Below, the location of genes in the region is shown from the UCSC Genome Browser NCBI36/hg18 assembly of the human genome. b, c Box plots comparing the distribution of normalized FISH interprobe distances (d /r ) measured across subregion 2 of the 16p11.2 RER region in ER+ MCF7 (b) and ER− MDAMB231 (c) breast cancer cell lines. Data are shown for cells grown in normal serum, in media stripped of hormone for 3 days (−E2), and after addition of 100 nm estrogen for 24 hours (+E2). n = 60 cells for each sample. d, e The percentage of FISH hybridization signals for subregion 2 of the 16p11.2 RER region found in each of five shells of equal area eroded from the edge of the nucleus (shell 1) through to the nuclear center (shell 5), in MCF7 (d) and MDAMB231 cells (e) grown in normal serum (grey shaded bars), hormone stripped media (−E2, white) and after addition of 100 nm estrogen for 24 hours (+E2, black)
Fig. 8RER regions and TADs do not overlap. Diagrams of the RER regions at 6q23 (top) and 16p11.2 (bottom), showing the extent of the two RER regions and the location of TADs in the T47D breast cancer line (purple), IMR90 fibroblasts (red) and human ESCs (blue). TAD data are from [43, 44]
Fosmids probes used for FISH
| Chromosomal region | Whitehead probe name | Other probe name | Start (bp) | End (bp) | Midpoint (bp) | Separation between probe pair midpoints (kb) |
|---|---|---|---|---|---|---|
| 16p11.2 RER (sub region 1) | W12-1584N4 | G248P86075G2 | 29,664,946 | 29,703,636 | 29,684,291 | 344 |
| W12-1754H9 | G248P8656D5 | 30,010,045 | 30,046,946 | 30,028,496 | ||
| 16p11.2 RER (sub region 2) | W12-1754H9 | G248P8656D5 | 30,010,045 | 30,046,946 | 30,028,496 | 373 |
| W12-906G10 | G248P8190D5 | 30,379,727 | 30,424,227 | 30,401,977 | ||
| 16p11.2 RER (sub region 3) | W12-906G10 | G248P8190D5 | 30,379,727 | 30,424,227 | 30,401,977 | 405 |
| W12-497E18 | G248P8178C9 | 30,788,769 | 30,824,965 | 30,806,867 | ||
| 16p11.2 RER (sub region 4) | W12-497E18 | G248P8178C9 | 30,788,769 | 30,824,965 | 30,806,867 | 380 |
| W12-2222M4 | G248P87014G2 | 31,166,763 | 31,207,277 | 31,187,020 | ||
| 16p11.2 (nonRER control) | W12-3081O2 | G248P89434H1 | 28,483,429 | 28,519,288 | 28,501,359 | 418 |
| W12-2889N9 | G248P89117G5 | 28,899,528 | 28,938,582 | 28,919,055 | ||
| 11p15.4 (nonRER control) | W12-528M6 | G248P8086G3 | 4,961,240 | 4,999,789 | 4,980,515 | 493 |
| W12-2033J5 | G248P85537E3 | 5,453,348 | 5,494,460 | 5,473,904 |
Probe names are from the Whitehead Fosmid database [58]. Alternative probe names can be used to view fosmids on the UCSC genome browser. All genome locations (base pairs) are from the hg19 assembly of the human genome