| Literature DB >> 30988298 |
Arie B Brinkman1, Serena Nik-Zainal2,3, Femke Simmer4,5, F Germán Rodríguez-González6, Marcel Smid6, Ludmil B Alexandrov2,7,8, Adam Butler2, Sancha Martin2, Helen Davies2, Dominik Glodzik2, Xueqing Zou2, Manasa Ramakrishna2, Johan Staaf9, Markus Ringnér9, Anieta Sieuwerts6, Anthony Ferrari10, Sandro Morganella11, Thomas Fleischer12, Vessela Kristensen12,13,14, Marta Gut15, Marc J van de Vijver16, Anne-Lise Børresen-Dale12,13, Andrea L Richardson17,18, Gilles Thomas10, Ivo G Gut15, John W M Martens6, John A Foekens6, Michael R Stratton2, Hendrik G Stunnenberg19.
Abstract
Global loss of DNA methylation and CpG island (CGI) hypermethylation are key epigenomic aberrations in cancer. Global loss manifests itself in partially methylated domains (PMDs) which extend up to megabases. However, the distribution of PMDs within and between tumor types, and their effects on key functional genomic elements including CGIs are poorly defined. We comprehensively show that loss of methylation in PMDs occurs in a large fraction of the genome and represents the prime source of DNA methylation variation. PMDs are hypervariable in methylation level, size and distribution, and display elevated mutation rates. They impose intermediate DNA methylation levels incognizant of functional genomic elements including CGIs, underpinning a CGI methylator phenotype (CIMP). Repression effects on tumor suppressor genes are negligible as they are generally excluded from PMDs. The genomic distribution of PMDs reports tissue-of-origin and may represent tissue-specific silent regions which tolerate instability at the epigenetic, transcriptomic and genetic level.Entities:
Mesh:
Year: 2019 PMID: 30988298 PMCID: PMC6465362 DOI: 10.1038/s41467-019-09828-0
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Fig. 1Visualization of inter-tumor variation at genome-wide scale. a Genome-wide and b chromosome-wide maps of WGBS DNA methylation profiles from 30 breast tumor samples. Mean methylation is displayed in consecutive tiles of 10 kb (see Methods section). Ordering of tumor samples is according clustering of the tiled profiles. c WGBS DNA methylation visualization at megabase-scale. Pink coloring indicates common methylation loss (PMDs), although tumor-specific PMD borders vary. A scale bar (100 kb) is shown at the top of each panel. CpG islands are indicated in green. d Principal component analysis of WGBS DNA methylation profiles (see Methods section). Each tumor sample is represented with its estrogen-receptor (ER) status (point shape) and mean PMD methylation (point color)
Fig. 2Characterization of breast cancer PMDs. a Fraction of the genome covered by PMDs. Each dot represents one tumor sample, the boxplot summarizes this distribution. b Fraction of the genome covered by PMDs that are common between breast tumors. PMD frequency: the number of tumors in which a genomic region or gene is a PMD. c Breast cancer PMDs are not distributed randomly over the genome. The genome was dissected into 30-kb tiles, PMD frequency (number of boundaries) was calculated for each tile. The same analysis was done after shuffling the PMDs of each tumor sample. d Average profiles of LaminB[23], repliSeq (DNA replication timing, ENCODE), 3D chromatin interaction loops (HiC[27], and CTCF (ENCODE) over PMD borders. If available, data from the breast cancer cell line (MCF7) and mammary epithelial cells (HMEC) was used, otherwise data from fibroblasts (IMR90, Tig3) was used. e Gene distribution inside PMDs (top, as a fraction of all annotated genes; bottom, as gene coding density). f Gene expression inside PMDs. Gene expression (top) and standard deviation (bottom) for the 25 overlapping cases of our WGBS and the transcriptome cohorts[17] was plotted as a function of PMD frequency. g Somatic mutations inside PMDs. Substitutions, insertions, deletions, and rearrangements were calculated for the 25 overlapping cases of our WGBS and the breast tumor full genomes cohorts[15], and plotted as a function of PMD frequency. h Distribution of DNA methylation over functional genomic elements, inside and outside PMDs. CpGs were classified according PMD status and genomic elements, and the distribution of DNA methylation within each element was plotted. All boxplots in this figure represent the median and 25th and 75th percentiles, whiskers 1.5 times the interquartile range, outliers are not shown
Fig. 3CpG island hypermethylation inside PMDs. a Representative 2.1-Mb genomic region. Red bars, PMDs for each tumor; below, CGI methylation per tumor (same ordering). Green bars, CGIs. b CGI methylation as the fraction of all CGIs (x-axis). Horizontal bars represent individual tumors. c Methylation over CGIs inside/outside PMDs, averaged over all 30 tumors. Black/red lines, median; gray/pink area, 1st and 3rd quartiles. d CGI counts inside and outside of breast cancer PMDs. “in”, CGIs inside PMDs in at least one tumor sample. e Variation of CGI methylation (standard deviation) as a function of PMD frequency. f Regression analysis of B-CIMP (y-axis) as a function of the fraction CGIs inside PMDs (x-axis). B-CIMP: the genome-wide fraction of hypermethylated CGIs (>30% methylation). g Summary of regression analyses as in (F), for additional cancer types. n, the number of samples for each type. For abbreviations of cancer type names, see Fig. 4b. h Expression change of CGI-promoter genes inside vs. outside of PMDs, as a function of PMD frequency. i Gene expression as a function of PMD frequency in TCGA breast cancer data. PMD frequency was derived from our own methylation data. j Expression change of CGI-promoter genes as a function of PMD frequency in matched breast cancer tumor/normal pairs (TCGA). PMD frequency was derived from our own methylation data. k Tumor-suppressor genes (TSGs) are excluded from PMDs. PMD frequency was determined for each TSG and the resulting distribution was plotted. Main plot, relative distribution; inset, absolute gene count. “Non-TSGs”, genes not annotated as TSGs; “TSGs all cancers”, genes annotated as TSGs regardless of cancer type; “TSGs breast cancer”, genes annotated as TSG in breast cancer; “Nik-Zainal breast cancer driver mutations”, genes with driver mutations in breast cancer[15]. l Expression of X-linked genes when inside or outside PMDs. Genes were grouped according X-inactivation status (E, escape; S, subject to XCI; VE, variably escaping; PAR, pseudoautosomal region)[41]. All boxplots in this figure represent the median and 25th and 75th percentiles, whiskers 1.5 times the interquartile range, outliers are not shown
Fig. 4PMD methylation in normal tissues and tumors of various tissues. a Mean PMD methylation of normal tissues and tumors of various tissue types. Each dot represents one sample. b Hierarchical clustering of tumor samples based on genomic distribution of their PMDs. For breast tumors (this study) the ER status is indicated