| Literature DB >> 32661416 |
Shuang G Zhao1,2,3,4,5, William S Chen3,4,6, Haolong Li3,4, Adam Foye4,7, Meng Zhang3,4, Martin Sjöström3,4, Rahul Aggarwal4,7, Denise Playdle4,7, Arnold Liao8, Joshi J Alumkal9,10, Rajdeep Das3,4, Jonathan Chou3,4,7, Junjie T Hua11, Travis J Barnard3,4, Adina M Bailey4,7, Eric D Chow12,13, Marc D Perry3,4,7, Ha X Dang14,15,16, Rendong Yang17, Ruhollah Moussavi-Baygi3,4, Li Zhang4,7, Mohammed Alshalalfa3, S Laura Chang3, Kathleen E Houlahan18,19,20, Yu-Jia Shiah18, Tomasz M Beer9,21, George Thomas9,22, Kim N Chi23,24, Martin Gleave23, Amina Zoubeidi23, Robert E Reiter25, Matthew B Rettig25,26, Owen Witte27, M Yvonne Kim28, Lawrence Fong4,7, Daniel E Spratt1, Todd M Morgan2,5,29, Rohit Bose4,7,30,31, Franklin W Huang4,7, Hui Li3,4, Lisa Chesner3,4, Tanushree Shenoy4,7, Hani Goodarzi12,30, Irfan A Asangani32, Shahneen Sandhu33, Joshua M Lang34, Nupam P Mahajan16,35, Primo N Lara36,37, Christopher P Evans37,38, Phillip Febbo8, Serafim Batzoglou8, Karen E Knudsen39, Housheng H He11,19, Jiaoti Huang40, Wilbert Zwart41, Joseph F Costello28, Jianhua Luo42, Scott A Tomlins5, Alexander W Wyatt23, Scott M Dehm43,44, Alan Ashworth4,7, Luke A Gilbert4,30, Paul C Boutros19,20,25, Kyle Farh8, Arul M Chinnaiyan2,5,29,45,46,47, Christopher A Maher14,15,16,48, Eric J Small4,7, David A Quigley4,30,49, Felix Y Feng50,51,52,53.
Abstract
Although DNA methylation is a key regulator of gene expression, the comprehensive methylation landscape of metastatic cancer has never been defined. Through whole-genome bisulfite sequencing paired with deep whole-genome and transcriptome sequencing of 100 castration-resistant prostate metastases, we discovered alterations affecting driver genes that were detectable only with integrated whole-genome approaches. Notably, we observed that 22% of tumors exhibited a novel epigenomic subtype associated with hypermethylation and somatic mutations in TET2, DNMT3B, IDH1 and BRAF. We also identified intergenic regions where methylation is associated with RNA expression of the oncogenic driver genes AR, MYC and ERG. Finally, we showed that differential methylation during progression preferentially occurs at somatic mutational hotspots and putative regulatory regions. This study is a large integrated study of whole-genome, whole-methylome and whole-transcriptome sequencing in metastatic cancer that provides a comprehensive overview of the important regulatory role of methylation in metastatic castration-resistant prostate cancer.Entities:
Mesh:
Year: 2020 PMID: 32661416 PMCID: PMC7454228 DOI: 10.1038/s41588-020-0648-8
Source DB: PubMed Journal: Nat Genet ISSN: 1061-4036 Impact factor: 38.330
Figure 1:CpG Methylator Phenotype (CMP)
a, Sample-level summary of hypo-methylated region (HMR) frequency and somatic alterations in 100 independent mCRPC samples. Bar plots show HMR counts within genomic features (HMR count), counts of HMRs overlapping with CpG islands/shores/shelves (CpG overlap), percent of the genome with DNA copy number alterations (CNA %), somatic mutations per megabase (Mutations / Mb), and counts of structural variants (SV count). CMP samples labeled in blue. TFBS, transcription factor binding site. b, Hierarchical clustering of the 10% most variable recurrent HMRs in 100 mCRPC samples. Blue dendrogram denotes CMP samples. c, HMR count per sample in thousands in non-CMP (N=78) and CMP (N=22). Significance was assessed with two-sided Wilcoxon test. d, Percent of CpGs methylated at loci harboring recurrent HMRs in Non-CMP (N=78) and CMP (N=22), plotted and assessed as in (c). e, rHMRs located in CpG islands, shores, and shelves, count per sample in thousands, plotted and assessed as in (c). f, rHMRs located in open seas, count per sample in thousands, plotted and assessed as in (c). Boxplots show the median, first, and third quartiles, and outliers are shown if outside 1.5x the inter-quartile range.
Figure 2:DNA methylation valleys (DMVs)
a, Top: Sample-level log2 odds ratio calculated from the number of DMVs which overlap H3K4me3 vs. H3K27me3 sites. Lower values favor H3K27me3, higher values favor H3K4me3. Bottom: Sample-level count of DMVs in order matching top panel. b, Mean percent methylation across the AR locus for benign prostate (N=4), localized prostate cancer (N=5), mCRPC adenocarcinoma (N=95), and t-SCNC samples (N=5). Vertical black lines show the location of the previously identified AR enhancer[17]. The vertical green and red lines show the TSS and transcriptional terminator of the androgen receptor, respectively.
Figure 3:Methylation associated with prostate cancer-specific genes
a, Variability in gene expression levels versus the correlation between gene expression and methylation. Expression variability was calculated as standard deviation (Log2(TPM+1), and correlation calculated at the most significant promoter/gene body eHMR for each gene. Y-axis box-plot shows gene expression variability for prostate cancer-specific genes versus all other genes. X-axis box-plot shows correlation of methylation with gene expression of prostate cancer-specific genes versus all other genes. Significance was assessed with two-sided Wilcoxon test, N=169 vs. 51502. Boxplots show the median, first, and third quartiles, and outliers are shown if outside 1.5x the inter-quartile range. b, Sample-level gene expression levels compared to the presence of DNA alterations and methylation at the most significant promoter/gene body eHMR. Alterations predicted to be activating (SLC45A3, SPON2, TDRD1, SCHLAP1) or inactivating (TMEFF2, PCAT14) are shown[17]). Significance of methylation levels was assessed by ANOVA comparing a model predicting gene expression from DNA alterations alone to a second model with methylation as an added factor. N=100 independent mCRPC samples. CN, copy number.
Figure 4:Methylation association with the androgen response pathway
a, Percentage of genes in MSigDB Hallmark pathways for which methylation predicted expression independently from DNA alterations in a linear model. An asterisk indicates significant enrichment (two-sided FDR ≤ 0.05) relative to the set of all housekeeping genes. Significance was assessed with a two-sided Fisher’s exact test. N=100 independent mCRPC samples.
b, Sample-level gene expression levels compared to the presence of DNA alterations and methylation at the most significant promoter/gene body eHMR. Alterations predicted to be activating (KLK3, FOLH1) or inactivating (NKX3–1) are shown[17]). Significance of methylation levels was assessed by ANOVA comparing a model predicting gene expression from DNA alterations alone to a second model with methylation as an added factor, N=100 independent mCRPC samples.
c, HMRs, correlation between methylation in at loci harboring recurrent HMRs and AR expression, ChIP-seq peaks (H3K27ac[33], AR[36], ERG[38], FOXA1[37], HOXB13[37]), and ChIA-PET interactions (AR and ERG)[39] at the AR locus. Stars denote HMRs at which methylation was associated with AR expression (eHMRs), colored black for previously reported AR upstream enhancer, blue for the AR promoter, gold for new putative AR regulatory regions. Significance was assessed with a two-sided Spearman’s correlation test, N=100 independent mCRPC samples. “Primary” in the ChIP-seq tracks indicates localized primary prostate cancer.
Figure 5:Methylation association with TMPRSS2-ERG and MYC
a, HMRs, correlation between methylation in loci harboring recurrent HMRs and ERG expression, and ChIP-seq peaks (H3K27ac[34], AR[36], ERG[38], FOXA1[37], HOXB13[37]) at the TMPRSS2 locus. Significance was assessed with two-sided Spearman’s correlation, N=100 independent mCRPC samples. TMPRSS2 isoform 204 was not shown as its TSS was ~20Kbp upstream of the other 5 protein coding isoforms.
b, Observed ERG expression in TMPRSS2-ERG fusion positive mCRPC and ERG expression predicted in those tumors using two linear models: one including AR expression and AR mutations and another including AR expression, AR mutations, and methylation at the TMPRSS2 promoter and upstream locus. Significance was assessed by a two-sided ANOVA (N=41 independent fusion positive samples).
c, HMRs, correlation between methylation in recurrent HMRs and MYC expression, ChIP-seq peaks (H3K27ac[34]), and ChIA-PET interactions (AR and ERG)[39] at the MYC-PVT1 locus. Significance was assessed with two-sided Spearman’s correlation, N=100 independent mCRPC samples. “Primary” in the ChIP-seq tracks indicates localized primary prostate cancer.
d, Observed MYC expression and MYC expression predicted in those tumors using two linear models: one including MYC copy number alone and another including MYC copy number and methylation at the MYC-PVT1 locus. Significance was assessed by a two-sided ANOVA (N=100 independent mCRPC samples).
Figure 6:Genome-wide analysis of differential methylation
a, Differentially methylated regions (DMRs) and mutation frequency in mCRPC. Ideogram shows, for each chromosome, from left to right: DMRs comparing primary prostate cancer (N=5) to benign prostate (N=4), DMRs comparing mCRPC (adenocarcinoma, N=95) to primary prostate cancer (N=5), and mutational frequency in 1Mbp windows in the mCRPC samples (excluding two hyper-mutated samples[17]). Maximum bar height in mutation frequency represents an average mutational frequency ≥10 mutations per Mb per sample.
b, Differential methylation (comparing mCRPC (adenocarcinoma) to benign prostate) compared to mutational frequency (excluding 2 hyper-mutated samples[17]), N=98. Each point represents a fixed 1Mbp window of the genome, and all points collectively represent all 1 Mb windows across the genome excluding centromeres and telomeres.
c, Average differential methylation values across all sites identified from publicly available ChIP-seq data (AR[36], ERG[38], FOXA1[37], HOXB13[37], H3K27ac[35]). For each ChIP-seq peak, a 20Kbp window centered on midpoint of the peak (x=0) was assessed for differential methylation between mCRPC adenocarcinoma vs. benign prostate samples.