| Literature DB >> 30097071 |
Jianzhong Su1,2,3, Yung-Hsin Huang4,5, Xiaodong Cui6, Xinyu Wang7, Xiaotian Zhang8, Yong Lei4, Jianfeng Xu6, Xueqiu Lin6, Kaifu Chen6, Jie Lv6, Margaret A Goodell9,10, Wei Li11.
Abstract
BACKGROUND: Cancers have long been recognized to be not only genetically but also epigenetically distinct from their tissues of origin. Although genetic alterations underlying oncogene upregulation have been well studied, to what extent epigenetic mechanisms, such as DNA methylation, can also induce oncogene expression remains unknown.Entities:
Keywords: DNA methylation; Gene-body; Homeobox oncogene; Hypermethylation; Methylation editing; Pan-cancer analysis; Transcription; Whole-genome bisulfite sequencing
Mesh:
Substances:
Year: 2018 PMID: 30097071 PMCID: PMC6085761 DOI: 10.1186/s13059-018-1492-3
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Fig. 1Human reference UMRs. a The statistical framework for the identification of conserved reference UMRs using WGBS data from 30 normal tissues and 35 solid tumors (Online Methods). b Cumulative distribution of UMR width for normal and tumor samples. Reference canyons (length > 3.5 kb) account for about 6% of all reference UMRs. The remaining short UMRs are regarded as control cUMRs. c Percentage of reference canyons/cUMRs covered by CpG islands (downloaded from UCSC). For each reference UMR, the percentage of UMR covered by the CpG islands is defined as the length of the UMR covered by CpG islands (either partially or entirely) divided by the total length of the UMR. Random represents 46,384 randomly selected regions for human genome that have the same length distribution, but without overlapping with reference UMRs. d Percentage of reference canyons/cUMRs covered by DNase I hypersensitivity clusters (DNaseI cluster) 125 cell types [23], transcription factor binding site clusters (TFBS cluster) of 161 TFs in 91 cell types [24], and enhancer clusters of H3K27ac peak regions in 88 human cell types [25]
Fig. 2Pan-cancer differentially methylated UMRs. a The statistical framework for the identification of pan-cancer hypermethylated or hypomethylated UMRs. Red (blue) lines represent significant methylation increase (decrease) (p value < 0.001corrected by BH) in each tumor type and in pan-cancer. Tumors types include bladder urothelial carcinoma (BLCA), breast invasive carcinoma (BRCA), glioblastoma multiforme (GBM), lung adenocarcinoma (LUAD), lung squamous cell carcinoma (LUSC), stomach adenocarcinoma (STAD), and uterine corpus endometrial carcinoma (UCEC). b Pan-cancer differential UMRs found in individual tumor type. c Percentage of human reference canyons/cUMRs that are either hypermethylated or hypomethylated in pan-cancer. d DNA methylation genome browser tracks of a pan-cancer hyper-methylated canyon around homeobox gene HOXB13
Fig. 3Canyon hypermethylation is associated with increased expression of homeobox oncogenes. a DAVID functional annotation of pan-cancer hypermethylated, hypomethylated, and conserved canyons/cUMRs. The p values were adjusted by the BH method. b Oncogene or tumor suppressor [31] gene enrichment levels in the pan-cancer hypermethylated/hypomethylated canyon/cUMRs. P values were computed by Fisher’s exact test. c Venn diagram showing the overlap between pan-cancer canyon hypermethylated homeobox genes (m-homeobox genes) and oncogenes. P value was computed by Fisher’s exact test. d Boxplots showing mean gene expression distribution of five categories gene sets of pan-cancer hypermethylated/hypomethylated canyons/cUMRs associated genes and all genes in the uterine tumor (427) and normal (22) samples. P values were calculated by Wilcoxon signed-rank test. Boxplots represent the interquartile range (25–75%), with the median; whiskers correspond to 1.5 times the interquartile range. e Mean gene expression distribution of the m-homeobox genes and the other homeobox genes between uterine tumors (427) and normal (22) samples. The average expression of each gene was first computed across 22 normal samples or 427 tumor samples. The boxplot shows the average expression of all genes
Fig. 4Hypermethylation of gene-body but not promoter within canyon is associated with increased gene expression. a DNA methylation of upregulated and downregulated pan-cancer hypermethylated canyon genes between a uterine tumor (Data ID: TCGA-AX-A1CI-01A) and its matched normal sample (TCGA-AX-A1CI-11A). b Increased methylation in gene-body and low methylation around TSS for HOXB13. c Gene expression and DNA methylation in HOXB13 promoter (chr17: 46,806,000-46,807,000) and gene-body (chr17:46,802,200-46,805,999) between uterine normal (22) and tumor (427) samples. P values were calculated by Wilcoxon signed-rank test. d Correlation between HOXB13 expression and methylation level at gene promoter (left) and gene-body (right) in uterine normal (22) and tumor (427) samples. The rho and p value were computed by Spearman’s rank test. e Locus-specific spearman’s rank correlation of gene expression change and DNA methylation change (see “Methods”) between a uterine tumor and its matched normal sample in Fig. 4a for pan-cancer hypermethylated canyon genes (left), pan-cancer hypermethylated cUMR genes (middle), and all differentially expressed genes (right)
Fig. 5Gene-body canyon hypermethylation by dCas9-SunTag-DNMT3A can directly increase oncogene DLX1 expression. a Schematic graph for dCas9-SunTag-DNMT3A DNA methylation editing system. De-activated Cas9 (dCas9) was fused to SunTag epitopes and single-chain variable fragment (scFv) was fused to GFP and DNMT3A to methylate the gene-body canyon and activate gene expression. b Genome browser tracks of gene-body hypermethylated canyons around homeobox gene DLX1 across 30 normal and 35 tumor samples. c CpG DNA methylation level dramatically increased at the gene-body of DLX1 after adding guide RNA DLX1 (gray bar) with induction of dCas9SunTag and scFvDNMT3A, yet the methylation level in gene promoter was not affected. CpG DNA methylation level was calculated based on two biological replicates. Error bars represent mean ± s.e.m. of biological replicates. d qPCR shows significant increase of gene expression of DLX1 with induction of dCas9SunTag and scFvDNMT3A compared to the same cells without induction. P value was computed by two-sided Student’s t-test