| Literature DB >> 34001209 |
Chongze Yuan1,2,3, Haojie Chen4, Shiqi Tu4, Hsin-Yi Huang5, Yunjian Pan1,2,3, Xiuqi Gui4, Muyu Kuang1,2,3, Xuxia Shen3,6, Qiang Zheng3,6, Yang Zhang1,2,3, Chao Cheng1,2,3, Hui Hong1,2,3, Xiaoting Tao1,2,3, Yizhou Peng1,2,3, Xingxin Yao1,2,3, Feilong Meng7, Hongbin Ji8,9, Zhen Shao10, Yihua Sun11,12,13.
Abstract
BACKGROUND: Lung adenocarcinoma (LUAD) is a highly malignant and heterogeneous tumor that involves various oncogenic genetic alterations. Epigenetic processes play important roles in lung cancer development. However, the variation in enhancer and super-enhancer landscapes of LUAD patients remains largely unknown. To provide an in-depth understanding of the epigenomic heterogeneity of LUAD, we investigate the H3K27ac histone modification profiles of tumors and adjacent normal lung tissues from 42 LUAD patients and explore the role of epigenetic alterations in LUAD progression.Entities:
Keywords: Classification model; Core regulators; Epigenome; Lung adenocarcinoma; Super-enhancers
Mesh:
Substances:
Year: 2021 PMID: 34001209 PMCID: PMC8127276 DOI: 10.1186/s13059-021-02376-1
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Fig. 1Distinct H3K27ac profiles in LUAD and normal lung tissues. a Unsupervised hierarchical clustering of H3K27ac profiles for tumor and normal lung tissues of LUAD patients based on pairwise Pearson correlation coefficients (PCCs). b An MA plot of differential H3K27ac-modified sites between tumor and normal tissues, “M” represented log 2 fold change and “A” represented average log 2 signal intensities, sites with |M value| ≥ 1 and adjust p-value ≤ 0.05 defined as differential sites. c A heatmap of the H3K27ac signal in differential sites identified in b. The H3K27ac signal is represented as row-normalized z-scores. d Differential H3K27ac enrichment in super-enhancers (SE) between tumor and normal tissues. Each row represents an SE with a different enrichment between two tissues. SE scores are represented as row-normalized z-scores. Important differential SE-associated genes shown in the right. e Ranked plot for tumor-specific SE-associated TFs. IHC staining validated TFs are indicated with lines. f IHC staining results of 4 tumor samples showed tumor-specific SE-associated SOX9 were highly expressed in tumor. g The functional enrichment of tumor-specific (left) and normal-specific (right) SE-associated genes. h Track plots of the H3K27ac signal distribution in tumor (top) and normal (bottom) samples across the SOX9 (tumor-specific super-enhancer associated genes), CAV1-CAV2 (normal-specific super-enhancer associated genes), and MET (other super-enhancer associated genes) loci. “SOX9-SE” represented this super-enhancer associated with SOX9. “CAV2&CAV1-SE” represented this super-enhancer associated with CAV2 and CAV1. “MET-SE” represented this super-enhancer associated with MET. Heatmap of log2 fold change indicates the H3K27ac signal differences between tumor and normal tissues. i An example of super-enhancer hijacking. Number of junction reads from RNA-seq supported EML4 and ALK gene fusion showed in the top left panel. Model of super-enhancer hijacking through chromosome translocation showed in the top right panel. Track plots of the H3K27ac signal distribution and gene expression in fusion and non-fusion samples across the EML4 and ALK loci (bottom)
Fig. 2Epigenetic heterogeneity differentiates LUAD patient clinical outcomes. a The hyper-variable peaks (HVPs) identified based on the global trend of means and variances. The dots are colored according to the significance of the variance test performed by MAnorm2. Variable peaks with p-value less to 0.01 were defined as tumor hyper-variable peaks. b Venn diagram showed overlap between tumor hyper-variable peaks (hyper-variable peaks identified in tumor samples), peaks upregulated in tumor samples compared to normal samples (previous identified tumor-specific peaks), and normal hyper-variable peaks (hyper-variable peaks identified in normal samples). c The first 5 significant principle components and their correlation with lymph node invasion, gender, and smoking history; asterisk represented significant association (p-value of ANOVA less to 0.05). d Unsupervised hierarchical clustering using PC1 from a PCA on hyper-variable peaks identifying two subgroups, group I and group II. The associations between clinical characteristics and subgroups showed in the bottom, the p-values of rank-sum test were indicated to show the significance of associations. e Survival analysis of the two subgroups: relapse-free survival, RFS (top) and over-all survival, OS (bottom), and p-value of log rank test showed in the plot
Fig. 3Transcriptomic and epigenetic alterations uncovered pathological pathways. a A volcano plot of the gene expression changes between group I (GI) and group II (GII). Genes with adjusted p-value less to 0.05 were defined as DEGs. b Functional enrichment of GII-specific (left) and GI-specific (right) genes. c A heatmap of the H3K27ac signals in group-specific peaks. The data were represented as row-normalized z-scores; each row represented a group-specific peaks, and each column represented a LUAD sample. d Genes ranked based on the correlation of gene expression and PC1 in hyper-variable peaks in tumor samples. The purple and orange bars (bottom) indicate GII-specific and GI-specific distal enhancers or SEs linked genes, respectively. e The convergence of GII-specific distal enhancers on cell cycle genes. f Comparison of ssGSEA-score of cell cycle pathway genes and embryonic stem cell core genes between GI and GII, t-test was performed between two different groups. g DEGs between GI and GII were used to group TCGA samples into GI-like, GII-like, and intermediate groups, and the K-M plot of patients’ survival in three groups, p-value determined by log rank test. ***p < 0.001. h The K-M plot of patients’ survival in GI-like and GII-like LUAD patients in stage I and stage II-IV, p-value determined by log rank test. *p < 0.05. i The distribution of GI-like, GII-like and intermediate samples across different tumor stages in the TCGA samples. j Top 30 bias somatic coding mutations in GI-like and GII-like LUAD patients. The middle panel showed somatic mutation by individuals (column) and by genes (row). The histogram on the top showed the number of mutations in each sample. The histogram on the right showed the differences in mutation frequency between GI-like and GII-like LUAD patients. Genes sorted by the p-value of Fisher-exact test
Fig. 4Co-expression networks for LUAD subgroups and group-specific core regulators. a Co-expression network (bipartite network) constructed in GI (orange) and GII (purple) based on gene expression correlations between differentially expressed transcription factors or epigenetic regulators and their target genes. b Co-expression network of differentially expressed regulators. Core regulators in GI (6 core regulators, left panel) and GII (18 core regulators, right panel), core regulators formed a highly connected clique in the regulator co-expression networks. c Core regulators(red) and other regulators(grey) with their degrees calculated from the bipartite networks in a, p-value determined by rank-sum test. d Upstream differential epigenetic elements participate in regulation in group-specific core regulator, including differential distal enhancers, differential proximal peaks, and differential super-enhancers
Fig. 5In vivo and in vitro validation of potential function of identified TSGs. a A schematic diagram illustrated how core regulators and epigenetic regulatory elements (including H3K27ac-marked distal enhancers, promoters, and super-enhancers) regulated well-known TSGs. Bar plot in the right indicated that there were more active TSGs in GI core regulator target genes than GII core regulator target genes. The p-value of Fisher-exact test showed in the plot. b The strategy used for selecting TSG candidates. Three super-enhancer-associated GI core regulators co-regulated genes were selected as TSG candidates. c Gene expression level of CLU in GI-like, GII-like, and normal samples in TCGA-LUAD cohort. The p-value of t-test showed in the plot. ns, not significant. d Track plots revealed CLU gene expression was regulated by GI upregulated distal enhancers. The CLU gene expression in GI and GII samples was shown on the right. e Real-time PCR quantification and Western blot detection of CLU in CRL-5803 (upper panel) and PC9 (lower panel) cells with or without CLU ectopic expression. f Cell proliferation assay in CRL-5803 (upper panel) and PC9 (lower panel) cells with or without CLU overexpression. g Soft agar colony formation assay in CRL-5803 (upper panel) and PC9 (lower panel) cells with or without CLU overexpression. h Real-time PCR quantification and Western blot detection of CLU in CRL-5872 cells with or without CLU knockdown. i Cell proliferation assay in CRL-5872 cells with or without CLU knockdown. j Soft agar colony formation assay in CRL-5872 cells with or without CLU knockdown. k Representative photos of HE and IHC staining of CLU and Ki-67 in GI and GII samples. l Statistical analyses of CLU and Ki-67 IHC scores in group I and group II samples. Data were shown as mean with S.E.M. Statistical analyses was calculated by two-tailed, unpaired t-test. m The TF binding motifs on the enhancer region. Grey boxes indicate TFs; red arrows, predicted sgRNAs; and yellow arrows, PCR primers for testing knockout efficiency. n Sequencing of the PCR products by reverse (R) primers to validate dual gRNA knockout efficiency. o CLU expression of the indicated sgRNAs by real-time PCR quantification. p Cell proliferation analysis of the indicated sgRNAs