| Literature DB >> 32859263 |
Paul J Hop1,2, René Luijk1, Lucia Daxinger3, Maarten van Iterson1, Koen F Dekkers1, Rick Jansen4, Joyce B J van Meurs5, Peter A C 't Hoen6, M Arfan Ikram7, Marleen M J van Greevenbroek8,9, Dorret I Boomsma10, P Eline Slagboom1, Jan H Veldink2, Erik W van Zwet11, Bastiaan T Heijmans12.
Abstract
BACKGROUND: DNA methylation is a key epigenetic modification in human development and disease, yet there is limited understanding of its highly coordinated regulation. Here, we identify 818 genes that affect DNA methylation patterns in blood using large-scale population genomics data.Entities:
Keywords: Causal inference; Chromatin; DNA methylation; Epigenetic regulation; Functional genomics; Genetic instrumental variable; Pleiotropy; Transcription factor
Mesh:
Substances:
Year: 2020 PMID: 32859263 PMCID: PMC7453518 DOI: 10.1186/s13059-020-02114-z
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Fig. 1Flowchart showing the successive steps leading to the identification of 818 genes that affect DNA methylation in trans
Fig. 2A considerable fraction of the identified genes (N = 308) affected multiple target CpGs in trans. a Each dot represents a gene with trans DNA methylation effects. The x-axis shows the number of affected target CpGs with decreased methylation levels upon increased gene expression, and the y-axis shows the number of affected target CpGs with increased methylation levels upon increased gene expression. The figure in the right upper corner is a zoomed-in version in which only genes that affect less than 25 CpG sites in either direction are displayed. b Bars represent the number of genes with either 1, 2, 3–5, or more than 5 target CpGs. The percentage of genes that are annotated as transcription factors increases with the number of target CpGs
Fig. 3a Enrichment (odds ratio) for transcription factors among identified genes with either 1, 2, 3–5, or more than 5 target CpGs. Error bars represent 95% confidence intervals. b Transcription factor binding site enrichment; each dot represents a transcription factor, with on the x-axis the logarithm of the number of target CpGs for that transcription factor (at a gene-level significance level; P < 1.2 × 10−7), and on the y-axis the odds ratio for the enrichment of the target CpGs in its experimentally determined binding sites (ChIP-seq). The size of the dots represents the significance (FDR), and TFs for which the target CpGs were significantly enriched in its binding sites are colored in blue
Fig. 4a Network for transcription factor NFKB1 and its inhibitor NFKBIE. Gray circles indicate target CpGs, and arrows represent directed associations (i.e., association between GI and DNA methylation levels). Blue lines indicate a positive association between gene expression and DNA methylation levels; red lines indicate a negative association between gene expression levels and DNA methylation levels. b NLRC5 (chromosome 16) was associated with decreased DNA methylation levels at multiple (N = 43) CpG sites in the classical and extended MHC region (chromosome 6). Red lines indicate a negative association between NLRC5 expression levels and DNA methylation levels. The numbers displayed in the lines indicate how many target CpGs the line represents. Gene labels are displayed if one or more target CpGs were associated with the expression of these genes. Blue gene symbols refer to genes negatively correlated with target CpG methylation (implying upregulation by NLRC5), and vice versa for red labels. Asterisks indicate that the GI corresponding to NLRC5 was also (positively) associated with this gene
Fig. 5a CDCA7 (located on chromosome 2) and CDCA7L (located on chromosome 7) both affect genome-wide DNA methylation levels. Blue lines indicate a positive association between CDCA7 expression and trans DNA methylation levels. Green lines indicate a positive association between CDCA7L expression levels and trans DNA methylation levels. b, c Over- or underrepresentation of target CpGs in predicted chromatin states for b CDCA7 and c CDCA7L. Blue bars represent enrichment of CpGs that are significant at a genome-wide significance level (P < 1.4 × 10−11), and gray bars represent enrichment of CpGs that are significant at a gene-level significance level (P < 1.2 × 10−7). BivFlnk, flanking bivalent TSS/enhancer; Enh, enhancer; EnhBiv, bivalent enhancer; EnhG, genic enhancer; Het, heterochromatin; Quies, quiescent; ReprPC, repressed polycomb; ReprPCWk, weak repressed polycomb; TssA, active TSS; TssAFlnk, flanking active TSS; TssBiv, bivalent/poised TSS; Tx, strong transcription; TxFlnk, weak transcription; ZNF/Rpts, ZNF genes and repeats