| Literature DB >> 28855971 |
Daniel A Newkirk1,2, Yen-Yun Chen1,3, Richard Chien1,4, Weihua Zeng1,5, Jacob Biesinger2,6, Ebony Flowers1,7,8, Shimako Kawauchi9, Rosaysela Santos9, Anne L Calof9, Arthur D Lander10, Xiaohui Xie2, Kyoko Yokomori1.
Abstract
BACKGROUND: Cornelia de Lange syndrome (CdLS) is a multisystem developmental disorder frequently associated with heterozygous loss-of-function mutations of Nipped-B-like (NIPBL), the human homolog of Drosophila Nipped-B. NIPBL loads cohesin onto chromatin. Cohesin mediates sister chromatid cohesion important for mitosis but is also increasingly recognized as a regulator of gene expression. In CdLS patient cells and animal models, expression changes of multiple genes with little or no sister chromatid cohesion defect suggests that disruption of gene regulation underlies this disorder. However, the effect of NIPBL haploinsufficiency on cohesin binding, and how this relates to the clinical presentation of CdLS, has not been fully investigated. Nipbl haploinsufficiency causes CdLS-like phenotype in mice. We examined genome-wide cohesin binding and its relationship to gene expression using mouse embryonic fibroblasts (MEFs) from Nipbl+/- mice that recapitulate the CdLS phenotype.Entities:
Keywords: Adipogenesis; CdLS; Chromatin immunoprecipitation (ChIP); Chromatin interaction; Chromatin regulation; Cohesin; Gene regulation; Haploinsufficiency; Nipbl
Mesh:
Substances:
Year: 2017 PMID: 28855971 PMCID: PMC5574093 DOI: 10.1186/s13148-017-0391-x
Source DB: PubMed Journal: Clin Epigenetics ISSN: 1868-7075 Impact factor: 6.551
The list of PCR primers
| Unique regions ChIP primers | |
| pax2-F | CTGGCACTGACATCTTGTGG |
| pax2-R | TGGGACCTGTAGTCCTGACC |
| anapc13-F | TCCTAAGCCGTCCTGTAGTCC |
| anapc13-R | GGGTGTCCATCATCTGAGTCC |
| alox8-F | GTATGAGGTGGGCCTGAGTG |
| alox8-R | AAGCCCTGCCTAAATGTGTG |
| ebf1-F | AACTGAGCCTTAGGGGAAGC |
| ebf1-R | TCAGGGTTCAATCTCCAAGG |
| cebpb-F | AGAGTTCTGCTTCCCAGGAGT |
| cebpb-R | GGAAACAGATCGTTCCTCCA |
| fez1-F | GAGGGTGGGACGTATTTCAGT |
| fez1-R | CAGCCTTCTTTCCCTCACAA |
| pcdhb22-F | GCAGTAATGCCAGCAATGG |
| pcdhb22-R | TCCAGTTGGTTGGGTTTCAT |
| RT-qPCR primers | |
| Rnh1-F (Housing keeping gene) | TCCAGTGTGAGCAGCTGAG |
| Rnh1-R (Housing keeping gene) | TGCAGGCACTGAAGCACCA |
| Nipbl-F | AGTCCATATGCCCCACAGAG |
| Nipbl-R | ACCGGCAACAATAGGACTTG |
| Rad21-F | AGCCAAGAGGAAGAGGAAGC |
| Rad21-R | AGCCAGGTCCAGAGTCGTAA |
| Cebpb-F | GCGGGGTTGTTGATGTTT |
| Cebpb-R | ATGCTCGAAACGGAAAAGG |
| Cebpd-F | ACAGGTGGGCAGTGGAGTAA |
| Cebpd-R | GTGGCACTGTCACCCATACA |
| Ebf1-F | GCGAGAATCTCCTTCAAGACTTC |
| Ebf1-R | ACCTACTTGCCTTTGTGGGTT |
| Il6-F | TAGTCCTTCCTACCCCAATTTCC |
| Il6-R | TTGGTCCTTAGCCACTCCTTC |
| Avpr1a-F | TGGTGGCCGTGCTGGGTAATAG |
| Avpr1a-R | GCGGAAGCGGTAGGTGATGTC |
| Lpar1-F | ATTTCACAGCCCCAGTTCAC |
| Lpar1-R | CACCAGCTTGCTCACTGTGT |
| Adm-F | TATCAGAGCATCGCCACAGA |
| Adm-R | TTAGCGCCCACTTATTCCAC |
| Cebpb 3C primers | |
| cebpb-promoter | ACTCCGAATCCTCCATCCTT |
| cebpb-region-b | CCTGCCCTGTATCAAAGCAT |
| cebpb-region-a | CTGCCCAAATCAGTGAGGTT |
| cebpb-region-c | CCTCTGTGAGGTCTGGTCGT |
| cebpb-promoter-R | GGTGGCTGCGTTAGACAGTA |
| cebpb-region-a-R | GTTGTATCCCAAGCCAGCTC |
| cebpb-region-b-R | CTCCCCACTCTGTTCAGGAC |
| cebpb-region-c-R | TAACAGCAGGGATGGGTTCT |
Fig. 5Correlation of cohesin binding and gene expression changes in mutant MEFs. a KS test indicating the degree of cohesin binding to genes changing expression in Nipbl+/− MEFs. X-axis represents all 13,587 genes from the microarray data [19] ranked by absolute fold expression changes from biggest on the left to the smallest on the right in the left panel. Fold changes are shown in different colors as indicated on the side. In the middle panel, gene expression changes were ranked from negative to positive with the color scale shown on the side. Both color scales apply to the rest of the figure. The Y-axis is the running enrichment score for cohesin binding (see the "Methods" section for details). Distribution of cohesin-bound genes among 13,587 genes examined is shown as a beanplot [62] at the top, and the number of cohesin-bound genes and p values are shown underneath. The schematic diagram showing the definition of the gene regions, promoter (2.5 kb upstream and 0.5 kb downstream of TSS), gene body, and downstream (2.5 kb downstream and 0.5 kb upstream of TTS) regions is shown on the right. b Similar KS test analysis as in a, in which cohesin binding to the promoter, gene body, and downstream regions are analyzed separately. c Genes are ranked by expression changes from positive on the left to negative on the right. Fold changes are shown by different colors as indicated on the right. CTCF binding to promoter regions (GSE22562) [40] was analyzed for a comparison. d Lack of correlation between the mutant expression changes and randomly chosen genes are shown on the right as a negative control
Fig. 1Global decrease of cohesin binding to chromatin in Nipbl heterozygous mutant MEFs. a Cohesin-binding sites identified by ChIP-sequencing using antibody specific for Rad21 in control wild type and Nipbl+/− MEFs. Peak calling was done using AREM [50]. The p value and FDR are shown. b Heatmap comparison of Rad21 ChIP-seq data with those of SMC1, SMC3, SA1, and SA2. Rad21 peaks in the wild type MEFs are ranked by strongest to weakest and compared to the ChIP-seq data of SMC1, SMC3, SA1, and SA2 in MEFs (GSE32320) [67] in the corresponding regions. The normalized (reads per million) tag densities in a 4-kb window around each Rad21 peak are plotted, with peaks sorted from the highest number of tags in the wild type MEFs to the lowest. c Histogram of cohesin peak widths in wild type and mutant MEFs, indicating the number of peaks in a given size range. The segmentation of the histogram is at 100 bp intervals. The median value is indicated with a vertical black line and labeled. d Scatter plot of histone H3 ChIP-seq tag counts in wild type and mutant MEFs in 500 bp bins across the mouse genome. The values are plotted in log reads per million (RPM). e Histogram showing the distribution of total peaks called. A comparable number of reads to the Nipbl+/− mutant dataset (i.e., 4,740,463) were sub-sampled from the wild type dataset, and peaks called using only the sub-sampled reads. This process was performed 1000 times to produce the histogram above. Mean values with standard deviations are shown. f Heatmap analysis of cohesin binding in wild type (WT) MEFs and corresponding peak signals in Nipbl+/− MEFs. The normalized (reads per million) tag densities in a 4-kb window around each peak are plotted, with peaks sorted from the highest number of tags in the wild type to the lowest. Peaks are separated into two categories, those that are found only in wild type (“WT only”) and those that overlap between wild type and Nipbl+/− (“common”). Preimmune IgG ChIP-seq signals in the corresponding regions are also shown as a control. The color scale indicates the number of tags in a given region. g Histogram of the ratio between normalized (reads per million total reads) wild type and mutant reads in peaks common to both. Positive values indicate more wild type tags. The black line indicates the mean ratio between wild type and mutant tag counts
Nipbl and Rad21 depletion levels in mutant and siRNA-treated MEFs
| Gene | Nipbl+/− mutant | Nipbl siRNA | Rad21 siRNA |
|---|---|---|---|
| Nipbl | 0.68 ± 0.003 | 0.68 ± 0.001 | 1.04 ± 0.051 |
| Rad21 | 0.94 ± 0.021 | 0.99 ± 0.021 | 0.26 ± 0.018 |
| CTCF | 0.95 ± 0.050 | 0.96 ± 0.066 | 0.84 ± 0.074 |
Fig. 6Cohesin-binding signals at specific gene regions. a Cohesin-binding site distribution in cohesin target genes as defined in Table 1. Cohesin binding to the promoter (P), gene body (B), and/or downstream region (D) are indicated for each cohesin target gene in red (upregulated) and blue (downregulated) boxes. b Signal intensity profiles of Rad21 ChIP-seq at specific gene regions in wild type and Nipbl mutant MEFs. Preimmune IgG ChIP-seq signals are shown as a negative control. Experimentally determined CTCF-binding peaks in MEFs [40] are also indicated. Examples of genes that are bound by cohesin and changed expression in Nipbl+/− MEFs (top) and those genes that did not change expression (bottom) are shown. No cohesin-binding peaks were found at the Srp14 gene region
Fig. 3Nipbl reduction decreases cohesin binding. a Manual ChIP-q-PCR of cohesin-binding sites at unique gene regions and repeat regions using anti-Rad21 antibody (top left) compared to histone H3 (bottom) in Nipbl+/− mutant and wild type MEFs. Representative examples of Nipbl ChIP are also shown (top, right). “Plus sign” indicates CTCF binding, and “asterisk” indicates the presence of motif. PCR signals were normalized with preimmune IgG (pre-IgG) and input. *p < 0.05. b Western blot analysis of control, Nipbl, or Rad21 siRNA-treated cells is shown using antibodies indicated. Depletion efficiency and specificity of Nipbl siRNA were also examined by RT-q-PCR (Table 2). Nipbl protein depletion was estimated to be ~ 80% (siNipbl-1) and 60% (siNipbl-2) according to densitometirc measurement (lanes 2 and 3, respectively). Comparable ChIP results were obtained by the two Nipbl siRNAs (data not shown). c Similar manual ChIP-q-PCR analysis as in a in control and Nipbl siRNA (siNipbl-1)-treated MEFs
Fig. 2Most of cohesin-binding sites contain CTCF motifs. a De novo motif search of cohesin-binding sites using MEME. The CTCF motifs identified at the cohesin-binding sites in WT and mutant MEFs are compared to the CTCF motif obtained from CTCF ChIP-seq data in MEFs (GSE22562) [40]. E values are 5.5e−1528 (cohesin-binding sites in WT MEFs), 6.6e−1493 (cohesin-binding sites in Nipbl MEFs), and 2.6e−1946 (CTCF-binding sites in MEFs), respectively. b Overlap of cohesin binding sites with CTCF binding sites. The number in the parenthesis in overlapping regions between cohesin and CTCF binding represents the number of CTCF-binding peaks. c Presence of CTCF motifs in cohesin only and cohesin/CTCF-binding sites. Shaded area represents binding sites containing CTCF motifs defined in a (FDR 4.7%). d The CTCF motif score distribution for all cohesin peaks that overlap with a CTCF peak (top) and that do not overlap with a CTCF peak (bottom). Note that the X axis is discontinuous and scores less than 200 are placed in the single bin in each figure. For peaks that contained multiple CTCF motifs, we report the maximum score for the peak. The score threshold (900 with FDR 4.7%) is marked in each figure. e Heatmap comparison of cohesin ChIP-seq tags in WT MEFs and Nipbl mutant MEFs with CTCF ChIP-seq tags at the corresponding regions in wild type MEFs [40] as indicated at the top. The normalized (reads per million total reads) tag densities in a 4-kb window (± 2 kb around the center of all the cohesin peaks) are plotted, with peaks sorted by the number of cohesin tags (highest at the top) in WT MEFs. Tag density scale from 0 to 20 is shown. f Percentages of CTCF binding in cohesin-binding sites common or unique to WT MEFs
Fig. 4Cohesin-binding site distribution in the genome in MEFs. a Percentage distribution of cohesin peaks in genomic regions. “Promoter” and “Downstream” is defined as 2500 bp upstream of the transcription start site (TSS) and 500 bp downstream of the TSS, and “Downstream” represents 500 bp upstream of transcription termination site (TTS) and 2500 bp downstream of TTS. The 3′ and 5′ untranslated regions (UTRs) are defined as those annotated by the UCSC genome browser minus the 500 bp interior at either the TSS or TTS. When a peak overlaps with multiple regions, it is assigned to one region with the order of precedence of promoter, 5′ UTR, Intron, Exon, 3’UTR, downstream, and intergenic. b Enrichment of cohesin peaks across genomic regions as compared to randomly sampled genomic sequence. A comparable number of peaks (25,407 and 16,528 peaks in wild type and mutant MEFs, respectively), with the same length as the input set, were randomly chosen 1000 times and the average used as a baseline to determine enrichment in each genomic region category
Gene expression changes and cohesin-binding status
| Total | Cohesin binding | |||||
|---|---|---|---|---|---|---|
| Gene region | Promoter | Gene body | Downstream | None | ||
| Total | 218 | 115 | 61 | 83 | 20 | 103 |
| Upregulated | 62 | 30 | 14 | 22 | 6 | 32 |
| Downregulated | 156 | 85 | 47 | 61 | 14 | 71 |
(Fold change > 1.2; p value < 0.05)
Ontology analysis of cohesin target genes
| Biological process |
| Enrichment | Gene number | Expected number | Genes |
|---|---|---|---|---|---|
| Altered gene expression in Nipbl+/− MEFs associated with cohesin binding to the promoters | |||||
| Development | 2.96E−04 | 2.38 | 18 | 7.55 | Avpr1a, Dner, Fgf7, Thbd, Hoxa5,Hoxb5, Cebpa, Cebpb, Rcan2, Lama2, Ebf1, Klf4, Hunk, Tgfb3, Irx5, Odz4, Ptpre, Lpp |
| Metabolism | 2.90E−03 | 1.50 | 33 | 22 | Dner, Acvr2a, Hoxa5, Hoxb5, Trib2, Satb1, Cebpa, Cebpb, Gstm2, Amacr, Cd55, Dhrs3, Grk5, Ell2, Serpinb1a, Cyp1b1, Chst1, Hsd3b7, Aldh1a7, Npr3, Man2a1, Klf4, Hunk, Prkd1,Prdx5, Ercc1, Irx5, Odz4, Sox11, Ptpre, Ccrn4l, Rgnef, Bcl11b |
| Cell communication | 2.96E−03 | 1.82 | 21 | 11.53 | Dner, Acvr2a, Trib2, Cd55, Grk5, Hunk, Odz4, Ptpre, Rgnef, Avpr1a, Fgf7, Thbd, Fam43a, Rcan2, Socs3, Lama2, Cxcr7, Tpcn1, Rerg, Tgfb3, Lpp |
| Immune system | 6.44E−03 | 2.06 | 14 | 6.81 | Dner, Cd55, Hunk, Ptpre, Thbd, Lama2, Cxcr7, Cebpa, Cebpb, Gstm2, Klf4, Prdx5, Fcgrt, Cd302 |
| Altered gene expression in Nipbl+/− MEFs associated with cohesin binding to the gene regions | |||||
| Immune system | 6.60E−06 | 2.34 | 30 | 12.83 | Klf4, Dner, Thbd, Cd55, Lama2, Cd302, Cxcr7, Hunk, Cebpa, Cebpb, Gstm2, Fcgrt, Prdx5, Fmod, Crlf1, Prelp, Svep1, Plac8, Heph, Swap70, Mxra8, Sdc2, Colec12, Pcolce2, Flt4, Gbp1, Hck, Dusp14, Cd109, Ptpre |
| Cell adhesion | 1.33E−05 | 3.05 | 19 | 6.22 | Dner, Cd55, Lama2, Fmod, Prelp, Svep1, Plac8, Heph, Mxra8, Sdc2, Colec12, Pcolce2, Flt4, Hck, Ptpre, Rerg, Vcan, Odz4, Rgnef |
| Cell communication | 1.65E−05 | 1.89 | 41 | 21.72 | Dner, Cd55, Lama2, Fmod, Prelp, Svep1, Heph, Sdc2, Colec12, Pcolce2, Flt4, Hck, Ptpre, Rerg, Vcan, Odz4, Rgnef, Thbd, Cxcr7, Hunk, Crlf1, Dusp14, Cd109, Rcan2, Socs3, Fam43a, Trib2, Grk5, Tpcn1, Avpr1a, Fgf7, Acvr2a, Figf, Myh3, Tob1, Acvrl1, Moxd1, Tgfb3, Lpp, Wnt4 |
| Development | 4.81E−05 | 2.11 | 30 | 14.22 | Dner, Lama2, Fmod, Prelp, Heph, Sdc2, Colec12, Pcolce2, Flt4, Ebf1, Hck, Ptpre, Vcan, Odz4, Thbd, Hunk, Crlf1, Rcan2, Socs3, Avpr1a, Fgf7, Figf, Myh3, Tgfb3, Lpp, Klf4, Cebpa, Cebpb, Hoxa5, Hoxb5, Irx5 |
| Metabolism | 1.91E−03 | 1.38 | 57 | 41.44 | Dner, Heph, Pcolce2, Flt4, Hck, Ptpre, Odz4, Hunk, Klf4, Cebpa, Cebpb, Hoxa5, Hoxb5, Irx5, Cd55, Svep1, Rgnef, Dusp14, Cd109, Trib2, Grk5, Acvr2a, Acvrl1, Moxd1, Prdx5, Swap70, Satb1, Amacr, Dhrs3, Ell2, Npr3, Man2a1, Prkd1, Cyp1b1, Serpinb1a, Chst1, Hsd3b7, Aldh1a7, H6pd, Serpine2, Cyp7b1, P4ha2, Larp6, Mrps11, Aox1, Hdac5, Cpxm1, Eno2, Sox11, Prkcdbp, Ccrn4l, Ercc1, Pqlc3, Bcl11b |
Biological processes enriched in cohesin target genes with cohesin binding at either promoters or gene regions. “Gene number” is the number of cohesin target genes that belong to a specific category; “Expected number” is the expected gene numbers that belong to a specific category at random
Fig. 7Cohesin plays a direct role in adipogenesis gene regulation. a RT-q-PCR analysis of gene expression changes in Nipbl+/− mutant MEFs and MEFs treated with siRNA against Nipbl and Rad21 (*p < 0.05, **p < 0.01). Cohesin-binding status is also shown. P: promoter, B: gene body, and D: downstream as in Fig. 5 with the exception of IL6. For IL6, the cohesin-binding site in the downstream region is 3 kb away from TSS. b A schematic diagram of genes involved in the adipogenesis pathway. Genes that changed expression in Nipbl+/− mutant MEFs are circled, and those bound by cohesin and examined in a are shown with shaded circles
Fig. 8Enrichment of H3K4me3 at the promoters of cohesin-bound genes. a Density of histone modifications within 10 kb of cohesin peaks found in the promoter or downstream regions. Histone methylation data was downloaded from NCBI (GEO: GSE26657). Tags within a 10-kb window around cohesin peaks located in a promoter region were counted and normalized to the total number of tags (reads per million) and used to generate a density plot. b Expression status of cohesin target genes. Genes are ranked by their expression status (shown as a z-score) in wild type MEFs (lane 2), and those genes with cohesin binding at the promoter regions are indicated by yellow lines (lane 1). The expression status of the corresponding genes in Nipbl mutant cells is also shown (lane 3), and the cohesin target genes (Table 2) (either upregulated (lane 4) or downregulated (lane 5) in mutant cells) are indicated by black lines. Genes in the adipogenesis pathway are indicated with arrows on the right. Five clusters (I through V) of 200 cohesin-bound genes each in wild type MEFs according to the expression levels are indicated on the left, which were used for the analysis in c and d. c The numbers of cohesin target genes containing histone marks in the promoter were tallied for the categories I through V from b. As a control, the cohesin-free gene directly below each cohesin target gene was also tallied and plotted. H3K4me3, H3K9me3, H3K27me3, bivalent (H3K4me3 and H3K27me3), and the promoters with none of these marks (“None”) are indicated. There is almost no signal of H3K9me3 in these categories. d Enrichment plot of H3K4me3, H3K27me3, and bivalent (H3K4me3 and K27me3) in promoters of cohesin-bound genes versus cohesin-free genes in the five expression categories as in c is shown
Fig. 9The long distance interaction involving the Cebpβ promoter is decreased in Nipbl+/− MEFs. a Comparison of Rad21-binding peaks in wild type (WT) and Nipbl+/− mutant MEFs with SMC1 and SMC3, CTCF, and Mediator subunit 12 (Med12) [40] (GSE22562), pol II (GSE22302), H3K4me3 (GSE26657), and H3K4me1 (GSE31039) in WT MEFs in the genomic region surrounding the Cebpβ gene. The positions of primers for the 3C analysis (a, b, c and the promoter as the bait) are indicated. These regions were chosen based on the overlapping peaks of cohesin and CTCF, and/or cohesin, pol II and Med12 with H3K4me1/me3. The interaction observed by 3C in (b) is shown in a solid line and other interactions examined but weak are shown in dotted lines at the top. b The 3C analysis of Cebpβ promoter interactions with regions a, b, and c (as indicated in a). The chromatin interactions between WT and Nipbl mutant MEFs (top panel) and between control and Nipbl siRNA-treated MEFs (bottom) were quantified and normalized as described in the "Methods" section. *p value < 0.01. **p value < 0.05