| Literature DB >> 35542046 |
Yahan Li1, Frimpong Boadu2, Max R Highsmith2, Darren E Hagen3, Jianlin Cheng2, Rocío Melissa Rivera1.
Abstract
Large offspring syndrome (LOS) and Beckwith-Wiedemann syndrome are similar epigenetic congenital overgrowth conditions in ruminants and humans, respectively. We have reported global loss-of-imprinting, methylome epimutations, and gene misregulation in LOS. However, less than 4% of gene misregulation can be explained with short range (<20kb) alterations in DNA methylation. Therefore, we hypothesized that methylome epimutations in LOS affect chromosome architecture which results in misregulation of genes located at distances >20kb in cis and in trans (other chromosomes). Our analyses focused on two imprinted domains that frequently reveal misregulation in these syndromes, namely KvDMR1 and IGF2R. Using bovine fetal fibroblasts, we identified CTCF binding at IGF2R imprinting control region but not KvDMR1, and allele-specific chromosome architecture of these domains in controls. In LOS, analyses identified erroneous long-range contacts and clustering tendency in the direction of expression of misregulated genes. In conclusion, altered chromosome architecture is associated with LOS.Entities:
Keywords: Biological sciences; Genetics; molecular biology
Year: 2022 PMID: 35542046 PMCID: PMC9079005 DOI: 10.1016/j.isci.2022.104269
Source DB: PubMed Journal: iScience ISSN: 2589-0042
Figure 1Differentially methylated regions identified within IGF2R ICR and KvDMR1 in LOS when compared with controls
(A–C) Data are represented as box plots with dots indicating individual samples. Y-axis shows average CpG DNA methylation level (not allelic).
Figure 2Validation of CTCF binding by chromatin immunoprecipitation (ChIP)
(A) PCR amplifications of ChIP products and input genomic DNA of predicted CTCF binding sites within the IGF2R ICR and KvDMR1 and within a region of IGF2R with no predicted CTCF binding site, namely intron 3. PCR amplicons were visualized on a 7% acrylamide gels. 2% input = input genomic DNA after micrococcal nuclease digestion without ChIP; Histone H3 ChIP = positive control; Rabbit IgG ChIP = negative control (for unspecific binding).
(B) Band intensity ratio between CTCF ChIP and 2% input DNA from (A) indicating increased presence of CTCF at IGF2R ICR in LOS and no binding at KvDMR1. Data are represented as mean ± SD. P-values were from t-test.
(C) Allele-specific binding of CTCF at IGF2R ICR shown by Sanger sequencing. Peaks show the intensity of florescence signal for each nucleotide. The nucleotide enclosed in a box denotes a SNP between the maternal (C, blue) and paternal (T, red) alleles.
(D) Increased maternal allele binding of CTCF in LOS samples. Maternal allele ratio of CTCF ChIP and 2% input DNA, and corresponding CTCF/2% input ratio calculated from (C). The high ratio of maternal allele in the 2% input could be caused by micrococcal nuclease digestion during the ChIP procedure, as its efficiency is known to be affected by the status of chromatin compression (Grewal and Elgin, 2002). Data are represented as mean ± SD. P-value was from t-test.
Figure 34C identified allele-specific cis and trans contacts with IGF2R ICR
Shown are data for the IGF2R_MseI assay.
(A) Comparison of cis contacts between the paternal and maternal alleles in controls. Track ‘4C CPM’ shows the mean normalized count of reads aligned to the genome indicating physical contacts with the bait. Track ‘Peaks’show regions with statistically significant contacts with the bait identified by fourSig software within a group. Track ‘Gain’ (red line) and ‘Loss’ (blue line) indicate regions with statistically significant difference in contacts with the bait regions identified by DESeq2 between alleles. Track ‘CTCF’ shows predicted CTCF binding sites on the sense (gold line) or antisense (black line) strand. The gene annotation is at the bottom of the figure. Mb = megabases. CPM = counts per million reads. M = maternal allele. P = paternal allele.
(B and C) Comparison of allele-specific cis contacts between control, LOS, and DC. Shown are the comparison of LOS and DC groups vs controls. Track ‘Gain’ (red line) and ‘Loss’ (blue line) indicate regions with statistically significant difference in contacts with the bait regions identified by DESeq2 between groups. Track ‘DMR’ shows non-allelic differentially methylated regions identified between the LOS and the control group with the red line indicating increased and blue line indicating decreased methylation levels. All other track information as in (A).
(D and E) Comparison of contacts in far-cis and trans between parental alleles in controls. (D) far-cis contacts (chromosome 9) and (E) trans contacts (interchromosomal) in controls. Circos plots showing DESeq2-identified statistically different contacts with the bait in the paternal vs the maternal allele. Red line indicates increased contacts and blue line indicates decreased contacts.
Figure 44C identified allele-specific cis and trans contacts with KvDMR1
(A) Comparison of cis contacts between the paternal and maternal alleles in controls. Track ‘4C CPM’ shows the mean normalized count of reads aligned to the genome indicating physical contacts with the bait. Track ‘Peaks’show regions with statistically significant contacts with the bait identified by fourSig software within a group. Track ‘Gain’ (red line) and ‘Loss’ (blue line) indicate regions with statistically significant difference in contacts with the bait regions identified by DESeq2 between alleles. Track ‘CTCF’ shows predicted CTCF binding sites on the sense (gold line) or antisense (black line) strand. The gene annotation is at the bottom of the figure. Mb = megabases. CPM = counts per million reads. M = maternal allele. P = paternal allele.
(B and C) Comparison of allele-specific cis contacts between control, LOS, and DC. Shown are the comparison of LOS and DC groups vs controls. Track ‘Gain’ (red line) and ‘Loss’ (blue line) indicate regions with statistically significant difference in contacts with the bait regions identified by DESeq2 between groups. Track ‘DMR’ shows non-allelic differentially methylated regions identified between the LOS and the control group with the red line indicating increased and blue line indicating decreased methylation levels. All other track information as in (A).
(D and E) Comparison of contacts in far-cis and trans between parental alleles in controls. (D) far-cis contacts (chromosome 29) and (E) trans contacts (interchromosomal) in controls. Circos plots showing DESeq2-identified statistically different contacts with the bait in the paternal vs the maternal allele. Red line indicates increased contacts and blue line indicates decreased contacts.
Figure 5Distribution of altered IGF2R ICR far-cis and trans contact across various genomic contexts
Shown are data for the IGF2R_MseI assay. (A, D, G, and I) Circos plots showing DESeq2-identified statistically different contacts with the bait in LOS vs. controls. Red line indicates increased contacts and blue line indicates decreased contacts. (A and D) Paternal and (G and I) maternal allele-specific comparisons. (A and G) far-cis contacts (chromosome 9) and (D and I) trans contacts (interchromosomal). P = paternal allele. M = maternal allele. (B-C, E-F, and H) Figures show the total number of altered far-cis (B-C and H) or trans (E-F) contacts identified and the number and percent of increased (B, E, and H) and decreased (C and F) contacts over each genomic context. For example, ∼55% of increased far-cis contacts in the paternal allele overlap with repetitive sequences (n = 17) and genes (n = 17). In addition, the figures include the number and percent of altered contacts that overlap differentially methylated regions (DMR) and within 100kb of differentially expressed genes (DEG) reported in this work. Analyses were only conducted for conditions with greater than five altered contacts.
Figure 6Location based clustering tendency of differentially expressed genes indicates global alteration of chromosome architecture in LOS
Shown are the genomic locations of differentially methylated regions (DMRs), differentially expressed genes (DEG) and predicted CTCF binding sites that overlap LOS DMRs. In addition, gene density and log10 transformed gene expression level per million bases are shown. The vertical location indicates level of misregulation for DMR and DEG, and sense (external) or antisense (internal) strands of CTCF binding sites. Mb = megabases. Note: track three (expression level) shows that for the most part the level of expression of genes globally is similar between LOS and controls.
| REAGENT or RESOURCE | SOURCE | IDENTIFIER |
|---|---|---|
| CTCF | Cell Signaling Technology | Cat#3418; RRID: |
| Histone H3 | Cell Signaling Technology | Cat#4620; RRID: |
| Normal Rabbit IgG | Cell Signaling Technology | Cat#2729; RRID: |
| DH10B Competent Cells | Thermo Scientific | EC0113 |
| Bovine fetal tissues | This study | N/A |
| DMEM | Gibco | 11885084 |
| Fetal bovine serum | Atlanta Biologicals | S11150H |
| Antibiotic-antimycotic | Gibco | 15240062 |
| HEPES | Sigma-Aldrich | H4034 |
| 0.05% trypsin-EDTA | Gibco | 25300054 |
| DMSO | Sigma-Aldrich | D2650 |
| Decitabine | Sigma-Aldrich | A3656 |
| Phenol:Chloroform:Isoamyl Alcohol | Sigma-Aldrich | P3803 |
| TRIzol™ Reagent | Invitrogen | 15596026 |
| RQ1 RNase-Free DNase | Promega | M6101 |
| random hexamers | Promega | C1181 |
| formaldehyde | Electron Microscopy Sciences | 157-4 |
| T4 DNA Ligase | New England Biolabs | M0202L |
| Proteinase K | Fisher | BP1700 |
| RNase A | Roche | 10109142001 |
| NlaIII | New England Biolabs | R0125S |
| Tsp45I | New England Biolabs | R0583S |
| MseI | New England Biolabs | R0525S |
| BsrI | New England Biolabs | R0527S |
| EZ DNA Methylation-Direct™ Kit | ZYMO RESEARCH | D5021 |
| SuperScript® IV Reverse Transcriptase | Invitrogen | 18090010 |
| GoTaq® Flexi DNA Polymerase | Promega | M8295 |
| Wizard® SV Gel and PCR Clean-Up System | Promega | A9282 |
| CloneJET PCR Cloning Kit | Thermo Scientific | K1231 |
| Qubit dsDNA HS Assay Kit | Invitrogen | Q32851 |
| QIAquick PCR Purification Kit | QIAGEN | 28104 |
| Platinum Taq DNA Polymerase High Fidelity | Invitrogen | 11304-011 |
| AxyPrep MAG PCR Clean-Up Kit | Axygen | MAG-PCR-CL-5 |
| NEBNext Library Quant Kit for Illumina | New England Biolabs | E7630S |
| SimpleChIP Enzymatic Chromatin IP Kit | Cell Signaling Technology | 9003S |
| RNA-seq, WGBS, 4C-seq, and DNA-seq | This study | GEO: |
| Bovine fetal fibroblast primary cells | This study | N/A |
| GE_KvDMR1_F1 | Integrated DNA Technologies | 5’-AATCCGATCGCAAGGGT |
| GE_KvDMR1_R1 | Integrated DNA Technologies | 5’-GCTTCTCGGTGAGGAGAG |
| GE_IGF2R_ICR_F | Integrated DNA Technologies | 5’-GGGGGAGGGTCTTTAAGGTTG |
| GE_IGF2R_ICR_R | Integrated DNA Technologies | 5’-TGGCTTTCAGGCTCCATAGAA |
| BI_KvDMR1_F | Integrated DNA Technologies | 5’-GTGAGGAGTATGGTATTGAGG |
| BI_KvDMR1_R | Integrated DNA Technologies | 5’-CCCCTACAAACTATCCAATCAACT |
| 4C_KvDMR1_F | Integrated DNA Technologies | 5’- TACACGACGCTCTTCCGATCT/CT |
| 4C_KvDMR1_R | Integrated DNA Technologies | 5’- CAGACGTGTGCTCTTCCGATCT/ |
| 4C_IGF2R_ICR_F | Integrated DNA Technologies | 5’- TACACGACGCTCTTCCGATCT/ |
| 4C_IGF2R_ICR_R | Integrated DNA Technologies | 5’-CAGACGTGTGCTCTTCCGATCT/ |
| GE_KvDMR1_F2 | Integrated DNA Technologies | 5’-GCACACCGCTTTCCACACC |
| GE_KvDMR1_R2 | Integrated DNA Technologies | 5’-GCACTGAGGTGACTGCGG |
| GE_IGF2R_INT3_F | Integrated DNA Technologies | 5’-CTCTGGAGGGTTTCAGCGTC |
| GE_IGF2R_INT3_R | Integrated DNA Technologies | 5’-AGGGAATACGCTTTCCCACG |
| 4Cker | Open source | |
| BBMap | Open source | |
| bedtools | Open source | |
| bismark | Open source | |
| BisSNP | Open source | |
| bowtie2 | Open source | |
| bwa-mem2 | Open source | |
| circular | Open source | |
| CTCFBSDB2.0 | Open source | |
| DAVID Bioinformatics Resources | LHRI | |
| DESeq2 | Open source | |
| edgeR | Open source | |
| fourSig | Open source | |
| GATK | Open source | |
| ggplot2 | Open source | |
| HISAT2 | Open source | |
| HTSeq | Open source | |
| hummingbird | Open source | |
| ImageJ | NIH | |
| Integrative Genomics Viewer | Open source | |
| JASPAR2020 | Open source | |
| picard | Open source | |
| Samtools | Open source | |
| StringTie | Open source | |
| Sushi | Open source | |
| TFBSTools | Open source | |
| Trimmomatic | Open source | |
| Custom code | This Study | Zenodo: |