| Literature DB >> 36012336 |
Clarissa Boschiero1, Yahui Gao1,2, Ransom L Baldwin1, Li Ma2, Cong-Jun Li1, George E Liu1.
Abstract
The weaning transition in calves is characterized by major structural changes such as an increase in the rumen capacity and surface area due to diet changes. Studies evaluating rumen development in calves are vital to identify genetic mechanisms affected by weaning. This study aimed to provide a genome-wide characterization of CTCF-binding sites and differentially CTCF-binding sites (DCBS) in rumen tissue during the weaning transition of four Holstein calves to uncover regulatory elements in rumen epithelial tissue using ChIP-seq. Our study generated 67,280 CTCF peaks for the before weaning (BW) and 39,891 for after weaning (AW). Then, 7401 DCBS were identified for the AW vs. BW comparison representing 0.15% of the cattle genome, comprising ~54% of induced DCBS and ~46% of repressed DCBS. Most of the induced and repressed DCBS were in distal intergenic regions, showing a potential role as insulators. Gene ontology enrichment revealed many shared GO terms for the induced and the repressed DCBS, mainly related to cellular migration, proliferation, growth, differentiation, cellular adhesion, digestive tract morphogenesis, and response to TGFβ. In addition, shared KEGG pathways were obtained for adherens junction and focal adhesion. Interestingly, other relevant KEGG pathways were observed for the induced DCBS like gastric acid secretion, salivary secretion, bacterial invasion of epithelial cells, apelin signaling, and mucin-type O-glycan biosynthesis. IPA analysis further revealed pathways with potential roles in rumen development during weaning, including TGFβ, Integrin-linked kinase, and Integrin signaling. When DCBS were further integrated with RNA-seq data, 36 putative target genes were identified for the repressed DCBS, including KRT84, COL9A2, MATN3, TSPAN1, and AJM1. This study successfully identified DCBS in cattle rumen tissue after weaning on a genome-wide scale and revealed several candidate target genes that may have a role in rumen development, such as TGFβ, integrins, keratins, and SMADs. The information generated in this preliminary study provides new insights into bovine genome regulation and chromatin landscape.Entities:
Keywords: CTCF; ChIP-seq; cattle; epithelial tissue; regulatory elements; rumen development; weaning
Mesh:
Substances:
Year: 2022 PMID: 36012336 PMCID: PMC9408924 DOI: 10.3390/ijms23169070
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 6.208
CTCF ChIP sequence read statistics showing the number of reads, number and percentages of reads mapped, mitochondrial reads, duplicate reads and reads with mapping quality < 10, and the number of clean reads used for peak calling.
| Condition | N of Reads | Mapped Reads | Mapped Reads% | MT Reads | MT Reads% 1 | Duplicate Reads | Duplicate Reads% 1 | MAPQ < 10 Reads | MAPQ < 10 Reads% 1 | Clean Reads 2 |
|---|---|---|---|---|---|---|---|---|---|---|
| BW | 42,921,125 | 41,327,290 | 96.29 | 38,772 | 0.09 | 6,695,091 | 16.20 | 6,316,659 | 15.28 | 27,814,748 |
| AW | 44,702,240 | 42,063,278 | 94.10 | 72,080 | 0.16 | 4,140,259 | 9.84 | 5,655,767 | 13.45 | 31,627,368 |
| Total | 87,623,365 | 83,390,568 | - | 110,852 | - | 10,835,350 | - | 11,972,426 | - | 59,442,116 |
| Average | 43,811,683 | 41,695,284 | 95.19 | 55,426 | 0.13 | 5,417,675 | 13.02 | 5,986,213 | 14.37 | 29,721,058 |
BW: before weaning. AW: after weaning. MT: mitochondrion. 1 Percentages were calculated considering the total number of mapped reads. 2 Reads uniquely mapped, with MAPQ > 10, no duplicate reads or reads located on MT chromosome.
Peak calling metrics showing the total number of clean reads used to call peaks and calculate the fraction of reads in peaks (FRiP), number of CTCF peaks (FDR < 0.05), number of assigned reads in peaks, FRiP, an average of peak lengths, and the proportion of peaks near TSS (±3 kb, %).
| Condition | Clean Reads 1 | Clean Reads Used for FRiP 2 | CTCF Peaks 2 | N of Assigned Reads in Peaks 2 | FRiP 3 | Average Peak Length | Peaks Near TSS (±3 kb, %) |
|---|---|---|---|---|---|---|---|
| BW | 27,814,748 | 27,066,621 | 67,280 | 6,193,989 | 0.22 | 393 | 17.56 |
| AW | 31,627,368 | 30,864,883 | 39,891 | 4,401,267 | 0.14 | 498 | 20.61 |
| Total | 59,442,116 | 57,931,504 | 107,171 | 10,595,256 | - | - | - |
| Average | 29,721,058 | 28,965,752 | 53,586 | 5,297,628 | 0.18 | 446 | 19.09 |
BW: before weaning. AW: after weaning. 1 Reads uniquely mapped, with MAPQ > 10, no duplicate reads or reads located on MT chromosome. 2 Reads located on chromosomes X and unplaced were not included. 3 Fraction of reads in peaks.
Number of differentially CTCF-binding sites (DCBS) after weaning showing the different steps of filtration, including the number of induced and repressed sites.
| Filtration Steps of the After-Weaning DCBS | N | % |
|---|---|---|
| Initially detected DCBS ( | 26,871 | 100 |
| Filtered DCBS (FDR < 0.01 and |log2FC| > 1) | 8249 | 30.70 |
| Filtered DCBS that overlapped with peaks | 7041 | 26.20 |
| Total of filtered DCBS | 7041 | 100 |
| Induced sites with log2FC ≥ 1 | 3812 | 54.14 |
| Repressed sites with log2FC ≤ −1 | 3229 | 45.86 |
Figure 1(A) Annotation of induced and repressed differentially CTCF-binding sites (DCBS) after weaning. (B) Distribution of the induced and repressed DCBS after weaning relative to TSS.
Figure 2GO enrichment results for the induced (A) and repressed (B) differentially CTCF-binding sites after weaning. The top 30 hits of GO terms are shown in this figure. Biological processes are ranked according to the fold enrichment values. Bubble colors represent the p-value of the False Discovery Rate (FDR). The most significant processes are highlighted in red, and the less significant processes are highlighted in blue according to log10 (FDR) values. Bubble sizes indicate the number of genes. Blue arrows indicate GO terms present in both induced and repressed sites.
Figure 3GO enrichment analysis for the induced (A) and repressed (B) differentially CTCF-binding sites after weaning. The top 30 hits of GO terms are shown in this figure. Cellular components are ranked according to the fold enrichment values. Bubble colors represent the p-value of the false discovery rate (FDR). The most significant processes are highlighted in red, and the less significant processes are highlighted in blue according to log10 (FDR) values. Bubble sizes indicate the number of genes. Blue arrows indicate GO terms present in both induced and repressed sites.
Figure 4KEGG gastric acid secretion pathway obtained from the induced differentially CTCF-binding sites (DCBS). Genes from the induced DCBS are highlighted in red.
Figure 5KEGG TGF-β signaling pathway obtained from the repressed differentially CTCF-binding sites (DCBS). Genes from the repressed DCBS are highlighted in red.