| Literature DB >> 35110662 |
Katharina L M Ruff1, Ronja Hollstein1, Julia Fazaal1, Frederic Thieme1, Jan Gehlen2, Elisabeth Mangold1, Michael Knapp3, Julia Welzenbach1, Kerstin U Ludwig4.
Abstract
Non-syndromic cleft lip with/without cleft palate (nsCL/P) is a frequent congenital malformation with multifactorial etiology. While recent genome-wide association studies (GWAS) have identified several nsCL/P risk loci, the functional effects of the associated non-coding variants are largely unknown. Furthermore, additional risk loci remain undetected due to lack of power. As genetic variants might alter binding of transcription factors (TF), we here hypothesized that the integration of data from TF binding sites, expression analyses and nsCL/P GWAS might help to (i) identify functionally relevant variants at GWAS loci, and (ii) highlight novel risk variants that have been previously undetected. Analysing the craniofacial TF TFAP2A in human embryonic palatal mesenchyme (HEPM) cells, we identified 2845 TFAP2A ChIP-seq peaks, several of which were located near nsCL/P candidate genes (e.g. MSX1 and SPRY2). Comparison with independent data suggest that 802 of them might be specific to craniofacial development, and genes near these peaks are enriched in processes relevant to nsCL/P. Integration with nsCL/P GWAS data, however, did not show robust evidence for co-localization of common nsCL/P risk variants with TFAP2A ChIP-seq peaks. This data set represents a new resource for the analyses of craniofacial processes, and similar approaches with additional cell lines and TFs could be applied to generate further insights into nsCL/P etiology.Entities:
Mesh:
Substances:
Year: 2022 PMID: 35110662 PMCID: PMC8810875 DOI: 10.1038/s41598-022-05876-7
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.996
Figure 1Expression analysis in human embryonal palatal mesenchyme (HEPM) cells. Density plot of 3′mRNA sequence reads, provided at log10 scale for the average of three replicates. In total, 14,508 genes were observed with ≥ 5 reads, and 3,623 genes were in the upper quartile (≥ 327 reads) of all genes with ≥ 5 reads. Out of 22 candidate transcription factors for craniofacial development, 11 were expressed in HEPM.
Figure 2TFAP2A high confidence peaks (hc-peaks) in human embryonal palatal mesenchyme (HEPM) cells (a) Venn diagram showing the total number of TFAP2A ChIP-seq peaks across two replicates (replicate 1 n = 5820; replicate 2 n = 3989). The overlap of 2845 regions was denoted as „hc-peaks” for the subsequent analyses. Plotted with BioVenn© (2020). (b) Identification of the most abundant binding motif within hc-peaks (present in n = 1535 hc-peaks) with MEME-ChIP. (c) Comparative analysis using Tomtom and JASPAR core 2018 assembly identified the significant alignment of the enriched motif with three distinct TFAP2A binding motifs.
Figure 3Intersection of TFAP2A hc-peaks overlapping with TFAP2A ChIP-seq peaks in HeLa S3 and/or chromatin marks in craniofacial tissue. Venn diagram displaying the intersection of 2,713 hc-peaks overlapping with TFAP2A ChIP-seq peaks in HeLa S3/ DNase1 hypersensitivity regions/ H3K27ac chromatin marks in CS15 craniofacial tissue. Hc-peaks that did not overlap with any of these are not displayed in the diagram (n = 132). A total of 802 hc-peaks overlapped with DNAse1 hypersensitivity regions and H3K27ac marks in CS15 craniofacial tissue, but did not overlap with a TFAP2A ChIP-seq peak in HeLa S3 cells, thus indicating craniofacial-specific hc-peaks. Overlaps were defined as described in the Methods. Plotted with BioVenn© (2020).
Candidate genes for nsCL/P located near TFAP2A binding sites in HEPM cells.
| Hc-peak information | NsCL/P candidate genea | |||||
|---|---|---|---|---|---|---|
| Chromosome | Start | End | Gene symbol | Distance peak to TSS (bp) | Reference (PubMed-ID) | Expression in HEPM cells (average no. of 3′RNA-seq reads)b |
| 1 | 94,787,207 | 94,787,445 | − 84,205 | Beaty et al. 2010 (PMID: 20,436,469) | 1436 | |
| 1 | 94,791,145 | 94,791,437 | − 88,170 | |||
| 3 | 99,844,546 | 99,844,795 | − 11,314 | Beaty et al.2013 (PMID: 23,512,105) | 523 | |
| 3 | 99,878,493 | 99,878,721 | 45,250 | |||
| 3 | 189,281,805 | 189,282,039 | − 67,294 | Leslie et al. 2017 (PMID: 28,054,174) | Not expressed | |
| 3 | 189,655,227 | 189,655,963 | 306,379 | |||
| 4 | 4,860,996 | 4,861,363 | − 213 | Yu et al. 2017 (PMID: 28,232,668) | 537 | |
| 13 | 80,205,476 | 80,205,754 | 708,179 | Ludwig et al. 2012 (PMID: 22,863,734) | 125 | |
| 13 | 80,604,734 | 80,605,110 | 308,872 | |||
| 13 | 80,788,600 | 80,788,933 | 125,027 | |||
| 13 | 80,915,560 | 80,915,801 | − 1887 | |||
| 15 | 32,962,254 | 32,962,778 | − 47,659 | Ludwig et al. 2016 (PMID: 26,968,009) | 5471 | |
| 15 | 62,898,345 | 62,898,659 | − 436,382 | Ludwig et al. 2012 (PMID: 22,863,734) | 7613 | |
| 15 | 63,233,187 | 63,233,480 | − 101,550 | |||
| 15 | 74,838,564 | 74,838,823 | 5176 | Ludwig et al. 2017 (PMID: 28,087,736) | 4 | |
| 16 | 4,166,662 | 4,166,962 | − 626 | Sun et al. 2015 (PMID: 25,775,280) | 126 | |
| 17 | 54,240,701 | 54,240,761 | − 430,329 | Mangold et al. 2010 (PMID: 20,023,658); Leslie et al. 2015 (PMID:25,704,602) | 60 | |
| 17 | 54,672,130 | 54,672,266 | 1138 | |||
aNsCL/P candidate genes as putative downstream target genes of hc-peaks assigned with GREAT (association rules described in methods).
bAverage number of 3′-mRNA-seq reads (Supplementary Table S1). Methods provided in the text.
Figure 4Gene Ontology (GO) analysis of 802 craniofacial-specific TFAP2A high confidence peaks (hc-peaks). Bar charts displaying -log10 binomial p-values of enriched mouse single knockout phenotypes with false discovery rate (FDR) < 0.05, human phenotypes with FDR < 0.05 and GO biological processes with FDR < 0.05. Terms relating to orofacial clefting are highlighted in color. Genes associated with these terms in assigned to TFAP2A hc-peaks, based on their proximity, as described in the Methods, are displayed in the boxes.