| Literature DB >> 33268355 |
Noboru J Sakabe1, Ivy Aneas1, Nicholas Knoblauch1, Debora R Sobreira1, Nicole Clark2, Cristina Paz1, Cynthia Horth1, Ryan Ziffra1, Harjot Kaur1, Xiao Liu1, Rebecca Anderson1, Jean Morrison1, Virginia C Cheung3, Chad Grotegut4, Timothy E Reddy5, Bo Jacobsson6,7, Mikko Hallman8, Kari Teramo9, Amy Murtha10, John Kessler3, William Grobman11, Ge Zhang12, Louis J Muglia13, Sarosh Rana13, Vincent J Lynch1, Gregory E Crawford2, Carole Ober14,13, Xin He14, Marcelo A Nóbrega14.
Abstract
While a genetic component of preterm birth (PTB) has long been recognized and recently mapped by genome-wide association studies (GWASs), the molecular determinants underlying PTB remain elusive. This stems in part from an incomplete availability of functional genomic annotations in human cell types relevant to pregnancy and PTB. We generated transcriptome (RNA-seq), epigenome (ChIP-seq of H3K27ac, H3K4me1, and H3K4me3 histone modifications), open chromatin (ATAC-seq), and chromatin interaction (promoter capture Hi-C) annotations of cultured primary decidua-derived mesenchymal stromal/stem cells and in vitro differentiated decidual stromal cells and developed a computational framework to integrate these functional annotations with results from a GWAS of gestational duration in 56,384 women. Using these resources, we uncovered additional loci associated with gestational duration and target genes of associated loci. Our strategy illustrates how functional annotations in pregnancy-relevant cell types aid in the experimental follow-up of GWAS for PTB and, likely, other pregnancy-related conditions.Entities:
Mesh:
Substances:
Year: 2020 PMID: 33268355 PMCID: PMC7710387 DOI: 10.1126/sciadv.abc8696
Source DB: PubMed Journal: Sci Adv ISSN: 2375-2548 Impact factor: 14.136
Fig. 1Schematic of RNA-seq, ATAC-seq, ChIP-seq, and pcHi-C maps centered on the prolactin (PRL) gene, as an example.
Each histone modification and RNA-seq track shows read counts per base pair for each experiment. The pcHi-C signal track shows the number of reads per MboI restriction fragment. Arcs in the pcHi-C interactions track show significant interactions between the promoter of the PRL gene and putative distal regulatory elements identified with pcHi-C. Pooled data (three replicates) for one cell line are shown for untreated cells (MSCs, in green) and decidualized cells (DSCs, in purple). pcHi-C data were generated in a fourth cell line that was decidualized.
Fig. 2Differential histone modification and ATAC-seq peaks are associated with differential expression and enriched for transcription factors with roles in decidualization.
(A) Plot showing the overlap between the different histone modifications and ATAC-seq maps (intersection between annotations). Peaks were assigned to 100-bp bins to avoid ambiguity in overlap due to different peak borders. Black circles indicate overlap with other annotations; light gray circles indicate that the annotation does not overlap others. (B) Each data point shows the ratio between the number of increased/decreased differential peaks nearby genes that increase expression after decidualization (blue, positive log ratios; upper half of the figure) or decrease expression after decidualization (orange, negative log ratios; lower half of the figure). Genes that were more highly expressed in decidualized cells were flanked by a higher number of ChIP-seq and ATAC-seq peaks that displayed increased read counts in decidualized samples compared to peaks that displayed decreased read counts (top inset). Genes that were down-regulated in decidualized cells showed the opposite trend (bottom inset). All enrichments: P < 10−25. (C) DNA binding motifs of transcription factors relevant in decidualization are enriched in peaks that change following decidualization treatment. Motifs are color-coded by similarity. (D) Colocalization of PGR, FOSL2, FOXO1, GATA2, and NR2F2 with ATAC-seq and ChIP-seq peaks. Transcription factor binding sites co-occur with ATAC-seq and ChIP-seq peaks in both untreated (green) and decidualized (purple) cells more often than with random peaks. Enrichment of the co-occurrences of PGR, FOXO1, GATA2, and NR2F2 are higher when co-occurring with peaks that have increased read counts (navy blue) and lower with peaks that have decreased read counts (orange) in decidualized compared to untreated cells. Enrichment of co-occurrences with peak sets was calculated as the fold difference between the number of transcription factor peaks overlapping with ATAC-seq/ChIP-seq peaks and with a random set of peaks (see Materials and Methods).
Fig. 3pcHi-C connects predicted regulatory elements to their putative target genes.
(A) Randomly assigning a gene to a peak (see Materials and Methods) resulted in fewer peaks that matched the direction of change with that of differentially expressed genes than when using pcHi-C interactions or the nearest gene to pair peaks to genes. (B) The FOXO1 gene is more highly expressed in decidualized samples (fourfold increase, P = 7 × 10−22) and its promoter physically interacts (red arcs) with distal regulatory elements (yellow highlights) that show increased activation in decidualized samples. The nearest expressed gene to these differential peaks is COG6.
Fig. 4GWAS analysis pipeline and heritability enrichment in functional annotations.
(A) Computational pipeline for analyzing GWAS of gestation duration. Yellow boxes (input data): GWAS summary statistics and functional annotations from endometrial stromal cells (in both untreated and decidualized cells). Green boxes: Stages of statistical analysis (see Materials and Methods). (B) Stratified LDSC heritability analysis of GWAS of gestational duration using functional annotations. Left: Fold enrichment of heritability in each annotation. Dashed line shows values at 1, i.e., no enrichment. Center: Proportion of heritability explained by each annotation. Right: Proportion of SNPs across the genome that fall within an annotation. For each annotation, enrichment (left) is the ratio of h2 proportion (center) divided by the SNP proportion (right). Error bars represent 95% confidence intervals.
Fig. 5Fine-mapping GWAS loci of gestational duration.
(A) PIPs of SNPs using uniform vs. functional priors in SuSiE (each dot is an SNP). The functional prior of an SNP is based on SNP annotations and is estimated using TORUS. (B) Summary of fine-mapping statistics of all 10 regions. X axis: The size (number of SNPs) of credible set. Y axis: The maximum PIP in a region. We label each region by its top SNP (by PIP) and the likely causal gene, according to Table 1 or the nearest gene of the top SNP. (C) Likely causal variants near HAND2 and their functional annotations. The top panel shows the significance of SNP association in the GWAS and the middle panel shows fine-mapping results (PIPs) in the region. The vertical yellow bar highlights the two SNPs with high PIPs. These SNPs are located in a region annotated with ATAC-seq, H3K27ac, H3K4me1, and H3K4me3 peaks (bottom). This putative enhancer also had increased ATAC-seq, H3K27ac, and H3K4me1 levels in decidualized samples and interacts with the HAND2 promoter (red arc).
Most probable SNPs identified from computational fine mapping of regions associated with gestational duration.
Functional annotations are based on data from endometrial stromal cells. We list an annotation if the SNP is located in a sequence with that annotation in either untreated or decidualized condition. Functional prior is the prior probability of an SNP being a causal variant. For an SNP without any functional annotation, its prior probability is 3.6 × 10−6. We list the pcHi-C annotation if the SNP is within 1 kb of a region involved in a pcHi-C interaction. We call a gene the target of an SNP if (i) the SNP is located in the promoter (< 1 kb of transcription start site) of that gene or (ii) the promoter of that gene has a pcHi-C interaction with a region within 1 kb of the SNP. In the case of rs147843771 at the FOXL2 locus, the target was defined by literature evidence (). The number of credible SNPs at each region is shown in Fig. 5B. SNPs in bold are discussed in the text. FOXL2 (), forkhead box L2; GATA2, GATA-binding protein 2; HAND2, heart and neural crest derivatives expressed 2; KCNAB1, potassium voltage-gated channel subfamily A member regulatory beta subunit 1; WNT4, Wnt family member 4.
| rs147843771 | chr3:138843356 | 3.8 × 10−8 | 8.3 × 10−5 | 0.74 | K4me1 | |
| rs17315501 | chr3:139029676 | 1.7 × 10−7 | 9.9 × 10−5 | 0.21 | K4me1, K4me3, | |
| rs2946164 | chr5:157884706 | 3.0 × 10−26 | 8.3 × 10−5 | 0.72 | K4me1 | |
| chr4:174728703 | 3.9 × 10−7 | 5.1 × 10−4 | 0.38 | K4me1, K27ac, | ||
| chr4:174729014 | 4.5 × 10−7 | 5.1 × 10−4 | 0.33 | K4me1, K27ac, | ||
| rs13387174 | chr2:74206685 | 4.7 × 10−7 | 3.8 × 10−4 | 0.35 | pcHi-C, K4me1, K27ac | |
| rs13390332 | chr2:74207357 | 2.0 × 10−7 | 9.9 × 10−5 | 0.18 | K4me1, K27ac | |
| rs4677884 | chr3:123062970 | 4.1 × 10−9 | 8.3 × 10−5 | 0.34 | K4me1, ATAC | |
| rs56318008 | chr1:22470407 | 2.3 × 10−12 | 4.3 × 10−4 | 0.3 | K4me1, K4me3, | |
| rs55938609 | chr1:22470451 | 2.3 × 10−12 | 4.3 × 10−4 | 0.3 | K4me1, K4me3, | |
| rs3820282 | chr1:22468215 | 6.4 × 10−13 | 1.1 × 10−4 | 0.27 | K4me1, K4me3, | |
| rs4679761 | chr3:155868039 | 5.0 × 10−9 | 9.9 × 10−5 | 0.24 | K4me1, K27ac | |
| rs9882088 | chr3:155867092 | 5.5 × 10−9 | 8.3 × 10−5 | 0.19 | K4me1 | |
| chr3:127889287 | 5.4 × 10−12 | 3.2 × 10−4 | 0.18 | K4me1, pcHi-C | ||
| chr3:127878416 | 2.0 × 10−12 | 8.3 × 10−5 | 0.12 | K4me1, K27ac, | ||
| chr3:127895986 | 1.2 × 10−11 | 3.8 × 10−4 | 0.10 | K4me1, pcHi-C, |