| Literature DB >> 33863903 |
Louella Vasquez1, Klaudia Walter1, Alice L Mann1, Stephen Watt1, Kousik Kundu1,2, Lu Chen1,2,3, Ying Sims1, Simone Ecker4, Frances Burden5,6, Samantha Farrow5,6, Ben Farr1, Valentina Iotchkova1,7,8, Heather Elding1, Daniel Mead1, Manuel Tardaguila1, Hannes Ponstingl1, David Richardson7, Avik Datta7, Paul Flicek7, Laura Clarke7, Kate Downes5,6, Tomi Pastinen9, Peter Fraser10,11, Mattia Frontini5,6,12,13, Biola-Maria Javierre14,15, Mikhail Spivakov16,17,18, Nicole Soranzo19,20.
Abstract
Neutrophils play fundamental roles in innate immune response, shape adaptive immunity, and are a potentially causal cell type underpinning genetic associations with immune system traits and diseases. Here, we profile the binding of myeloid master regulator PU.1 in primary neutrophils across nearly a hundred volunteers. We show that variants associated with differential PU.1 binding underlie genetically-driven differences in cell count and susceptibility to autoimmune and inflammatory diseases. We integrate these results with other multi-individual genomic readouts, revealing coordinated effects of PU.1 binding variants on the local chromatin state, enhancer-promoter contacts and downstream gene expression, and providing a functional interpretation for 27 genes underlying immune traits. Collectively, these results demonstrate the functional role of PU.1 and its target enhancers in neutrophil transcriptional control and immune disease susceptibility.Entities:
Mesh:
Substances:
Year: 2021 PMID: 33863903 PMCID: PMC8052402 DOI: 10.1038/s41467-021-22548-8
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Fig. 1Properties PU.1 transcription factor QTLs.
a Summary of molecular traits generated as part of this study. b Density plot displaying the distance between sentinel SNPs and their associated PU.1 peaks. The bimodal distribution (grey) can be further subdivided into proximal (solid navy, <2.5 kb) and distal SNP effects (dotted, >2.5 kb). c Boxplot of absolute PU.1 tfQTL effect sizes (beta). Proximal PU.1 tfQTLs exhibit larger effect sizes compared to distal tfQTLs (Welch two-sided t-test). Box plots show the medians (centre lines) and the 25th and 75th percentiles (box edges), with whiskers extending to 1.5 times the interquartile range. n = the number of PU.1 tfQTL in each category. d Proportion of significant tfQTL SNPs with significant allele-specific (A/S) binding. Peaks without suitable heterozygous SNPs were not tested (grey).
Fig. 2Effect of PU.1 SNPs on second transcription factor binding.
a Mean effect size (95% confidence intervals) for association of proximal sentinel PU.1 SNPs with the nearest C/EBPβ (light blue) and CTCF (red) binding site. The effect size decreases with distance for C/EBPβ (linear model p < 2.2 × 10−16) but not for CTCF (linear model p = 0.113). Beneath: bar chart of number of peaks included in each distance bin. b Genome browser shot of an illustrative example of tfQTL, where SNP rs8057431 (dashed line) alters a PU.1 motif and is associated with a disruption in binding of both PU.1 and C/EBPβ. With signal box plots created from all individuals segregated by genotype. Box plots show the medians (centre lines) and the 25th and 75th percentiles (box edges), with whiskers extending to 1.5 times the interquartile range. p values were obtained by fitting linear mixed models implemented in LIMIX. n = the number of independent donors. c Density plot displaying distance of lead proximal PU.1 SNP to the nearest shared association (p < 10−5) C/EBPβ (light blue) or CTCF (red). d Transcription factor binding intensity for PU.1 (y axis Log RPMs) at shared PU.1 and C/EBPβ tfQTL from five matched individuals in two cell types Neutrophils (left panel) and Monocytes (right panel). Binding sites were segregated by donor genotype (x-axis). tfQTL are categorised as either proximal or distal (33 distal and 43 proximal) and linear models were fitted separately for proximal and distal sites.
Fig. 3Neutrophil PU.1 tfQTLs and their association with chromatin state.
a Bar plot displaying the number of shared associations (r2 ≥ 0.8) between tfQTL and histone QTL. Bar plot is split into two categories, those cases where the beta between phenotypes are positively correlated, i.e., an allele leads to a gain (an increase in signal) in both phenotypes. Or negatively correlated, where an allele associates with a gain in one phenotype and a reduction in the second phenotype. The Pearson correlation between the betas is shown beneath. b Boxplot of absolute beta for H3K27ac neutrophil QTL (no significance threshold) for proximal lead PU.1 SNPs, differentiating H3K27ac regions that are or are not marked by C/EBPβ and/or CTCF binding. Box plots show the medians (centre lines) and the 25th and 75th percentiles (box edges), with whiskers extending to 1.5 times the interquartile range. n = the number of PU.1 tfQTL that intersect H3K27ac hQTL. p value displayed above is a Welch two-sided t-test. c Density distribution plot of log distance between lead PU.1 tfQTL SNPs and shared (r2 ≥ 0.8) histone QTL in neutrophils.
Fig. 4tfQTLs perturb gene expression through altered chromatin state.
a Enrichment of significant tfQTLs (PU.1 and CTCF; Fisher’s exact test p < 1 × 10−5) in PIRs of both neutrophils and monocytes. The bars represent the mean and the error bars the 95% confidence interval. b Heat map showing enrichment of transcription factor or histone modified regions intersecting PIRs, whereby PIRs were ranked into four bins based on the gene expression of the connected baited genes in neutrophils. c Density plot of gene expression QTL Beta value for neutrophil PU.1 SNPs within PIRs (navy) versus distance-matched significant SNPs not in PIRs (grey) (two-sided Fisher’s exact test p < 2 × 10−16), and distribution of beta values for SNPs within <25 kb of transcription start site (purple). The SNPs that are not in PIRs are also significant PU.1 tfQTLs and eQTLs (linear model p < 1 × 10−5 cut off). d CHiCAGO scores for the PIR at tfQTL, segregated by donor genotype for rs519989. n the number of individual donors. e Signal box plots with donors separated by genotype for rs519989 for five molecular traits, f boxplots displaying RNA level for LRRC8C gene segregated by donor genotype for SNP rs519989. n = the number of individual donors. Box plots show the medians (centre lines) and the 25th and 75th percentiles (box edges), with whiskers extending to 1.5 times the interquartile range. p values were obtained by fitting linear mixed models implemented in LIMIX. g Genome browser view of region around LRRC8C gene, QTL regions for each molecular trait are highlighted. Dashed line depicts the location of rs519989.
Fig. 5tfQTLs influence cellular phenotype and disease.
a Circos plots displaying fold-enrichment of GWAS loci for neutrophil count and monocyte count within neutrophil regions marked by TF binding, modified histones and PU.1 and C/EBPβ specific binding sites in monocytes. Radial grid lines for published GWAS p values, asterisk denotes significance of enrichment for annotation tested at each GWAS p value cut off. b Bar plot displaying the number of GWAS loci that colocalise with PU.1 tfQTL for both full blood count phenotypes (left) and autoimmune disease (right). c Example of colocalised signal for sentinel SNP rs791357 tfQTL with a shared association for both neutrophil and monocyte count traits. Manhattan plots showing −log10(P) value for shared SNPs from published GWAS for neutrophil and monocyte counts obtained by fitting linear models, and PU.1 tfQTL (navy) obtained from LIMIX. With genome browser visualisation of locus for CTCF, C/EBPβ and PU.1 binding shown beneath. The top associated peak is highlighted by the shaded area. In addition, PCHi-C data show that this region is highly connected to the enhancer region in both neutrophils (blue) and monocytes (green). d Boxplot for TF and RNA signal segregated by donor genotype. PU.1 (navy), CEBPβ (light blue), CPEB4 gene expression neutrophil (navy) and CPEB4 gene expression monocyte (olive green). Box plots show the medians (centre lines) and the twenty-fifth and seventy-fifth percentiles (box edges), with whiskers extending to 1.5 times the interquartile range. p values were obtained by fitting linear mixed models implemented in LIMIX. n = the number of individual donors.