| Literature DB >> 32144282 |
John P Ray1, Carl G de Boer1,2, Charles P Fulco1,3, Caleb A Lareau1,4, Masahiro Kanai1,5,6, Jacob C Ulirsch1,4, Ryan Tewhey1,7, Leif S Ludwig1, Steven K Reilly1,7, Drew T Bergman1, Jesse M Engreitz1,8, Robbyn Issner1, Hilary K Finucane1,5, Eric S Lander1,3,9, Aviv Regev10,11,12,13, Nir Hacohen14,15.
Abstract
Genome-wide association studies have associated thousands of genetic variants with complex traits and diseases, but pinpointing the causal variant(s) among those in tight linkage disequilibrium with each associated variant remains a major challenge. Here, we use seven experimental assays to characterize all common variants at the multiple disease-associated TNFAIP3 locus in five disease-relevant immune cell lines, based on a set of features related to regulatory potential. Trait/disease-associated variants are enriched among SNPs prioritized based on either: (1) residing within CRISPRi-sensitive regulatory regions, or (2) localizing in a chromatin accessible region while displaying allele-specific reporter activity. Of the 15 trait/disease-associated haplotypes at TNFAIP3, 9 have at least one variant meeting one or both of these criteria, 5 of which are further supported by genetic fine-mapping. Our work provides a comprehensive strategy to characterize genetic variation at important disease-associated loci, and aids in the effort to identify trait causal genetic variants.Entities:
Mesh:
Substances:
Year: 2020 PMID: 32144282 PMCID: PMC7060350 DOI: 10.1038/s41467-020-15022-4
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Fig. 1Disease variants in the complex autoimmune-associated TNFAIP3 locus.
a TNFAIP3 encodes the A20 protein, which forms part of a negative feedback loop to dampen NF-kB-mediated immune activation. b HiC plots for the lymphoblastoid B cell line GM12878, with color intensity proportional to the interaction frequency between genomic coordinates (x-axis). Boxes indicate the 300 kb high-interaction domain and the 605 kb region used in this study. c, d Genetics of the TNFAIP3 locus. The positions (shared x-axis indicated above c) of variants with respect to the TNFAIP3 gene and a lncRNA (LOC100130476). c GWAS tag SNPs (red) and SNPs in tight LD (greyscale boxes indicating LD to tag SNP) for many immune-related phenotypes (y-axis). d GWAS haplotypes defined by combining all SNPs in tight LD (r2 > 0.8) to GWAS tag SNPs for European (CEU; top) and East Asian (CHB/JPT; bottom) populations. Colors are used to help identify shared haplotypes between CEU and CHB/JPT populations. e Autoimmune GWAS signals are enriched in open chromatin of immune cells. Heritability enrichment (color) of disease-associated SNPs in DHS of various tissues (x-axis) for seven autoimmune diseases (y-axis), according to LD-score regression. Also see Supplementary Data 1, 2.
Fig. 2Seven approaches for characterizing non-coding genetic variants.
Genomic assays (left), the coverage of all common genetic variants in the 605 kb locus (middle), and whether the assay is specific to genomic regions or variants (right), grouped into observational (top) and perturbational assays (bottom). (1) HiChIP can be used to identify active chromatin regions (H3K27ac labeled) that interact with the TNFAIP3 promoter. (2) DHS and ATAC-seq can be used to identify regions of accessible chromatin. (3) Variants predicted to alter TF binding can be identified using motif analysis in combination with evidence of TF binding by ChIP-seq[42]. Also see Supplementary Data 3–9. (4 and 5) Pooled CRISPRi and CRISPRa screens can determine regulatory potential of each region by repressing (CRISPRi) or artificially inducing (CRISPRa) each targeted region. (6 and 7) MPRA (with lentiviral or transfection delivery strategies) can be used to test for allele-specific reporter expression.
Fig. 3Comparison of GWAS enrichment across methods.
a, b Values for n, n, and n for all methods, considering (a) all variants, and (b) only variants in open chromatin. c, d Pseudo-precision (y axes) and pseudo-recall (x axes) for GWAS enrichment for each assay (colors), with diamonds depicting the actual assay performance (as in Supplementary Fig. 9a), and the lines depicting the 95% CI of each assay’s null distribution (as in Supplementary Fig. 9b). Empirical one-sided P-values derived from the genomic-shifts null are indicated next to each assay label. P-values are not corrected for multiple hypothesis testing. c Each assay evaluated individually for all tested variants and d considering only SNPs in blood cell accessible chromatin. The relationship between pseudo-precision and pseudo-recall is linear in the null (pseudo-precision = (n/n) × pseudo-recall) because both are proportional to n and n and n are constant.
Fig. 4Prioritizing variants on disease-associated haplotypes.
A summary of assay results and genetics data for all SNPs on each disease-associated haplotype. Each subpanel represents a different haplotype, with associated traits and the haplotype number are indicated on the top. For each SNP (x axes), the total number of assay hits is shown in the bar graph (top) with SNPs that are hits in CRISPRi or T-MPRA hits in accessible chromatin marked with an asterisk. Results from each assay are shown in the middle, with hits in red, and SNPs that are assayed but were not hits in gray for each of the seven assays (y-axis). The vertical black bars above accessible chromatin SNP status indicate SNPs that were in accessible chromatin in our tested cell lines. Fine-mapped immune-related traits from UK Biobank (UKBB), including SNPs in the 95% credible set (CS—blue) and those that have a posterior inclusion probability > 10% (PIP10—green) are second from the bottom. The population-specific SNPs contained within each disease-associated haplotype are indicated (bottom) with orange for European (CEU) and purple for East Asian (CHB/JPT). Also see Supplementary Data 3.
Disease-associated variants positive for CRISPRi or chromatin accessibility with T-MPRA.
| SNP ID | Associated trait | Tehranchi asATAC | Tehranchi asChIP | Fine mapped UKBB 95% CS (SuSiE) | Other evidence | Haplotype | Hit in assays |
|---|---|---|---|---|---|---|---|
| rs200820567 | Allergy, ISD, RA, SLE, eosinophil counts, IgA deficiency, Sjogren’s | x | Eosinophil counts (PIP = 0.03); Allergy (PIP = 0.04) | Fine mapped in Adrianto et al. (SLE) | 2 | T-MPRA + accessible chromatin | |
| rs148314165 | Allergy, ISD, RA, SLE, eosinophil counts, IgA deficiency, Sjogren’s | Fine mapped in Adrianto et al. (SLE) | 2 | T-MPRA + accessible chromatin | |||
| rs112497003 | Allergy, ISD, RA, SLE, eosinophil counts, IgA deficiency, Sjogren’s | Eosinophil counts (PIP = 0.01) | 2 | T-MPRA + accessible chromatin | |||
| rs111883038 | Allergy, ISD, RA, SLE, eosinophil counts, IgA deficiency, Sjogren’s | Eosinophil counts (PIP = 0.01) | 2 | L-MPRA + accessible chromatin; T-MPRA + accessible chromatin | |||
| Celiac, IBD, RA, Asthma, IgA deficiency, Sjogren’s, ISD, T1D, primary biliary cirrhosis | x | NF-kB, JunD | Combined Autoimmune (PIP = 0.13) | Fine mapped in Huang et al. (UC, PIP = 0.06); Farh et al. (RA, PIP = 0.11; Celiac, PIP = 0.19; UC, PIP = 0.23); Westra et al. (RA, PIP = 0.10) | 6 | T-MPRA + accessible chromatin | |
| rs643177 | ISD, psoriasis | Pou2f1 | Fine mapped in Farh et al. (Psoriasis, PIP = 0.15) | 9 | L-MPRA + accessible chromatin; T-MPRA + accessible chromatin | ||
| rs59086769 | Urine metabolites | 5 | T-MPRA + accessible chromatin | ||||
| Celiac | x | NF-kB, PU.1 | 14 | T-MPRA + accessible chromatin | |||
| rs11758213 | MS | x | JunD | Fine mapped in Huang et al. (UC, PIP = 0.075) | 15 | T-MPRA + accessible chromatin | |
| MS | 15 | T-MPRA + accessible chromatin | |||||
| rs12201430 | Blood metabolites | x | 4 | T-MPRA + accessible chromatin | |||
| rs12192746 | Blood metabolites | 4 | L-MPRA + accessible chromatin; T-MPRA + accessible chromatin | ||||
| rs34654849 | MS, IgA deficiency, RA | 1 | T-MPRA + accessible chromatin | ||||
| rs73558137 | MS, IgA deficiency, RA | 1 | T-MPRA + accessible chromatin | ||||
| Allergy, ISD, RA, SLE, eosinophil counts, IgA deficiency, Sjogren’s | BJAB asATAC and fine mapped in Farh et al. (SLE, PIP = 0.09) | 2 | CRISPRi | ||||
| Allergy, ISD, RA, SLE, eosinophil counts, IgA deficiency, Sjogren’s | 2, 3 | CRISPRi | |||||
| rs10499197 | Allergy, ISD, RA, SLE, eosinophil counts, IgA deficiency, Sjogren’s | 2 | T-MPRA + accessible chromatin; CRISPRi | ||||
| rs58905141 | Allergy, ISD, RA, SLE, Eosinophil Counts, IgA deficiency, Sjogren’s | Eosinophil counts (PIP = 0.02) | 2 | L-MPRA + accessible chromatin; CRISPRi | |||
| rs559766217 | ISD, Psoriasis | 9 | CRISPRi |
Variants that are positive for either chromatin accessibility with T-MPRA or CRISPRi are listed with their associated trait, and whether they were also positive in Tehranchi et al. as having allele-specific ATAC (asATAC) or asChIP-seq for TFs in LCLs. Our fine-mapping data using UKBB traits for the 95% credible set variants are included, and other fine-mapping data or evidence for SNP functionality is listed in Other Evidence. The haplotype for the SNP is listed in Haplotype.