| Literature DB >> 34127860 |
Catherine C Robertson1,2, Jamie R J Inshaw3, Suna Onengut-Gumuscu1,4, Wei-Min Chen1,4, David Flores Santa Cruz3, Hanzhi Yang1, Antony J Cutler3, Daniel J M Crouch3, Emily Farber1, S Louis Bridges5,6, Jeffrey C Edberg7, Robert P Kimberly7, Jane H Buckner8, Panos Deloukas9,10, Jasmin Divers11, Dana Dabelea12, Jean M Lawrence13, Santica Marcovina14,15, Amy S Shah16, Carla J Greenbaum17,18, Mark A Atkinson19, Peter K Gregersen20, Jorge R Oksenberg21, Flemming Pociot22,23,24, Marian J Rewers25, Andrea K Steck25, David B Dunger26,27, Linda S Wicker3, Patrick Concannon19,28, John A Todd29, Stephen S Rich1,4.
Abstract
We report the largest and most diverse genetic study of type 1 diabetes (T1D) to date (61,427 participants), yielding 78 genome-wide-significant (P < 5 × 10-8) regions, including 36 that are new. We define credible sets of T1D-associated variants and show that they are enriched in immune-cell accessible chromatin, particularly CD4+ effector T cells. Using chromatin-accessibility profiling of CD4+ T cells from 115 individuals, we map chromatin-accessibility quantitative trait loci and identify five regions where T1D risk variants co-localize with chromatin-accessibility quantitative trait loci. We highlight rs72928038 in BACH2 as a candidate causal T1D variant leading to decreased enhancer accessibility and BACH2 expression in T cells. Finally, we prioritize potential drug targets by integrating genetic evidence, functional genomic maps and immune protein-protein interactions, identifying 12 genes implicated in T1D that have been targeted in clinical trials for autoimmune diseases. These findings provide an expanded genomic landscape for T1D.Entities:
Mesh:
Year: 2021 PMID: 34127860 PMCID: PMC8273124 DOI: 10.1038/s41588-021-00880-5
Source DB: PubMed Journal: Nat Genet ISSN: 1061-4036 Impact factor: 41.307
Regions of association with T1D, identified to genome-wide significance (P < 5 × 10-8) for the first time.
Of these 36 regions, 13 had a lead variant that was in strong linkage disequilibrium (r 2 > 0.95 in 1000 Genomes Project European population) with variants that are associated with at least one related trait.
| Chr | Position (bp) | Lead variant rsID | A1 | A2 | Putative candidate gene | AFEUR (A2) | OR |
| Traits with shared association |
|---|---|---|---|---|---|---|---|---|---|
| 1 | 63643100 | rs2269241 | T | C |
| 0.196 | 1.111 | 4.67 × 10-12 | |
| 1 | 92358141 | rs34090353 | G | C |
| 0.361 | 1.078 | 1.10 × 10-8 | |
| 1 | 119895261 | rs2641348 | A | G |
| 0.107 | 1.113 | 1.61 × 10-8 | Crohn’s disease, T2D |
| 1 | 154465420 | rs2229238 | T | C |
| 0.813 | 0.896 | 1.38 × 10-12 | |
| 1 | 172746562 | rs78037977 | A | G |
| 0.124 | 0.884 | 2.41 × 10-9 | Asthma, vitiligo, allergic sensitization |
| 1 | 192570207 | rs2816313 | G | A |
| 0.719 | 1.090 | 4.57 × 10-9 | |
| 1 | 212796238 | rs11120029 | G | T |
| 0.147 | 1.102 | 1.82 × 10-8 | |
| 2 | 12512805 | rs10169963 | C | T |
| 0.580 | 1.074 | 2.78 × 10-8 | |
| 2 | 100147438 | rs12712067 | G | T |
| 0.358 | 0.925 | 4.12 × 10-9 | |
| 2 | 191105394 | rs7582694 | C | G |
| 0.773 | 0.916 | 2.83 × 10-9 | SLE, hypothyroidism, celiac disease, RA |
| 2 | 241468331 | rs10933559 | A | G |
| 0.208 | 1.109 | 2.39 × 10-11 | |
| 4 | 973543 | rs113881148 | C | A |
| 0.626 | 1.082 | 5.72 × 10-9 | Body fat percentage |
| 4 | 38602849 | rs337637 | G | A |
| 0.364 | 0.919 | 2.57 × 10-10 | White blood cell count |
| 5 | 40521603 | rs1876142 | G | T |
| 0.658 | 0.905 | 2.18 × 10-14 | |
| 5 | 56146422 | rs10213692 | T | C |
| 0.241 | 0.912 | 2.85 × 10-9 | RA, Crohn’s disease, MS |
| 6 | 424915 | rs9405661 | C | A |
| 0.514 | 1.080 | 2.26 × 10-9 | |
| 6 | 137682468 | rs12665429 | T | C |
| 0.370 | 0.907 | 1.36 × 10-13 | |
| 6 | 159049210 | rs212408 | G | T |
| 0.638 | 1.112 | 1.42 × 10-15 | MS, Crohn’s disease, eczema |
| 7 | 20557306 | rs17143056 | A | G |
| 0.183 | 0.909 | 2.44 × 10-8 | |
| 7 | 28102567 | rs10245867 | G | T |
| 0.331 | 0.928 | 3.15 × 10-8 | Eczema, hay fever, MS, SLE, monocyte percentage |
| 8 | 11877675 | rs2250903 | G | T |
| 0.283 | 0.905 | 1.35 × 10-10 | |
| 9 | 99823263 | rs1405209 | T | C |
| 0.375 | 1.075 | 3.45 × 10-8 | |
| 10 | 33137219 | rs722988 | T | C |
| 0.367 | 1.108 | 3.21 × 10-15 | |
| 11 | 35267496 | rs11033048 | C | T |
| 0.366 | 1.091 | 1.53 × 10-10 | Vitiligo |
| 11 | 60961822 | rs79538630 | G | T |
| 0.035 | 1.213 | 1.14 × 10-9 | |
| 11 | 61828092 | rs968567 | C | T |
| 0.177 | 0.903 | 8.42 × 10-9 | RA, neutrophil percentage |
| 11 | 64367826 | rs645078 | A | C |
| 0.385 | 0.925 | 3.34 × 10-9 | |
| 11 | 128734337 | rs605093 | G | T |
| 0.470 | 1.077 | 4.25 × 10-9 | |
| 12 | 8942630 | rs1805731 | T | C |
| 0.389 | 1.073 | 4.16 × 10-8 | Eosinophil count |
| 12 | 53077434 | rs7313065 | C | A |
| 0.162 | 1.101 | 3.28 × 10-9 | |
| 13 | 42343795 | rs74537115 | C | T |
| 0.141 | 1.109 | 5.41 × 10-9 | |
| 14 | 68286876 | rs911263 | C | T |
| 0.710 | 1.083 | 1.69 × 10-8 | PBC, SLE, RA |
| 16 | 20331769 | rs4238595 | T | C |
| 0.687 | 0.912 | 2.43 × 10-11 | |
| 17 | 45996523 | rs1052553 | A | G |
| 0.232 | 0.879 | 1.65 × 10-15 | Parkinson’s disease |
| 17 | 47956725 | rs2597169 | A | G |
| 0.348 | 1.081 | 3.35 × 10-9 | |
| 21 | 44204668 | rs56178904 | C | T |
| 0.187 | 0.898 | 6.48 × 10-11 |
Genome build 38
Closest gene or gene with mechanistic support from the literature.
Additive odds ratio for the addition of an A2 allele.
Related traits (https://genetics.opentargets.org) where the lead variant is in strong LD (r 2 > 0.95 in 1000 Genomes Project European population) with T1D lead variant.
RA, rheumatoid arthritis; T2D, type 2 diabetes; SLE, systemic lupus erythematosus; MS, multiple sclerosis; IBD, inflammatory bowel disease; PBC, primary biliary cholangitis; AF, allele frequency; OR, odds ratio.
Figure 1Fine-mapping T1D regions using a Bayesian stochastic search algorithm.
a, Number of variants in GUESSFM-prioritized groups with group posterior probability > 0.5. Candidate gene names and lead variants for each group are shown on the y-axis. b, Manhattan plot of the UBASH3A region from the EUR case-control analysis, highlighting the lead variant from the univariable analysis, rs11203203:G>A (grey), and the three variants prioritized using GUESSFM, rs9984852:T>C (blue), rs13048049:G>A (red) and rs7276555:T>C (green). c, Comparison of model AIC in the UBASH3A region for models fit using EUR cases and controls only, comparing combinations of alleles prioritized either in univariable (grey) or GUESSFM analyses (red, green and blue). d, Analysis of haplotypes associated with T1D in the UBASH3A region. The most common haplotype (H1: T-G-G-T for rs7276555-rs13048049-rs11203203-rs9984852) is presented on the far left; alternative haplotypes (H2-H6) are shown with white squares highlighting the differentiating alleles (C, A, A, or C, respectively). The frequency and effect estimates for association with T1D relative to the baseline haplotype (H1) are shown above the grid (the point and error bars represent the log odds ratio and 95% confidence interval of the log odds ratio, respectively); for example, the log odds ratio for T1D risk for haplotype H3 (T-G-A-T) relative to the baseline haplotype (H1) is close to zero and the 95% confidence interval crosses zero. Haplotype analyses were performed based on n = 33,601 unrelated EUR individuals (13,458 T1D cases and 20,143 controls).
Figure 2Fine-mapping of the chromosome 4p15.2 region.
a, European (EUR, top panel) and African (AFR, middle panel) ancestry group association z-score statistics; posterior probabilities (bottom panel) from multi-ethnic fine-mapping of EUR and AFR using PAINTOR; z-scores are colored by linkage disequilibrium (LD) to the lead PAINTOR-prioritized variant. b, Overlay of T1D-credible variants with open chromatin ATAC-seq peaks in immune cells, with variants prioritized by PAINTOR (posterior probability > 0.1) highlighted with blue dashed lines. Normalized ATAC-seq read count shown for effector CD4+ T cells, B cells, and CD8+ T cells, under stimulated and non-stimulated conditions.
Extended Data Figure 1
Extended Data Figure 2T1D-associations colocalizing with caQTLs in CD4+ T cells.
Five regions show colocalization between T1D and a caQTL with a colocalization posterior probability > 0.8. In all of these regions, at least one T1D credible variant overlaps the caQTL peak itself. In four regions, the T1D association also colocalizes with an eQTL for expression of one or more genes in whole blood.
| T1D lead variant | BetaT1D
| Peak | T1D-credible variants in peak | caQTL lead variant | BetacaQTL
|
| PP | Whole blood |
|---|---|---|---|---|---|---|---|---|
| rs71624119 (chr5:56144903:G:A) | -0.099 | chr5:56147972-56149111 | rs7731626 | rs7731626 (chr5:56148856:G:A) | -0.5 | 2.4 × 10-9 | 0.97 |
|
| rs72928038 (chr6:90267049:G:A) | 0.172 | chr6:90266766-90267747 | rs72928038 | rs72928038 (chr6:90267049:G:A) | -1.0 | 3.9 × 10-16 | 1.00 |
|
| rs2027299 (chr6:126364681:G:C) | 0.147 | chr6:126339725-126340580 | rs9388486 | rs1361262 (chr6:126380821:T:C) | -0.4 | 2.0 × 10-16 | 0.87 |
|
| rs61555617[ | 0.257 | chr12:56041256-56042638 | rs705704 | rs705704 (chr12:56041628:G:A) | -0.2 | 1.1 × 10-15 | 0.97 |
|
| rs4900384 (chr14:98032614:A:G) | 0.118 | chr14:98018322-98019163 | rs11628807 | rs11628807 (chr14:98018774:T:G) | 0.7 | 1.8 × 10-21 | 0.95 | - |
T1D lead variant is the most associated variant in the credible set, as defined by fine mapping (Supplementary Table 11); caQTL lead variant is the most associated variant with chromatin accessibility at the peak of interest. Variants are provided as rsid (chromosome:hg38_position:reference:alternative).
BetaT1D refers to the effect size for the alternative allele of the T1D lead variant; BetacaQTL refers to the effect size for the alternative allele of the caQTL lead variant.
Whole blood cis-eQTL statistics from eQTLGen for the T1D lead variant and colocalization with the T1D association.
rs61555617 is referred to as rs796916887 in supplementary tables
cis-eQTL statistics for rs61555617 are missing in eQTLGen; the reported GDF11 cis-eQTL z-score is for the highly correlated variant rs705704.
PP, posterior probability of colocalization between the QTL (eQTL or caQTL) and the T1D association (referred to in coloc documentation as “PP.H4.abf”); caQTL, chromatin accessibility quantitative trait locus; eQTL, expression quantitative trait locus.
Figure 3Functional annotation of T1D-associated variants in the BACH2 region.
a-c, Position of T1D credible variants (rs72928038:G>A and rs6908626:G>T) relative to introns and exons of BACH2 (a), chromHMM tracks across diverse immune cell types from the BLUEPRINT consortium (red, active promoter; orange, distal active promoter; dark green, transcription; light green, genic enhancer; yellow, enhancer; white, quiescent; light grey, Polycomb repressed; dark grey, repressed; blue, heterochromatin) (b), and interactions with the BACH2 promoter in published PCHi-C data from naïve CD4+ T cells[43] (grey squares indicate boundaries of target (left) and bait (right)) (c). Chromatin coordinates and scale are identical and aligned in figures a-c. d, Accessibility of regions overlapping rs72928038:G>A and rs6908626:G>T by genotype; peak accessibility is quantified as normalized transposase cut frequency (Online Methods); center line, median; box limits, upper and lower quartiles; whiskers, 1.5x interquartile range (n = 115 individuals). e, Allele-specific accessibility of chromatin within heterozygous individuals at rs72928038:G>A (n = 14 heterozygous individuals) and rs6908626:G>T (n = 15 heterozygous individuals). f, Chromatin accessibility profiles in the region overlapping rs72928038:G>A across resting and activated CD4+ and CD8+ T cells (published data[38]). Height of tracks represent transposase cut frequency; all tracks are plotted using the same vertical scale. g, LocusCompare plots showing colocalization between T1D association, the caQTL for chr6:90266766-90267715 (left), and the eQTL for BACH2 (right).