| Literature DB >> 27182969 |
Buhm Han1,2,3,4, Jennie G Pouget1,5,6,7, Kamil Slowikowski1,3,4,8, Eli Stahl9, Cue Hyunkyu Lee10, Dorothee Diogo1,3,4, Xinli Hu1,3,4,11, Yu Rang Park10,12, Eunji Kim10,13, Peter K Gregersen14, Solbritt Rantapää Dahlqvist15, Jane Worthington16,17, Javier Martin18, Steve Eyre16,17, Lars Klareskog19, Tom Huizinga20, Wei-Min Chen21, Suna Onengut-Gumuscu21, Stephen S Rich21, Naomi R Wray22, Soumya Raychaudhuri1,3,4,19,23.
Abstract
There is growing evidence of shared risk alleles for complex traits (pleiotropy), including autoimmune and neuropsychiatric diseases. This might be due to sharing among all individuals (whole-group pleiotropy) or a subset of individuals in a genetically heterogeneous cohort (subgroup heterogeneity). Here we describe the use of a well-powered statistic, BUHMBOX, to distinguish between those two situations using genotype data. We observed a shared genetic basis for 11 autoimmune diseases and type 1 diabetes (T1D; P < 1 × 10(-4)) and for 11 autoimmune diseases and rheumatoid arthritis (RA; P < 1 × 10(-3)). This sharing was not explained by subgroup heterogeneity (corrected PBUHMBOX > 0.2; 6,670 T1D cases and 7,279 RA cases). Genetic sharing between seronegative and seropostive RA (P < 1 × 10(-9)) had significant evidence of subgroup heterogeneity, suggesting a subgroup of seropositive-like cases within seronegative cases (PBUHMBOX = 0.008; 2,406 seronegative RA cases). We also observed a shared genetic basis for major depressive disorder (MDD) and schizophrenia (P < 1 × 10(-4)) that was not explained by subgroup heterogeneity (PBUHMBOX = 0.28; 9,238 MDD cases).Entities:
Mesh:
Substances:
Year: 2016 PMID: 27182969 PMCID: PMC4925284 DOI: 10.1038/ng.3572
Source DB: PubMed Journal: Nat Genet ISSN: 1061-4036 Impact factor: 38.330
Figure 1Overview of BUHMBOX
(a) Under the scenario of subgroup heterogeneity, risk alleles of disease B (DB)-associated loci will be enriched in a subgroup of disease A (DA) cases, producing positive correlations between DB risk allele dosages from independent loci. (b) Under the scenario where there is no heterogeneity and DA and DB share alleles due to pleiotropy (i.e. whole-group pleiotropy), DB risk alleles will be uniformly distributed and have no correlations. Red boxes: risk alleles; white boxes: non-risk alleles.
Figure 2Power gain by weighting SNPs by allele frequency and effect size
We compared the statistical power of BUHMBOX with a weighting scheme that optimally weights correlations between SNPs (weighted) to an alternative approach that weights correlations uniformly (unweighted; equation (12) in Supplementary Note). We simulated 1,000 case individuals and assumed 50 risk loci, whose OR and RAFs were sampled from the GWAS catalog. Colored bands denote 95% confidence intervals of power estimates.
Figure 3BUHMBOX power analysis
Power of BUHMBOX for detecting heterogeneity as a function of the number of risk loci, number of case samples, and the proportion of samples that actually have different phenotype (heterogeneity proportion, π). We assume that we have the same number of controls as cases. White lines denote 20, 40, 60, and 80% power. (a) Power as a function of number of case individuals and heterogeneity proportion, when the number of risk loci is fixed at 50. (b) Power as a function of number of risk loci and heterogeneity proportion, when the case sample size is fixed at 2,000.
Summary of genetic overlap using GRS and BUHMBOX
Only the traits that have significant GRS p-values in positive directions are shown. Significant GRS p-value indicates evidence of shared genetic structure; significant BUHMBOX p-value indicates evidence of heterogeneity. See Supplementary Table 4 for the full results for all traits tested.
| Cohort data | Test trait | #SNP | GRS p-value | GRS Beta (95% CI) | BUHMBOX p-value | BUHMBOX power at |
|---|---|---|---|---|---|---|
| T1D | AA | 10 | 1.4 × 10−120 | 0.76 (0.69 – 0.82) | 0.83 | 0.15 |
| ATD | 7 | 1.4 × 10−31 | 0.48 (0.40 – 0.56) | 0.30 | 0.05 | |
| CEL | 38 | 2.2 × 10−35 | 0.32 (0.27 – 0.38) | 0.16 | 0.50 | |
| CRO | 119 | 2.4 × 10−05 | 0.08 (0.04 – 0.11) | 0.54 | 0.99 | |
| JIA | 22 | 3.6 × 10−151 | 0.44 (0.40 – 0.47) | 0.37 | 0.96 | |
| PBC | 19 | 1.1 × 10−12 | 0.16 (0.11 – 0.20) | 0.18 | 0.82 | |
| PSC | 12 | 4.1 × 10−26 | 0.38 (0.31 – 0.45) | 0.91 | 0.08 | |
| RA | 68 | 6.6 × 10−89 | 0.55 (0.49 – 0.60) | 0.45 | 0.40 | |
| SJO | 7 | 3.9 × 10−146 | 0.53 (0.49 – 0.57) | 0.84 | 0.66 | |
| SLE | 16 | 1.1 × 10−83 | 0.44 (0.39 – 0.48) | 0.79 | 0.91 | |
| VIT | 12 | 2.5 × 10−90 | 0.59 (0.53 – 0.65) | 0.14 | 0.33 | |
| RA | AA | 10 | 1.5 × 10−22 | 0.28 (0.22 – 0.34) | 0.71 | 0.23 |
| AS | 24 | 6.1 × 10−04 | 0.10 (0.04 – 0.15) | 0.19 | 0.20 | |
| ATD | 7 | 3.9 × 10−20 | 0.34 (0.27 – 0.41) | 0.57 | 0.08 | |
| CEL | 38 | 6.4 × 10−20 | 0.21 (0.17 – 0.26) | 0.57 | 0.63 | |
| JIA | 22 | 8.9 × 10−125 | 0.36 (0.33 – 0.39) | 0.61 | 0.99 | |
| PBC | 19 | 1.5 × 10−13 | 0.15 (0.11 – 0.19) | 0.83 | 0.90 | |
| PSC | 12 | 6.2 × 10−14 | 0.24 (0.18 – 0.31) | 0.46 | 0.12 | |
| SLE | 16 | 4.3 × 10−06 | 0.10 (0.05 – 0.14) | 0.34 | 0.96 | |
| SSC | 5 | 9.6 × 10−10 | 0.22 (0.15 – 0.29) | 0.08 | 0.09 | |
| T1D | 53 | 9.6 × 10−207 | 0.43 (0.40 – 0.46) | 0.29 | 1.00 | |
| VIT | 12 | 1.8 × 10−11 | 0.18 (0.12 – 0.23) | 0.02 | 0.41 | |
| Seroneg.RA | Seropos.RA | 14 | 1.1 × 10−10 | 0.30 (0.21 – 0.39) | 0.008 | 0.26 |
| MDD | SCZ | 90 | 1.5 × 10−5 | 0.17 (0.09 – 0.24) | 0.28 | 0.53 |
AA, Alopecia areata; AS, Ankylosing spondylitis; ATD, Autoimmune thyroid disease; CEL, celiac disease; CRO, Crohn’s disease; JIA, juvenile idiopathic arthritis; MS, multiple sclerosis; PBC, primary biliary cirrhosis; PSC, primary sclerosing cholangitis; SJO, Sjögren’s syndrome; SLE, systemic lupus erythematosus; SSC, Systemic sclerosis; UC, ulcerative colitis; VIT: Vitiligo; MDD, major depressive disorder; SCZ, schizophrenia; Seroneg., seronegative; Seropos., seropositive.
Figure 4Genetic sharing between autoimmune diseases and psychiatric disorders
In (a) and (b), we show only the diseases that have significantly positive GRS p-values out of the 17 tested. Y-axis denotes the expected heterogeneity proportion (π) to explain observed genetic sharing. Vertical bars indicate 95% confidence intervals. Heterogeneity proportion estimates are based on GRS analysis, assuming no pleiotropy for (a) T1D, (b) RA, (c) seronegative RA, and (d) MDD.
Figure 5Statistical power of BUHMBOX to detect heterogeneity
We calculated power by performing 1,000 simulations with corresponding sample size, number of risk alleles, risk allele frequencies, and odds ratios. To calculate power for (c) and (d), we used a significance threshold of 0.05. For (a) and (b), the threshold was adjusted using the Bonferroni correction accounting for 11 tests in T1D and RA, respectively.
Figure 6BUHMBOX results
We show only diseases with significantly positive GRS p-values (for complete results for all traits tested, see Supplementary Table 4). Significant GRS p-values indicate evidence of shared genetic structure; significant BUHMBOX p-value indicates evidence of heterogeneity. Point size represents the number of DB-associated SNPs included in the analysis. Dashed vertical lines denote the Bonferroni-adjusted significance threshold for the BUHMBOX test statistic. Arrow indicates significant BUHMBOX test statistic.