| Literature DB >> 30289880 |
Guanghao Qi1, Nilanjan Chatterjee1,2.
Abstract
Genome-wide association studies have shown that pleiotropy is a common phenomenon that can potentially be exploited for enhanced detection of susceptibility loci. We propose heritability informed power optimization (HIPO) for conducting powerful pleiotropic analysis using summary-level association statistics. We find optimal linear combinations of association coefficients across traits that are expected to maximize non-centrality parameter for the underlying test statistics, taking into account estimates of heritability, sample size variations and overlaps across the traits. Simulation studies show that the proposed method has correct type I error, robust to population stratification and leads to desired genome-wide enrichment of association signals. Application of the proposed method to publicly available data for three groups of genetically related traits, lipids (N = 188,577), psychiatric diseases (Ncase = 33,332, Ncontrol = 27,888) and social science traits (N ranging between 161,460 to 298,420 across individual traits) increased the number of genome-wide significant loci by 12%, 200% and 50%, respectively, compared to those found by analysis of individual traits. Evidence of replication is present for many of these loci in subsequent larger studies for individual traits. HIPO can potentially be extended to high-dimensional phenotypes as a way of dimension reduction to maximize power for subsequent genetic association testing.Entities:
Mesh:
Year: 2018 PMID: 30289880 PMCID: PMC6192650 DOI: 10.1371/journal.pgen.1007549
Source DB: PubMed Journal: PLoS Genet ISSN: 1553-7390 Impact factor: 5.917
Type I error rates for HIPO observed in datasets simulated under covariance structure estimated from studies of blood lipids.
See S1 Table 1a and 1b for detailed settings. Summary-level association statistics are simulated for 4 traits using genetic and phenotypic covariance matrices estimated from Global Lipids Genetics Consortium (GLGC) data, with and without population stratification. The results for HIPO-D1 and the most heritable trait are listed. Reported are the average of genome-wide type I error rates across 100 simulations, under significance thresholds p<0.05, p<0.01 and p<0.001.
| p-value threshold | 0.1 | 0.2 | 0.35 | 0.5 | 0.1 | 0.2 | 0.35 | 0.5 | |
|---|---|---|---|---|---|---|---|---|---|
| N | |||||||||
| p<0.05 | 0.051 | 0.051 | 0.05 | 0.05 | 0.051 | 0.051 | 0.05 | 0.05 | |
| p<0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | |
| p<0.001 | 0.0011 | 0.001 | 0.001 | 0.001 | 0.0011 | 0.001 | 0.001 | 0.001 | |
| p<0.05 | 0.05 | 0.05 | 0.05 | 0.05 | 0.05 | 0.05 | 0.05 | 0.05 | |
| p<0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | |
| p<0.001 | 0.001 | 0.001 | 0.001 | 0.001 | 0.001 | 0.001 | 0.001 | 0.001 | |
| p<0.05 | 0.05 | 0.05 | 0.051 | 0.05 | 0.05 | 0.05 | 0.05 | 0.051 | |
| p<0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | |
| p<0.001 | 0.001 | 0.001 | 0.001 | 0.001 | 0.001 | 0.001 | 0.001 | 0.001 | |
| p<0.05 | 0.05 | 0.051 | 0.05 | 0.052 | 0.05 | 0.051 | 0.051 | 0.052 | |
| p<0.01 | 0.01 | 0.01 | 0.01 | 0.011 | 0.01 | 0.01 | 0.01 | 0.011 | |
| p<0.001 | 0.001 | 0.0011 | 0.001 | 0.0011 | 0.001 | 0.001 | 0.0011 | 0.0011 | |
| p<0.05 | 0.051 | 0.051 | 0.05 | 0.051 | 0.051 | 0.051 | 0.05 | 0.051 | |
| p<0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | |
| p<0.001 | 0.0011 | 0.0011 | 0.0011 | 0.0011 | 0.0011 | 0.0011 | 0.001 | 0.0011 | |
| p<0.05 | 0.05 | 0.05 | 0.051 | 0.051 | 0.051 | 0.051 | 0.051 | 0.051 | |
| p<0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | |
| p<0.001 | 0.0011 | 0.0011 | 0.0011 | 0.0011 | 0.0011 | 0.0011 | 0.0011 | 0.0011 | |
| p<0.05 | 0.051 | 0.051 | 0.051 | 0.051 | 0.052 | 0.051 | 0.051 | 0.051 | |
| p<0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.011 | 0.011 | 0.011 | 0.011 | |
| p<0.001 | 0.0011 | 0.0011 | 0.0011 | 0.0011 | 0.0011 | 0.0011 | 0.0011 | 0.0011 | |
| p<0.05 | 0.05 | 0.051 | 0.051 | 0.051 | 0.055 | 0.055 | 0.055 | 0.055 | |
| p<0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.012 | 0.012 | 0.012 | 0.012 | |
| p<0.001 | 0.0011 | 0.0011 | 0.0011 | 0.0011 | 0.0013 | 0.0013 | 0.0013 | 0.0013 | |
is the largest heritability among the individual traits.
Number of truly associated independent loci discovered by HIPO, MTAG and individual trait analysis observed in datasets simulated under the covariance structure estimated from studies of blood lipids.
See 1a-1d in S1 Table for detailed settings. We report the average number of truly associated loci identified by all the individual traits/HIPO components/MTAG estimates across 100 simulations, under significance threshold p < 5 × 10−8 and LD pruning threshold r2 < 0.1 and different loci required to be >0.5Mb apart.
| 0.1 | 0.2 | 0.35 | 0.5 | 0.1 | 0.2 | 0.35 | 0.5 | ||
|---|---|---|---|---|---|---|---|---|---|
| N | |||||||||
| Individual traits | 0 | 1 | 1 | 3 | 0 | 0 | 1 | 3 | |
| HIPO | 0 | 0 | 1 | 2 | 0 | 0 | 1 | 2 | |
| MTAG | 0 | 1 | 1 | 4 | 0 | 0 | 1 | 4 | |
| HIPO new | 0 | 0 | 1 | 2 | 0 | 0 | 1 | 1 | |
| MTAG new | 0 | 0 | 1 | 2 | 0 | 0 | 1 | 2 | |
| Individual traits | 3 | 25 | 134 | 341 | 3 | 25 | 136 | 344 | |
| HIPO | 2 | 24 | 129 | 317 | 2 | 23 | 128 | 318 | |
| MTAG | 4 | 32 | 165 | 393 | 3 | 32 | 165 | 394 | |
| HIPO new | 2 | 12 | 52 | 98 | 1 | 12 | 51 | 101 | |
| MTAG new | 2 | 13 | 51 | 90 | 1 | 12 | 50 | 90 | |
| Individual traits | 25 | 193 | 722 | 1299 | 26 | 201 | 727 | 1314 | |
| HIPO | 23 | 185 | 679 | 1232 | 24 | 186 | 669 | 1229 | |
| MTAG | 32 | 233 | 799 | 1382 | 33 | 234 | 791 | 1381 | |
| HIPO new | 12 | 68 | 167 | 206 | 12 | 66 | 154 | 197 | |
| MTAG new | 12 | 65 | 139 | 157 | 13 | 62 | 130 | 148 | |
| Individual traits | 1303 | 2536 | 3173 | 3490 | 1352 | 2557 | 3186 | 3491 | |
| HIPO | 1240 | 2464 | 3132 | 3471 | 1204 | 2425 | 3119 | 3454 | |
| MTAG | 1394 | 2568 | 3175 | 3488 | 1358 | 2531 | 3164 | 3474 | |
| HIPO new | 202 | 144 | 81 | 53 | 171 | 120 | 70 | 46 | |
| MTAG new | 162 | 88 | 33 | 17 | 121 | 63 | 26 | 12 | |
| Individual traits | 0 | 0 | 1 | 3 | 0 | 0 | 1 | 1 | |
| HIPO | 0 | 0 | 1 | 3 | 0 | 0 | 0 | 1 | |
| MTAG | 0 | 0 | 1 | 4 | 0 | 0 | 1 | 1 | |
| HIPO new | 0 | 0 | 1 | 2 | 0 | 0 | 0 | 1 | |
| MTAG new | 0 | 0 | 1 | 2 | 0 | 0 | 0 | 1 | |
| Individual traits | 3 | 26 | 135 | 348 | 1 | 8 | 49 | 136 | |
| HIPO | 2 | 21 | 122 | 312 | 1 | 8 | 50 | 139 | |
| MTAG | 4 | 33 | 163 | 403 | 1 | 11 | 67 | 179 | |
| HIPO new | 2 | 13 | 60 | 126 | 1 | 5 | 26 | 61 | |
| MTAG new | 2 | 12 | 49 | 96 | 1 | 6 | 28 | 64 | |
| Individual traits | 26 | 196 | 730 | 1336 | 8 | 73 | 334 | 719 | |
| HIPO | 23 | 178 | 651 | 1219 | 8 | 73 | 332 | 715 | |
| MTAG | 33 | 239 | 802 | 1412 | 12 | 98 | 413 | 845 | |
| HIPO new | 14 | 84 | 195 | 256 | 5 | 36 | 122 | 199 | |
| MTAG new | 13 | 68 | 136 | 153 | 6 | 38 | 118 | 184 | |
| Individual traits | 1338 | 2601 | 3269 | 3588 | 725 | 1954 | 2799 | 3193 | |
| HIPO | 1225 | 2523 | 3243 | 3583 | 721 | 1938 | 2772 | 3181 | |
| MTAG | 1419 | 2632 | 3274 | 3589 | 854 | 2078 | 2846 | 3219 | |
| HIPO new | 254 | 198 | 116 | 75 | 194 | 231 | 143 | 94 | |
| MTAG new | 155 | 82 | 31 | 15 | 186 | 189 | 91 | 54 | |
is the largest heritability among the individual traits.
Fig 1QQ plots for individual traits and underlying HIPO components across blood lipids, psychiatric diseases, social science traits.
Blood lipid traits include HDL, LDL, triglycerides (TG) and total cholesterol (TC). Psychiatric diseases include autism spectrum disorder (ASD), ADHD, bipolar disorder (BIP), major depressive disorder (MDD) and schizophrenia (SCZ). Meta-analysis QQ plot is also included for psychiatric diseases (in green). Social science traits include depressive symptoms (DS), neuroticism (NEU) and subjective well-being (SWB). Genomic control factors and average χ2 statistics are shown in the legend.
Novel loci discovered at genome-wide significance level (p < 5 × 10−8) by the first and second HIPO components of blood lipid traits.
| SNP | CHR | MBP | Nearest Gene (Distance) | ||||||
|---|---|---|---|---|---|---|---|---|---|
| 2 | 3.6 | 2.87e-03 (+) | 1.15e-04 (+) | 8.58e-04 (-) | 2.14e-06 (+) | 1.13e-09 | 7.82e-04 | ||
| 2 | 65.3 | 8.72e-02 (+) | 6.35e-06 (+) | 1.89e-06 (-) | 4.66e-01 (+) | 1.33e-08 | 7.92e-01 | ||
| 3 | 123.1 | 9.78e-01 (+) | 3.88e-06 (-) | 2.44e-03 (+) | 6.75e-02 (-) | 4.30e-08 | 5.52e-01 | ||
| 4 | 157.7 | 6.44e-01 (-) | 3.20e-07 (+) | 3.18e-04 (-) | 5.89e-01 (+) | 2.27e-08 | 5.44e-01 | ||
| 4 | 102.7 | 3.11e-01 (-) | 9.19e-08 (-) | 1.27e-01 (+) | 9.85e-04 (-) | 3.03e-08 | 2.43e-02 | ||
| 5 | 173.4 | 7.00e-01 (-) | 6.13e-04 (-) | 7.72e-07 (+) | 2.67e-01 (-) | 3.69e-08 | 7.80e-01 | ||
| 8 | 36.8 | 7.66e-01 (+) | 3.79e-06 (-) | 1.85e-06 (+) | 7.16e-01 (-) | 3.91e-09 | 3.32e-01 | ||
| 10 | 115.8 | 4.32e-01 (-) | 1.09e-06 (-) | 1.61e-03 (+) | 1.07e-01 (-) | 4.21e-08 | 5.74e-01 | ||
| 11 | 13.4 | 2.53e-02 (+) | 1.52e-07 (+) | 5.73e-07 (-) | 1.87e-02 (+) | 3.88e-12 | 3.16e-01 | ||
| 11 | 68.6 | 3.69e-01 (-) | 2.05e-07 (-) | 9.49e-06 (+) | 4.01e-02 (-) | 3.35e-11 | 5.30e-01 | ||
| 11 | 27.6 | 2.46e-01 (-) | 3.50e-05 (-) | 3.44e-06 (+) | 7.19e-02 (-) | 2.56e-09 | 6.44e-01 | ||
| 11 | 110.0 | 9.06e-03 (+) | 9.84e-07 (+) | 1.77e-01 (-) | 2.70e-06 (+) | 2.58e-08 | 2.23e-04 | ||
| 12 | 122.2 | 7.23e-01 (-) | 1.45e-06 (+) | 3.98e-07 (-) | 5.43e-01 (+) | 2.84e-10 | 3.86e-01 | ||
| 12 | 113.2 | 9.47e-03 (+) | 3.61e-07 (+) | 3.18e-02 (-) | 5.00e-05 (+) | 4.39e-09 | 2.51e-03 | ||
| 12 | 26.5 | 8.65e-02 (-) | 2.93e-07 (-) | 1.59e-03 (+) | 5.74e-02 (-) | 1.75e-08 | 3.28e-01 | ||
| 15 | 41.8 | 5.17e-01 (+) | 2.26e-07 (-) | 4.30e-05 (+) | 7.10e-01 (-) | 4.25e-09 | 3.75e-01 | ||
| 17 | 46.2 | 1.99e-02 (+) | 1.71e-07 (+) | 2.23e-04 (-) | 9.51e-03 (+) | 4.30e-10 | 1.32e-01 | ||
| 19 | 32.9 | 9.34e-01 (-) | 5.58e-08 (+) | 4.83e-03 (-) | 9.85e-02 (+) | 7.71e-09 | 6.28e-01 | ||
| 19 | 47.6 | 1.47e-01 (-) | 9.99e-07 (+) | 3.20e-07 (-) | 2.70e-01 (-) | 8.87e-09 | 1.65e-02 | ||
| 20 | 33.2 | 1.30e-01 (+) | 5.73e-07 (+) | 6.33e-05 (-) | 6.33e-02 (+) | 1.27e-09 | 4.76e-01 | ||
| 3 | 142.7 | 5.70e-05 (-) | 6.85e-01 (+) | 3.77e-04 (-) | 2.87e-07 (-) | 5.41e-01 | 2.99e-08 | ||
| 4 | 69.6 | 2.62e-05 (+) | 1.75e-01 (+) | 4.23e-04 (+) | 5.68e-08 (+) | 5.62e-01 | 8.10e-09 | ||
| 6 | 52.4 | 7.52e-07 (-) | 3.08e-01 (+) | 4.14e-02 (-) | 6.66e-08 (-) | 9.39e-01 | 2.13e-08 | ||
| 11 | 78.0 | 2.79e-06 (-) | 9.75e-01 (-) | 5.47e-03 (-) | 2.48e-07 (-) | 9.57e-01 | 3.47e-08 |
Independent SNPs were identified through LD-pruning with r2 threshold of 0.1 and pruned SNPs were assumed to represent independent loci if they are >0.5Mb apart. Loci are considered novel if they are not identified at genome-wide significance level through analysis of individual traits. For each lead SNP, p-values for association are shown for HIPO components and for individual traits. The directions of association (+/-) are also shown for each of the individual traits. TG: triglycerides; TC: total cholesterol. HIPO-D1 and HIPO-D2: 1st and 2nd HIPO components. The weights for the first and second HIPO components are: .
# in the last two columns indicates that this locus passes the Bonferroni corrected significance threshold (11 loci by HIPO-D1 and 1 by HIPO-D2).
Novel loci discovered at genome-wide significance level (p < 5 × 10−8) by HIPO-D1 for social science traits.
| SNP | CHR | MBP | Nearest Gene (Distance) | ||||
|---|---|---|---|---|---|---|---|
| 1 | 21.3 | 6.33e-05 (-) | 6.33e-05 (-) | 6.33e-05 (+) | 1.38e-08 | ||
| 4 | 141.0 | 1.47e-05 (-) | 6.33e-05 (-) | 6.33e-05 (+) | 8.02e-09 | ||
| 5 | 164.6 | 1.47e-05 (-) | 5.73e-07 (-) | 1.96e-02 (+) | 1.90e-08 | ||
| 6 | 70.6 | 1.15e-03 (-) | 5.73e-07 (-) | 1.77e-04 (+) | 1.88e-09 | ||
| 6 | 120.3 | 8.58e-04 (+) | 2.14e-05 (+) | 1.47e-05 (-) | 5.94e-09 | ||
| 6 | 11.7 | 1.10e-01 (+) | 2.03e-06 (+) | 6.33e-05 (-) | 3.61e-08 | ||
| 7 | 114.0 | 1.77e-04 (+) | 3.06e-06 (+) | 8.58e-04 (-) | 1.04e-08 | ||
| 7 | 4.2 | 1.15e-03 (-) | 1.47e-05 (-) | 6.33e-05 (+) | 1.28e-08 | ||
| 13 | 99.1 | 6.33e-05 (-) | 1.52e-07 (-) | 8.58e-04 (+) | 6.50e-10 | ||
| 16 | 13.0 | 2.70e-03 (-) | 2.46e-04 (-) | 3.06e-06 (+) | 3.73e-08 | ||
| 18 | 52.0 | 5.96e-03 (-) | 2.14e-05 (-) | 9.64e-08 (+) | 8.17e-10 | ||
| 18 | 52.9 | 8.58e-04 (+) | 9.64e-08 (+) | 2.70e-03 (-) | 3.92e-09 |
Independent SNPs are identified through LD-pruning with threshold of 0.1 and pruned SNPs are assumed to represent independent loci if they are >0.5Mb apart. Loci are considered novel if they were not identified at genome-wide significance level through analysis individual traits. For each lead SNP, p-values for association are shown for HIPO components and for individual traits. The directions of association (+/-) are also shown for each of the individual traits. DS: depressive symptoms; NEU: neuroticism; SWB: subjective well-being. HIPO-D1: 1st HIPO component. Weights for HIPO-D1: . NA in the Nearest Gene column means there is no gene within 200kb of the SNP. SNPs marked by * indicates underlying loci show evidence of replication in the larger data set used in the MTAG paper (see Table 5).
# in the last column indicates that this locus passes Bonferroni corrected significance threshold (7 loci in total).
Evidence of replication of novel loci identified by HIPO analysis for social science traits in subsequent larger studies of DS and SWB.
| Lead SNP in Novel Loci | Proxy SNP Reported in MTAG Study | D’ | Individual Trait p-value in SSGAC | Individual Trait p-value in MTAG Study |
|---|---|---|---|---|
| rs1877075 | 0.78 | 2.00e-06 | 1.10e-06 | |
| rs10045971 | 0.99 | 4.51e-02 | 1.17e-09 | |
| rs4723416 | 0.91 | 1.59e-03 | 1.17e-06 | |
| rs4772087 | 1.00 | 2.42e-03 | 1.04e-06 | |
| rs11876620 | 0.82 | 1.58e-04 | 4.45e-08 | |
| rs12125335 | 1.00 | NA | 7.09e-08 | |
| rs769664 | 0.79 | 3.18e-03 | 4.59e-07 | |
| rs10947543 | 0.93 | NA | 3.10e-08 |
Reported are P-values for proxy SNPs (D’ > 0.75) for individual trait associations in SSGAC data and the more recent MTAG study. Novel loci are identified through analysis of SSGAC which include studies of DS and SWB with sample sizes Neff = 161,460 and N = 298,420, respectively. The MTAG study includes an expanded set of sample with Neff = 354,862 and N = 388,538 for DS and SWB, respectively. DS: depressive symptoms; NEU: neuroticism; SWB: subjective well-being. NA indicates that the proxy SNP is not present in the SSGAC data.