| Literature DB >> 34145383 |
Fernando M Aguate1, Ana I Vazquez2, Tony R Merriman3, Gustavo de Los Campos4,5.
Abstract
Pleiotropy (i.e., genes with effects on multiple traits) leads to genetic correlations between traits and contributes to the development of many syndromes. Identifying variants with pleiotropic effects on multiple health-related traits can improve the biological understanding of gene action and disease etiology, and can help to advance disease-risk prediction. Sequential testing is a powerful approach for mapping genes with pleiotropic effects. However, the existing methods and the available software do not scale to analyses involving millions of SNPs and large datasets. This has limited the adoption of sequential testing for pleiotropy mapping at large scale. In this study, we present a sequential test and software that can be used to test pleiotropy in large systems of traits with biobank-sized data. Using simulations, we show that the methods implemented in the software are powerful and have adequate type-I error rate control. To demonstrate the use of the methods and software, we present a whole-genome scan in search of loci with pleiotropic effects on seven traits related to metabolic syndrome (MetS) using UK-Biobank data (n~300 K distantly related white European participants). We found abundant pleiotropy and report 170, 44, and 18 genomic regions harboring SNPs with pleiotropic effects in at least two, three, and four of the seven traits, respectively. We validate our results using previous studies documented in the GWAS-catalog and using data from GTEx. Our results confirm previously reported loci and lead to several novel discoveries that link MetS-related traits through plausible biological pathways.Entities:
Mesh:
Year: 2021 PMID: 34145383 PMCID: PMC8633382 DOI: 10.1038/s41431-021-00911-z
Source DB: PubMed Journal: Eur J Hum Genet ISSN: 1018-4813 Impact factor: 4.246
Type I error rate (in -log10 scale) of sLRT and pleiotest by effects-scenario (Ha1 or Ha2), error correlation (Cor), and significance level (α).
| Cor = 0.2 | Cor = 0.8 | |||||||
|---|---|---|---|---|---|---|---|---|
| Ha1 | Ha2 | Ha1 | Ha2 | |||||
| -log10(α) | sLRT | pleioR | sLRT | pleioR | sLRT | pleioR | sLRT | pleioR |
| 8 | 7.65 | 7.73 | 8.23 | 8.40 | 8.02 | 8.11 | 8.24 | 8.24 |
| [7.41,7.94] | [7.46,8.05] | [7.76,8.91] | [7.85,9.32] | [7.65,8.50] | [7.70,8.68] | [7.78,8.93] | [7.78,8.93] | |
| 7 | 6.95 | 7.00 | 7.19 | 7.24 | 6.94 | 7.03 | 7.06 | 7.10 |
| [6.84,7.07] | [6.88,7.12] | [7.04,7.35] | [7.08,7.42] | [6.83,7.06] | [6.91,7.17] | [6.93,7.19] | [6.96,7.24] | |
| 6 | 5.96 | 6.02 | 6.08 | 6.14 | 5.95 | 6.03 | 5.99 | 6.04 |
| [5.92,5.99] | [5.98,6.06] | [6.04,6.13] | [6.10,6.19] | [5.91,5.99] | [5.99,6.07] | [5.95,6.03] | [6.00,6.08] | |
| 5 | 4.98 | 5.02 | 5.05 | 5.09 | 4.97 | 5.02 | 4.97 | 5.01 |
| [4.96,4.99] | [5.01,5.03] | [5.04,5.06] | [5.08,5.10] | [4.96,4.98] | [5.00,5.03] | [4.96,4.99] | [5.00,5.02] | |
| 4 | 3.98 | 4.01 | 4.02 | 4.05 | 3.98 | 4.01 | 3.99 | 4.01 |
| [3.98,3.98] | [4.01,4.02] | [4.02,4.03] | [4.05,4.05] | [3.98,3.99] | [4.01,4.02] | [3.98,3.99] | [4.01,4.01] | |
| 3 | 2.99 | 3.01 | 3.00 | 3.02 | 2.99 | 3.01 | 2.99 | 3.01 |
| [2.99,2.99] | [3.01,3.01] | [3.00,3.00] | [3.02,3.02] | [2.99,2.99] | [3.01,3.01] | [2.99,2.99] | [3.00,3.01] | |
| 2 | 1.99 | 2.00 | 2.00 | 2.00 | 1.99 | 2.00 | 2.00 | 2.00 |
| [1.99,1.99] | [2.00,2.00] | [2.00,2.00] | [2.00,2.00] | [1.99,1.99] | [2.00,2.00] | [2.00,2.00] | [2.00,2.00] | |
Results are based on 500 million Monte Carlo (MC) simulations with sample size 3000; 95% confident intervals between square brackets.
Fig. 1Power to detect pleiotropy of sLRT (thin solid line) and pleiotest (thick dashed line).
Each plot corresponds to a different sample size and effect-size ratio (effect ratio = 1: ; effect ratio = 0.5: ).
Fig. 2Total computational time (in seconds) to process 1000 variants with balanced and unbalanced data, and an increasing number of traits.
Colors indicate sample size from 10,000 to 300,000.
Number of non-overlapping genomic regions (# of SNPs) with a significant effect on at least two traits.
| BMI | SBP | URA | GLU | LDL | TRI | CRE | Total | |
|---|---|---|---|---|---|---|---|---|
| BMI | 16 (44) | 22 (268) | 7 (96) | 17 (83) | 30 (334) | 14 (56) | 80 (881) | |
| SBP | 16 (44) | 11 (25) | 2 (4) | 5 (22) | 9 (121) | 3 (18) | 33 (234) | |
| URA | 22 (268) | 11 (25) | 4 (43) | 23 (170) | 27 (602) | 45 (155) | 99 (1263) | |
| GLU | 7 (96) | 2 (4) | 4 (43) | 6 (17) | 9 (165) | 2 (7) | 21 (332) | |
| LDL | 17 (83) | 5 (22) | 23 (170) | 6 (17) | 32 (648) | 6 (25) | 63 (965) | |
| TRI | 30 (334) | 9 (121) | 27 (602) | 9 (165) | 32 (648) | 14 (83) | 88 (1953) | |
| CRE | 14 (56) | 3 (18) | 45 (155) | 2 (7) | 6 (25) | 14 (83) | 70 (344) |
Fig. 3Ideogram of regions harboring SNPs with effects in at least four traits.
The symbols by the region indicate the trait. Traits included in the analyses were: body mass index (1), systolic blood pressure (2), serum urate (3), glucose level (4), low-density lipoproteins (5), triglycerides (6), and creatinine (7). Blue arrows denote novel associations.
Regions harboring SNPs with four-trait pleiotropic significant effects (p value < 1e−8).
| Chromosome | GWAS Catalogb | SNP w/smallest | Genesc | Traits | ||||||
|---|---|---|---|---|---|---|---|---|---|---|
| [position]a | BMI | SBP | URA | GLU | LDL | TRI | CRE | |||
| 2 | Yes | rs1260326 (56.312) | GCKR C2orf16 | × | × | × | × | |||
| [26.8–28.6] | ZNF512 | |||||||||
| 2 | Yes | rs1128249 (13.556) | COBLL1 | × | × | × | × | |||
| [164.2–165.2] | ||||||||||
| 3 | No | rs62260779 (8.429) | MAP4 | × | × | × | × | |||
| [47.5–48.5] | ||||||||||
| 3 | No | rs2624847 (8.631) | SEMA3F-AS1 | × | × | × | × | |||
| [49.6–50.6] | ||||||||||
| 4 | No | rs1229984 (10.626) | ADH1B | × | × | × | × | |||
| [98.8–99.8] | ||||||||||
| 4 | Yes | rs13107325 (9.390) | SLC39A8 | × | × | × | × | |||
| [101.8–102.8] | ||||||||||
| 5 | Nod | rs4865796 (10.417) | ARL15 | × | × | × | × | |||
| [53.5–54.5] | ||||||||||
| 6 | Yes | rs1264377 (13.211) | DDR1 | × | × | × | × | |||
| [27.3–33.2] | NOTCH4 | |||||||||
| 8 | Yes | rs898137 (8.854) | LOC157273 | × | × | × | × | |||
| [8.6–10] | ||||||||||
| 8 | No | rs13280813 (8.611) | BLK | × | × | × | × | |||
| [11.1–12] | ||||||||||
| 8 | Yes | rs2001945 (11.847) | TRIB1 | × | × | × | × | |||
| [125–126] | ||||||||||
| 11 | Yes | rs174547 (10.284) | FADS1 | × | × | × | × | |||
| [61.3–62.3] | FADS2 | |||||||||
| 12 | Yes | rs653178 (9.673) | ATXN2 | × | × | × | × | |||
| [111–112.6] | ||||||||||
| 15 | No | rs11856835 (10.799) | SEMA7A | × | × | × | × | |||
| [73.9–74.9] | ||||||||||
| 16 | Yes | rs1421085 (10.767) | FTO | × | × | × | × | |||
| [53.3–54.3] | ||||||||||
| 19 | Nod | rs58542926 (9.965) | TM6SF2 | × | × | × | × | |||
| [18.8–19.8] | ||||||||||
| 19 | Yes | rs4420638 (10.508) | APOC1 | × | × | × | × | |||
| [44.4–45.4] | TOMM40 | |||||||||
| 20 | Yes | rs8121509 (9.408) | OPRL1 | × | × | × | × | |||
| [63.6–64.3] | ||||||||||
BMI body mass index, SBP systolic blood pressure, URA serum urate, GLU glucose level, LDL low-density lipoprotein, TRI triglycerides, CRE creatinine.
aPosition in mega base-pairs (Mbp).
bWhether at least one SNP in the region has been reported for MetS in the GWAS catalog.
cGene corresponding to the SNP with the smallest p value.
dReported elsewhere.
Number and percentage of discoveries in our study (p value < 1e−8) that have been previously reported, by number of traits simultaneously affected (variants with pleiotropic effects in at least 2, 3, or 4 traits).
| Number of traits simultaneously affected | Number of discoveries in this study | Overlap with other studiesa | ||
|---|---|---|---|---|
| GWAS catalog | Avery et al. (2011) | Kraja et al. (2014) | ||
| Non-overlapping genomic regions | ||||
| 2 | 170 | 51 (30%) | 14 (8%) | 13 (8%) |
| 3 | 44 | 22 (50%) | 6 (14%) | 8 (18%) |
| 4 | 18 | 11 (61%) | 6 (33%) | 8 (44%) |
| SNPs | ||||
| 2 | 2982 | 1677 (56%) | 305 (10%) | 1081 (36%) |
| 3 | 871 | 497 (57%) | 95 (11%) | 329 (38%) |
| 4 | 246 | 202 (82%) | 54 (22%) | 130 (53%) |
aSNPs reported in other studies that were within a 1-Mbp of a discovery in our study were considered overlapping.