| Literature DB >> 28749953 |
Thomas W Winkler1, Anne E Justice2, L Adrienne Cupples3,4, Florian Kronenberg5, Zoltán Kutalik6,7, Iris M Heid1.
Abstract
Genome-wide association meta-analyses (GWAMAs) conducted separately by two strata have identified differences in genetic effects between strata, such as sex-differences for body fat distribution. However, there are several approaches to identify such differences and an uncertainty which approach to use. Assuming the availability of stratified GWAMA results, we compare various approaches to identify between-strata differences in genetic effects. We evaluate type I error and power via simulations and analytical comparisons for different scenarios of strata designs and for different types of between-strata differences. For strata of equal size, we find that the genome-wide test for difference without any filtering is the best approach to detect stratum-specific genetic effects with opposite directions, while filtering for overall association followed by the difference test is best to identify effects that are predominant in one stratum. When there is no a priori hypothesis on the type of difference, a combination of both approaches can be recommended. Some approaches violate type I error control when conducted in the same data set. For strata of unequal size, the best approach depends on whether the genetic effect is predominant in the larger or in the smaller stratum. Based on real data from GIANT (>175 000 individuals), we exemplify the impact of the approaches on the detection of sex-differences for body fat distribution (identifying up to 10 loci). Our recommendations provide tangible guidelines for future GWAMAs that aim at identifying between-strata differences. A better understanding of such effects will help pinpoint the underlying mechanisms.Entities:
Mesh:
Year: 2017 PMID: 28749953 PMCID: PMC5531538 DOI: 10.1371/journal.pone.0181038
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Simulation-based Type I error for the seven stratified GWAMA approaches to detect GxS.
Shown is the type I error at a 5% significance level derived from simulated data as the proportion of variants with nominally significant difference test (P<0.05) relative to the number of variants tested for difference (1,000,000 in the difference test without filtering, number of filtered variants in the approaches with filtering). The simulation results are based on a balanced strata design (n = 100,000, n = 100,000; split in half for two-stage approaches), variants with MAF = 0.05 or 0.30, and phenotypes simulated under the null hypothesis of no GxS, i.e. no difference between stratum-specific effects (H0: β1 = β2 = β). We present the results for β = 0 and β ≠ 0. For the second setting, we set β as the minimum effect size detectable at 80% power for the given MAF and the given sample size for the difference test (n = 200 000 for one-stage approaches, β = 0.029, 0.014 for MAF = 0.05, MAF = 0.30, respectively; n = 100,000 for the two-stage approaches, β = 0.041, 0.019 for MAF = 0.05, MAF = 0.30, respectively). Marked in bold are violated type 1 error rates.
| Approach | MAF | #variants in difference test | #variants with | Type I | #variants in difference test | #variants with | Type I error [%] |
|---|---|---|---|---|---|---|---|
| 0.05 | 1 000 000 | 49 882 | 4.99 | 1 000 000 | 49 652 | 4.97 | |
| 0.30 | 1 000 000 | 49 949 | 4.99 | 1 000 000 | 50 207 | 5.02 | |
| 0.05 | 50 032 | 2 454 | 4.90 | 323 857 | 16 143 | 4.98 | |
| 0.30 | 49 879 | 2 497 | 5.01 | 324 431 | 16 323 | 5.03 | |
| 0.05 | 49 018 | 20 956 | 76 496 | 21 732 | |||
| 0.30 | 49 057 | 20 912 | 76 415 | 22 094 | |||
| 0.05 | 49 809 | 24 732 | 235 152 | 20 762 | |||
| 0.30 | 49 667 | 24 784 | 235 383 | 21 076 | |||
| 0.05 | 49 812 | 2 475 | 4.97 | 16 346 | 801 | 4.90 | |
| 0.30 | 49 726 | 2 548 | 5.12 | 16 291 | 801 | 4.92 | |
| 0.05 | 49 249 | 2 475 | 5.03 | 3 780 | 189 | 5.00 | |
| 0.30 | 49 306 | 2 459 | 4.99 | 3 786 | 196 | 5.18 | |
| 0.05 | 49 948 | 2 470 | 4.95 | 11 812 | 562 | 4.76 | |
| 0.30 | 49 976 | 2 504 | 5.01 | 11 749 | 601 | 5.12 | |
a Number of independent variants tested for difference.
b Number of variants with nominally significant difference (P < 0.05); MAF = minor allele frequency.
Application to real sex-stratified GWAMA data for WHRadjBMI from the GIANT GENDER project.
Shown are the 10 identified loci with GxSex by each approach (‘x’ indicating that the locus was identified by the respective approach) at a Bonferroni-corrected significance level, based on the GIANT data for WHRadjBMI (up to 77,000 men and 98,000 women) [12]. Detailed association results are provided in for the one-stage approaches and in for the two-stage approaches.
| One-stage approaches | Two-stage approaches | ||||
|---|---|---|---|---|---|
| Locus | Type | [Diff5e-8] | |||
| Qualitative | x | - | - | - | |
| Pure | x | x | x | x | |
| Pure | x | x | x | x | |
| Pure | - | x | - | - | |
| Pure | x | x | - | - | |
| Pure | - | x | - | - | |
| Pure | - | x | - | - | |
| Quantitative | - | x | x | x | |
| Quantitative | - | x | - | - | |
| Quantitative | x | x | x | x | |