| Literature DB >> 28911207 |
Reedik Mägi1, Momoko Horikoshi2,3, Tamar Sofer4, Anubha Mahajan2, Hidetoshi Kitajima2, Nora Franceschini5, Mark I McCarthy2,6,7, Andrew P Morris1,2,8,9.
Abstract
Trans-ethnic meta-analysis of genome-wide association studies (GWAS) across diverse populations can increase power to detect complex trait loci when the underlying causal variants are shared between ancestry groups. However, heterogeneity in allelic effects between GWAS at these loci can occur that is correlated with ancestry. Here, a novel approach is presented to detect SNP association and quantify the extent of heterogeneity in allelic effects that is correlated with ancestry. We employ trans-ethnic meta-regression to model allelic effects as a function of axes of genetic variation, derived from a matrix of mean pairwise allele frequency differences between GWAS, and implemented in the MR-MEGA software. Through detailed simulations, we demonstrate increased power to detect association for MR-MEGA over fixed- and random-effects meta-analysis across a range of scenarios of heterogeneity in allelic effects between ethnic groups. We also demonstrate improved fine-mapping resolution, in loci containing a single causal variant, compared to these meta-analysis approaches and PAINTOR, and equivalent performance to MANTRA at reduced computational cost. Application of MR-MEGA to trans-ethnic GWAS of kidney function in 71,461 individuals indicates stronger signals of association than fixed-effects meta-analysis when heterogeneity in allelic effects is correlated with ancestry. Application of MR-MEGA to fine-mapping four type 2 diabetes susceptibility loci in 22,086 cases and 42,539 controls highlights: (i) strong evidence for heterogeneity in allelic effects that is correlated with ancestry only at the index SNP for the association signal at the CDKAL1 locus; and (ii) 99% credible sets with six or fewer variants for five distinct association signals.Entities:
Mesh:
Year: 2017 PMID: 28911207 PMCID: PMC5755684 DOI: 10.1093/hmg/ddx280
Source DB: PubMed Journal: Hum Mol Genet ISSN: 0964-6906 Impact factor: 6.150
Figure 1.Power to detect association, at genome-wide significance (P < 5 × 10−8), using alternative approaches to aggregate GWAS across diverse populations: fixed-effects meta-analysis; random-effects (RE2) meta-analysis; and meta-regression including axes of genetic variation as covariates as implemented in MR-MEGA. Power is presented as a function of the allelic odds-ratio for each of five scenarios for heterogeneity in effects between populations, described in Supplementary Material,
Figure 2.Metrics of fine-mapping resolution, with ‘perfect data’, across alternative approaches to aggregate GWAS across diverse populations: fixed-effects meta-analysis; random-effects meta-analysis; meta-regression including axes of genetic variation as covariates as implemented in MR-MEGA; MANTRA; and PAINTOR. Two metrics are presented: (i) the median number of variants in the 99% credible set on a log10-scale; and (ii) the mean posterior probability ascribed to the causal variant. Metrics are presented for each of five scenarios for heterogeneity in effects between populations, described in Supplementary Material, Table S1. In each scenario, the odds ratio has been fixed to obtain approximately 80% power to detect association at genome-wide significance (P < 5 × 10−8) in the meta-regression analysis.
Coverage of the causal variant by the 99% credible set across 500 simulations of each scenario with ‘perfect’ data for five fine-mapping approaches: (i) fixed-effects meta-analysis; (ii) random-effects meta-analysis; (iii) meta-regression accounting for heterogeneity in allelic effects implemented in MR-MEGA; (iv) MANTRA; and (v) PAINTOR
| Fine-mapping method | Heterogeneity scenario | ||||
|---|---|---|---|---|---|
| Homogeneous | African-specific | Eurasian | Native American | Non-ancestral | |
| Fixed-effects | 0.660 | 0.944 | |||
| Random-effects | 0.966 | ||||
| Meta-regression | |||||
| MANTRA | 0.918 | 0.878 | |||
| PAINTOR | 0.692 | 0.772 | 0.972 | 0.916 | 0.922 |
Coverage rates highlighted in bold are consistent with 99% (based on 500 simulations of each of the five scenarios).
Figure 3.Metrics of fine-mapping resolution, with imputed data, across alternative approaches to aggregate GWAS across diverse populations: fixed-effects meta-analysis; random-effects meta-analysis; meta-regression including axes of genetic variation as covariates as implemented in MR-MEGA; MANTRA; and PAINTOR. Two metrics are presented: (i) the median number of SNPs in the 99% credible set on a log10-scale; and (ii) the mean posterior probability ascribed to the causal variant. Metrics are presented for each of five scenarios for heterogeneity in effects between populations, described in Supplementary Material, Table S1. In each scenario, the odds ratio has been fixed to obtain approximately 80% power to detect association at genome-wide significance (P < 5 × 10−8) in the meta-regression analysis.
Loci attaining genome-wide significant evidence of association (P < 5 × 10−8) with eGFR in MR-MEGA meta-regression of 71,461 individuals
| Locus | Lead SNP | Chr | Position(bp, b37) | Alleles | Fixed-effects meta-analysis | MR-MEGA meta-regression | ||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Effect | Other | Beta | SE | |||||||||
| rs35716097 | 5 | 176,806,636 | T | C | −1.092 | 0.128 | 3.5 × 10−17 | 0.13 | 3.0 × 10−17 | 0.016 | 0.66 | |
| rs28394165 | 4 | 77,394,018 | C | T | −0.949 | 0.117 | 1.0 × 10−15 | 0.0028 | 1.8 × 10−15 | 0.041 | 0.036 | |
| rs62435145 | 7 | 1,286,567 | T | G | −1.097 | 0.138 | 4.0 × 10−15 | 0.16 | 8.3 × 10−15 | 0.042 | 0.58 | |
| rs77924615 | 16 | 20,392,332 | G | A | −1.184 | 0.147 | 1.9 × 10−15 | 0.010 | 9.7 × 10−15 | 0.10 | 0.017 | |
| rs9895661 | 17 | 59,456,589 | C | T | −0.990 | 0.132 | 1.7 × 10−13 | 0.18 | 6.5 × 10−13 | 0.085 | 0.38 | |
| rs1260326 | 2 | 27,730,940 | C | T | −0.867 | 0.115 | 9.0 × 10−14 | 0.072 | 2.0 × 10−12 | 0.54 | 0.041 | |
| rs690428 | 15 | 53,950,578 | A | C | −0.688 | 0.115 | 4.1 × 10−9 | 7.8 × 10−5 | 8.7 × 10−11 | 0.00053 | 0.0081 | |
| rs715 | 2 | 211,543,055 | C | T | −0.880 | 0.128 | 1.1 × 10−11 | 0.21 | 1.3 × 10−10 | 0.31 | 0.21 | |
| rs2486288 | 15 | 45,712,339 | C | T | −0.875 | 0.126 | 7.5 × 10−12 | 0.73 | 1.8 × 10−10 | 0.65 | 0.63 | |
| rs11884776 | 2 | 73,746,923 | C | T | −0.929 | 0.141 | 7.6 × 10−11 | 0.15 | 3.3 × 10−10 | 0.59 | 0.035 | |
| rs57989581 | 2 | 170,194,459 | C | A | −1.961 | 0.315 | 8.6 × 10−10 | 0.19 | 7.7 × 10−10 | 0.025 | 0.70 | |
| rs4744712 | 9 | 71,434,707 | A | C | −0.756 | 0.112 | 2.8 × 10−11 | 0.90 | 8.1 × 10−10 | 0.80 | 0.80 | |
| rs10265221 | 7 | 151,414,329 | C | T | −0.952 | 0.146 | 1.2 × 10−10 | 0.24 | 1.9 × 10−9 | 0.44 | 0.19 | |
| chr5:39404526:D | 5 | 39,404,526 | D | R | −0.822 | 0.126 | 1.2 × 10−10 | 0.79 | 2.2 × 10−9 | 0.56 | 0.74 | |
| rs316009 | 6 | 160,675,764 | C | T | −1.190 | 0.193 | 1.2 × 10−9 | 0.48 | 1.4 × 10−8 | 0.36 | 0.49 | |
| rs881858 | 6 | 43,806,609 | A | G | −0.777 | 0.127 | 1.6 × 10−9 | 0.0019 | 2.1 × 10−8 | 0.40 | 0.00092 | |
Chr: chromosome. SE: standard error. p: Cochran’s Q P-value. pHET-ANC: P-value for heterogeneity correlated with ancestry. pHET-RES: P-value for residual heterogeneity.
Index SNPs for distinct T2D association signals at four susceptibility loci on the basis of aggregation of summary statistics from 18 GWAS (22,086 cases and 42,539 controls) from diverse populations using: (i) MR-MEGA meta-regression accounting for ancestry with three axes of genetic variation as covariates; and (ii) reported results from fixed-effects meta-analysis
| Locus | Index SNP | Chr | Position | Alleles | MR-MEGA meta-regression | Fixed-effects meta-analysis | |||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Risk | Other | OR (95% CI) | |||||||||
| rs11705729 | 3 | 185,507,299 | T | C | 2.1 × 10−19 | 0.50 | 0.44 | 1.14 (1.11–1.17) | 1.3 × 10−21 | 0.49 | |
| rs9368222 | 6 | 20,686,996 | A | C | 5.1 × 10−31 | 0.00042 | 0.23 | 1.17 (1.14–1.21) | 4.1 × 10−30 | 0.0058 | |
| rs10965246 | 9 | 22,132,698 | T | C | 4.8 × 10−37 | 0.62 | 0.0012 | 1.31 (1.26–1.36) | 8.4 × 10−40 | 0.0029 | |
| rs10757282 | 9 | 22,133,984 | C | T | 3.9 × 10−11 | 0.16 | 0.24 | 1.12 (1.09–1.16) | 2.0 × 10−12 | 0.17 | |
| rs231353 | 11 | 2,709,019 | G | A | 2.7 × 10−9 | 0.92 | 0.64 | 1.11 (1.07–1.14) | 1.7 × 10−11 | 0.79 | |
| rs233448 | 11 | 2,840,424 | C | T | 3.9 × 10−10 | 0.34 | 0.17 | 1.12 (1.09–1.16) | 9.5 × 10−12 | 0.18 | |
| rs2237897 | 11 | 2,858,546 | C | T | 2.9 × 10−10 | 0.33 | 0.36 | 1.19 (1.14–1.26) | 7.7 × 10−12 | 0.35 | |
Chr: chromosome. OR: odds-ratio. CI: confidence interval. pHET-ANC: P-value for heterogeneity correlated with ancestry. pHET-RES: P-value for residual heterogeneity. p: Cochran’s Q statistic P-value.
Properties of 99% credible sets of variants underlying distinct T2D association signals at four susceptibility loci on the basis of aggregation of association summary statistics from 18 GWAS (22,086 cases and 42,539 controls) from diverse populations using: (i) MR-MEGA meta-regression accounting for ancestry with three axes of genetic variation as covariates; and (ii) MANTRA
| Locus | Index SNP | MR-MEGA meta-regression | MANTRA | ||||
|---|---|---|---|---|---|---|---|
| SNPs | Distance (bp) | Interval (bp) | SNPs | Distance (bp) | Interval (bp) | ||
| rs11705729 | 40 | 39,163 | 185,495,320–185,534,483 | 36 | 31,027 | 185,503,456–185,534,482 | |
| rs9368222 | 6 | 12,330 | 20,675,792–20,688,121 | 5 | 12,330 | 20,675,792–20,688,121 | |
| rs10965246 | 6 | 1,556 | 22,132,698–22,134,253 | 5 | 1,371 | 22,132,698–22,134,068 | |
| rs10757282 | 6 | 4,041 | 22,133,645–22,137,685 | 7 | 4,435 | 22,133,251–22,137,685 | |
| rs231353 | 4 | 38,477 | 2,691,471–2,729,947 | 3 | 17,549 | 2,691,471–2,709,019 | |
| rs233448 | 13 | 20,175 | 2,837,723–2,857,897 | 11 | 20,273 | 2,837,625–2,857,897 | |
| rs2237897 | 4 | 342 | 2,858,295–2,858,636 | 3 | 197 | 2,858,440–2,858,636 | |