| Literature DB >> 22971100 |
Christian Meesters1, Markus Leber, Christine Herold, Marina Angisch, Manuel Mattheisen, Dmitriy Drichel, André Lacour, Tim Becker.
Abstract
BACKGROUND: Meta-analysis (MA) is widely used to pool genome-wide association studies (GWASes) in order to a) increase the power to detect strong or weak genotype effects or b) as a result verification method. As a consequence of differing SNP panels among genotyping chips, imputation is the method of choice within GWAS consortia to avoid losing too many SNPs in a MA. YAMAS (Yet Another Meta Analysis Software), however, enables cross-GWAS conclusions prior to finished and polished imputation runs, which eventually are time-consuming.Entities:
Mesh:
Year: 2012 PMID: 22971100 PMCID: PMC3472171 DOI: 10.1186/1471-2105-13-231
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Running time estimates for different number of studies
| 6 | 2m5s | 92m5s |
| 12 | 3m42s | 94m53s |
| 24 | 6m56s | 100m4s |
aNumber of studies examined by meta-analysis.
bNaive approach, real time in minutes (m) and seconds (s).
cProxy algorithm, real time in minutes (m) and seconds (s).
Empirical levels for different nominal alpha levels (0.01 and 0.05) using different reference data sets
| MA-pointwise | - | fixed | 0.047 | 0.009 |
| | - | random | 0.037 | 0.006 |
| MA-proxy | ideal | fixed | 0.046 | 0.009 |
| | | random | 0.036 | 0.007 |
| | incomplete | fixed | 0.046 | 0.009 |
| | | random | 0.036 | 0.006 |
| | mismatched | fixed | 0.047 | 0.009 |
| | | random | 0.036 | 0.007 |
| MA-impute | ideal | fixed | 0.042 | 0.007 |
| | | random | 0.035 | 0.006 |
| | incomplete | fixed | 0.041 | 0.007 |
| | | random | 0.034 | 0.006 |
| | mismatched | fixed | 0.039 | 0.008 |
| random | 0.032 | 0.006 |
aEnumeration using either the naive approach, proxy algorithm or imputation strategy.
bReference data set used for proxy algorithm and imputation.
cCalculation of the allelic effects by the fixed effect or random effect model.
dEmpirical levels for nominal α of 0.05 (average over all SNPs from chr 22 for 100 permutation replicates).
eEmpirical levels for nominal α of 0.01.
Figure 1MA with “ideal” reference panel. Power levels are plotted over different nominal α levels (on the x-axis, with a negative logarithmic scale).
Figure 2MA with “incomplete” reference panel. Power levels are plotted over different nominal α levels (on the x-axis, with a negative logarithmic scale).
Figure 3MA with “mismatched” reference panel. Power levels are plotted over different nominal α levels (on the x-axis, with a negative logarithmic scale).
Type II Diabetes dbGaP studies
| 1 | Health Research Vanderbilt Ua | | Illumina | 499,350 | 607 | |
| 2 | Health Research Vanderbilt Ua | | Illumina | 919,602 | 1,384 | |
| 3 | Health Research Northwestern Ub | | Illumina | 495,588 | 1,239 | |
| 4 | Health Research Northwestern Ub | | Illumina | 908,692 | 267 | |
| 5 | GENEVA Diabetes Study | NHSc | Affymetrix | 764,678 | 3,435 | |
| 6 | GENEVA Diabetes Study | HPFSd | Affymetrix | 787,213 | 2,606 |
aProject Health Research - Vanderbilt University, Northwestern NUgene Project: Type 2 Diabetes, National Human Genome Research Institute (NHGRI).
bProject Health Research - Northwestern University, Northwestern NUgene Project: Type 2 Diabetes, National Human Genome Research Institute (NHGRI).
cNurses health study.
dHealth Professionals Follow-up Study.
Comparison of point-wise and proxy MA of dBGaP GWAS for known type II diabetes genes
| 20581827 | BCL11A | rs243021 | 3.0E-15 | rs243021 | 3.3E-03 | rs11697597 | 3.3E-03 | 3.2E-03 | 1 | 1.03 |
| 20818381 | C2CD4A,C2CD4B | rs7172432 | 9.0E-14 | rs335302 | 1.2E-03 | rs7172432 | 1.2E-03 | 1.98E-04 | 1 | 6.04 |
| 20581827 | CDKAL1 | rs10440833 | 2.0E-22 | rs12336110 | 2.1E-03 | rs6950237 | 2.1E-03 | 1.0E-03 | 1 | 2.10 |
| 19401414 | CDKN2A, CDKN2B | rs2383208 | 2.0E-29 | rs2383208 | 2.2E-03 | rs2383208 | 8.4E-04 | 5.2E-04 | 2.6 | 1.60 |
| 20581827 | CENTD2 | rs1552224 | 1.0E-22 | rs1552224 | 1.1E-01 | rs1552224 | 3.4E-02 | 4.8E-03 | 3.33 | 7.08 |
| 17463249 | FTO | rs8050136 | 7.0E-14 | rs8050136 | 8.0E-03 | rs8050136 | 6.7E-04 | 9.4E-04 | 11.9 | 0.71 |
| 20581827 | HHEX,IDE | rs5015480 | 1.0E-15 | rs5015480 | 9.6E-03 | rs5015480 | 2.0E-03 | 4.9E-04 | 4.8 | 4.10 |
| 20581827 | HMGA2 | rs1531343 | 4.0E-09 | rs12741948 | 3.3E-02 | rs1122590 | 1.4E-02 | 9.1E-05 | 2.43 | 149.8 |
| 17463249 | IGF2BP2 | rs4402960 | 9.0E-16 | rs4402960 | 2.3E-03 | rs4402960 | 7.2E-04 | 1.2E-04 | 3.21 | 6.12 |
| 20581827 | IRS1 | rs7578326 | 5.0E-20 | rs7578326 | 4.2E-02 | rs7578326 | 1.7E-03 | 8.8E-04 | 24.65 | 1.95 |
| 18372903 | JAZF1 | rs864745 | 5.0E-14 | rs864745 | 1.6E-03 | rs864745 | 1.9E-04 | 1.1E-04 | 8.47 | 1.78 |
| 17463249 | KCNJ11 | rs5215 | 5.0E-11 | rs5215 | 8.1E-02 | rs4646410 | 3.1E-03 | 9.6E-04 | 26.16 | 3.24 |
| 18711367 | KCNQ1 | rs2237892 | 2.0E-42 | rs2237892 | 2.1E-02 | rs2237892 | 2.7E-04 | 3.5E-04 | 78.57 | 0.77 |
| 19734900 | LOC64673, IRS1 | rs2943641 | 9.0E-12 | rs2943641 | 7.8E-02 | rs2943641 | 1.7E-03 | 8.8E-04 | 45.94 | 1.95 |
| 20418489 | RBMS1, ITGB6 | rs7593730 | 4.0E-08 | rs7593730 | 1.8E-05 | rs7593730 | 3.4E-06 | 3.6E-06 | 5.15 | 0.94 |
| 20581827 | SLC30A8 | rs3802177 | 1.0E-08 | rs2466295 | 2.4E-02 | rs2466295 | 1.0E-02 | 5.3E-05 | 2.35 | 193.6 |
| 20862305 | SPRY2 | rs1359790 | 6.0E-09 | rs17249026 | 4.5E-02 | rs17249026 | 4.5E-02 | 2.1E-03 | 1 | 21.6 |
| 19734900 | TCF7L2 | rs7903146 | 1.0E-30 | rs7903146 | 3.2E-19 | rs7903146 | 1.5E-22 | 4.4E-23 | 2126.7 | 3.40 |
| 18372903 | THADA | rs7578597 | 1.0E-09 | rs2236705 | 1.5E-02 | rs7578597 | 6.7E-03 | 2.4E-03 | 2.18 | 2.79 |
| 18372903 | TSPAN8,LGR5 | rs7961581 | 1.0E-09 | rs4581087 | 1.1E-02 | rs4581087 | 1.2E-02 | 2.4E-03 | 0.70 | 4.83 |
| 19734900 | WFS1, PPP2R2C | rs4689388 | 1.0E-08 | rs4689388 | 5.5E-03 | rs4689388 | 1.6E-03 | 7.7E-04 | 3.44 | 2.08 |
| 20581827 | ZFAND6 | rs11634397 | 2.0E-09 | rs11634397 | 2.6E-02 | rs11634397 | 2.6E-02 | 1.0E-02 | 1 | 2.46 |
aEach gene region is listed only once, even if listed several times in the GWAS catalogue.
bMost significant SNP according to GWAS catalog [1].
cp-value according to GWAS catalog.
dMost significant SNP with naïve MA on intersection of marker panels of 6 dbGaP GWAS described before.
ep-value refering to SNP from previous column.
fMost significant SNP with proxy MA on 6 dbGaP GWAS.
gp-value refering to SNP from previous column.
hp-value of the correslonding SNP calculated by imputation/snptest.
iImprovement with proxy algorithm: quotient of columns “p-Pointwise” and “p-Proxy”.
jImprovement with imputation: quotient of columns “p-Proxy” and “p-Impute”.
Proxy-Analysis of rs7903146 (TCF7L2)
| 1 | rs7903146 | 10 | 114758349 | A | G | 0.39 | 0.09 | 1.5×10−05 | - |
| 2 | rs7903146 | 10 | 114758349 | A | G | 0.68 | 0.20 | 7.0×10−04 | - |
| 3 | rs7903146 | 10 | 114758349 | A | G | 0.40 | 0.13 | 3.0×10−03 | - |
| 4 | rs7903146 | 10 | 114758349 | A | G | 0.44 | 0.08 | 1.6×10−07 | - |
| 5 | rs4506565 | 10 | 114756041 | T | A | 0.20 | 0.05 | 2.0×10−04 | 0.945 |
| 6 | rs4506565 | 10 | 114756041 | T | A | 0.30 | 0.06 | 3.8×10−04 | 0.945 |
| Meta-Analysis | rs7903146 | 10 | 114758349 | A | G | 0.31 | 0.03 | 1.5×10−22 | - |
aEnumeration according to Table 1.
bEffect allele: the allele beta is given for.
cOther allele.
dEffect estimate according to logistic regression.
eStandard error.
fp-value.
gbetween SNP and proxy-SNP according to reference data (1,000 Genomes).
Figure 4Proxy meta-analysis schematic example. Schematic example of a meta-analysis with proxy markers. For simplicity we consider only two studies with four markers each (1-4). Common MA is applied on markers 1 and 4 (as they are present in both marker sets), yet when YAMAS hits marker 3, which is missing in the second study (3 – gray box), it selects marker 2 in study 2 as its proxy marker, based on the r2 indicator. Dashed arrows indicate non-chosen potential proxy markers. The case of the missing marker 2 in study 1 is omitted for better readability.