| Literature DB >> 26501144 |
Albert Rosenberger1, Stefanie Friedrichs1, Christopher I Amos2, Paul Brennan3, Gordon Fehringer4, Joachim Heinrich5, Rayjean J Hung4, Thomas Muley6, Martina Müller-Nurasyid7, Angela Risch8, Heike Bickeböller1.
Abstract
INTRODUCTION: Gene-set analysis (GSA) methods are used as complementary approaches to genome-wide association studies (GWASs). The single marker association estimates of a predefined set of genes are either contrasted with those of all remaining genes or with a null non-associated background. To pool the p-values from several GSAs, it is important to take into account the concordance of the observed patterns resulting from single marker association point estimates across any given gene set. Here we propose an enhanced version of Fisher's inverse χ2-method META-GSA, however weighting each study to account for imperfect correlation between association patterns. SIMULATION AND POWER: We investigated the performance of META-GSA by simulating GWASs with 500 cases and 500 controls at 100 diallelic markers in 20 different scenarios, simulating different relative risks between 1 and 1.5 in gene sets of 10 genes. Wilcoxon's rank sum test was applied as GSA for each study. We found that META-GSA has greater power to discover truly associated gene sets than simple pooling of the p-values, by e.g. 59% versus 37%, when the true relative risk for 5 of 10 genes was assume to be 1.5. Under the null hypothesis of no difference in the true association pattern between the gene set of interest and the set of remaining genes, the results of both approaches are almost uncorrelated. We recommend not relying on p-values alone when combining the results of independent GSAs. APPLICATION: We applied META-GSA to pool the results of four case-control GWASs of lung cancer risk (Central European Study and Toronto/Lunenfeld-Tanenbaum Research Institute Study; German Lung Cancer Study and MD Anderson Cancer Center Study), which had already been analyzed separately with four different GSA methods (EASE; SLAT, mSUMSTAT and GenGen). This application revealed the pathway GO0015291 "transmembrane transporter activity" as significantly enriched with associated genes (GSA-method: EASE, p = 0.0315 corrected for multiple testing). Similar results were found for GO0015464 "acetylcholine receptor activity" but only when not corrected for multiple testing (all GSA-methods applied; p ≈ 0.02).Entities:
Mesh:
Substances:
Year: 2015 PMID: 26501144 PMCID: PMC4621033 DOI: 10.1371/journal.pone.0140179
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1META-GSA: a tool for meta-analysis based on GSAs for GWASs.
Genome-wide association study (GWAS); gene-set analysis (GSA); meta-analysis of GSAs (META-GSA); gene set of interest (GS, containing genes 3,5,…); complementary gene set (GS’, containing genes 1, 2, 4,…); gene-level statistic of gene g and study s (Γ ); “enrichment score” (ES) as test statistic for GSA in study s.
Notation.
|
| no. of studies | ||
|
| study | ||
|
| no. of markers | ||
|
| marker | ||
|
| measure of linkage disequilibrium | between marker | and |
|
| no. of genes | ||
|
| gene | ||
|
| no. of markers assigned to gene g | ||
|
| association measure (odds ratio) | for marker | of study |
|
| direction of | for marker | of study |
|
| p-value of testing e.g. H0: θm,s = 1 | for marker | |
|
| p-value of a gene | for gene g | of study |
|
| direction of a gene | for gene g | of study |
|
| directed reverse p-value (PDR) | for gene | of study |
|
| no. of gene sets | ||
|
| gene set of interest | ||
|
| gene set complimentary to | ||
|
| Gene-level statistic of GSA | for gene | of study |
|
| test statistic of GSA | for gene set | of study |
|
| p-value of GSA | for gene set | of study |
|
| test statistic of META-GSA | for gene set | across studies |
|
| p-value of META-GSA (nominal) | for gene set | across studies |
|
| p-value of META-GSA (adjusted for multiple testing) | for gene set | across studies |
| There are four tests mentioned further: | |||
|
| testing significance of simple p-pooling | unconditional combination of | |
|
| testing concordance of association patterns | ||
| pooledGWAS-GSA | first performing a random effects meta-analysis for each marker and then perform a single GSA | ||
| META-GSA | conditional combination of SPP and the direction test | ||
Power of META-GSA, pooledGWAS-GSA and SPP across all studies, based on 100 genes with 1 marker each.
| patterns of true marker-phenotype associations | |||||||
|---|---|---|---|---|---|---|---|
|
|
|
|
| SPP | pooledGWAS-GSA | META-GSA | |
|
| |||||||
|
| no gene is associated at all | 10x1 | 90x1 | 10 | 5.6% | 3.7% | 4.4% |
|
| all genes are associated | 10x1.2 | 90x1.2 | 10 | 5.2% | 5.5% | 7.0% |
|
| all genes are associated | 10x1.5 | 90x1.5 | 10 | 6.0% | 5.6% | 4.8% |
|
| |||||||
|
| ½ the genes in | 5x1 5x1.1 | 90x1 | 10 | 5.4% | 6.7% | 9.2% |
|
| 5x1 5x1.2 | 90x1 | 10 | 9.4% | 9.5% | 26.4% | |
|
| 5x1 5x1.3 | 90x1 | 10 | 21.0% | 20.7% | 48.8% | |
|
| 5x1 5x1.4 | 90x1 | 10 | 29.6% | 31.3% | 58.0% | |
|
| 5x1 5x1.5 | 90x1 | 10 | 36.8% | 56.4% | 58.8% | |
|
| |||||||
|
| 5x1 5x1.5 | 90x1 | 2 | 16.0% | 12.7% | 24.4% | |
|
| 5x1 5x1.5 | 90x1 | 3 | 16.8% | 15.9% | 31.2% | |
|
| 5x1 5x1.5 | 90x1 | 4 | 27.2% | 21.3% | 41.8% | |
|
| 5x1 5x1.5 | 90x1 | 5 | 27.8% | 25.0% | 45.0% | |
|
| 5x1 5x1.5 | 90x1 | 6 | 28.4% | 30.2% | 47.4% | |
|
| 5x1 5x1.5 | 90x1 | 7 | 31.8% | 38.2% | 50.4% | |
|
| 5x1 5x1.5 | 90x1 | 8 | 34.4% | 40.9% | 54.4% | |
|
| 5x1 5x1.5 | 90x1 | 9 | 35.6% | 47.2% | 55.2% | |
|
| |||||||
|
| HA: assoc. genes in | 5x1 4x1.2 1x1.5 | 90x1 | 10 | 17.0% | 16.1% | 35.6% |
|
| HA: GS dominates | 5x1 4x1.2 1x1.5 | 63x1 18x1.2 9x1.5 | 10 | 11.0% | 11.4% | 5.8% |
|
| HA: GS is dominated | 5x1 4x1.2 1x1.5 | 41x1 32x1.2 17x1.5 | 10 | 8.4% | 1.7% | 1.6% |
|
| H0: same prop. of genes are associated in | 5x1 4x1.2 1x1.5 | 45x1 36x1.2 9x1.5 | 10 | 4.4% | 4.7% | 3.0% |
Given a true type I error of 5%, the observed type I error may range from 3% to 7% (95% random dispersion interval for 500 simulations). Given a true power of 50%, the observed power may range from 45% to 54% (95% random dispersion interval for 500 simulations).
§ Truly associated genes are more frequent in GS’ than in GS.
Comparing the number of significant findings of META-GSA with SPP in selected scenarios.
| patterns of true marker-phenotype associations | META-GSA | SPP | |||
|---|---|---|---|---|---|
| Scenario No. |
|
|
|
| |
|
| |||||
| 1 | 10x1 | 90x1 |
| 4 | 16 |
|
| 16 | 464 | |||
| 2 | 10x1.2 | 90x1.2 |
| 6 | 24 |
|
| 18 | 448 | |||
| 20 | 10x1.5 | 90x1.5 |
| 13 | 2 |
|
| 34 | 451 | |||
|
| |||||
| 4 | 5x1 5x1.1 | 90x1 |
| 7 | 39 |
|
| 20 | 434 | |||
| 5 | 5x1 5x1.2 | 90x1 |
| 38 | 94 |
|
| 9 | 359 | |||
| 6 | 5x1 5x1.3 | 90x1 |
| 103 | 141 |
|
| 2 | 254 | |||
| 7 | 5x1 5x1.4 | 90x1 |
| 148 | 142 |
|
| — | 210 | |||
| 8 | 5x1 5x1.5 | 90x1 |
| 184 | 110 |
|
| — | 206 | |||
| 17 | 5x1 4x1.2 1x1.5 | 90x1 |
| 77 | 101 |
|
| 8 | 314 | |||
| 18 | 5x1 4x1.2 1x1.5 | 63x1 18x1.2 9x1.5 |
| 27 | 2 |
|
| 28 | 443 | |||
| 19 | 5x1 4x1.2 1x1.5 | 45x1 36x1.2 9x1.5 |
| 7 | 1 |
|
| 35 | 457 | |||
* Applying Kendall’s correlation coefficient
Fig 2Correlation of p and p in scenario no. 1.
The numbers of simulations out of a total of 500 are depicted. Gene sets are classified as significant (p ≤ 0.05) or not significant using SPP and META-GSA.
Fig 3Correlation of p and p in scenario no. 8.
The numbers of simulations out of a total of 500 are depicted. Gene sets are classified as significant (p ≤ 0.05) or not significant using SPP and META-GSA.
META-GSA vs. SPP: Number of nominally significant gene sets by GSA methods.
|
|
| |||
|---|---|---|---|---|
|
|
|
|
| ( |
|
| 46 | 22 | 27 | 32 |
|
| 46 | 24 | 29 | 28 |
|
| 46 | 23 | 28 | 89 |
|
| 47 | 23 | 30 | 72 |
Rank correlation of p-values comparing GSA methods (best SNP approach).
|
|
| |||||
|---|---|---|---|---|---|---|
|
|
|
|
|
|
| |
|
| 0.93 | 0.88 | 0.93 | 0.54 | 0.40 | 0.55 |
|
| 0.90 | 0.95 | 0.52 | 0.73 | ||
|
| 0.90 | 0.70 | ||||
|
| ||||||
| MA step: | simulate marker association |
| GWAS step: | perform GWAS |
| GSA step: | perform GSA |
| META-GSA step: | perform META-GSA and SPP |