| Literature DB >> 24336170 |
Dajiang J Liu1, Gina M Peloso2, Xiaowei Zhan1, Oddgeir L Holmen3, Matthew Zawistowski4, Shuang Feng4, Majid Nikpay5, Paul L Auer6, Anuj Goel7, He Zhang8, Ulrike Peters9, Martin Farrall7, Marju Orho-Melander10, Charles Kooperberg11, Ruth McPherson5, Hugh Watkins7, Cristen J Willer8, Kristian Hveem12, Olle Melander10, Sekar Kathiresan13, Gonçalo R Abecasis1.
Abstract
The majority of reported complex disease associations for common genetic variants have been identified through meta-analysis, a powerful approach that enables the use of large sample sizes while protecting against common artifacts due to population structure and repeated small-sample analyses sharing individual-level data. As the focus of genetic association studies shifts to rare variants, genes and other functional units are becoming the focus of analysis. Here we propose and evaluate new approaches for performing meta-analysis of rare variant association tests, including burden tests, weighted burden tests, variable-threshold tests and tests that allow variants with opposite effects to be grouped together. We show that our approach retains useful features from single-variant meta-analysis approaches and demonstrate its use in a study of blood lipid levels in ∼18,500 individuals genotyped with exome arrays.Entities:
Mesh:
Substances:
Year: 2013 PMID: 24336170 PMCID: PMC3939031 DOI: 10.1038/ng.2852
Source DB: PubMed Journal: Nat Genet ISSN: 1061-4036 Impact factor: 38.330
Figure 1Power comparison for our approach, Fisher's method and the minimal p-value approach. Three phenotype models were simulated: (1) half of low frequency variants with MAF < 0.5% are causal, each increasing expected trait values by 1/4 standard deviation; (2) half of all variants are causal, irrespective of frequency, and increase trait values by 1/4 standard deviation; (3) 50% of the variants are casual, irrespective of frequency, and 80% of these increase expected trait values by 1/4 standard deviation, while the remaining 20% decrease trait values by the same amount. A number of 2-100 samples of size 1000 were simulated for each model, with each sample drawn from a randomly chosen population. Meta-analysis was performed using our approach or using Fisher's method and the minimal p-value approach to combine burden test, SKAT and variable threshold (VT) test statistics for variants with MAF<5%. The power was evaluated at the significance threshold of α=2.5×10-6 using 10,000 replicates. Panel A displays the power for three meta-analysis methods using simple burden test under model (1). Panel B displays the results for three meta-analysis methods using VT under model (1). Panel C displays the results for three meta-analysis methods using SKAT under model (1). Panel D displays the results for three meta-analysis methods using simple burden test under model (2). Panel E displays the results for three meta-analysis methods using VT under model (2). Panel F displays the results for three meta-analysis methods using SKAT under model (2). Panel G displays the results for three meta-analysis methods using simple burden test under model (3). Panel H displays the results for three meta-analysis methods using VT under model (3). Panel I displays the results for three meta-analysis methods using SKAT under model (3). Note that differences between our approach and these alternatives become more marked when more studies are meta-analyzed.
Results for meta-analysis of gene-level rare variant association test. Associations that attain exome-wide significance (p < 3.1×10-6) are displayed. Five gene-level association tests were used to analyze the data: simple burden tests with 1% or 5% cutoff (Burden-1 and Burden-5), SKAT tests with 1% or 5% cutoff (SKAT-1 and SKAT-5) and variable threshold (VT) tests that analyze variants with MAF<5%. Significant p-values for each test are displayed in bold font. For the associations that are significant, estimates of average genetic effect are also shown. The loci where one or more gene-based association signal exceeds the top single variant association signal are labeled with an asterisk.
| Gene | Gene Position | Burden-1 | Burden-5 | SKAT-1 | SKAT-5 | VT | MAF Cutoff | Direction of Single Variant Association Statistics | Estimates of Genetic Average Effect (s.d units) for Rare Variants under Different MAF Thresholds | ||
|---|---|---|---|---|---|---|---|---|---|---|---|
|
| |||||||||||
| 0.01 | 0.05 | VT | |||||||||
| chr15:58.7Mb | 1.4×10-2 | 3.7×10-3 | -++++--+- | 0.5 | 0.1 | 0.5 | |||||
| chr8:19.8Mb | 9.7×10-1 | 3.5×10-1 | 2.5×10-2 | (-)-(-)+-++ | - | -0.3 | -0.3 | ||||
| chr19:8.4Mb | 2.2×10-2 | 2.2×10-2 | 2.6×10-2 | (+)--++-+++ | - | 0.3 | 0.3 | ||||
| chr18:47.1Mb | 2.2×10-5 | 2.1×10-5 | 1.3×10-2 | -++----(+)+ | - | 0.4 | 0.4 | ||||
| chr20:43.0Mb | 7.5×10-1 | 6.8×10-1 | 4.1×10-2 | (-)--+-+ | - | -0.1 | -0.1 | ||||
| chr17:41.9Mb | 4.9×10-1 | 5.2×10-1 | 1.0×10-5 | 3.3×10-2 | (-)+-(+) | - | -0.1 | - | |||
|
| |||||||||||
| chr1:55.5Mb | 1.8×10-2 | 8.1×10-2 | 1.3×10-2 | (-)--(-)--+-++- | - | -0.3 | -0.5 | ||||
| chr19:45.3Mb | 1.7×10-1 | 1.5×10-1 | 3.0×10-5 | 3.6×10-2 | (-)+++(-)+-+++---+(-)+--+--++ | - | -0.1 | -0.1 | |||
| chr19:45.3Mb | 9.4×10-1 | 4.4×10-1 | 1.5×10-4 | 4.4×10-2 | -(-)--+-(-)(+) | - | -0.1 | -0.1 | |||
| chr19:45.2Mb | 6.1×10-2 | 4.8×10-2 | 6.3×10-2 | 4.9×10-2 | (-)++--+ | - | -0.1 | -0.1 | |||
| chr19:11.2Mb | 1.8×10-3 | 4.7×10-5 | 3.8×10-2 | 2.5×10-1 | 5.2×10-4 | +++++++++-++++--+ | - | - | 0.8 | ||
|
| |||||||||||
| chr19:8.4Mb | 2.6×10-2 | 3.7×10-2 | 2.6×10-2 | (-)+---+--- | - | -0.3 | -0.2 | ||||
| chr8:19.8Mb | 6.8×10-1 | 2.6×10-1 | 2.5×10-2 | (+)+(+)--+- | - | 0.2 | 0.2 | ||||
Gene position is defined based upon hg19, GRCh37 Genome Reference Consortium Human Reference 37
Direction of single site statistics for variants with MAF<5%. Variants within parenthesis have frequency >1%.
The loci with one or more gene-level association signal exceeding the top single variant signal.