| Literature DB >> 22373113 |
Hae-Won Uh1, Roula Tsonaka, Jeanine J Houwing-Duistermaat.
Abstract
Analyzing sequencing data is difficult because of the low frequency of rare variants, which may result in low power to detect associations. We consider pathway analysis to detect multiple common and rare variants jointly and to investigate whether analysis at the pathway level provides an alternative strategy for identifying susceptibility genes. Available pathway analysis methods for data from genome-wide association studies might not be efficient because these methods are designed to detect common variants. Here, we investigate the performance of several existing pathway analysis methods for sequencing data. In particular, we consider the global test, which does not consider linkage disequilibrium between the variants in a gene. We improve the performance of the global test by assigning larger weights to rare variants, as proposed in the weighted-sum approach. Our conclusion is that straightforward application of pathway analysis is not satisfactory; hence, when common and rare variants are jointly analyzed, larger weights should be assigned to rare variants.Entities:
Year: 2011 PMID: 22373113 PMCID: PMC3287932 DOI: 10.1186/1753-6561-5-S9-S90
Source DB: PubMed Journal: BMC Proc ISSN: 1753-6561
Type I error rate and empirical power of tests for 125 SNPs in the VEGF pathway
| Method | Type I error rate | Empirical power |
|---|---|---|
| Global test | 0.044 | 0.41 |
| GRASS | 0.047 | 0.43 |
| Empirical Bayes | 0.055 | 0.05 |
| Weighted global test | 0.041 | 0.65 |
Empirical power of tests for genes in the VEGF pathway
| Gene | Number of SNPs | Empirical power | ||||
|---|---|---|---|---|---|---|
| Total | Common | Rare | CMC | Weighted-sum | Weighted global test | |
| 18 (5) | 3 (1) | 15 (4) | 0.31 | 0.28 | 0.12 | |
| 10 (2) | 3 (0) | 7 (2) | 0.27 | 0.10 | 0.14 | |
| 35 (11) | 10 (3) | 25 (8) | 0.80 | 0.83 | 0.84 | |
| 10 (2) | 2 (0) | 8 (2) | 0.03 | 0.02 | 0.02 | |
| 8 (4) | 1 (1) | 7 (3) | 0.03 | 0.02 | 0.03 | |
| 21 (3) | 6 (0) | 15 (3) | 0.13 | 0.04 | 0.15 | |
| 16 (10) | 2 (2) | 14 (8) | 0.16 | 0.09 | 0.08 | |
| 6 (1) | 1 (0) | 5 (1) | 0.07 | 0.05 | 0.08 | |
| 1 (1) | 0 (0) | 1 (1) | ||||
The number of functional SNPs is shown in parentheses.
Figure 1Linkage disequilibrium of 125 SNPs using The low r2 ≈ 0 values (white cells) show weak correlations between the SNPs. The white cells indicate absence of LD (r2 = 0), shades of grey intermediate degree of LD (0
Figure 2Empirical Bayes estimates of individuals per gene. The EB estimates were obtained using the EB approach in a gene-level analysis and summarize the amount of information contained by each gene. Higher EB estimates indicate more information for the gene. The green circles depict individuals with rare variants.