| Literature DB >> 23044548 |
Heide Fier1, Sungho Won, Dmitry Prokopenko, Taofik AlChawa, Kerstin U Ludwig, Rolf Fimmers, Edwin K Silverman, Marcello Pagano, Elisabeth Mangold, Christoph Lange.
Abstract
MOTIVATION: For the analysis of rare variants in sequence data, numerous approaches have been suggested. Fixed and flexible threshold approaches collapse the rare variant information of a genomic region into a test statistic with reduced dimensionality. Alternatively, the rare variant information can be combined in statistical frameworks that are based on suitable regression models, machine learning, etc. Although the existing approaches provide powerful tests that can incorporate information on allele frequencies and prior biological knowledge, differences in the spatial clustering of rare variants between cases and controls cannot be incorporated. Based on the assumption that deleterious variants and protective variants cluster or occur in different parts of the genomic region of interest, we propose a testing strategy for rare variants that builds on spatial cluster methodology and that guides the identification of the biological relevant segments of the region. Our approach does not require any assumption about the directions of the genetic effects.Entities:
Mesh:
Year: 2012 PMID: 23044548 PMCID: PMC3516147 DOI: 10.1093/bioinformatics/bts568
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Evaluation of type 1 error (500/500 cases/controls and 750/750 cases/controls, 30 rare variants)
| MAF | Number of cases/controls | DBM | |
|---|---|---|---|
| 0.01 | 0.05 | 500/500 | 0.051 |
| 0.01 | 0.01 | 500/500 | 0.008 |
| 0.005 | 0.05 | 500/500 | 0.050 |
| 0.005 | 0.01 | 500/500 | 0.015 |
| 0.01 | 0.05 | 750/750 | 0.047 |
| 0.01 | 0.01 | 750/750 | 0.008 |
| 0.005 | 0.05 | 750/750 | 0.050 |
| 0.005 | 0.01 | 750/750 | 0.011 |
Tested at α = 0.05 or α = 0.01, 1000 replicates.
Power estimates of outlined approaches in a non-clustered scenario
| MAF | 0.01 | 0.005 | 0.01 | 0.005 | 0.01 | 0.005 | 0.01 | 0.005 |
|---|---|---|---|---|---|---|---|---|
| Number of cases/controls | 500/500 | 500/500 | 750/750 | 750/750 | 500/500 | 500/500 | 750/750 | 750/750 |
| Number of variants | 30 | 30 | 30 | 30 | 30 | 30 | 30 | 30 |
| Number of risk variants/number of protective variants | 10/0 | 10/0 | 10/0 | 10/0 | 7/3 | 7/3 | 7/3 | 7/3 |
| CMC | 0.222 | 0.148 | 0.280 | 0.176 | 0.094 | 0.054 | 0.108 | 0.070 |
| Price | 0.232 | 0.152 | 0.274 | 0.160 | 0.132 | 0.102 | 0.160 | 0.140 |
| MB | 0.248 | 0.176 | 0.330 | 0.194 | 0.104 | 0.066 | 0.128 | 0.100 |
| RB | 0.372 | 0.250 | 0.446 | 0.288 | 0.166 | 0.114 | 0.226 | 0.168 |
| SKAT | 0.254 | 0.176 | 0.346 | 0.176 | 0.128 | 0.074 | 0.162 | 0.114 |
| C-Alpha | 0.172 | 0.112 | 0.244 | 0.134 | 0.130 | 0.104 | 0.210 | 0.126 |
| DBM | 0.332 | 0.218 | 0.342 | 0.238 | 0.132 | 0.122 | 0.152 | 0.162 |
Tested at α = 0.05, 500 replicates.
Power estimates of outlined approaches in a clustered scenario
| MAF | 0.01 | 0.005 | 0.01 | 0.005 | 0.01 | 0.005 | 0.01 | 0.005 |
|---|---|---|---|---|---|---|---|---|
| Numberof cases/controls | 500/500 | 500/500 | 750/750 | 750/750 | 500/500 | 500/500 | 750/750 | 750/750 |
| Number of variants | 30 | 30 | 30 | 30 | 30 | 30 | 30 | 30 |
| Number of risk variants/number of protective variants | 10/0 | 10/0 | 10/0 | 10/0 | 7/3 | 7/3 | 7/3 | 7/3 |
| CMC | 0.210 | 0.122 | 0.308 | 0.170 | 0.094 | 0.082 | 0.128 | 0.070 |
| Price | 0.242 | 0.166 | 0.290 | 0.196 | 0.162 | 0.090 | 0.162 | 0.128 |
| MB | 0.234 | 0.142 | 0.342 | 0.182 | 0.112 | 0.096 | 0.148 | 0.084 |
| RB | 0.326 | 0.214 | 0.488 | 0.292 | 0.186 | 0.142 | 0.236 | 0.170 |
| SKAT | 0.230 | 0.148 | 0.332 | 0.194 | 0.130 | 0.102 | 0.162 | 0.112 |
| C-Alpha | 0.190 | 0.096 | 0.218 | 0.136 | 0.156 | 0.084 | 0.206 | 0.114 |
| DBM | 0.392 | 0.294 | 0.494 | 0.352 | 0.310 | 0.208 | 0.324 | 0.224 |
Tested at α = 0.05, 500 replicates.
Fig. 1.Spatial distribution of rare variants in the sample. Two rare variants (one rare variant in cases and another in controls) with outlying positions are not shown in the figure
P-values of the compared methods for testing the association of 15q13.3 with NSCL/P
| MAF | CMC | Price | MB | RB | SKAT | C-Alpha | DBM |
|---|---|---|---|---|---|---|---|
| 0.01 | 0.783 | 0.279 | 0.562 | 0.256 | 0.062 | 0.071 | 0.006 |
| 0.05 | 0.695 | 0.281 | 0.556 | 0.331 | 0.152 | 0.914 | 0.011 |
P-values are based on 1000 permutations.