| Literature DB >> 27980630 |
Alden Green1, Kaitlyn Cook2, Kelsey Grinde3, Alessandra Valcarcel4, Nathan Tintle5.
Abstract
Current rare-variant, gene-based tests of association often suffer from a lack of statistical power to detect genotype-phenotype associations as a result of a lack of prior knowledge of genetic disease models combined with limited observations of extremely rare causal variants in population-based samples. The use of pedigree data, in which rare variants are often more highly concentrated than in population-based data, has been proposed as 1 possible method for enhancing power. Methods for combining multiple gene-based tests of association into a single summary p value are a robust approach to different genetic architectures when little a priori knowledge is available about the underlying genetic disease model. To date, however, little consideration has been given to combining gene-based tests of association for the analysis of pedigree data. We propose a flexible framework for combining any number of family-based rare-variant tests of association into a single summary statistic and for assessing the significance of that statistic. We show that this approach maintains type I error and improves the robustness, to different genetic architectures, of the statistical power of family- and gene-based rare-variant tests through application to simulated phenotype data from Genetic Analysis Workshop 19.Entities:
Year: 2016 PMID: 27980630 PMCID: PMC5133502 DOI: 10.1186/s12919-016-0024-y
Source DB: PubMed Journal: BMC Proc ISSN: 1753-6561
Type I error rate for 10 genes selected to be representative of all 30 genes explored
| Gene |
|
|
|
|
|
|
|
| Combined |
|---|---|---|---|---|---|---|---|---|---|
| High power | |||||||||
|
| 0.055 | 0.05 | 0.015 | 0.03 | 0.065 | 0.04 | 0.04 | 0.025 | 0.03 |
|
| 0.055 | 0.085 | 0.07 | 0.07 | 0.08 | 0.075 | 0.075 | 0.07 | 0.055 |
|
| 0.06 | 0.05 | 0.055 | 0.09 | 0.085 | 0.09 | 0.055 | 0.06 | 0.07 |
|
| 0.09 | 0.135 | 0.025 | 0.055 | 0.08 | 0.18 | 0.055 | 0.075 | 0.055 |
| Low power | |||||||||
|
| 0.07 | 0.045 | 0.115 | 0.065 | 0.075 | 0.065 | 0.065 | 0.045 | 0.04 |
|
| 0.04 | 0.095 | 0.045 | 0.055 | 0.04 | 0.08 | 0.035 | 0.055 | 0.035 |
|
| 0.005 | 0.025 | 0.055 | 0.015 | 0.01 | 0.045 | 0.01 | 0.035 | 0.03 |
|
| 0.045 | 0.045 | 0.025 | 0.06 | 0.045 | 0.035 | 0.04 | 0.04 | 0.03 |
| Other representative genes | |||||||||
|
| 0.05 | 0.035 | 0.055 | 0.045 | 0.055 | 0.055 | 0.05 | 0.045 | 0.035 |
|
| 0.065 | 0.035 | 0.045 | 0.02 | 0.06 | 0.055 | 0.055 | 0.04 | 0.03 |
| Mean Across All 30 Genes (SD) | 0.067 (0.03) | 0.074 (0.05) | 0.048 (0.02) | 0.059 (0.05) | 0.07 (0.03) | 0.084 (0.05) | 0.058 (0.03) | 0.065 (0.05) | 0.058 (0.04) |
Type I error rates are reported for all 9 tests across 10 different genes: the 4 genes with highest overall power, the 4 genes with lowest overall power, and 2 genes deemed representative of the remaining genes explored. The mean type I error rate for each test statistic across all 30 genes is reported at the bottom of the table. Averaged across all 30 genes, the type I error rate was nominally conserved at a 0.05 level
Power for 10 genes selected to be representative of all 30 genes explored
| Gene |
|
|
|
|
|
|
|
| Combined |
|---|---|---|---|---|---|---|---|---|---|
| High power | |||||||||
|
| 0.425 | 0.315 | 0.07 | 0.075 | 0.885 | 0.585 | 0.785 | 0.435 | 0.58 |
|
| 0.525 | 0.315 | 0.13 | 0.055 | 0.53 | 0.365 | 0.455 | 0.365 | 0.325 |
|
| 0.995 | 0.995 | 0.225 | 0.99 | 0.995 | 0.995 | 0.995 | 0.995 | 0.995 |
|
| 0.995 | 0.82 | 0.54 | 0.17 | 0.99 | 0.98 | 0.985 | 0.78 | 0.945 |
| Low power | |||||||||
|
| 0.025 | 0.015 | 0.005 | 0.01 | 0.025 | 0.03 | 0.025 | 0.035 | 0.025 |
|
| 0.04 | 0.025 | 0.04 | 0.005 | 0.035 | 0.045 | 0.045 | 0.015 | 0.025 |
|
| 0.02 | 0.0 | 0.025 | 0.035 | 0.01 | 0.005 | 0.015 | 0.025 | 0.01 |
|
| 0.025 | 0.005 | 0.05 | 0.04 | 0.005 | 0.005 | 0.02 | 0.015 | 0.01 |
| Other representative genes | |||||||||
|
| 0.32 | 0.03 | 0.05 | 0.015 | 0.30 | 0.10 | 0.13 | 0.035 | 0.11 |
|
| 0.09 | 0.325 | 0.10 | 0.21 | 0.08 | 0.295 | 0.07 | 0.325 | 0.19 |
| Mean Across All 30 Genes (SD) | 0.18 (0.25) | 0.18 (0.23) | 0.08 (0.11) | 0.11 (0.18) | 0.22 (0.30) | 0.20 (0.25) | 0.17 (0.27) | 0.17 (0.23) | 0.17 (0.25) |
Statistical power is reported for all 9 tests across 10 different genes: the 4 genes with highest overall power, the 4 genes with lowest overall power, and 2 genes deemed representative of the remaining genes explored. The mean power for each test statistic across all 30 genes is reported at the bottom of the table
Fig. 1Empirical power of all 9 family-based rare-variant tests of association for 2 genes of interest, PDCD6IP and ZBTB38. When performing family-based rare-variant tests of association on gene PDCD61P, -weighted tests (Q2, Q1, Q3, Q4) saw higher empirical power than -weighted tests (Q2*, Q1*, Q4*, Q3*); for gene ZBTB38, this trend was reversed. The combined test (black) maintained consistent power between the 2 genes, and was less susceptible to the change in genetic architecture