| Literature DB >> 25519384 |
Cong Li1, Can Yang2, Mengjie Chen1, Xiaowei Chen1, Lin Hou2, Hongyu Zhao2.
Abstract
High-throughput sequencing technology allows researchers to test associations between phenotypes and all the variants identified throughout the genome, and is especially useful for analyzing rare variants. However, the statistical power to identify phenotype-associated rare variants is very low with typical genome-wide association studies because of their low allele frequencies among unrelated individuals. In contrast, a family-based design may have more power because rare variants are more likely to be enriched in families than among unrelated individuals. Regardless, an analysis of family-based association studies needs to account appropriately for relatedness between family members. We analyzed the observed quantitative trait systolic blood pressure as well as the simulated Q1 data in the Genetic Analysis Workshop 18 data set using 4 tests: (a) a single-variant test, (b) a collapsing test, (c) a single-variant test where familial relatedness was accounted for, and (d) a collapsing test where familial relatedness was accounted for. We then compared the results of the 4 methods and observed that adjusting for familial relatedness could appropriately control the false-positive rate while maintaining reasonable power to detect several strongly associated variants/genes.Entities:
Year: 2014 PMID: 25519384 PMCID: PMC4143885 DOI: 10.1186/1753-6561-8-S1-S39
Source DB: PubMed Journal: BMC Proc ISSN: 1753-6561
The power and false-positive rate (FPR) of the 4 methods in simulated data
| Gene | UNI | UNI-ADJ | SKAT | SKAT-ADJ | |||||
|---|---|---|---|---|---|---|---|---|---|
| # of NS variants | Power | FPR | Power | FPR | Power | FPR | Power | FPR | |
| 8 | 1 | 0.12 | 1 | 0.04 | 1 | 0.13 | 1 | 0.03 | |
| 20 | 0.9 | 0.225 | 0.74 | 0.055 | 0.895 | 0.38 | 0.825 | 0.04 | |
| 6 | 0.915 | 0.125 | 0.895 | 0.05 | 0.22 | 0.085 | 0.73 | 0.04 | |
| 4 | 0.66 | 0.105 | 0.3 | 0.045 | 0.33 | 0.135 | 0.305 | 0.07 | |
| 13 | 0.44 | 0.14 | 0.165 | 0.03 | 0.08 | 0.17 | 0.515 | 0.035 | |
| 9 | 0.070 | 0.155 | 0.035 | 0.030 | 0.150 | 0.075 | 0.050 | 0.075 | |
| 6 | 0.265 | 0.155 | 0.130 | 0.070 | 0.030 | 0.065 | 0.385 | 0.065 | |
A significance level of 0.05 was used to calculate both power and FPR.
Figure 1Q-Q plots for the 4 methods in real data. The Q-Q plots are based on −log10 p-values of the 7781 genes with at least 1 nonsynonymous mutation in the real data. Red curves represent the observed p values and the black curves represent the expected p values under the null model.