| Literature DB >> 25519386 |
Yiwei Zhang1, Wei Pan1.
Abstract
To avoid inflated type I error and reduced power in genetic association studies, it is necessary to adjust properly for population stratification and known/unknown subject relatedness. It would be interesting to compare the performance of a principal component-based approach with a linear mixed model. Furthermore, with the availability of genome-wide sequencing data, the question of whether it is preferable to use common variants or rare variants for such an adjustment remains largely unknown. In this paper, we use the Genetic Analysis Workshop 18 data to empirically investigate these issues. We consider both a quantitative trait and a binary trait.Entities:
Year: 2014 PMID: 25519386 PMCID: PMC4143729 DOI: 10.1186/1753-6561-8-S1-S42
Source DB: PubMed Journal: BMC Proc ISSN: 1753-6561
Figure 1Q-Q plots of p values without considering the correlation among samples
Summary statistics of p values for SBP1 by PCA.V, PCA.IBS, and EMMAX. The similarity matrix is based on CVs.
| Method | Min. | 1st. Qu. | Median | Mean | 3rd Qu. | Max. | % ( | λ |
|---|---|---|---|---|---|---|---|---|
| PCA.V | 1.106e-05 | 0.232 | 0.486 | 0.491 | 0.750 | 1.000 | 0.053 | 1.068 |
| PCA.IBS | 5.022e-06 | 0.235 | 0.491 | 0.493 | 0.749 | 1.000 | 0.054 | 1.041 |
| EMMAX | 1.42e-05 | 0.254 | 0.516 | 0.508 | 0.758 | 1.000 | 0.043 | 0.974 |
The similarity matrix is based on CVs.
Figure 2Comparison of PCA.V, PCA.IBS, and EMMAX in testing association between SBP1 and each of 6228 SNPs
Summary statistics of p values for HTN1 by PCA.V, PCA.IBS, and EMMAX
| Method | Min. | 1st. Qu. | Median | Mean | 3rd Qu. | Max. | % ( | λ |
|---|---|---|---|---|---|---|---|---|
| PCA.V | 1.457e-04 | 0.239 | 0.489 | 0.494 | 0.748 | 1.000 | 0.055 | 1.054 |
| PCA.IBS | 7.044e-05 | 0.239 | 0.492 | 0.493 | 0.746 | 1.000 | 0.056 | 1.039 |
| EMMAX | 2.831e-04 | 0.259 | 0.510 | 0.507 | 0.761 | 1.000 | 0.048 | 0.977 |
The similarity matrix is based on CVs.
Figure 3Comparison of PCA.V, PCA.IBS, and EMMAX in testing association between HTN1 and each of 6228 SNPs
Results of the association tests by PCA
| % ( | λ | ||||||
|---|---|---|---|---|---|---|---|
| PCA.V | PCA.IBS | EMMAX | PCA.V | PCA.IBS | EMMAX | ||
| 0.068 | 0.052 | 0.052 | 1.121 | 1.050 | 1.054 | ||
| 0.062 | 0.049 | 0.049 | 1.080 | 1.000 | 0.980 | ||
The similarity matrix is based on RVs.
Figure 4Scree plots for the top 400 PCs of a similarity matrix based on (a) CVs or (b) RVs. The black line is for the covariance matrix and the red (gray) line is for the IBS matrix.