| Literature DB >> 20233392 |
Yurii S Aulchenko1, Maksim V Struchalin, Cornelia M van Duijn.
Abstract
BACKGROUND: Over the last few years, genome-wide association (GWA) studies became a tool of choice for the identification of loci associated with complex traits. Currently, imputed single nucleotide polymorphisms (SNP) data are frequently used in GWA analyzes. Correct analysis of imputed data calls for the implementation of specific methods which take genotype imputation uncertainty into account.Entities:
Mesh:
Year: 2010 PMID: 20233392 PMCID: PMC2846909 DOI: 10.1186/1471-2105-11-134
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Mean values of the test statistics (Wald for Linear, score for mmscore), genomic control λ (median test statistic over 0.455), and type 1 error at different α for different models.
| α | |||||
|---|---|---|---|---|---|
| Model | Mean( | 0.05 | 0.01 | 0.001 | |
| Linear | 1.206 | 1.224 | 0.073 | 0.018 | 0.0027 |
| Linear, robust | 1.210 | 1.228 | 0.073 | 0.018 | 0.0028 |
| Linear, mmscore | 0.984 | 1.007 | 0.047 | 0.009 | 0.0011 |
Tests were performed using a trait dependent on two covariates and with (adjusted) heritability of 30%. Only SNPs with estimated minor allele frequency greater than 0.01 (n = 212, 691) used. Linear: standard linear model; Linear, robust: linear models using with standard errors; Linear, mmscore: two-step approximation to mixed model, fixed effects included in step 1 of analysis.
Time for analysis of chromosome 2 imputed data (220,833 SNPs).
| Model | Option | No. people | CPU time |
|---|---|---|---|
| Linear | - | 500 | 0 m 43 s |
| 1000 | 1 m 23 s | ||
| 1500 | 2 m 10 s | ||
| Linear | 500 | 0 m 50 s | |
| 1000 | 1 m 43 s | ||
| 1500 | 2 m 35 s | ||
| Linear | 500 | 16 m 18 s | |
| 1000 | 92 m 45 s | ||
| 1500 | 231 m 49 s | ||
| Logistic | - | 500 | 3 m 20 s |
| 1000 | 6 m 38 s | ||
| 1500 | 10 m 8 s | ||
| Logistic | 500 | 3 m 25 s | |
| 1000 | 6 m 53 s | ||
| 1500 | 10 m 29 s | ||
| Cox PH | - | 500 | 2 m 18 s |
| 1000 | 4 m 30 s | ||
| 1500 | 6 m 43 s | ||
In all analyzes, 2 covariates were included in the model.