| Literature DB >> 18466524 |
Robert P Igo1, Douglas Londono, Katherine Miller, Antonio R Parrado, Shannon R E Quade, Moumita Sinha, Sulgi Kim, Sungho Won, Jing Li, Katrina A B Goddard.
Abstract
Clustering of related haplotypes in haplotype-based association mapping has the potential to improve power by reducing the degrees of freedom without sacrificing important information about the underlying genetic structure. We have modified a generalized linear model approach for association analysis by incorporating a density-based clustering algorithm to reduce the number of coefficients in the model. Using the GAW 15 Problem 3 simulated data, we show that our novel method can substantially enhance power to detect association with the binary rheumatoid arthritis (RA) phenotype at the HLA-DRB1 locus on chromosome 6. In contrast, clustering did not appreciably improve performance at locus D, perhaps a consequence of a rare susceptibility allele and of the overwhelming effect of HLA-DRB1/locus C, 5 cM distal. Optimization of parameters governing the clustering algorithm identified a set of parameters that delivered nearly ideal performance in a variety of situations. The cluster-based score test was valid over a wide range of haplotype diversity, and was robust to severe departures from Hardy-Weinberg equilibrium encountered near HLA-DRB1 in RA case-control samples.Entities:
Year: 2007 PMID: 18466524 PMCID: PMC2367537 DOI: 10.1186/1753-6561-1-s1-s27
Source DB: PubMed Journal: BMC Proc ISSN: 1753-6561
Figure 1Deviation from HWE in the neighborhood ofHLA-DRB1. The χ2 test for HWE was applied to samples from 100 replicates; the percentage of replicates in which p < 0.001 is shown for 170 consecutive markers near HLA-DRB1 (arrow).
Figure 2Comparison of TDTEX and FBAT results nearHLA-DRB1. Median negative log10(p) values from TDTEX (top) and FBAT (bottom) over 100 replicates are plotted for 150 consecutive markers near HLA-DRB1 (arrow).
Power of haplotype- and cluster-based association analyses at the HLA-DRB1 locus
| Power | |||
| α = 0.05 | α = 0.01 | Mean d.f. | |
| Haplotypes | 0.672 | 0.379 | 9.36 |
| Clusters | 0.812 | 0.593 | 3.65 |
| Haplotypes | 0.654 | 0.356 | 10.07 |
| Clusters | 0.822 | 0.597 | 3.94 |
Sex and number of DRB1*04 alleles were included as covariates in all analyses.
Proportion of 1000 data sets of 150 each cases and controls giving p-values below nominal α.
Mean degrees of freedom from the score test for association.
SNPs 3435–3440.
SNPs 3433, 3435–3441.
Power of association analyses at Locus D
| Power, no covariates | Power, adjusted | ||||
| α = 0.05 | α = 0.01 | α = 0.05 | α = 0.01 | Mean d.f. | |
| Haplotypes | 0.240 | 0.088 | 0.059 | 0.009 | 9.04 |
| Clusters, ε = 0.4 | 0.272 | 0.104 | -- | -- | 4.48 |
| Clusters, ε = 0.5 | 0.271 | 0.098 | 0.079 | 0.022 | 4.43 |
| Haplotypes | 0.448 | 0.206 | 0.112 | 0.025 | 14.37 |
| Clusters, ε = 0.7 | 0.470 | 0.239 | -- | -- | 10.26 |
| Clusters, ε = 0.5 | 0.448 | 0.236 | 0.135 | 0.019 | 10.26 |
Power was measured over 800 data sets of 500 each cases and controls.
Sex and number of DRB1*04 alleles were included as covariates.
See Table 1. Values shown are from analyses without covariates; mean d.f. from adjusted analyses were nearly identical.
SNPs 3914–3919.
Value of ε maximizing power for score test without covariates.
SNPs 3913–3920.
Type I error of score tests, as a function of haplotype diversity
| Haplotype analyses | Cluster analyses | ||||||
| Diversity | Haplotypes | α = 0.05 | α = 0.01 | d.f. | α = 0.05 | α = 0.01 | d.f. |
| Low | 4.2 | 0.031 | 0.001 | 3.17 | 0.060 | 0.012 | 1.21 |
| Medium | 10 | 0.022 | 0.003 | 8.70 | 0.035 | 0.003 | 4.02 |
| High | 17 | 0.015 | 0.001 | 15.40 | 0.035 | 0.002 | 7.12 |
| Very high | 28 | 0.010 | 0.002 | 23.02 | 0.040 | 0.007 | 10.06 |
| Low | 5.2 | 0.027 | 0.006 | 4.18 | 0.042 | 0.007 | 1.00 |
| Medium | 15 | 0.030 | 0.003 | 11.53 | 0.047 | 0.010 | 4.81 |
| High | 25 | 0.011 | 0.000 | 21.28 | 0.038 | 0.005 | 8.82 |
| Very high | 48 | 0.002 | 0.000 | 40.15 | 0.025 | 0.001 | 23.18 |
Mean haplotype diversity over samples of 200 each RA cases and controls taken from 50 different Problem 3 replicates.
Degrees of freedom from the score test averaged over 1000 samples of 150 each cases and controls analyzed with adjustment for sex and DRB1*04 alleles.