| Literature DB >> 15601530 |
Xiaohua Zhang1, Kathryn Roeder, Garrick Wallstrom, Bernie Devlin.
Abstract
In the search for genetic determinants of complex disease, two approaches to association analysis are most often employed, testing single loci or testing a small group of loci jointly via haplotypes for their relationship to disease status. It is still debatable which of these approaches is more favourable, and under what conditions. The former has the advantage of simplicity but suffers severely when alleles at the tested loci are not in linkage disequilibrium (LD) with liability alleles; the latter should capture more of the signal encoded in LD, but is far from simple. The complexity of haplotype analysis could be especially troublesome for association scans over large genomic regions, which, in fact, is becoming the standard design. For these reasons, the authors have been evaluating statistical methods that bridge the gap between single-locus and haplotype-based tests. In this article, they present one such method, which uses non-parametric regression techniques embodied by Bayesian adaptive regression splines (BARS). For a set of markers falling within a common genomic region and a corresponding set of single-locus association statistics, the BARS procedure integrates these results into a single test by examining the class of smooth curves consistent with the data. The non-parametric BARS procedure generally finds no signal when no liability allele exists in the tested region (ie it achieves the specified size of the test) and it is sensitive enough to pick up signals when a liability allele is present. The BARS procedure provides a robust and potentially powerful alternative to classical tests of association, diminishes the multiple testing problem inherent in those tests and can be applied to a wide range of data types, including genotype frequencies estimated from pooled samples.Entities:
Mesh:
Substances:
Year: 2003 PMID: 15601530 PMCID: PMC3525002 DOI: 10.1186/1479-7364-1-1-20
Source DB: PubMed Journal: Hum Genomics ISSN: 1473-9542 Impact factor: 4.639
Figure 1Non-parametric regression curves fitted to the hereditary haemochromatosis data from Feder [28]using as a measure of LD. Lines are obtained by using cubic polynomial regression (dashed), smoothing spline regression (dotted) and the BARS method (solid).
Figure 2[29] The arrow indicates the location of the mutation, and the corresponding line segment indicates the confidence interval obtained for the mode.
Figure 3Simulated linkage disequilibrium values measured using . The curves depict five of the many obtained via the MCMC algorithm.
The coverage of a liability locus by the 95% confidence interval using Gin 200 simulated datasets
| Smoothing splines | Bars | ||
|---|---|---|---|
| Coverage | 0.540 | 0.940 | 0.950 |
| SD | 0.123 | 0.338 | 0.363 |
Note: SD is the standard deviation of the location of the mode.
Sample frequencies of alleles obtained at a particular locus in a case-control study
| Cases | Controls | Marginal | |
|---|---|---|---|
| Allele 1 | |||
| Allele 2 | |||
| ⋮ | ⋮ | ⋮ | |
| Allele k | |||
| Marginal |
Note: k may differ across loci.