| Literature DB >> 23762362 |
Allan F McRae1, Melinda M Richter, Penelope A Lind.
Abstract
While genome-wide association studies (GWAS) have been successful in identifying a large number of variants associated with disease, the challenge of locating the underlying causal loci remains. Sequencing of case and control DNA pools provides an inexpensive method for assessing all variation in a genomic region surrounding a significant GWAS result. However, individual variants need to be ranked in terms of the strength of their association to disease in order to prioritise follow-up by individual genotyping. A simple method for testing for case-control association in sequence data from DNA pools is presented that allows the partitioning of the variance in allele frequency estimates into components due to the sampling of chromosomes from the pool during sequencing, sampling individuals from the population and unequal contribution from individuals during pool construction. The utility of this method is demonstrated on a sequence from the alcohol dehydrogenase (ADH) gene cluster on a case-control sample for heavy alcohol consumption.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23762362 PMCID: PMC3676437 DOI: 10.1371/journal.pone.0065410
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1QQ-plot of -log10(p-value) from the test statistics using the three different estimates of the variance of the allele frequency in the sample pool: accounting for sample size in the pool (solid – variance #1), accounting for samples size and read depth (dashed – variance #2) and additionally accounting for variation in amounts of DNA in pool construction (dotted – variance #3).
The 95% confidence interval for the expected distribution with no association and independent variants is shaded. The p-values for the four most significant variants are indicated with crosses.