Literature DB >> 16624925

Two-stage designs in case-control association analysis.

Yijun Zuo1, Guohua Zou, Hongyu Zhao.   

Abstract

DNA pooling is a cost-effective approach for collecting information on marker allele frequency in genetic studies. It is often suggested as a screening tool to identify a subset of candidate markers from a very large number of markers to be followed up by more accurate and informative individual genotyping. In this article, we investigate several statistical properties and design issues related to this two-stage design, including the selection of the candidate markers for second-stage analysis, statistical power of this design, and the probability that truly disease-associated markers are ranked among the top after second-stage analysis. We have derived analytical results on the proportion of markers to be selected for second-stage analysis. For example, to detect disease-associated markers with an allele frequency difference of 0.05 between the cases and controls through an initial sample of 1000 cases and 1000 controls, our results suggest that when the measurement errors are small (0.005), approximately 3% of the markers should be selected. For the statistical power to identify disease-associated markers, we find that the measurement errors associated with DNA pooling have little effect on its power. This is in contrast to the one-stage pooling scheme where measurement errors may have large effect on statistical power. As for the probability that the disease-associated markers are ranked among the top in the second stage, we show that there is a high probability that at least one disease-associated marker is ranked among the top when the allele frequency differences between the cases and controls are not <0.05 for reasonably large sample sizes, even though the errors associated with DNA pooling in the first stage are not small. Therefore, the two-stage design with DNA pooling as a screening tool offers an efficient strategy in genomewide association studies, even when the measurement errors associated with DNA pooling are nonnegligible. For any disease model, we find that all the statistical results essentially depend on the population allele frequency and the allele frequency differences between the cases and controls at the disease-associated markers. The general conclusions hold whether the second stage uses an entirely independent sample or includes both the samples used in the first stage and an independent set of samples.

Mesh:

Substances:

Year:  2006        PMID: 16624925      PMCID: PMC1526674          DOI: 10.1534/genetics.105.042648

Source DB:  PubMed          Journal:  Genetics        ISSN: 0016-6731            Impact factor:   4.562


  16 in total

1.  Family-based association tests for different family structures using pooled DNA.

Authors:  Guohua Zou; Hongyu Zhao
Journal:  Ann Hum Genet       Date:  2005-07       Impact factor: 1.670

Review 2.  The relative power of family-based and case-control designs for linkage disequilibrium studies of complex human diseases I. DNA pooling.

Authors:  N Risch; J Teng
Journal:  Genome Res       Date:  1998-12       Impact factor: 9.043

3.  Association mapping of disease loci, by use of a pooled DNA genomic screen.

Authors:  L F Barcellos; W Klitz; L L Field; R Tobias; A M Bowcock; R Wilson; M P Nelson; J Nagatomi; G Thomson
Journal:  Am J Hum Genet       Date:  1997-09       Impact factor: 11.025

4.  The future of genetic studies of complex human diseases.

Authors:  N Risch; K Merikangas
Journal:  Science       Date:  1996-09-13       Impact factor: 47.728

5.  Two-stage global search designs for linkage analysis using pairs of affected relatives.

Authors:  R C Elston; X Guo; L V Williams
Journal:  Genet Epidemiol       Date:  1996       Impact factor: 2.135

6.  Association testing by DNA pooling: an effective initial screen.

Authors:  Aruna Bansal; Dirk van den Boom; Stefan Kammerer; Christiane Honisch; Gail Adam; Charles R Cantor; Patrick Kleyn; Andi Braun
Journal:  Proc Natl Acad Sci U S A       Date:  2002-12-10       Impact factor: 11.205

Review 7.  Searching for genetic determinants in the new millennium.

Authors:  N J Risch
Journal:  Nature       Date:  2000-06-15       Impact factor: 49.962

8.  Two-stage designs for gene-disease association studies with sample size constraints.

Authors:  Jaya M Satagopan; E S Venkatraman; Colin B Begg
Journal:  Biometrics       Date:  2004-09       Impact factor: 2.571

9.  High-throughput development and characterization of a genomewide collection of gene-based single nucleotide polymorphism markers by chip-based matrix-assisted laser desorption/ionization time-of-flight mass spectrometry.

Authors:  K H Buetow; M Edmonson; R MacDonald; R Clifford; P Yip; J Kelley; D P Little; R Strausberg; H Koester; C R Cantor; A Braun
Journal:  Proc Natl Acad Sci U S A       Date:  2001-01-02       Impact factor: 11.205

10.  Optimal two-stage genotyping in population-based association studies.

Authors:  Jaya M Satagopan; Robert C Elston
Journal:  Genet Epidemiol       Date:  2003-09       Impact factor: 2.135

View more
  22 in total

1.  Identification of the genetic basis for complex disorders by use of pooling-based genomewide single-nucleotide-polymorphism association studies.

Authors:  John V Pearson; Matthew J Huentelman; Rebecca F Halperin; Waibhav D Tembe; Stacey Melquist; Nils Homer; Marcel Brun; Szabolcs Szelinger; Keith D Coon; Victoria L Zismann; Jennifer A Webster; Thomas Beach; Sigrid B Sando; Jan O Aasly; Reinhard Heun; Frank Jessen; Heike Kolsch; Magdalini Tsolaki; Makrina Daniilidou; Eric M Reiman; Andreas Papassotiropoulos; Michael L Hutton; Dietrich A Stephan; David W Craig
Journal:  Am J Hum Genet       Date:  2006-12-06       Impact factor: 11.025

2.  Optimal two-stage design for case-control association analysis incorporating genotyping errors.

Authors:  Y Zuo; G Zou; J Wang; H Zhao; H Liang
Journal:  Ann Hum Genet       Date:  2008-01-23       Impact factor: 1.670

3.  Entropy-based joint analysis for two-stage genome-wide association studies.

Authors:  Guolian Kang; Yijun Zuo
Journal:  J Hum Genet       Date:  2007-08-09       Impact factor: 3.172

4.  Optimal 2-stage design with given power in association studies.

Authors:  Jiexun Wang; Hua Liang; Guohua Zou
Journal:  Biostatistics       Date:  2008-12-03       Impact factor: 5.899

5.  Multimarker analysis and imputation of multiple platform pooling-based genome-wide association studies.

Authors:  Nils Homer; Waibhav D Tembe; Szabolcs Szelinger; Margot Redman; Dietrich A Stephan; John V Pearson; Stanley F Nelson; David Craig
Journal:  Bioinformatics       Date:  2008-07-10       Impact factor: 6.937

6.  Optimal DNA pooling-based two-stage designs in case-control association studies.

Authors:  Yihong Zhao; Shuang Wang
Journal:  Hum Hered       Date:  2008-10-17       Impact factor: 0.444

7.  The efficacy of detecting variants with small effects on the Affymetrix 6.0 platform using pooled DNA.

Authors:  Charleston W K Chiang; Zofia K Z Gajdos; Joshua M Korn; Johannah L Butler; Rachel Hackett; Candace Guiducci; Thutrang T Nguyen; Rainford Wilks; Terrence Forrester; Katherine D Henderson; Loic Le Marchand; Brian E Henderson; Christopher A Haiman; Richard S Cooper; Helen N Lyon; Xiaofeng Zhu; Colin A McKenzie; Mark R Palmert; Joel N Hirschhorn
Journal:  Hum Genet       Date:  2011-03-22       Impact factor: 4.132

8.  Design of association studies with pooled or un-pooled next-generation sequencing data.

Authors:  Su Yeon Kim; Yingrui Li; Yiran Guo; Ruiqiang Li; Johan Holmkvist; Torben Hansen; Oluf Pedersen; Jun Wang; Rasmus Nielsen
Journal:  Genet Epidemiol       Date:  2010-07       Impact factor: 2.135

9.  Robust joint analysis allowing for model uncertainty in two-stage genetic association studies.

Authors:  Dongdong Pan; Qizhai Li; Ningning Jiang; Aiyi Liu; Kai Yu
Journal:  BMC Bioinformatics       Date:  2011-01-07       Impact factor: 3.169

10.  Validation of pooled genotyping on the Affymetrix 500 k and SNP6.0 genotyping platforms using the polynomial-based probe-specific correction.

Authors:  Ramani Anantharaman; Fook Tim Chew
Journal:  BMC Genet       Date:  2009-12-14       Impact factor: 2.797

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.