Literature DB >> 17410553

Optimal selection of markers for validation or replication from genome-wide association studies.

Celia M T Greenwood1, Jagadish Rangrej, Lei Sun.   

Abstract

With reductions in genotyping costs and the fast pace of improvements in genotyping technology, it is not uncommon for the individuals in a single study to undergo genotyping using several different platforms, where each platform may contain different numbers of markers selected via different criteria. For example, a set of cases and controls may be genotyped at markers in a small set of carefully selected candidate genes, and shortly thereafter, the same cases and controls may be used for a genome-wide single nucleotide polymorphism (SNP) association study. After such initial investigations, often, a subset of "interesting" markers is selected for validation or replication. Specifically, by validation, we refer to the investigation of associations between the selected subset of markers and the disease in independent data. However, it is not obvious how to choose the best set of markers for this validation. There may be a prior expectation that some sets of genotyping data are more likely to contain real associations. For example, it may be more likely for markers in plausible candidate genes to show disease associations than markers in a genome-wide scan. Hence, it would be desirable to select proportionally more markers from the candidate gene set. When a fixed number of markers are selected for validation, we propose an approach for identifying an optimal marker-selection configuration by basing the approach on minimizing the stratified false discovery rate. We illustrate this approach using a case-control study of colorectal cancer from Ontario, Canada, and we show that this approach leads to substantial reductions in the estimated false discovery rates in the Ontario dataset for the selected markers, as well as reductions in the expected false discovery rates for the proposed validation dataset. Copyright 2007 Wiley-Liss, Inc.

Entities:  

Mesh:

Substances:

Year:  2007        PMID: 17410553     DOI: 10.1002/gepi.20220

Source DB:  PubMed          Journal:  Genet Epidemiol        ISSN: 0741-0395            Impact factor:   2.135


  6 in total

1.  SNP characteristics predict replication success in association studies.

Authors:  Ivan P Gorlov; Jason H Moore; Bo Peng; Jennifer L Jin; Olga Y Gorlova; Christopher I Amos
Journal:  Hum Genet       Date:  2014-10-02       Impact factor: 4.132

2.  Were genome-wide linkage studies a waste of time? Exploiting candidate regions within genome-wide association studies.

Authors:  Yun J Yoo; Shelley B Bull; Andrew D Paterson; Daryl Waggott; Lei Sun
Journal:  Genet Epidemiol       Date:  2010-02       Impact factor: 2.135

3.  The performance of a new local false discovery rate method on tests of association between coronary artery disease (CAD) and genome-wide genetic variants.

Authors:  Shuyan Mei; Ali Karimnezhad; Marie Forest; David R Bickel; Celia M T Greenwood
Journal:  PLoS One       Date:  2017-09-20       Impact factor: 3.240

Review 4.  Find the Needle in the Haystack, Then Find It Again: Replication and Validation in the 'Omics Era.

Authors:  Wei Perng; Stella Aslibekyan
Journal:  Metabolites       Date:  2020-07-12

5.  The multiplicity problem in linkage analysis of gene expression data - the power of differentiating cis- and trans-acting regulators.

Authors:  Baisong Huang; Jagadish Rangrej; Andrew D Paterson; Lei Sun
Journal:  BMC Proc       Date:  2007-12-18

6.  Exploring the potential benefits of stratified false discovery rates for region-based testing of association with rare genetic variation.

Authors:  Changjiang Xu; Antonio Ciampi; Celia M T Greenwood
Journal:  Front Genet       Date:  2014-01-29       Impact factor: 4.599

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.