Literature DB >> 21451739

IMPROVING POPULATION-SPECIFIC ALLELE FREQUENCY ESTIMATES BY ADAPTING SUPPLEMENTAL DATA: AN EMPIRICAL BAYES APPROACH.

Marc Coram1, Hua Tang.   

Abstract

Estimation of the allele frequency at genetic markers is a key ingredient in biological and biomedical research, such as studies of human genetic variation or of the genetic etiology of heritable traits. As genetic data becomes increasingly available, investigators face a dilemma: when should data from other studies and population subgroups be pooled with the primary data? Pooling additional samples will generally reduce the variance of the frequency estimates; however, used inappropriately, pooled estimates can be severely biased due to population stratification. Because of this potential bias, most investigators avoid pooling, even for samples with the same ethnic background and residing on the same continent. Here, we propose an empirical Bayes approach for estimating allele frequencies of single nucleotide polymorphisms. This procedure adaptively incorporates genotypes from related samples, so that more similar samples have a greater influence on the estimates. In every example we have considered, our estimator achieves a mean squared error (MSE) that is smaller than either pooling or not, and sometimes substantially improves over both extremes. The bias introduced is small, as is shown by a simulation study that is carefully matched to a real data example. Our method is particularly useful when small groups of individuals are genotyped at a large number of markers, a situation we are likely to encounter in a genome-wide association study.

Entities:  

Year:  2007        PMID: 21451739      PMCID: PMC3065192          DOI: 10.1214/07-aoas121

Source DB:  PubMed          Journal:  Ann Appl Stat        ISSN: 1932-6157            Impact factor:   2.083


  26 in total

1.  Empirical Bayes procedure for estimating genetic distance between populations and effective population size.

Authors:  S Kitada; T Hayashi; H Kishino
Journal:  Genetics       Date:  2000-12       Impact factor: 4.562

2.  Genomic control for association studies.

Authors:  B Devlin; K Roeder
Journal:  Biometrics       Date:  1999-12       Impact factor: 2.571

3.  Genetic structure of human populations.

Authors:  Noah A Rosenberg; Jonathan K Pritchard; James L Weber; Howard M Cann; Kenneth K Kidd; Lev A Zhivotovsky; Marcus W Feldman
Journal:  Science       Date:  2002-12-20       Impact factor: 47.728

4.  The genetical structure of populations.

Authors:  S WRIGHT
Journal:  Ann Eugen       Date:  1951-03

5.  Support from the relationship of genetic and geographic distance in human populations for a serial founder effect originating in Africa.

Authors:  Sohini Ramachandran; Omkar Deshpande; Charles C Roseman; Noah A Rosenberg; Marcus W Feldman; L Luca Cavalli-Sforza
Journal:  Proc Natl Acad Sci U S A       Date:  2005-10-21       Impact factor: 11.205

6.  A haplotype map of the human genome.

Authors: 
Journal:  Nature       Date:  2005-10-27       Impact factor: 49.962

Review 7.  Genome-wide association studies for common diseases and complex traits.

Authors:  Joel N Hirschhorn; Mark J Daly
Journal:  Nat Rev Genet       Date:  2005-02       Impact factor: 53.242

8.  Whole-genome patterns of common DNA variation in three human populations.

Authors:  David A Hinds; Laura L Stuve; Geoffrey B Nilsen; Eran Halperin; Eleazar Eskin; Dennis G Ballinger; Kelly A Frazer; David R Cox
Journal:  Science       Date:  2005-02-18       Impact factor: 47.728

Review 9.  Positive natural selection in the human lineage.

Authors:  P C Sabeti; S F Schaffner; B Fry; J Lohmueller; P Varilly; O Shamovsky; A Palma; T S Mikkelsen; D Altshuler; E S Lander
Journal:  Science       Date:  2006-06-16       Impact factor: 47.728

Review 10.  Genetic dissection of complex traits.

Authors:  E S Lander; N J Schork
Journal:  Science       Date:  1994-09-30       Impact factor: 47.728

View more
  4 in total

1.  Integrating genetic and gene expression evidence into genome-wide association analysis of gene sets.

Authors:  Qing Xiong; Nicola Ancona; Elizabeth R Hauser; Sayan Mukherjee; Terrence S Furey
Journal:  Genome Res       Date:  2011-09-22       Impact factor: 9.043

2.  Estimating the number of unseen variants in the human genome.

Authors:  Iuliana Ionita-Laza; Christoph Lange; Nan M Laird
Journal:  Proc Natl Acad Sci U S A       Date:  2009-03-10       Impact factor: 11.205

3.  BETASEQ: a powerful novel method to control type-I error inflation in partially sequenced data for rare variant association testing.

Authors:  Song Yan; Yun Li
Journal:  Bioinformatics       Date:  2013-12-12       Impact factor: 6.937

4.  Effective sample size: Quick estimation of the effect of related samples in genetic case-control association analyses.

Authors:  Yaning Yang; Elaine F Remmers; Chukwuma B Ogunwole; Daniel L Kastner; Peter K Gregersen; Wentian Li
Journal:  Comput Biol Chem       Date:  2011-01-22       Impact factor: 2.877

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.