Literature DB >> 16251459

Ascertainment bias in studies of human genome-wide polymorphism.

Andrew G Clark1, Melissa J Hubisz, Carlos D Bustamante, Scott H Williamson, Rasmus Nielsen.   

Abstract

Large-scale SNP genotyping studies rely on an initial assessment of nucleotide variation to identify sites in the DNA sequence that harbor variation among individuals. This "SNP discovery" sample may be quite variable in size and composition, and it has been well established that properties of the SNPs that are found are influenced by the discovery sampling effort. The International HapMap project relied on nearly any piece of information available to identify SNPs-including BAC end sequences, shotgun reads, and differences between public and private sequences-and even made use of chimpanzee data to confirm human sequence differences. In addition, the ascertainment criteria shifted from using only SNPs that had been validated in population samples, to double-hit SNPs, to finally accepting SNPs that were singletons in small discovery samples. In contrast, Perlegen's primary discovery was a resequencing-by-hybridization effort using the 24 people of diverse origin in the Polymorphism Discovery Resource. Here we take these two data sets and contrast two basic summary statistics, heterozygosity and F(ST), as well as the site frequency spectra, for 500-kb windows spanning the genome. The magnitude of disparity between these samples in these measures of variability indicates that population genetic analysis on the raw genotype data is ill advised. Given the knowledge of the discovery samples, we perform an ascertainment correction and show how the post-correction data are more consistent across these studies. However, discrepancies persist, suggesting that the heterogeneity in the SNP discovery process of the HapMap project resulted in a data set resistant to complete ascertainment correction. Ascertainment bias will likely erode the power of tests of association between SNPs and complex disorders, but the effect will likely be small, and perhaps more importantly, it is unlikely that the bias will introduce false-positive inferences.

Entities:  

Mesh:

Year:  2005        PMID: 16251459      PMCID: PMC1310637          DOI: 10.1101/gr.4107905

Source DB:  PubMed          Journal:  Genome Res        ISSN: 1088-9051            Impact factor:   9.043


  20 in total

1.  Use of unlinked genetic markers to detect population stratification in association studies.

Authors:  J K Pritchard; N A Rosenberg
Journal:  Am J Hum Genet       Date:  1999-07       Impact factor: 11.025

2.  Prospects for whole-genome linkage disequilibrium mapping of common disease genes.

Authors:  L Kruglyak
Journal:  Nat Genet       Date:  1999-06       Impact factor: 38.330

3.  The effect of single nucleotide polymorphism identification strategies on estimates of linkage disequilibrium.

Authors:  Joshua M Akey; Kun Zhang; Momiao Xiong; Li Jin
Journal:  Mol Biol Evol       Date:  2003-02       Impact factor: 16.240

4.  Correcting for ascertainment biases when analyzing SNP data: applications to the estimation of linkage disequilibrium.

Authors:  Rasmus Nielsen; James Signorovitch
Journal:  Theor Popul Biol       Date:  2003-05       Impact factor: 1.570

5.  Reconstituting the frequency spectrum of ascertained single-nucleotide polymorphism data.

Authors:  Rasmus Nielsen; Melissa J Hubisz; Andrew G Clark
Journal:  Genetics       Date:  2004-09-15       Impact factor: 4.562

6.  Haplotype diversity across 100 candidate genes for inflammation, lipid metabolism, and blood pressure regulation in two populations.

Authors:  Dana C Crawford; Christopher S Carlson; Mark J Rieder; Dana P Carrington; Qian Yi; Joshua D Smith; Michael A Eberle; Leonid Kruglyak; Deborah A Nickerson
Journal:  Am J Hum Genet       Date:  2004-03-10       Impact factor: 11.025

7.  Whole-genome patterns of common DNA variation in three human populations.

Authors:  David A Hinds; Laura L Stuve; Geoffrey B Nilsen; Eran Halperin; Eleazar Eskin; Dennis G Ballinger; Kelly A Frazer; David R Cox
Journal:  Science       Date:  2005-02-18       Impact factor: 47.728

8.  Simultaneous inference of selection and population growth from patterns of variation in the human genome.

Authors:  Scott H Williamson; Ryan Hernandez; Adi Fledel-Alon; Lan Zhu; Rasmus Nielsen; Carlos D Bustamante
Journal:  Proc Natl Acad Sci U S A       Date:  2005-05-19       Impact factor: 11.205

9.  Natural selection on protein-coding genes in the human genome.

Authors:  Carlos D Bustamante; Adi Fledel-Alon; Scott Williamson; Rasmus Nielsen; Melissa Todd Hubisz; Stephen Glanowski; David M Tanenbaum; Thomas J White; John J Sninsky; Ryan D Hernandez; Daniel Civello; Mark D Adams; Michele Cargill; Andrew G Clark
Journal:  Nature       Date:  2005-10-20       Impact factor: 49.962

10.  The pattern of polymorphism in Arabidopsis thaliana.

Authors:  Magnus Nordborg; Tina T Hu; Yoko Ishino; Jinal Jhaveri; Christopher Toomajian; Honggang Zheng; Erica Bakker; Peter Calabrese; Jean Gladstone; Rana Goyal; Mattias Jakobsson; Sung Kim; Yuri Morozov; Badri Padhukasahasram; Vincent Plagnol; Noah A Rosenberg; Chitiksha Shah; Jeffrey D Wall; Jue Wang; Keyan Zhao; Theodore Kalbfleisch; Vincent Schulz; Martin Kreitman; Joy Bergelson
Journal:  PLoS Biol       Date:  2005-05-24       Impact factor: 8.029

View more
  210 in total

1.  Human population dispersal "Out of Africa" estimated from linkage disequilibrium and allele frequencies of SNPs.

Authors:  Brian P McEvoy; Joseph E Powell; Michael E Goddard; Peter M Visscher
Journal:  Genome Res       Date:  2011-04-25       Impact factor: 9.043

2.  2b-RAD: a simple and flexible method for genome-wide genotyping.

Authors:  Shi Wang; Eli Meyer; John K McKay; Mikhail V Matz
Journal:  Nat Methods       Date:  2012-05-20       Impact factor: 28.547

3.  The genetic basis of evolutionary change in gene expression levels.

Authors:  J J Emerson; Wen-Hsiung Li
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2010-08-27       Impact factor: 6.237

4.  Inferring genetic ancestry: opportunities, challenges, and implications.

Authors:  Charmaine D Royal; John Novembre; Stephanie M Fullerton; David B Goldstein; Jeffrey C Long; Michael J Bamshad; Andrew G Clark
Journal:  Am J Hum Genet       Date:  2010-05-14       Impact factor: 11.025

5.  Ascertainment biases in SNP chips affect measures of population divergence.

Authors:  Anders Albrechtsen; Finn Cilius Nielsen; Rasmus Nielsen
Journal:  Mol Biol Evol       Date:  2010-06-17       Impact factor: 16.240

6.  Whole-genome sequencing and comprehensive variant analysis of a Japanese individual using massively parallel sequencing.

Authors:  Akihiro Fujimoto; Hidewaki Nakagawa; Naoya Hosono; Kaoru Nakano; Tetsuo Abe; Keith A Boroevich; Masao Nagasaki; Rui Yamaguchi; Tetsuo Shibuya; Michiaki Kubo; Satoru Miyano; Yusuke Nakamura; Tatsuhiko Tsunoda
Journal:  Nat Genet       Date:  2010-10-24       Impact factor: 38.330

Review 7.  Population genetic studies in the genomic sequencing era.

Authors:  Hua Chen
Journal:  Dongwuxue Yanjiu       Date:  2015-07-18

8.  The Mouse Universal Genotyping Array: From Substrains to Subspecies.

Authors:  Andrew P Morgan; Chen-Ping Fu; Chia-Yu Kao; Catherine E Welsh; John P Didion; Liran Yadgary; Leeanna Hyacinth; Martin T Ferris; Timothy A Bell; Darla R Miller; Paola Giusti-Rodriguez; Randal J Nonneman; Kevin D Cook; Jason K Whitmire; Lisa E Gralinski; Mark Keller; Alan D Attie; Gary A Churchill; Petko Petkov; Patrick F Sullivan; Jennifer R Brennan; Leonard McMillan; Fernando Pardo-Manuel de Villena
Journal:  G3 (Bethesda)       Date:  2015-12-18       Impact factor: 3.154

9.  Tests for stochastic ordering under biased sampling.

Authors:  Hsin-Wen Chang; Hammou El Barmi; Ian W McKeague
Journal:  J Nonparametr Stat       Date:  2016-10-05       Impact factor: 1.231

10.  High-throughput sequencing reveals inbreeding depression in a natural population.

Authors:  Joseph I Hoffman; Fraser Simpson; Patrice David; Jolianne M Rijks; Thijs Kuiken; Michael A S Thorne; Robert C Lacy; Kanchon K Dasmahapatra
Journal:  Proc Natl Acad Sci U S A       Date:  2014-02-28       Impact factor: 11.205

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.