Literature DB >> 23836388

SNP ascertainment bias in population genetic analyses: why it is important, and how to correct it.

Joseph Lachance1, Sarah A Tishkoff.   

Abstract

Whole genome sequencing and SNP genotyping arrays can paint strikingly different pictures of demographic history and natural selection. This is because genotyping arrays contain biased sets of pre-ascertained SNPs. In this short review, we use comparisons between high-coverage whole genome sequences of African hunter-gatherers and data from genotyping arrays to highlight how SNP ascertainment bias distorts population genetic inferences. Sample sizes and the populations in which SNPs are discovered affect the characteristics of observed variants. We find that SNPs on genotyping arrays tend to be older and present in multiple populations. In addition, genotyping arrays cause allele frequency distributions to be shifted towards intermediate frequency alleles, and estimates of linkage disequilibrium are modified. Since population genetic analyses depend on allele frequencies, it is imperative that researchers are aware of the effects of SNP ascertainment bias. With this in mind, we describe multiple ways to correct for SNP ascertainment bias.
© 2013 WILEY Periodicals, Inc.

Entities:  

Keywords:  African hunter-gatherers; SNP ascertainment bias; human genetics; population genetics; whole genome sequencing

Mesh:

Year:  2013        PMID: 23836388      PMCID: PMC3849385          DOI: 10.1002/bies.201300014

Source DB:  PubMed          Journal:  Bioessays        ISSN: 0265-9247            Impact factor:   4.345


  51 in total

1.  Inference of population structure using multilocus genotype data.

Authors:  J K Pritchard; M Stephens; P Donnelly
Journal:  Genetics       Date:  2000-06       Impact factor: 4.562

2.  Correcting for ascertainment biases when analyzing SNP data: applications to the estimation of linkage disequilibrium.

Authors:  Rasmus Nielsen; James Signorovitch
Journal:  Theor Popul Biol       Date:  2003-05       Impact factor: 1.570

3.  SIMCOAL 2.0: a program to simulate genomic diversity over large recombining regions in a subdivided population with a complex history.

Authors:  Guillaume Laval; Laurent Excoffier
Journal:  Bioinformatics       Date:  2004-04-29       Impact factor: 6.937

Review 4.  Target-enrichment strategies for next-generation sequencing.

Authors:  Lira Mamanova; Alison J Coffey; Carol E Scott; Iwanka Kozarewa; Emily H Turner; Akash Kumar; Eleanor Howard; Jay Shendure; Daniel J Turner
Journal:  Nat Methods       Date:  2010-02       Impact factor: 28.547

5.  Evaluating SNP ascertainment bias and its impact on population assignment in Atlantic cod, Gadus morhua.

Authors:  Ian R Bradbury; Sophie Hubert; Brent Higgins; Sharen Bowman; Ian G Paterson; Paul V R Snelgrove; Corey J Morris; Robert S Gregory; David C Hardie; Tudor Borza; Paul Bentzen
Journal:  Mol Ecol Resour       Date:  2011-03       Impact factor: 7.090

6.  Controlling the false-positive rate in multilocus genome scans for selection.

Authors:  Kevin R Thornton; Jeffrey D Jensen
Journal:  Genetics       Date:  2006-11-16       Impact factor: 4.562

7.  How accurate is the current picture of human genetic variation?

Authors:  I G Romero; A Manica; J Goudet; L L Handley; F Balloux
Journal:  Heredity (Edinb)       Date:  2008-09-03       Impact factor: 3.821

8.  Statistical method for testing the neutral mutation hypothesis by DNA polymorphism.

Authors:  F Tajima
Journal:  Genetics       Date:  1989-11       Impact factor: 4.562

9.  Inferring the history of population size change from genome-wide SNP data.

Authors:  Christoph Theunert; Kun Tang; Michael Lachmann; Sile Hu; Mark Stoneking
Journal:  Mol Biol Evol       Date:  2012-07-10       Impact factor: 16.240

10.  An integrated map of genetic variation from 1,092 human genomes.

Authors:  Goncalo R Abecasis; Adam Auton; Lisa D Brooks; Mark A DePristo; Richard M Durbin; Robert E Handsaker; Hyun Min Kang; Gabor T Marth; Gil A McVean
Journal:  Nature       Date:  2012-11-01       Impact factor: 49.962

View more
  108 in total

Review 1.  The genetic basis of chronic mountain sickness.

Authors:  Roy Ronen; Dan Zhou; Vineet Bafna; Gabriel G Haddad
Journal:  Physiology (Bethesda)       Date:  2014-11

2.  Genomic Analyses Reveal the Influence of Geographic Origin, Migration, and Hybridization on Modern Dog Breed Development.

Authors:  Heidi G Parker; Dayna L Dreger; Maud Rimbault; Brian W Davis; Alexandra B Mullen; Gretchen Carpintero-Ramirez; Elaine A Ostrander
Journal:  Cell Rep       Date:  2017-04-25       Impact factor: 9.423

Review 3.  Importance of Genetic Studies of Cardiometabolic Disease in Diverse Populations.

Authors:  Lindsay Fernández-Rhodes; Kristin L Young; Adam G Lilly; Laura M Raffield; Heather M Highland; Genevieve L Wojcik; Cary Agler; Shelly-Ann M Love; Samson Okello; Lauren E Petty; Mariaelisa Graff; Jennifer E Below; Kimon Divaris; Kari E North
Journal:  Circ Res       Date:  2020-06-04       Impact factor: 17.367

4.  Biased gene conversion skews allele frequencies in human populations, increasing the disease burden of recessive alleles.

Authors:  Joseph Lachance; Sarah A Tishkoff
Journal:  Am J Hum Genet       Date:  2014-10-02       Impact factor: 11.025

5.  Phylogeography of the iconic Australian red-tailed black-cockatoo (Calyptorhynchus banksii) and implications for its conservation.

Authors:  Kyle M Ewart; Nathan Lo; Rob Ogden; Leo Joseph; Simon Y W Ho; Greta J Frankham; Mark D B Eldridge; Richard Schodde; Rebecca N Johnson
Journal:  Heredity (Edinb)       Date:  2020-05-12       Impact factor: 3.821

Review 6.  Targeted capture in evolutionary and ecological genomics.

Authors:  Matthew R Jones; Jeffrey M Good
Journal:  Mol Ecol       Date:  2015-07-30       Impact factor: 6.185

7.  Population Genomics of Human Adaptation.

Authors:  Joseph Lachance; Sarah A Tishkoff
Journal:  Annu Rev Ecol Evol Syst       Date:  2013-11       Impact factor: 13.915

8.  A bioinformatic pipeline for identifying informative SNP panels for parentage assignment from RADseq data.

Authors:  Kimberly R Andrews; Jennifer R Adams; E Frances Cassirer; Raina K Plowright; Colby Gardner; Maggie Dwire; Paul A Hohenlohe; Lisette P Waits
Journal:  Mol Ecol Resour       Date:  2018-07-09       Impact factor: 7.090

9.  High-throughput sequencing reveals inbreeding depression in a natural population.

Authors:  Joseph I Hoffman; Fraser Simpson; Patrice David; Jolianne M Rijks; Thijs Kuiken; Michael A S Thorne; Robert C Lacy; Kanchon K Dasmahapatra
Journal:  Proc Natl Acad Sci U S A       Date:  2014-02-28       Impact factor: 11.205

10.  Genome-Wide Analysis of SNPs Is Consistent with No Domestic Dog Ancestry in the Endangered Mexican Wolf (Canis lupus baileyi).

Authors:  Robert R Fitak; Sarah E Rinkevich; Melanie Culver
Journal:  J Hered       Date:  2018-05-11       Impact factor: 2.645

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.