Literature DB >> 21868606

Combining markers into haplotypes can improve population structure inference.

Lucie M Gattepaille1, Mattias Jakobsson.   

Abstract

High-throughput genotyping and sequencing technologies can generate dense sets of genetic markers for large numbers of individuals. For most species, these data will contain many markers in linkage disequilibrium (LD). To utilize such data for population structure inference, we investigate the use of haplotypes constructed by combining the alleles at single-nucleotide polymorphisms (SNPs). We introduce a statistic derived from information theory, the gain of informativeness for assignment (GIA), which quantifies the additional information for assigning individuals to populations using haplotype data compared to using individual loci separately. Using a two-loci-two-allele model, we demonstrate that combining markers in linkage equilibrium into haplotypes always leads to nonpositive GIA, suggesting that combining the two markers is not advantageous for ancestry inference. However, for loci in LD, GIA is often positive, suggesting that assignment can be improved by combining markers into haplotypes. Using GIA as a criterion for combining markers into haplotypes, we demonstrate for simulated data a significant improvement of assigning individuals to candidate populations. For the many cases that we investigate, incorrect assignment was reduced between 26% and 97% using haplotype data. For empirical data from French and German individuals, the incorrectly assigned individuals can, for example, be decreased by 73% using haplotypes. Our results can be useful for challenging population structure and assignment problems, in particular for studies where large-scale population-genomic data are available.

Mesh:

Substances:

Year:  2011        PMID: 21868606      PMCID: PMC3249356          DOI: 10.1534/genetics.111.131136

Source DB:  PubMed          Journal:  Genetics        ISSN: 0016-6731            Impact factor:   4.562


  45 in total

1.  Inference of population structure using multilocus genotype data.

Authors:  J K Pritchard; M Stephens; P Donnelly
Journal:  Genetics       Date:  2000-06       Impact factor: 4.562

2.  Bayesian analysis of genetic differentiation between populations.

Authors:  Jukka Corander; Patrik Waldmann; Mikko J Sillanpää
Journal:  Genetics       Date:  2003-01       Impact factor: 4.562

3.  Inference of population structure using multilocus genotype data: linked loci and correlated allele frequencies.

Authors:  Daniel Falush; Matthew Stephens; Jonathan K Pritchard
Journal:  Genetics       Date:  2003-08       Impact factor: 4.562

4.  Genetic structure of human populations.

Authors:  Noah A Rosenberg; Jonathan K Pritchard; James L Weber; Howard M Cann; Kenneth K Kidd; Lev A Zhivotovsky; Marcus W Feldman
Journal:  Science       Date:  2002-12-20       Impact factor: 47.728

5.  Bayesian clustering using hidden Markov random fields in spatial population genetics.

Authors:  Olivier François; Sophie Ancelet; Gilles Guillot
Journal:  Genetics       Date:  2006-08-03       Impact factor: 4.562

6.  Impact of landscape management on the genetic structure of red squirrel populations.

Authors:  M L Hale; P W Lurz; M D Shirley; S Rushton; R M Fuller; K Wolff
Journal:  Science       Date:  2001-09-21       Impact factor: 47.728

Review 7.  The application of molecular genetic approaches to the study of human evolution.

Authors:  L Luca Cavalli-Sforza; Marcus W Feldman
Journal:  Nat Genet       Date:  2003-03       Impact factor: 38.330

8.  Assigning African elephant DNA to geographic region of origin: applications to the ivory trade.

Authors:  Samuel K Wasser; Andrew M Shedlock; Kenine Comstock; Elaine A Ostrander; Benezeth Mutayoba; Matthew Stephens
Journal:  Proc Natl Acad Sci U S A       Date:  2004-09-30       Impact factor: 11.205

9.  The genetic structure and history of Africans and African Americans.

Authors:  Sarah A Tishkoff; Floyd A Reed; Françoise R Friedlaender; Christopher Ehret; Alessia Ranciaro; Alain Froment; Jibril B Hirbo; Agnes A Awomoyi; Jean-Marie Bodo; Ogobara Doumbo; Muntaser Ibrahim; Abdalla T Juma; Maritha J Kotze; Godfrey Lema; Jason H Moore; Holly Mortensen; Thomas B Nyambo; Sabah A Omar; Kweli Powell; Gideon S Pretorius; Michael W Smith; Mahamadou A Thera; Charles Wambebe; James L Weber; Scott M Williams
Journal:  Science       Date:  2009-04-30       Impact factor: 47.728

10.  Sex-specific genetic structure and social organization in Central Asia: insights from a multi-locus study.

Authors:  Laure Ségurel; Begoña Martínez-Cruz; Lluis Quintana-Murci; Patricia Balaresque; Myriam Georges; Tatiana Hegay; Almaz Aldashev; Firuza Nasyrova; Mark A Jobling; Evelyne Heyer; Renaud Vitalis
Journal:  PLoS Genet       Date:  2008-09-26       Impact factor: 5.917

View more
  15 in total

1.  The Relationship Between Haplotype-Based F ST and Haplotype Length.

Authors:  Rohan S Mehta; Alison F Feder; Simina M Boca; Noah A Rosenberg
Journal:  Genetics       Date:  2019-07-08       Impact factor: 4.562

2.  Inferring biogeographic ancestry with compound markers of slow and fast evolving polymorphisms.

Authors:  Amandine Moriot; Carla Santos; Ana Freire-Aradas; Christopher Phillips; Diana Hall
Journal:  Eur J Hum Genet       Date:  2018-07-11       Impact factor: 4.246

Review 3.  Recent advances in the study of fine-scale population structure in humans.

Authors:  John Novembre; Benjamin M Peter
Journal:  Curr Opin Genet Dev       Date:  2016-09-20       Impact factor: 5.578

4.  Haplotype structure in commercial maize breeding programs in relation to key founder lines.

Authors:  Stephanie M Coffman; Matthew B Hufford; Carson M Andorf; Thomas Lübberstedt
Journal:  Theor Appl Genet       Date:  2019-11-20       Impact factor: 5.699

5.  Inference of population structure using dense haplotype data.

Authors:  Daniel John Lawson; Garrett Hellenthal; Simon Myers; Daniel Falush
Journal:  PLoS Genet       Date:  2012-01-26       Impact factor: 5.917

6.  Two genomic regions contribute disproportionately to geographic differentiation in wild barley.

Authors:  Zhou Fang; Ana M Gonzales; Michael T Clegg; Kevin P Smith; Gary J Muehlbauer; Brian J Steffenson; Peter L Morrell
Journal:  G3 (Bethesda)       Date:  2014-04-22       Impact factor: 3.154

7.  HaploPOP: a software that improves population assignment by combining markers into haplotypes.

Authors:  Nicolas Duforet-Frebourg; Lucie M Gattepaille; Michael G B Blum; Mattias Jakobsson
Journal:  BMC Bioinformatics       Date:  2015-07-31       Impact factor: 3.169

8.  The fine-scale genetic structure and evolution of the Japanese population.

Authors:  Fumihiko Takeuchi; Tomohiro Katsuya; Ryosuke Kimura; Toru Nabika; Minoru Isomura; Takayoshi Ohkubo; Yasuharu Tabara; Ken Yamamoto; Mitsuhiro Yokota; Xuanyao Liu; Woei-Yuh Saw; Dolikun Mamatyusupu; Wenjun Yang; Shuhua Xu; Yik-Ying Teo; Norihiro Kato
Journal:  PLoS One       Date:  2017-11-01       Impact factor: 3.240

9.  Genome-wide diversity in the levant reveals recent structuring by culture.

Authors:  Marc Haber; Dominique Gauguier; Sonia Youhanna; Nick Patterson; Priya Moorjani; Laura R Botigué; Daniel E Platt; Elizabeth Matisoo-Smith; David F Soria-Hernanz; R Spencer Wells; Jaume Bertranpetit; Chris Tyler-Smith; David Comas; Pierre A Zalloua
Journal:  PLoS Genet       Date:  2013-02-28       Impact factor: 5.917

10.  Nonstationary patterns of isolation-by-distance: inferring measures of local genetic differentiation with Bayesian kriging.

Authors:  Nicolas Duforet-Frebourg; Michael G B Blum
Journal:  Evolution       Date:  2014-01-26       Impact factor: 3.694

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.