Literature DB >> 16305328

Algorithms for selecting informative marker panels for population assignment.

Noah A Rosenberg1.   

Abstract

Given a set of potential source populations, genotypes of an individual of unknown origin at a collection of markers can be used to predict the correct source population of the individual. For improved efficiency, informative markers can be chosen from a larger set of markers to maximize the accuracy of this prediction. However, selecting the loci that are individually most informative does not necessarily produce the optimal panel. Here, using genotypes from eight species--carp, cat, chicken, dog, fly, grayling, human, and maize--this univariate accumulation procedure is compared to new multivariate "greedy" and "maximin" algorithms for choosing marker panels. The procedures generally suggest similar panels, although the greedy method often recommends inclusion of loci that are not chosen by the other algorithms. In seven of the eight species, when applied to five or more markers, all methods achieve at least 94% assignment accuracy on simulated individuals, with one species--dog--producing this level of accuracy with only three markers, and the eighth species--human--requiring approximately 13-16 markers. The new algorithms produce substantial improvements over use of randomly selected markers; where differences among the methods are noticeable, the greedy algorithm leads to slightly higher probabilities of correct assignment. Although none of the approaches necessarily chooses the panel with optimal performance, the algorithms all likely select panels with performance near enough to the maximum that they all are suitable for practical use.

Entities:  

Mesh:

Substances:

Year:  2005        PMID: 16305328     DOI: 10.1089/cmb.2005.12.1183

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  21 in total

1.  Selecting SNPs to identify ancestry.

Authors:  Joshua N Sampson; Kenneth K Kidd; Judith R Kidd; Hongyu Zhao
Journal:  Ann Hum Genet       Date:  2011-07       Impact factor: 1.670

2.  Toward a genome-wide approach for detecting hybrids: informative SNPs to detect introgression between domestic cats and European wildcats (Felis silvestris).

Authors:  R Oliveira; E Randi; F Mattucci; J D Kurushima; L A Lyons; P C Alves
Journal:  Heredity (Edinb)       Date:  2015-06-24       Impact factor: 3.821

3.  Single nucleotide polymorphisms and haplotypes in Native American populations.

Authors:  Judith R Kidd; Françoise Friedlaender; Andrew J Pakstis; Manohar Furtado; Rixun Fang; Xudong Wang; Caroline M Nievergelt; Kenneth K Kidd
Journal:  Am J Phys Anthropol       Date:  2011-09-13       Impact factor: 2.868

4.  Genetic evidence for a second domestication of barley (Hordeum vulgare) east of the Fertile Crescent.

Authors:  Peter L Morrell; Michael T Clegg
Journal:  Proc Natl Acad Sci U S A       Date:  2007-02-21       Impact factor: 11.205

5.  Analyses of a set of 128 ancestry informative single-nucleotide polymorphisms in a global set of 119 population samples.

Authors:  Judith R Kidd; Françoise R Friedlaender; William C Speed; Andrew J Pakstis; Francisco M De La Vega; Kenneth K Kidd
Journal:  Investig Genet       Date:  2011-01-05

6.  Genealogical lineage sorting leads to significant, but incorrect Bayesian multilocus inference of population structure.

Authors:  Pablo Orozco-terWengel; Jukka Corander; Christian Schlötterer
Journal:  Mol Ecol       Date:  2011-01-18       Impact factor: 6.185

7.  The GenoChip: a new tool for genetic anthropology.

Authors:  Eran Elhaik; Elliott Greenspan; Sean Staats; Thomas Krahn; Chris Tyler-Smith; Yali Xue; Sergio Tofanelli; Paolo Francalacci; Francesco Cucca; Luca Pagani; Li Jin; Hui Li; Theodore G Schurr; Bennett Greenspan; R Spencer Wells
Journal:  Genome Biol Evol       Date:  2013       Impact factor: 3.416

8.  Population structure in a comprehensive genomic data set on human microsatellite variation.

Authors:  Trevor J Pemberton; Michael DeGiorgio; Noah A Rosenberg
Journal:  G3 (Bethesda)       Date:  2013-05-20       Impact factor: 3.154

9.  Rank and order: evaluating the performance of SNPs for individual assignment in a non-model organism.

Authors:  Caroline G Storer; Carita E Pascal; Steven B Roberts; William D Templin; Lisa W Seeb; James E Seeb
Journal:  PLoS One       Date:  2012-11-20       Impact factor: 3.240

10.  Variation in genetic admixture and population structure among Latinos: the Los Angeles Latino eye study (LALES).

Authors:  Corina J Shtir; Paul Marjoram; Stanley Azen; David V Conti; Loic Le Marchand; Christopher A Haiman; Rohit Varma
Journal:  BMC Genet       Date:  2009-11-10       Impact factor: 2.797

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.