Literature DB >> 30131346

Inferring Population Structure and Admixture Proportions in Low-Depth NGS Data.

Jonas Meisner1, Anders Albrechtsen2.   

Abstract

We here present two methods for inferring population structure and admixture proportions in low-depth next-generation sequencing (NGS) data. Inference of population structure is essential in both population genetics and association studies, and is often performed using principal component analysis (PCA) or clustering-based approaches. NGS methods provide large amounts of genetic data but are associated with statistical uncertainty, especially for low-depth sequencing data. Models can account for this uncertainty by working directly on genotype likelihoods of the unobserved genotypes. We propose a method for inferring population structure through PCA in an iterative heuristic approach of estimating individual allele frequencies, where we demonstrate improved accuracy in samples with low and variable sequencing depth for both simulated and real datasets. We also use the estimated individual allele frequencies in a fast non-negative matrix factorization method to estimate admixture proportions. Both methods have been implemented in the PCAngsd framework available at http://www.popgen.dk/software/.
Copyright © 2018 by the Genetics Society of America.

Keywords:  PCA; Population structure; admixture; ancestry; genotype likelihoods; low depth; next-generation sequencing

Mesh:

Year:  2018        PMID: 30131346      PMCID: PMC6216594          DOI: 10.1534/genetics.118.301336

Source DB:  PubMed          Journal:  Genetics        ISSN: 0016-6731            Impact factor:   4.562


  34 in total

1.  Inference of population structure using multilocus genotype data.

Authors:  J K Pritchard; M Stephens; P Donnelly
Journal:  Genetics       Date:  2000-06       Impact factor: 4.562

2.  A human genome diversity cell line panel.

Authors:  Howard M Cann; Claudia de Toma; Lucien Cazes; Marie-Fernande Legrand; Valerie Morel; Laurence Piouffre; Julia Bodmer; Walter F Bodmer; Batsheva Bonne-Tamir; Anne Cambon-Thomsen; Zhu Chen; J Chu; Carlo Carcassi; Licinio Contu; Ruofu Du; Laurent Excoffier; G B Ferrara; Jonathan S Friedlaender; Helena Groot; David Gurwitz; Trefor Jenkins; Rene J Herrera; Xiaoyi Huang; Judith Kidd; Kenneth K Kidd; Andre Langaney; Alice A Lin; S Qasim Mehdi; Peter Parham; Alberto Piazza; Maria Pia Pistillo; Yaping Qian; Qunfang Shu; Jiujin Xu; S Zhu; James L Weber; Henry T Greely; Marcus W Feldman; Gilles Thomas; Jean Dausset; L Luca Cavalli-Sforza
Journal:  Science       Date:  2002-04-12       Impact factor: 47.728

3.  Model-free Estimation of Recent Genetic Relatedness.

Authors:  Matthew P Conomos; Alexander P Reiner; Bruce S Weir; Timothy A Thornton
Journal:  Am J Hum Genet       Date:  2016-01-07       Impact factor: 11.025

4.  Principal components analysis corrects for stratification in genome-wide association studies.

Authors:  Alkes L Price; Nick J Patterson; Robert M Plenge; Michael E Weinblatt; Nancy A Shadick; David Reich
Journal:  Nat Genet       Date:  2006-07-23       Impact factor: 38.330

5.  Fast and efficient estimation of individual ancestry coefficients.

Authors:  Eric Frichot; François Mathieu; Théo Trouillon; Guillaume Bouchard; Olivier François
Journal:  Genetics       Date:  2014-02-04       Impact factor: 4.562

6.  Fast Principal-Component Analysis Reveals Convergent Evolution of ADH1B in Europe and East Asia.

Authors:  Kevin J Galinsky; Gaurav Bhatia; Po-Ru Loh; Stoyan Georgiev; Sayan Mukherjee; Nick J Patterson; Alkes L Price
Journal:  Am J Hum Genet       Date:  2016-02-25       Impact factor: 11.025

7.  Population structure and eigenanalysis.

Authors:  Nick Patterson; Alkes L Price; David Reich
Journal:  PLoS Genet       Date:  2006-12       Impact factor: 5.917

8.  ANGSD: Analysis of Next Generation Sequencing Data.

Authors:  Thorfinn Sand Korneliussen; Anders Albrechtsen; Rasmus Nielsen
Journal:  BMC Bioinformatics       Date:  2014-11-25       Impact factor: 3.169

9.  SNP calling, genotype calling, and sample allele frequency estimation from New-Generation Sequencing data.

Authors:  Rasmus Nielsen; Thorfinn Korneliussen; Anders Albrechtsen; Yingrui Li; Jun Wang
Journal:  PLoS One       Date:  2012-07-24       Impact factor: 3.240

10.  An integrated map of genetic variation from 1,092 human genomes.

Authors:  Goncalo R Abecasis; Adam Auton; Lisa D Brooks; Mark A DePristo; Richard M Durbin; Robert E Handsaker; Hyun Min Kang; Gabor T Marth; Gil A McVean
Journal:  Nature       Date:  2012-11-01       Impact factor: 49.962

View more
  83 in total

1.  Population genetics of wild Macaca fascicularis with low-coverage shotgun sequencing of museum specimens.

Authors:  Lu Yao; Kelsey Witt; Hongjie Li; Jonathan Rice; Nelson R Salinas; Robert D Martin; Emilia Huerta-Sánchez; Ripan S Malhi
Journal:  Am J Phys Anthropol       Date:  2020-07-08       Impact factor: 2.868

2.  Gene exchange between two divergent species of the fungal human pathogen, Coccidioides.

Authors:  Colin S Maxwell; Kathleen Mattox; David A Turissini; Marcus M Teixeira; Bridget M Barker; Daniel R Matute
Journal:  Evolution       Date:  2018-12-04       Impact factor: 3.694

3.  Unlocking the potential of a validated single nucleotide polymorphism array for genomic monitoring of trade in cheetahs (Acinonyx jubatus).

Authors:  Michelle Magliolo; Stefan Prost; Pablo Orozco-terWengel; Pamela Burger; Anna S Kropff; Antoinette Kotze; J Paul Grobler; Desire Lee Dalton
Journal:  Mol Biol Rep       Date:  2020-12-04       Impact factor: 2.316

4.  Hybridization boosts dispersal of two contrasted ecotypes in a grass species.

Authors:  Emma V Curran; Matilda S Scott; Jill K Olofsson; Florence Nyirenda; Graciela Sotelo; Matheus E Bianconi; Sophie Manzi; Guillaume Besnard; Lara Pereira; Pascal-Antoine Christin
Journal:  Proc Biol Sci       Date:  2022-01-26       Impact factor: 5.349

5.  Demographic decline and lineage-specific adaptations characterize New Zealand kiwi.

Authors:  Jordan B Bemmels; Else K Mikkelsen; Oliver Haddrath; Rogan M Colbourne; Hugh A Robertson; Jason T Weir
Journal:  Proc Biol Sci       Date:  2021-12-15       Impact factor: 5.349

6.  Divergence, gene flow, and the origin of leapfrog geographic distributions: The history of colour pattern variation in Phyllobates poison-dart frogs.

Authors:  Roberto Márquez; Tyler P Linderoth; Daniel Mejía-Vargas; Rasmus Nielsen; Adolfo Amézquita; Marcus R Kronforst
Journal:  Mol Ecol       Date:  2020-09-07       Impact factor: 6.185

7.  Reef environments shape microbial partners in a highly connected coral population.

Authors:  N G Kriefall; M R Kanke; G V Aglyamova; S W Davies
Journal:  Proc Biol Sci       Date:  2022-01-19       Impact factor: 5.349

8.  Biases in Demographic Modeling Affect Our Understanding of Recent Divergence.

Authors:  Paolo Momigliano; Ann-Britt Florin; Juha Merilä
Journal:  Mol Biol Evol       Date:  2021-06-25       Impact factor: 16.240

9.  Genetic load has potential in large populations but is realized in small inbred populations.

Authors:  Samarth Mathur; J Andrew DeWoody
Journal:  Evol Appl       Date:  2021-04-10       Impact factor: 5.183

10.  Herded and hunted goat genomes from the dawn of domestication in the Zagros Mountains.

Authors:  Kevin G Daly; Valeria Mattiangeli; Andrew J Hare; Hossein Davoudi; Homa Fathi; Sanaz Beizaee Doost; Sarieh Amiri; Roya Khazaeli; Delphine Decruyenaere; Jebrael Nokandeh; Tobias Richter; Hojjat Darabi; Peder Mortensen; Alexis Pantos; Lisa Yeomans; Pernille Bangsgaard; Marjan Mashkour; Melinda A Zeder; Daniel G Bradley
Journal:  Proc Natl Acad Sci U S A       Date:  2021-06-22       Impact factor: 11.205

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.