Literature DB >> 28334108

Fast admixture analysis and population tree estimation for SNP and NGS data.

Jade Yu Cheng1,2,3, Thomas Mailund1, Rasmus Nielsen2,3.   

Abstract

MOTIVATION: Structure methods are highly used population genetic methods for classifying individuals in a sample fractionally into discrete ancestry components. CONTRIBUTION: We introduce a new optimization algorithm for the classical STRUCTURE model in a maximum likelihood framework. Using analyses of real data we show that the new method finds solutions with higher likelihoods than the state-of-the-art method in the same computational time. The optimization algorithm is also applicable to models based on genotype likelihoods, that can account for the uncertainty in genotype-calling associated with Next Generation Sequencing (NGS) data. We also present a new method for estimating population trees from ancestry components using a Gaussian approximation. Using coalescence simulations of diverging populations, we explore the adequacy of the STRUCTURE-style models and the Gaussian assumption for identifying ancestry components correctly and for inferring the correct tree. In most cases, ancestry components are inferred correctly, although sample sizes and times since admixture can influence the results. We show that the popular Gaussian approximation tends to perform poorly under extreme divergence scenarios e.g. with very long branch lengths, but the topologies of the population trees are accurately inferred in all scenarios explored. The new methods are implemented together with appropriate visualization tools in the software package Ohana.
AVAILABILITY AND IMPLEMENTATION: Ohana is publicly available at https://github.com/jade-cheng/ohana . In addition to source code and installation instructions, we also provide example work-flows in the project wiki site. CONTACT: jade.cheng@birc.au.dk. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author (2017). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com

Entities:  

Mesh:

Year:  2017        PMID: 28334108      PMCID: PMC6543773          DOI: 10.1093/bioinformatics/btx098

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  29 in total

1.  Inference of population structure using multilocus genotype data.

Authors:  J K Pritchard; M Stephens; P Donnelly
Journal:  Genetics       Date:  2000-06       Impact factor: 4.562

2.  ANALYSIS OF HUMAN EVOLUTION UNDER RANDOM GENETIC DRIFT.

Authors:  L L CAVALLI-SFORZA; I BARRAI; A W EDWARDS
Journal:  Cold Spring Harb Symp Quant Biol       Date:  1964

3.  A haplotype map of the human genome.

Authors: 
Journal:  Nature       Date:  2005-10-27       Impact factor: 49.962

4.  Estimation of individual admixture: analytical and study design considerations.

Authors:  Hua Tang; Jie Peng; Pei Wang; Neil J Risch
Journal:  Genet Epidemiol       Date:  2005-05       Impact factor: 2.135

5.  Approximating the coalescent with recombination.

Authors:  Gilean A T McVean; Niall J Cardin
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2005-07-29       Impact factor: 6.237

Review 6.  Modern computational approaches for analysing molecular genetic variation data.

Authors:  Paul Marjoram; Simon Tavaré
Journal:  Nat Rev Genet       Date:  2006-10       Impact factor: 53.242

7.  A fast and flexible statistical model for large-scale population genotype data: applications to inferring missing genotypes and haplotypic phase.

Authors:  Paul Scheet; Matthew Stephens
Journal:  Am J Hum Genet       Date:  2006-02-17       Impact factor: 11.025

8.  A Markov chain Monte Carlo approach for joint inference of population structure and inbreeding rates from multilocus genotype data.

Authors:  Hong Gao; Scott Williamson; Carlos D Bustamante
Journal:  Genetics       Date:  2007-05-04       Impact factor: 4.562

9.  PLINK: a tool set for whole-genome association and population-based linkage analyses.

Authors:  Shaun Purcell; Benjamin Neale; Kathe Todd-Brown; Lori Thomas; Manuel A R Ferreira; David Bender; Julian Maller; Pamela Sklar; Paul I W de Bakker; Mark J Daly; Pak C Sham
Journal:  Am J Hum Genet       Date:  2007-07-25       Impact factor: 11.025

10.  Rapid and accurate haplotype phasing and missing-data inference for whole-genome association studies by use of localized haplotype clustering.

Authors:  Sharon R Browning; Brian L Browning
Journal:  Am J Hum Genet       Date:  2007-09-21       Impact factor: 11.025

View more
  13 in total

1.  Inference of Population Structure from Time-Series Genotype Data.

Authors:  Tyler A Joseph; Itsik Pe'er
Journal:  Am J Hum Genet       Date:  2019-06-27       Impact factor: 11.025

2.  Natural Selection on Genes Related to Cardiovascular Health in High-Altitude Adapted Andeans.

Authors:  Jacob E Crawford; Ricardo Amaru; Jihyun Song; Colleen G Julian; Fernando Racimo; Jade Yu Cheng; Xiuqing Guo; Jie Yao; Bharath Ambale-Venkatesh; João A Lima; Jerome I Rotter; Josef Stehlik; Lorna G Moore; Josef T Prchal; Rasmus Nielsen
Journal:  Am J Hum Genet       Date:  2017-11-02       Impact factor: 11.025

Review 3.  Population Stratification in Genetic Association Studies.

Authors:  Jacklyn N Hellwege; Jacob M Keaton; Ayush Giri; Xiaoyi Gao; Digna R Velez Edwards; Todd L Edwards
Journal:  Curr Protoc Hum Genet       Date:  2017-10-18

4.  Portability of 245 polygenic scores when derived from the UK Biobank and applied to 9 ancestry groups from the same cohort.

Authors:  Florian Privé; Hugues Aschard; Shai Carmi; Lasse Folkersen; Clive Hoggart; Paul F O'Reilly; Bjarni J Vilhjálmsson
Journal:  Am J Hum Genet       Date:  2022-01-06       Impact factor: 11.043

5.  Parallel adaptation of rabbit populations to myxoma virus.

Authors:  Joel M Alves; Miguel Carneiro; Jade Y Cheng; Ana Lemos de Matos; Masmudur M Rahman; Liisa Loog; Paula F Campos; Nathan Wales; Anders Eriksson; Andrea Manica; Tanja Strive; Stephen C Graham; Sandra Afonso; Diana J Bell; Laura Belmont; Jonathan P Day; Susan J Fuller; Stéphane Marchandeau; William J Palmer; Guillaume Queney; Alison K Surridge; Filipe G Vieira; Grant McFadden; Rasmus Nielsen; M Thomas P Gilbert; Pedro J Esteves; Nuno Ferrand; Francis M Jiggins
Journal:  Science       Date:  2019-02-14       Impact factor: 47.728

6.  Local adaptation and archaic introgression shape global diversity at human structural variant loci.

Authors:  Stephanie M Yan; Rachel M Sherman; Dylan J Taylor; Divya R Nair; Andrew N Bortvin; Michael C Schatz; Rajiv C McCoy
Journal:  Elife       Date:  2021-09-16       Impact factor: 8.140

7.  A population genetic assessment of taxonomic species: The case of Lake Malawi cichlid fishes.

Authors:  Catarina Pinho; Vera Cardoso; Jody Hey
Journal:  Mol Ecol Resour       Date:  2019-06-06       Impact factor: 7.090

8.  Postadmixture Selection on Chileans Targets Haplotype Involved in Pigmentation, Thermogenesis and Immune Defense against Pathogens.

Authors:  Lucas Vicuña; Olga Klimenkova; Tomás Norambuena; Felipe I Martinez; Mario I Fernandez; Vladimir Shchur; Susana Eyheramendy
Journal:  Genome Biol Evol       Date:  2020-08-01       Impact factor: 3.416

9.  Putting RFMix and ADMIXTURE to the test in a complex admixed population.

Authors:  Caitlin Uren; Eileen G Hoal; Marlo Möller
Journal:  BMC Genet       Date:  2020-04-07       Impact factor: 2.797

10.  The spatiotemporal spread of human migrations during the European Holocene.

Authors:  Fernando Racimo; Jessie Woodbridge; Ralph M Fyfe; Martin Sikora; Karl-Göran Sjögren; Kristian Kristiansen; Marc Vander Linden
Journal:  Proc Natl Acad Sci U S A       Date:  2020-04-01       Impact factor: 11.205

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.