Literature DB >> 31076776

Fast hierarchical Bayesian analysis of population structure.

Gerry Tonkin-Hill1, John A Lees2, Stephen D Bentley1, Simon D W Frost3,4, Jukka Corander1,5,6.   

Abstract

We present fastbaps, a fast solution to the genetic clustering problem. Fastbaps rapidly identifies an approximate fit to a Dirichlet process mixture model (DPM) for clustering multilocus genotype data. Our efficient model-based clustering approach is able to cluster datasets 10-100 times larger than the existing model-based methods, which we demonstrate by analyzing an alignment of over 110 000 sequences of HIV-1 pol genes. We also provide a method for rapidly partitioning an existing hierarchy in order to maximize the DPM model marginal likelihood, allowing us to split phylogenetic trees into clades and subclades using a population genomic model. Extensive tests on simulated data as well as a diverse set of real bacterial and viral datasets show that fastbaps provides comparable or improved solutions to previous model-based methods, while being significantly faster. The method is made freely available under an open source MIT licence as an easy to use R package at https://github.com/gtonkinhill/fastbaps.
© The Author(s) 2019. Published by Oxford University Press on behalf of Nucleic Acids Research.

Entities:  

Mesh:

Substances:

Year:  2019        PMID: 31076776      PMCID: PMC6582336          DOI: 10.1093/nar/gkz361

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


  32 in total

1.  A model-based method for identifying species hybrids using multilocus genetic data.

Authors:  E C Anderson; E A Thompson
Journal:  Genetics       Date:  2002-03       Impact factor: 4.562

2.  Bayesian analysis of genetic differentiation between populations.

Authors:  Jukka Corander; Patrik Waldmann; Mikko J Sillanpää
Journal:  Genetics       Date:  2003-01       Impact factor: 4.562

3.  Not so different after all: a comparison of methods for detecting amino acid sites under selection.

Authors:  Sergei L Kosakovsky Pond; Simon D W Frost
Journal:  Mol Biol Evol       Date:  2005-02-09       Impact factor: 16.240

4.  Detecting the number of clusters of individuals using the software STRUCTURE: a simulation study.

Authors:  G Evanno; S Regnaut; J Goudet
Journal:  Mol Ecol       Date:  2005-07       Impact factor: 6.185

5.  Refinement of whole-genome multilocus sequence typing analysis by addressing gene paralogy.

Authors:  Ji Zhang; Jani Halkilahti; Marja-Liisa Hänninen; Mirko Rossi
Journal:  J Clin Microbiol       Date:  2015-03-18       Impact factor: 5.948

6.  R/BHC: fast Bayesian hierarchical clustering for microarray data.

Authors:  Richard S Savage; Katherine Heller; Yang Xu; Zoubin Ghahramani; William M Truman; Murray Grant; Katherine J Denby; David L Wild
Journal:  BMC Bioinformatics       Date:  2009-08-06       Impact factor: 3.169

7.  Virus genomes reveal factors that spread and sustained the Ebola epidemic.

Authors:  Gytis Dudas; Luiz Max Carvalho; Trevor Bedford; Andrew J Tatem; Guy Baele; Nuno R Faria; Daniel J Park; Jason T Ladner; Armando Arias; Danny Asogun; Filip Bielejec; Sarah L Caddy; Matthew Cotten; Jonathan D'Ambrozio; Simon Dellicour; Antonino Di Caro; Joseph W Diclaro; Sophie Duraffour; Michael J Elmore; Lawrence S Fakoli; Ousmane Faye; Merle L Gilbert; Sahr M Gevao; Stephen Gire; Adrianne Gladden-Young; Andreas Gnirke; Augustine Goba; Donald S Grant; Bart L Haagmans; Julian A Hiscox; Umaru Jah; Jeffrey R Kugelman; Di Liu; Jia Lu; Christine M Malboeuf; Suzanne Mate; David A Matthews; Christian B Matranga; Luke W Meredith; James Qu; Joshua Quick; Suzan D Pas; My V T Phan; Georgios Pollakis; Chantal B Reusken; Mariano Sanchez-Lockhart; Stephen F Schaffner; John S Schieffelin; Rachel S Sealfon; Etienne Simon-Loriere; Saskia L Smits; Kilian Stoecker; Lucy Thorne; Ekaete Alice Tobin; Mohamed A Vandi; Simon J Watson; Kendra West; Shannon Whitmer; Michael R Wiley; Sarah M Winnicki; Shirlee Wohl; Roman Wölfel; Nathan L Yozwiak; Kristian G Andersen; Sylvia O Blyden; Fatorma Bolay; Miles W Carroll; Bernice Dahn; Boubacar Diallo; Pierre Formenty; Christophe Fraser; George F Gao; Robert F Garry; Ian Goodfellow; Stephan Günther; Christian T Happi; Edward C Holmes; Brima Kargbo; Sakoba Keïta; Paul Kellam; Marion P G Koopmans; Jens H Kuhn; Nicholas J Loman; N'Faly Magassouba; Dhamari Naidoo; Stuart T Nichol; Tolbert Nyenswah; Gustavo Palacios; Oliver G Pybus; Pardis C Sabeti; Amadou Sall; Ute Ströher; Isatta Wurie; Marc A Suchard; Philippe Lemey; Andrew Rambaut
Journal:  Nature       Date:  2017-04-12       Impact factor: 49.962

8.  Benzalkonium tolerance genes and outcome in Listeria monocytogenes meningitis.

Authors:  P H C Kremer; J A Lees; M M Koopmans; B Ferwerda; A W M Arends; M M Feller; K Schipper; M Valls Seron; A van der Ende; M C Brouwer; D van de Beek; S D Bentley
Journal:  Clin Microbiol Infect       Date:  2016-12-18       Impact factor: 8.067

9.  Fast and flexible bacterial genomic epidemiology with PopPUNK.

Authors:  John A Lees; Simon R Harris; Gerry Tonkin-Hill; Rebecca A Gladstone; Stephanie W Lo; Jeffrey N Weiser; Jukka Corander; Stephen D Bentley; Nicholas J Croucher
Journal:  Genome Res       Date:  2019-01-24       Impact factor: 9.043

10.  Automated analysis of phylogenetic clusters.

Authors:  Manon Ragonnet-Cronin; Emma Hodcroft; Stéphane Hué; Esther Fearnhill; Valerie Delpech; Andrew J Leigh Brown; Samantha Lycett
Journal:  BMC Bioinformatics       Date:  2013-11-06       Impact factor: 3.169

View more
  43 in total

1.  ProkEvo: an automated, reproducible, and scalable framework for high-throughput bacterial population genomics analyses.

Authors:  Natasha Pavlovikj; Joao Carlos Gomes-Neto; Jitender S Deogun; Andrew K Benson
Journal:  PeerJ       Date:  2021-05-21       Impact factor: 2.984

2.  Genome-wide association, prediction and heritability in bacteria with application to Streptococcus pneumoniae.

Authors:  Sudaraka Mallawaarachchi; Gerry Tonkin-Hill; Nicholas J Croucher; Paul Turner; Doug Speed; Jukka Corander; David Balding
Journal:  NAR Genom Bioinform       Date:  2022-02-22

3.  Phylogenomic Comparison of Neisseria gonorrhoeae Causing Disseminated Gonococcal Infections and Uncomplicated Gonorrhea in Georgia, United States.

Authors:  John C Cartee; Sandeep J Joseph; Emily Weston; Cau D Pham; Jesse C Thomas; Karen Schlanger; Sancta B St Cyr; Monica M Farley; Ashley E Moore; Amy K Tunali; Charletta Cloud; Brian H Raphael
Journal:  Open Forum Infect Dis       Date:  2022-05-13       Impact factor: 4.423

4.  The antique genetic plight of the Mediterranean monk seal (Monachus monachus).

Authors:  Jordi Salmona; Julia Dayon; Emilie Lecompte; Alexandros A Karamanlidis; Alex Aguilar; Pablo Fernandez de Larrinoa; Rosa Pires; Giulia Mo; Aliki Panou; Sabrina Agnesi; Asunción Borrell; Erdem Danyer; Bayram Öztürk; Arda M Tonay; Anastasios K Anestis; Luis M González; Panagiotis Dendrinos; Philippe Gaubert
Journal:  Proc Biol Sci       Date:  2022-08-31       Impact factor: 5.530

5.  Vibrio cholerae O139 genomes provide a clue to why it may have failed to usher in the eighth cholera pandemic.

Authors:  Thandavarayan Ramamurthy; Agila Kumari Pragasam; Alyce Taylor-Brown; Robert C Will; Karthick Vasudevan; Bhabatosh Das; Sunil Kumar Srivastava; Goutam Chowdhury; Asish K Mukhopadhyay; Shanta Dutta; Balaji Veeraraghavan; Nicholas R Thomson; Naresh C Sharma; Gopinath Balakrish Nair; Yoshifumi Takeda; Amit Ghosh; Gordon Dougan; Ankur Mutreja
Journal:  Nat Commun       Date:  2022-07-05       Impact factor: 17.694

6.  Global population structure and genotyping framework for genomic surveillance of the major dysentery pathogen, Shigella sonnei.

Authors:  Jane Hawkey; Kalani Paranagama; Kate S Baker; Rebecca J Bengtsson; François-Xavier Weill; Nicholas R Thomson; Stephen Baker; Louise Cerdeira; Zamin Iqbal; Martin Hunt; Danielle J Ingle; Timothy J Dallman; Claire Jenkins; Deborah A Williamson; Kathryn E Holt
Journal:  Nat Commun       Date:  2021-05-11       Impact factor: 14.919

7.  Molecular Epidemiological Characteristics of Mycobacterium abscessus Complex Derived from Non-Cystic Fibrosis Patients in Japan and Taiwan.

Authors:  Mitsunori Yoshida; Jung-Yien Chien; Po-Ren Hsueh; Kozo Morimoto; Takeshi Kinjo; Akio Aono; Yoshiro Murase; Keiji Fujiwara; Yuta Morishige; Hiroaki Nagano; Ruwen Jou; Naoki Hasegawa; Manabu Ato; Yoshihiko Hoshino; Satoshi Mitarai
Journal:  Microbiol Spectr       Date:  2022-04-21

8.  Targeted surveillance strategies for efficient detection of novel antibiotic resistance variants.

Authors:  Allison L Hicks; Stephen M Kissler; Tatum D Mortimer; Kevin C Ma; George Taiaroa; Melinda Ashcroft; Deborah A Williamson; Marc Lipsitch; Yonatan H Grad
Journal:  Elife       Date:  2020-06-30       Impact factor: 8.140

9.  Phylogenomic analysis reveals persistence of gonococcal strains with reduced-susceptibility to extended-spectrum cephalosporins and mosaic penA-34.

Authors:  Jesse C Thomas; Sandeep J Joseph; John C Cartee; Cau D Pham; Matthew W Schmerer; Karen Schlanger; Sancta B St Cyr; Ellen N Kersh; Brian H Raphael
Journal:  Nat Commun       Date:  2021-06-21       Impact factor: 14.919

10.  Contrasting Genetic Footprints among Saharan Olive Populations: Potential Causes and Conservation Implications.

Authors:  Guillaume Besnard; Océane Gorrilliot; Pauline Raimondeau; Benoit Génot; Ahmed El Bakkali; Fabien Anthelme; Djamel Baali-Cherif
Journal:  Plants (Basel)       Date:  2021-06-14
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.