Literature DB >> 30165689

EPA-ng: Massively Parallel Evolutionary Placement of Genetic Sequences.

Pierre Barbera1, Alexey M Kozlov1, Lucas Czech1, Benoit Morel1, Diego Darriba1,2, Tomáš Flouri1,3, Alexandros Stamatakis1,4.   

Abstract

Next generation sequencing (NGS) technologies have led to a ubiquity of molecular sequence data. This data avalanche is particularly challenging in metagenetics, which focuses on taxonomic identification of sequences obtained from diverse microbial environments. Phylogenetic placement methods determine how these sequences fit into an evolutionary context. Previous implementations of phylogenetic placement algorithms, such as the evolutionary placement algorithm (EPA) included in RAxML, or PPLACER, are being increasingly used for this purpose. However, due to the steady progress in NGS technologies, the current implementations face substantial scalability limitations. Herein, we present EPA-NG, a complete reimplementation of the EPA that is substantially faster, offers a distributed memory parallelization, and integrates concepts from both, RAxML-EPA and PPLACER. EPA-NG can be executed on standard shared memory, as well as on distributed memory systems (e.g., computing clusters). To demonstrate the scalability of EPA-NG, we placed $1$ billion metagenetic reads from the Tara Oceans Project onto a reference tree with 3748 taxa in just under $7$ h, using 2048 cores. Our performance assessment shows that EPA-NG outperforms RAxML-EPA and PPLACER by up to a factor of $30$ in sequential execution mode, while attaining comparable parallel efficiency on shared memory systems. We further show that the distributed memory parallelization of EPA-NG scales well up to 2048 cores. EPA-NG is available under the AGPLv3 license: https://github.com/Pbdas/epa-ng.

Entities:  

Mesh:

Year:  2019        PMID: 30165689      PMCID: PMC6368480          DOI: 10.1093/sysbio/syy054

Source DB:  PubMed          Journal:  Syst Biol        ISSN: 1063-5157            Impact factor:   15.683


  11 in total

1.  Metagenomic species profiling using universal phylogenetic marker genes.

Authors:  Shinichi Sunagawa; Daniel R Mende; Georg Zeller; Fernando Izquierdo-Carrasco; Simon A Berger; Jens Roat Kultima; Luis Pedro Coelho; Manimozhiyan Arumugam; Julien Tap; Henrik Bjørn Nielsen; Simon Rasmussen; Søren Brunak; Oluf Pedersen; Francisco Guarner; Willem M de Vos; Jun Wang; Junhua Li; Joël Doré; S Dusko Ehrlich; Alexandros Stamatakis; Peer Bork
Journal:  Nat Methods       Date:  2013-10-20       Impact factor: 28.547

2.  Phylogeny-aware identification and correction of taxonomically mislabeled sequences.

Authors:  Alexey M Kozlov; Jiajie Zhang; Pelin Yilmaz; Frank Oliver Glöckner; Alexandros Stamatakis
Journal:  Nucleic Acids Res       Date:  2016-05-10       Impact factor: 16.971

3.  Aligning short reads to reference alignments and trees.

Authors:  Simon A Berger; Alexandros Stamatakis
Journal:  Bioinformatics       Date:  2011-06-02       Impact factor: 6.937

4.  Ocean plankton. Structure and function of the global ocean microbiome.

Authors:  Shinichi Sunagawa; Luis Pedro Coelho; Samuel Chaffron; Jens Roat Kultima; Karine Labadie; Guillem Salazar; Bardya Djahanschiri; Georg Zeller; Daniel R Mende; Adriana Alberti; Francisco M Cornejo-Castillo; Paul I Costea; Corinne Cruaud; Francesco d'Ovidio; Stefan Engelen; Isabel Ferrera; Josep M Gasol; Lionel Guidi; Falk Hildebrand; Florian Kokoszka; Cyrille Lepoivre; Gipsi Lima-Mendez; Julie Poulain; Bonnie T Poulos; Marta Royo-Llonch; Hugo Sarmento; Sara Vieira-Silva; Céline Dimier; Marc Picheral; Sarah Searson; Stefanie Kandels-Lewis; Chris Bowler; Colomban de Vargas; Gabriel Gorsky; Nigel Grimsley; Pascal Hingamp; Daniele Iudicone; Olivier Jaillon; Fabrice Not; Hiroyuki Ogata; Stephane Pesant; Sabrina Speich; Lars Stemmann; Matthew B Sullivan; Jean Weissenbach; Patrick Wincker; Eric Karsenti; Jeroen Raes; Silvia G Acinas; Peer Bork
Journal:  Science       Date:  2015-05-22       Impact factor: 47.728

5.  Parasites dominate hyperdiverse soil protist communities in Neotropical rainforests.

Authors:  Frédéric Mahé; Colomban de Vargas; David Bass; Lucas Czech; Alexandros Stamatakis; Enrique Lara; David Singer; Jordan Mayor; John Bunge; Sarah Sernaker; Tobias Siemensmeyer; Isabelle Trautmann; Sarah Romac; Cédric Berney; Alexey Kozlov; Edward A D Mitchell; Christophe V W Seppey; Elianne Egge; Guillaume Lentendu; Rainer Wirth; Gabriel Trueba; Micah Dunthorn
Journal:  Nat Ecol Evol       Date:  2017-03-20       Impact factor: 15.460

6.  The phylogenetic Kantorovich-Rubinstein metric for environmental sequence samples.

Authors:  Steven N Evans; Frederick A Matsen
Journal:  J R Stat Soc Series B Stat Methodol       Date:  2012-02-15       Impact factor: 4.488

7.  Performance, accuracy, and Web server for evolutionary placement of short sequence reads under maximum likelihood.

Authors:  Simon A Berger; Denis Krompass; Alexandros Stamatakis
Journal:  Syst Biol       Date:  2011-03-23       Impact factor: 15.683

8.  pplacer: linear time maximum-likelihood and Bayesian phylogenetic placement of sequences onto a fixed reference tree.

Authors:  Frederick A Matsen; Robin B Kodner; E Virginia Armbrust
Journal:  BMC Bioinformatics       Date:  2010-10-30       Impact factor: 3.169

9.  Bacterial communities in women with bacterial vaginosis: high resolution phylogenetic analyses reveal relationships of microbiota to clinical criteria.

Authors:  Sujatha Srinivasan; Noah G Hoffman; Martin T Morgan; Frederick A Matsen; Tina L Fiedler; Robert W Hall; Frederick J Ross; Connor O McCoy; Roger Bumgarner; Jeanne M Marrazzo; David N Fredricks
Journal:  PLoS One       Date:  2012-06-18       Impact factor: 3.240

10.  Topographic diversity of fungal and bacterial communities in human skin.

Authors:  Keisha Findley; Julia Oh; Joy Yang; Sean Conlan; Clayton Deming; Jennifer A Meyer; Deborah Schoenfeld; Effie Nomicos; Morgan Park; Heidi H Kong; Julia A Segre
Journal:  Nature       Date:  2013-05-22       Impact factor: 49.962

View more
  88 in total

1.  Scalable methods for analyzing and visualizing phylogenetic placement of metagenomic samples.

Authors:  Lucas Czech; Alexandros Stamatakis
Journal:  PLoS One       Date:  2019-05-28       Impact factor: 3.240

2.  Sensitive Identification of Bacterial DNA in Clinical Specimens by Broad-Range 16S rRNA Gene Enrichment.

Authors:  Sara Rassoulian Barrett; Noah G Hoffman; Christopher Rosenthal; Andrew Bryan; Desiree A Marshall; Joshua Lieberman; Alexander L Greninger; Vikas Peddu; Brad T Cookson; Stephen J Salipante
Journal:  J Clin Microbiol       Date:  2020-11-18       Impact factor: 5.948

3.  16S rRNA gene amplicon-based metagenomic analysis of bacterial communities in the rhizospheres of selected mangrove species from Mida Creek and Gazi Bay, Kenya.

Authors:  Edith M Muwawa; Chinedu C Obieze; Huxley M Makonde; Joyce M Jefwa; James H P Kahindi; Damase P Khasa
Journal:  PLoS One       Date:  2021-03-23       Impact factor: 3.240

4.  Divergence of Biocrust Active Bacterial Communities in the Negev Desert During a Hydration-Desiccation Cycle.

Authors:  Capucine Baubin; Noya Ran; Hagar Siebner; Osnat Gillor
Journal:  Microb Ecol       Date:  2022-07-05       Impact factor: 4.552

5.  Unambiguous identification of fungi: where do we stand and how accurate and precise is fungal DNA barcoding?

Authors:  Robert Lücking; M Catherine Aime; Barbara Robbertse; Andrew N Miller; Hiran A Ariyawansa; Takayuki Aoki; Gianluigi Cardinali; Pedro W Crous; Irina S Druzhinina; David M Geiser; David L Hawksworth; Kevin D Hyde; Laszlo Irinyi; Rajesh Jeewon; Peter R Johnston; Paul M Kirk; Elaine Malosso; Tom W May; Wieland Meyer; Maarja Öpik; Vincent Robert; Marc Stadler; Marco Thines; Duong Vu; Andrey M Yurkov; Ning Zhang; Conrad L Schoch
Journal:  IMA Fungus       Date:  2020-07-10       Impact factor: 3.515

6.  APPLES: Scalable Distance-Based Phylogenetic Placement with or without Alignments.

Authors:  Metin Balaban; Shahab Sarmashghi; Siavash Mirarab
Journal:  Syst Biol       Date:  2020-05-01       Impact factor: 15.683

7.  Phylogeny Estimation Given Sequence Length Heterogeneity.

Authors:  Vladimir Smirnov; Tandy Warnow
Journal:  Syst Biol       Date:  2021-02-10       Impact factor: 15.683

8.  Evolutionary origin and ecological implication of a unique nif island in free-living Bradyrhizobium lineages.

Authors:  Jinjin Tao; Sishuo Wang; Tianhua Liao; Haiwei Luo
Journal:  ISME J       Date:  2021-05-14       Impact factor: 10.302

9.  Changes in Archaeal Community and Activity by the Invasion of Spartina anglica Along Soil Depth Profiles of a Coastal Wetland.

Authors:  Jinhyun Kim; Young Mok Heo; Jeongeun Yun; Hanbyul Lee; Jae-Jin Kim; Hojeong Kang
Journal:  Microb Ecol       Date:  2021-05-18       Impact factor: 4.552

10.  Dissecting the contribution of host genetics and the microbiome in complex behaviors.

Authors:  Shelly A Buffington; Sean W Dooling; Martina Sgritta; Cecilia Noecker; Oscar D Murillo; Daniela F Felice; Peter J Turnbaugh; Mauro Costa-Mattioli
Journal:  Cell       Date:  2021-03-10       Impact factor: 41.582

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.