Literature DB >> 30903685

ASTRAL-MP: scaling ASTRAL to very large datasets using randomization and parallelization.

John Yin1, Chao Zhang2, Siavash Mirarab3.   

Abstract

MOTIVATION: Evolutionary histories can change from one part of the genome to another. The potential for discordance between the gene trees has motivated the development of summary methods that reconstruct a species tree from an input collection of gene trees. ASTRAL is a widely used summary method and has been able to scale to relatively large datasets. However, the size of genomic datasets is quickly growing. Despite its relative efficiency, the current single-threaded implementation of ASTRAL is falling behind the data growth trends is not able to analyze the largest available datasets in a reasonable time.
RESULTS: ASTRAL uses dynamic programing and is not trivially parallel. In this paper, we introduce ASTRAL-MP, the first version of ASTRAL that can exploit parallelism and also uses randomization techniques to speed up some of its steps. Importantly, ASTRAL-MP can take advantage of not just multiple CPU cores but also one or several graphics processing units (GPUs). The ASTRAL-MP code scales very well with increasing CPU cores, and its GPU version, implemented in OpenCL, can have up to 158× speedups compared to ASTRAL-III. Using GPUs and multiple cores, ASTRAL-MP is able to analyze datasets with 10 000 species or datasets with more than 100 000 genes in <2 days.
AVAILABILITY AND IMPLEMENTATION: ASTRAL-MP is available at https://github.com/smirarab/ASTRAL/tree/MP. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author(s) 2019. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Mesh:

Year:  2019        PMID: 30903685     DOI: 10.1093/bioinformatics/btz211

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  14 in total

1.  Obligate sexual reproduction of a homothallic fungus closely related to the Cryptococcus pathogenic species complex.

Authors:  Andrew Ryan Passer; Shelly Applen Clancey; Terrance Shea; Márcia David-Palma; Anna Floyd Averette; Teun Boekhout; Betina M Porcel; Minou Nowrousian; Christina A Cuomo; Sheng Sun; Joseph Heitman; Marco A Coelho
Journal:  Elife       Date:  2022-06-17       Impact factor: 8.713

2.  Assembling a Reference Phylogenomic Tree of Bacteria and Archaea by Summarizing Many Gene Phylogenies.

Authors:  Qiyun Zhu; Siavash Mirarab
Journal:  Methods Mol Biol       Date:  2022

3.  Comparative genomics of the Western Hemisphere soft tick-borne relapsing fever borreliae highlights extensive plasmid diversity.

Authors:  Alexander R Kneubehl; Aparna Krishnavajhala; Sebastián Muñoz Leal; Adam J Replogle; Luke C Kingry; Sergio E Bermúdez; Marcelo B Labruna; Job E Lopez
Journal:  BMC Genomics       Date:  2022-05-31       Impact factor: 4.547

4.  Phylogenomics of 10,575 genomes reveals evolutionary proximity between domains Bacteria and Archaea.

Authors:  Qiyun Zhu; Uyen Mai; Wayne Pfeiffer; Stefan Janssen; Francesco Asnicar; Jon G Sanders; Pedro Belda-Ferre; Gabriel A Al-Ghalith; Evguenia Kopylova; Daniel McDonald; Tomasz Kosciolek; John B Yin; Shi Huang; Nimaichand Salam; Jian-Yu Jiao; Zijun Wu; Zhenjiang Z Xu; Kalen Cantrell; Yimeng Yang; Erfan Sayyari; Maryam Rabiee; James T Morton; Sheila Podell; Dan Knights; Wen-Jun Li; Curtis Huttenhower; Nicola Segata; Larry Smarr; Siavash Mirarab; Rob Knight
Journal:  Nat Commun       Date:  2019-12-02       Impact factor: 14.919

5.  ASTRAL-Pro: Quartet-Based Species-Tree Inference despite Paralogy.

Authors:  Chao Zhang; Celine Scornavacca; Erin K Molloy; Siavash Mirarab
Journal:  Mol Biol Evol       Date:  2020-11-01       Impact factor: 16.240

6.  Large-scale fungal strain sequencing unravels the molecular diversity in mating loci maintained by long-term balancing selection.

Authors:  David Peris; Dabao Sun Lu; Vilde Bruhn Kinneberg; Ine-Susanne Methlie; Malin Stapnes Dahl; Timothy Y James; Håvard Kauserud; Inger Skrede
Journal:  PLoS Genet       Date:  2022-03-31       Impact factor: 5.917

7.  Comparative Genomics Analyses of Lifestyle Transitions at the Origin of an Invasive Fungal Pathogen in the Genus Cryphonectria.

Authors:  Lea Stauber; Simone Prospero; Daniel Croll
Journal:  mSphere       Date:  2020-10-14       Impact factor: 4.389

8.  Partitioned Gene-Tree Analyses and Gene-Based Topology Testing Help Resolve Incongruence in a Phylogenomic Study of Host-Specialist Bees (Apidae: Eucerinae).

Authors:  Felipe V Freitas; Michael G Branstetter; Terry Griswold; Eduardo A B Almeida
Journal:  Mol Biol Evol       Date:  2021-03-09       Impact factor: 16.240

9.  SINE-Based Phylogenomics Reveal Extensive Introgression and Incomplete Lineage Sorting in Myotis.

Authors:  Jennifer M Korstian; Nicole S Paulat; Roy N Platt; Richard D Stevens; David A Ray
Journal:  Genes (Basel)       Date:  2022-02-23       Impact factor: 4.096

10.  Highly contiguous assemblies of 101 drosophilid genomes.

Authors:  Bernard Y Kim; Jeremy R Wang; Daniel R Matute; Dmitri A Petrov; Danny E Miller; Olga Barmina; Emily Delaney; Ammon Thompson; Aaron A Comeault; David Peede; Emmanuel R R D'Agostino; Julianne Pelaez; Jessica M Aguilar; Diler Haji; Teruyuki Matsunaga; Ellie E Armstrong; Molly Zych; Yoshitaka Ogawa; Marina Stamenković-Radak; Mihailo Jelić; Marija Savić Veselinović; Marija Tanasković; Pavle Erić; Jian-Jun Gao; Takehiro K Katoh; Masanori J Toda; Hideaki Watabe; Masayoshi Watada; Jeremy S Davis; Leonie C Moyle; Giulia Manoli; Enrico Bertolini; Vladimír Košťál; R Scott Hawley; Aya Takahashi; Corbin D Jones; Donald K Price; Noah Whiteman; Artyom Kopp
Journal:  Elife       Date:  2021-07-19       Impact factor: 8.713

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.