Literature DB >> 32280365

Alignment- and reference-free phylogenomics with colored de Bruijn graphs.

Roland Wittler1,2,3.   

Abstract

BACKGROUND: The increasing amount of available genome sequence data enables large-scale comparative studies. A common task is the inference of phylogenies-a challenging task if close reference sequences are not available, genome sequences are incompletely assembled, or the high number of genomes precludes multiple sequence alignment in reasonable time.
RESULTS: We present a new whole-genome based approach to infer phylogenies that is alignment- and reference-free. In contrast to other methods, it does not rely on pairwise comparisons to determine distances to infer edges in a tree. Instead, a colored de Bruijn graph is constructed, and information on common subsequences is extracted to infer phylogenetic splits.
CONCLUSIONS: The introduced new methodology for large-scale phylogenomics shows high potential. Application to different datasets confirms robustness of the approach. A comparison to other state-of-the-art whole-genome based methods indicates comparable or higher accuracy and efficiency.
© The Author(s) 2020.

Entities:  

Keywords:  Colored de Bruijn graphs; Phylogenetic splits; Phylogenetics; Phylogenomics

Year:  2020        PMID: 32280365      PMCID: PMC7137503          DOI: 10.1186/s13015-020-00164-3

Source DB:  PubMed          Journal:  Algorithms Mol Biol        ISSN: 1748-7188            Impact factor:   1.405


  19 in total

1.  Drawing explicit phylogenetic networks and their integration into SplitsTree.

Authors:  Tobias H Kloepper; Daniel H Huson
Journal:  BMC Evol Biol       Date:  2008-01-24       Impact factor: 3.260

2.  andi: fast and accurate estimation of evolutionary distances between closely related genomes.

Authors:  Bernhard Haubold; Fabian Klötzl; Peter Pfaffelhuber
Journal:  Bioinformatics       Date:  2014-12-10       Impact factor: 6.937

3.  Evolution of Tom, 297, 17.6 and rover retrotransposons in Drosophilidae species.

Authors:  Newton Medeiros Vidal; Adriana Ludwig; Elgion Lucio Silva Loreto
Journal:  Mol Genet Genomics       Date:  2009-07-08       Impact factor: 3.291

4.  De novo assembly and genotyping of variants using colored de Bruijn graphs.

Authors:  Zamin Iqbal; Mario Caccamo; Isaac Turner; Paul Flicek; Gil McVean
Journal:  Nat Genet       Date:  2012-01-08       Impact factor: 38.330

5.  FlyBase: genomes by the dozen.

Authors:  Madeline A Crosby; Joshua L Goodman; Victor B Strelets; Peili Zhang; William M Gelbart
Journal:  Nucleic Acids Res       Date:  2006-11-11       Impact factor: 16.971

6.  The UCSC Ebola Genome Portal.

Authors:  Maximilian Haeussler; Donna Karolchik; Hiram Clawson; Brian J Raney; Kate R Rosenbloom; Pauline A Fujita; Angie S Hinrichs; Matthew L Speir; Chris Eisenhart; Ann S Zweig; David Haussler; W James Kent
Journal:  PLoS Curr       Date:  2014-11-07

7.  SWPhylo - A Novel Tool for Phylogenomic Inferences by Comparison of Oligonucleotide Patterns and Integration of Genome-Based and Gene-Based Phylogenetic Trees.

Authors:  Xiaoyu Yu; Oleg N Reva
Journal:  Evol Bioinform Online       Date:  2018-02-20       Impact factor: 1.625

8.  Co-phylog: an assembly-free phylogenomic approach for closely related organisms.

Authors:  Huiguang Yi; Li Jin
Journal:  Nucleic Acids Res       Date:  2013-01-18       Impact factor: 16.971

9.  CVTree3 Web Server for Whole-genome-based and Alignment-free Prokaryotic Phylogeny and Taxonomy.

Authors:  Guanghong Zuo; Bailin Hao
Journal:  Genomics Proteomics Bioinformatics       Date:  2015-11-10       Impact factor: 7.691

10.  Pan-genome Analysis of Ancient and Modern Salmonella enterica Demonstrates Genomic Stability of the Invasive Para C Lineage for Millennia.

Authors:  Zhemin Zhou; Inge Lundstrøm; Alicia Tran-Dien; Sebastián Duchêne; Nabil-Fareed Alikhan; Martin J Sergeant; Gemma Langridge; Anna K Fotakis; Satheesh Nair; Hans K Stenøien; Stian S Hamre; Sherwood Casjens; Axel Christophersen; Christopher Quince; Nicholas R Thomson; François-Xavier Weill; Simon Y W Ho; M Thomas P Gilbert; Mark Achtman
Journal:  Curr Biol       Date:  2018-07-19       Impact factor: 10.834

View more
  5 in total

1.  Scalable, ultra-fast, and low-memory construction of compacted de Bruijn graphs with Cuttlefish 2.

Authors:  Jamshed Khan; Marek Kokot; Sebastian Deorowicz; Rob Patro
Journal:  Genome Biol       Date:  2022-09-08       Impact factor: 17.906

Review 2.  Data structures based on k-mers for querying large collections of sequencing data sets.

Authors:  Camille Marchet; Christina Boucher; Simon J Puglisi; Paul Medvedev; Mikaël Salson; Rayan Chikhi
Journal:  Genome Res       Date:  2020-12-16       Impact factor: 9.043

3.  Complete pan-plastome sequences enable high resolution phylogenetic classification of sugar beet and closely related crop wild relatives.

Authors:  Tony Heitkam; Daniela Holtgräwe; Katharina Sielemann; Boas Pucker; Nicola Schmidt; Prisca Viehöver; Bernd Weisshaar
Journal:  BMC Genomics       Date:  2022-02-10       Impact factor: 3.969

4.  Population-scale detection of non-reference sequence variants using colored de Bruijn Graphs.

Authors:  Thomas Krannich; W Timothy J White; Sebastian Niehus; Guillaume Holley; Bjarni V Halldórsson; Birte Kehr
Journal:  Bioinformatics       Date:  2021-11-02       Impact factor: 6.937

5.  SANS serif: alignment-free, whole-genome based phylogenetic reconstruction.

Authors:  Andreas Rempel; Roland Wittler
Journal:  Bioinformatics       Date:  2021-06-16       Impact factor: 6.937

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.