Literature DB >> 30535314

Prot-SpaM: fast alignment-free phylogeny reconstruction based on whole-proteome sequences.

Chris-Andre Leimeister1, Jendrik Schellhorn1, Svenja Dörrer1, Michael Gerth2, Christoph Bleidorn3,4, Burkhard Morgenstern1,5.   

Abstract

Word-based or 'alignment-free' sequence comparison has become an active research area in bioinformatics. While previous word-frequency approaches calculated rough measures of sequence similarity or dissimilarity, some new alignment-free methods are able to accurately estimate phylogenetic distances between genomic sequences. One of these approaches is Filtered Spaced Word Matches. Here, we extend this approach to estimate evolutionary distances between complete or incomplete proteomes; our implementation of this approach is called Prot-SpaM. We compare the performance of Prot-SpaM to other alignment-free methods on simulated sequences and on various groups of eukaryotic and prokaryotic taxa. Prot-SpaM can be used to calculate high-quality phylogenetic trees for dozens of whole-proteome sequences in a matter of seconds or minutes and often outperforms other alignment-free approaches. The source code of our software is available through Github: https://github.com/jschellh/ProtSpaM.
© The Author(s) 2018. Published by Oxford University Press.

Entities:  

Keywords:  Kimura; Wolbachia; alignment-free; amino-acid substitutions; distance method; micro-alignment; phylogeny; protein comparison; proteome; spaced words

Mesh:

Substances:

Year:  2019        PMID: 30535314      PMCID: PMC6436989          DOI: 10.1093/gigascience/giy148

Source DB:  PubMed          Journal:  Gigascience        ISSN: 2047-217X            Impact factor:   6.524


  69 in total

1.  Amino acid substitution matrices from protein blocks.

Authors:  S Henikoff; J G Henikoff
Journal:  Proc Natl Acad Sci U S A       Date:  1992-11-15       Impact factor: 11.205

2.  Phylogenetic diversity and the greedy algorithm.

Authors:  Mike Steel
Journal:  Syst Biol       Date:  2005-08       Impact factor: 15.683

3.  Computing the distribution of a tree metric.

Authors:  David Bryant; Mike Steel
Journal:  IEEE/ACM Trans Comput Biol Bioinform       Date:  2009 Jul-Sep       Impact factor: 3.710

4.  Improved Modeling of Compositional Heterogeneity Supports Sponges as Sister to All Other Animals.

Authors:  Roberto Feuda; Martin Dohrmann; Walker Pett; Hervé Philippe; Omar Rota-Stabelli; Nicolas Lartillot; Gert Wörheide; Davide Pisani
Journal:  Curr Biol       Date:  2017-11-30       Impact factor: 10.834

5.  MAFFT multiple sequence alignment software version 7: improvements in performance and usability.

Authors:  Kazutaka Katoh; Daron M Standley
Journal:  Mol Biol Evol       Date:  2013-01-16       Impact factor: 16.240

6.  Parasitism and mutualism in Wolbachia: what the phylogenomic trees can and cannot say.

Authors:  Seth R Bordenstein; Charalampos Paraskevopoulos; Julie C Dunning Hotopp; Panagiotis Sapountzis; Nathan Lo; Claudio Bandi; Hervé Tettelin; John H Werren; Kostas Bourtzis
Journal:  Mol Biol Evol       Date:  2008-10-28       Impact factor: 16.240

7.  Interactive tree of life (iTOL) v3: an online tool for the display and annotation of phylogenetic and other trees.

Authors:  Ivica Letunic; Peer Bork
Journal:  Nucleic Acids Res       Date:  2016-04-19       Impact factor: 16.971

8.  rasbhari: Optimizing Spaced Seeds for Database Searching, Read Mapping and Alignment-Free Sequence Comparison.

Authors:  Lars Hahn; Chris-André Leimeister; Rachid Ounit; Stefano Lonardi; Burkhard Morgenstern
Journal:  PLoS Comput Biol       Date:  2016-10-19       Impact factor: 4.475

9.  Best hits of 11110110111: model-free selection and parameter-free sensitivity calculation of spaced seeds.

Authors:  Laurent Noé
Journal:  Algorithms Mol Biol       Date:  2017-02-14       Impact factor: 1.405

10.  Co-phylog: an assembly-free phylogenomic approach for closely related organisms.

Authors:  Huiguang Yi; Li Jin
Journal:  Nucleic Acids Res       Date:  2013-01-18       Impact factor: 16.971

View more
  9 in total

1.  Prot-SpaM: fast alignment-free phylogeny reconstruction based on whole-proteome sequences.

Authors:  Chris-Andre Leimeister; Jendrik Schellhorn; Svenja Dörrer; Michael Gerth; Christoph Bleidorn; Burkhard Morgenstern
Journal:  Gigascience       Date:  2019-03-01       Impact factor: 6.524

2.  Read-SpaM: assembly-free and alignment-free comparison of bacterial genomes with low sequencing coverage.

Authors:  Anna-Katharina Lau; Svenja Dörrer; Chris-André Leimeister; Christoph Bleidorn; Burkhard Morgenstern
Journal:  BMC Bioinformatics       Date:  2019-12-17       Impact factor: 3.169

3.  Benchmarking of alignment-free sequence comparison methods.

Authors:  Andrzej Zielezinski; Hani Z Girgis; Guillaume Bernard; Chris-Andre Leimeister; Kujin Tang; Thomas Dencker; Anna Katharina Lau; Sophie Röhling; Jae Jin Choi; Michael S Waterman; Matteo Comin; Sung-Hou Kim; Susana Vinga; Jonas S Almeida; Cheong Xin Chan; Benjamin T James; Fengzhu Sun; Burkhard Morgenstern; Wojciech M Karlowski
Journal:  Genome Biol       Date:  2019-07-25       Impact factor: 13.583

4.  The transcriptome of Balamuthia mandrillaris trophozoites for structure-guided drug design.

Authors:  Isabelle Q Phan; Christopher A Rice; Justin Craig; Rooksana E Noorai; Jacquelyn R McDonald; Sandhya Subramanian; Logan Tillery; Lynn K Barrett; Vijay Shankar; James C Morris; Wesley C Van Voorhis; Dennis E Kyle; Peter J Myler
Journal:  Sci Rep       Date:  2021-11-04       Impact factor: 4.379

5.  Phylogenies from unaligned proteomes using sequence environments of amino acid residues.

Authors:  Juan Carlos Aledo
Journal:  Sci Rep       Date:  2022-05-06       Impact factor: 4.996

6.  Insertions and deletions as phylogenetic signal in an alignment-free context.

Authors:  Niklas Birth; Thomas Dencker; Burkhard Morgenstern
Journal:  PLoS Comput Biol       Date:  2022-08-08       Impact factor: 4.779

7.  Evolutionary Insight into the Trypanosomatidae Using Alignment-Free Phylogenomics of the Kinetoplast.

Authors:  Alexa Kaufer; Damien Stark; John Ellis
Journal:  Pathogens       Date:  2019-09-18

8.  De novo assembly of the olive fruit fly (Bactrocera oleae) genome with linked-reads and long-read technologies minimizes gaps and provides exceptional Y chromosome assembly.

Authors:  Anthony Bayega; Haig Djambazian; Konstantina T Tsoumani; Maria-Eleni Gregoriou; Efthimia Sagri; Eleni Drosopoulou; Penelope Mavragani-Tsipidou; Kristina Giorda; George Tsiamis; Kostas Bourtzis; Spyridon Oikonomopoulos; Ken Dewar; Deanna M Church; Alexie Papanicolaou; Kostas D Mathiopoulos; Jiannis Ragoussis
Journal:  BMC Genomics       Date:  2020-03-30       Impact factor: 3.969

9.  The number of k-mer matches between two DNA sequences as a function of k and applications to estimate phylogenetic distances.

Authors:  Sophie Röhling; Alexander Linne; Jendrik Schellhorn; Morteza Hosseini; Thomas Dencker; Burkhard Morgenstern
Journal:  PLoS One       Date:  2020-02-10       Impact factor: 3.240

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.