Literature DB >> 29186450

SPECTRE: a suite of phylogenetic tools for reticulate evolution.

Sarah Bastkowski1, Daniel Mapleson1, Andreas Spillner2, Taoyang Wu3, Monika Balvociute4, Vincent Moulton3.   

Abstract

Summary: Split-networks are a generalization of phylogenetic trees that have proven to be a powerful tool in phylogenetics. Various ways have been developed for computing such networks, including split-decomposition, NeighborNet, QNet and FlatNJ. Some of these approaches are implemented in the user-friendly SplitsTree software package. However, to give the user the option to adjust and extend these approaches and to facilitate their integration into analysis pipelines, there is a need for robust, open-source implementations of associated data structures and algorithms. Here, we present SPECTRE, a readily available, open-source library of data structures written in Java, that comes complete with new implementations of several pre-published algorithms and a basic interactive graphical interface for visualizing planar split networks. SPECTRE also supports the use of longer running algorithms by providing command line interfaces, which can be executed on servers or in High Performance Computing environments. Availability and implementation: Full source code is available under the GPLv3 license at: https://github.com/maplesond/SPECTRE. SPECTRE's core library is available from Maven Central at: https://mvnrepository.com/artifact/uk.ac.uea.cmp.spectre/core. Documentation is available at: http://spectre-suite-of-phylogenetic-tools-for-reticulate-evolution.readthedocs.io/en/latest/. Contact: sarah.bastkowski@earlham.ac.uk. Supplementary information: Supplementary data are available at Bioinformatics online.

Entities:  

Mesh:

Year:  2018        PMID: 29186450      PMCID: PMC5860355          DOI: 10.1093/bioinformatics/btx740

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


1 Introduction

Split-networks are a generalization of phylogenetic trees that are commonly used to analyze reticulate evolution in organisms such as plants, bacteria and viruses (see Fig. 1 for an example). They provide a snapshot of the data and can be used to display conflicting signals. Examples of algorithms for computing such networks include split-decomposition (Bandelt and Dress, 1992), Neighbor-Net (Bryant and Moulton, 2004), QNet (Grünewald ), SuperQ (Grünewald ) and FlatNJ (Balvočiūtė ). A comprehensive overview of split-networks can be found in (Huson and Bryant, 2006). Currently, the main program available for computing split-networks is the user-friendly SplitsTree program (Huson and Bryant, 2006). In addition, various methods for computing split-networks such as some of those mentioned above have been implemented and released as stand alone applications. Implementing data structures capable of representing the mathematical structures used to describe and compute split networks is not a trivial undertaking and existing software either is closed source or have their data structures and algorithms tightly integrated with their host tool, so are not easily reusable. There are, therefore, currently few options for developers wishing to create or extend their own tools based on these concepts other than to start from scratch. Hence, there is a need for a robust and flexible open-source library that provides core data structures and algorithms to facilitate development of new tools.
Fig. 1

To illustrate some of SPECTREs functionality, we processed a dataset analyzed in (Bollyky ) consisting of different Hepatitis B viruses (HBV). There are five different genomic groups and the phylogenetic analysis led to the result that HBVDNA is a recombinant with around half the genome coming from group A and half from group D. It also concluded that HPBADW1 is a recombinant of HPBADW2 (B) and HPBADWZCG (A), but with only a small insertion from HPBADWZCG into the Genome. (a) A minimum evolution tree constructed by NetME that is compatible with the split network constructed by NeighborNet, which is shown in (b). (c) The split network constructed by FlatNJ

To illustrate some of SPECTREs functionality, we processed a dataset analyzed in (Bollyky ) consisting of different Hepatitis B viruses (HBV). There are five different genomic groups and the phylogenetic analysis led to the result that HBVDNA is a recombinant with around half the genome coming from group A and half from group D. It also concluded that HPBADW1 is a recombinant of HPBADW2 (B) and HPBADWZCG (A), but with only a small insertion from HPBADWZCG into the Genome. (a) A minimum evolution tree constructed by NetME that is compatible with the split network constructed by NeighborNet, which is shown in (b). (c) The split network constructed by FlatNJ

2 SPECTRE

Here, we present SPECTRE, a suite of tools for computing, modelling and visualizing reticulate evolution based on split-networks. SPECTRE builds in part on existing open-source implementations of some of these tools, in particular for QNet, SuperQ and FlatNJ, integrating them into a unified and extendible library. The main tools available through SPECTRE are summarized below (for more details see Section 1 of Supplementary Material): These tools are accessible to the user via graphical and command line interfaces. Apart from driving the tools, the interactive graphical interface can visualize planar split networks using the drawing algorithm in (Spillner ). The interface offers a number of basic functions for orientating the canvas (e.g. zoom, pan, flip and rotate), manipulating labels (size, color, location) and creating image files (PDF, EPS, SVG, PNG). The command line implementation enables bioinformaticians to integrate tools into pipelines. This works on desktop PCs, like SplitsTree, but is also designed so long running tools are executable on servers or high performance computing environments where displays are not available. For developers wishing to reuse code and develop their own tools, SPECTRE provides a core library containing common data structures (e.g. splits, trees, networks, distances, quartets and multiple sequence alignments), algorithms (e.g. NeighborNet) and robust file parsers to process a range of input files (e.g. NEXUS, PHYLIP, Newick, Emboss, FastA); see Section 2 of Supplementary Material for more details. The library is available directly from Maven Central, giving developers direct access to the most recent version of the library and and providing a convenient way to integrate it into the processes for building their own projects. NeighborNet rapidly constructs a circular split network from a distance matrix or a sequence alignment (Bryant and Moulton, 2004). NetMake implements variants of NeighborNet as described in (Levy and Pachter, 2011). SuperQ constructs a circular split network from a set of (partial) input trees (Grünewald ). FlatNJ constructs a flat split network from a multiple sequence alignment, weighted quartet data or location data (Balvočiūtė ). NetME produces a minimum evolution tree compatible with an existing circular split network (Bastkowski ).

3 Concluding remarks

SPECTRE provides a collection of open-source tools and resources for modelling, understanding and visualizing reticulate evolution based on split networks. We believe that our software will both enable bioinformaticians to easily test and compare methods for inferring planar split networks and help computer scientists build their own methods for inferring phylogenetic networks by reusing our existing data structures and algorithms via the open-source library. Moreover, this also provides the option to easily add such new tools to the library making them readily available to other users. Click here for additional data file.
  8 in total

1.  Neighbor-net: an agglomerative method for the construction of phylogenetic networks.

Authors:  David Bryant; Vincent Moulton
Journal:  Mol Biol Evol       Date:  2003-12-05       Impact factor: 16.240

2.  Split decomposition: a new and useful approach to phylogenetic analysis of distance data.

Authors:  H J Bandelt; A W Dress
Journal:  Mol Phylogenet Evol       Date:  1992-09       Impact factor: 4.286

3.  Application of phylogenetic networks in evolutionary studies.

Authors:  Daniel H Huson; David Bryant
Journal:  Mol Biol Evol       Date:  2005-10-12       Impact factor: 16.240

4.  QNet: an agglomerative method for the construction of phylogenetic networks from weighted quartets.

Authors:  Stefan Grünewald; Kristoffer Forslund; Andreas Dress; Vincent Moulton
Journal:  Mol Biol Evol       Date:  2006-11-21       Impact factor: 16.240

5.  SuperQ: computing supernetworks from quartets.

Authors:  Stefan Grünewald; Andreas Spillner; Sarah Bastkowski; Anja Bögershausen; Vincent Moulton
Journal:  IEEE/ACM Trans Comput Biol Bioinform       Date:  2013 Jan-Feb       Impact factor: 3.710

6.  Constructing and drawing regular planar split networks.

Authors:  Andreas Spillner; Binh T Nguyen; Vincent Moulton
Journal:  IEEE/ACM Trans Comput Biol Bioinform       Date:  2011-08-04       Impact factor: 3.710

7.  FlatNJ: a novel network-based approach to visualize evolutionary and biogeographical relationships.

Authors:  Monika Balvočūtė; Andreas Spillner; Vincent Moulton
Journal:  Syst Biol       Date:  2014-01-15       Impact factor: 15.683

8.  Recombination between sequences of hepatitis B virus from different genotypes.

Authors:  P L Bollyky; A Rambaut; P H Harvey; E C Holmes
Journal:  J Mol Evol       Date:  1996-02       Impact factor: 2.395

  8 in total
  2 in total

1.  Genome sequencing of Rhinorhipus Lawrence exposes an early branch of the Coleoptera.

Authors:  Dominik Kusy; Michal Motyka; Carmelo Andujar; Matej Bocek; Michal Masek; Katerina Sklenarova; Filip Kokas; Milada Bocakova; Alfried P Vogler; Ladislav Bocak
Journal:  Front Zool       Date:  2018-05-02       Impact factor: 3.172

2.  Genome sequences identify three families of Coleoptera as morphologically derived click beetles (Elateridae).

Authors:  Dominik Kusy; Michal Motyka; Matej Bocek; Alfried P Vogler; Ladislav Bocak
Journal:  Sci Rep       Date:  2018-11-20       Impact factor: 4.379

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.