| Literature DB >> 35903557 |
Abstract
Phylogenetic trees are essential to evolutionary biology, and numerous methods exist that attempt to extract phylogenetic information applicable to a wide range of disciplines, such as epidemiology and metagenomics. Currently, the three main Python packages for trees are Bio.Phylo, DendroPy, and the ETE Toolkit, but as dataset sizes grow, parsing and manipulating ultra-large trees becomes impractical for these tools. To address this issue, we present TreeSwift, a user-friendly and massively scalable Python package for traversing and manipulating trees that is ideal for algorithms performed on ultra-large trees.Entities:
Keywords: Phylogenetics; Python; Scalable; Tree traversal
Year: 2020 PMID: 35903557 PMCID: PMC9328415 DOI: 10.1016/j.softx.2020.100436
Source DB: PubMed Journal: SoftwareX
Fig. 1.Runtimes of DendroPy, Bio.Phylo, the ETE Toolkit, and TreeSwift for a wide range of typical tree operations using trees of various sizes, as well as memory consumption after loading a tree (see Section 3 for details).
Fig. 2.Example Lineage-Through-Time (LTT) plot generated using TreeSwift.
| Current code version | 1.1.3 |
| Permanent link to code/repository used for this code version | |
| Legal Code License | GNU GPL v3.0 |
| Code versioning system used | git |
| Software code languages, tools, and services used | Python |
| Compilation requirements, operating environments & dependencies | Python, pip (optionally matplotlib for LTT plot visualization) |
| If available Link to developer documentation/manual | |
| Support email for questions |