| Literature DB >> 31791235 |
Yang Fang1, Chengcheng Liu2, Jiangyi Lin3, Xufeng Li1, Kambiz N Alavian4,5, Yi Yang6, Yulong Niu7.
Abstract
BACKGROUND: Phylogenetic species trees are widely used in inferring evolutionary relationships. Existing software and algorithms mainly focus on phylogenetic inference. However, less attention has been paid to intermediate steps, such as processing extremely large sequences and preparing configure files to connect multiple software. When the species number is large, the intermediate steps become a bottleneck that may seriously affect the efficiency of tree building.Entities:
Keywords: Automatic construction; Pipeline; Species tree
Mesh:
Substances:
Year: 2019 PMID: 31791235 PMCID: PMC6889546 DOI: 10.1186/s12862-019-1541-x
Source DB: PubMed Journal: BMC Evol Biol ISSN: 1471-2148 Impact factor: 3.260
Comparison of phylogenetic tree construction software
a The automation refers to sequences download, preprocess, and preparation of configure files. Critical steps such as sequence alignment and tree construction can be manually adjusted with advanced options in PhySpeTree
Fig. 1The workflow of PhySpeTree. PhySpeTree retrieves and downloads corresponding sequences following with multiple sequence alignments, conserved blocks selection, and tree reconstruction. PhySpeTree also allows users to insert their own sequences of HCP or SSU rRNA. Output trees are in the “newick” format files. ①Automatic tree reconstruction ②Processing user-defined fasta files for unannotated organisms ③Reconstructing species trees with unannotated organisms
Run time test of PhySpe Tree
a The tree reconstruction pipeline was conducted by MAFFT (alignment), Gblocks (trim), and FastTree (tree building). Benchmark test was conducted with i7-4790 3.6GHz CPU (parallel on 6 threads) with 16GB memory on Fedora operating system. b "--auto" option was turn on in MAFFT. The alignment strategy was automatically chose according to the number and length of sequences
Consistency test of PhySpe Tree comparied with the updated tree of life [33]
a SSU rRNA sequences were retrieved by PhySpeTree, aligned by SINA, and tree reconsturction by RAxML (GTRCAT model). b HCP sequences were retrieved by PhySpTree, aligned by MUSCLE, and tree reconstruction by RAxML (PROTGAMMAJTTX model). c Normalized Robinson-Foulds (nRF) distance was calculated by ETE3 [10]
Fig. 2Extended tree of life with Lokiarchaeota. The Lokiarchaeum sp. GC14_75 was inserted into the tree of life. The bacterial, eukaryotic, and archaeal branches were clasped