| Literature DB >> 31396256 |
Ming Tan1, Haixia Long2, Bo Liao1,2, Zhi Cao1, Dawei Yuan3, Geng Tian3, Jujuan Zhuang4, Jialiang Yang2,5.
Abstract
Phylogenetic networks are used to estimate evolutionary relationships among biological entities or taxa involving reticulate events such as horizontal gene transfer, hybridization, recombination, and reassortment. In the past decade, many phylogenetic tree and network reconstruction methods have been proposed. Despite that they are highly accurate in reconstructing simple to moderate complex reticulate events, the performance decreases when several reticulate events are present simultaneously. In this paper, we proposed QS-Net, a phylogenetic network reconstruction method taking advantage of information on the relationship among six taxa. To evaluate the performance of QS-Net, we conducted experiments on three artificial sequence data simulated from an evolutionary tree, an evolutionary network involving three reticulate events, and a complex evolutionary network involving five reticulate events. Comparison with popular phylogenetic methods including Neighbor-Joining, Split-Decomposition, Neighbor-Net, and Quartet-Net suggests that QS-Net is comparable with other methods in reconstructing tree-like evolutionary histories, while it outperforms them in reconstructing reticulate events. In addition, we also applied QS-Net in real data including a bacterial taxonomy data consisting of 36 bacterial species and the whole genome sequences of 22 H7N9 influenza A viruses. The results indicate that QS-Net is capable of inferring commonly believed bacterial taxonomy and influenza evolution as well as identifying novel reticulate events. The software QS-Net is publically available at https://github.com/Tmyiri/QS-Net.Entities:
Keywords: bacterial taxonomy; influenza reassortment; phylogenetic network; reticulate evolution; sextet
Year: 2019 PMID: 31396256 PMCID: PMC6667645 DOI: 10.3389/fgene.2019.00607
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Figure 1Phylogenetic tree: a phylogenetic tree for illustration and a phylogenetic tree with 12 leaves. (A) A phylogenetic tree for illustration with the branch length indicating evolutionary distance. (B) A phylogenetic tree with 12 leaves used to generate the first simulation data.
Comparison of accuracy (the total bootstrap value obtained from the experimental results is divided by the bootstrap BV value) between QS-Net and four other methods.
| Data set | QS-Net | Quartet-Net | Neighbor-Net | Split-Decomposition | Neighbor-Joining |
|---|---|---|---|---|---|
| Tree | 100% | 100% | 100% | 100% | 100% |
| Network (3) | 100% | 100% | 70.16% | 67.24% | 36% |
| Network (5) | 100% | 94.74% | 58.89% | 46.76% | 23.68% |
Network (3) is the phylogenetic network with three reticulate events, while Network (5) is the phylogenetic network with five reticulate events.
The number of true-positive results can be obtained by five methods.
| Data set | True | QS-Net | Quartet-Net | Neighbor-Net | Split-Decomposition | Neighbor-Joining |
|---|---|---|---|---|---|---|
| Tree | 9 | 9 | 9 | 9 | 9 | 9 |
| Network (3) | 25 | 25 | 25 | 21 | 23 | 9 |
| Network (5) | 38 | 38 | 36 | 30 | 22 | 11 |
The “True” column represents the real number of true-positive splits of the simulated data.
The number of false-positive results obtained by five methods.
| Data set | QS-Net | Quartet-Net | Neighbor-Net | Split-Decomposition | Neighbor-Joining |
|---|---|---|---|---|---|
| Tree | 2 | 2 | 35 | 4 | 0 |
| Network (3) | 4 | 4 | 16 | 1 | 0 |
| Network (5) | 4 | 4 | 4 | 1 | 0 |
Figure 2Phylogenetic network with 3/5 reticulate events. (A) A phylogenetic network with three reticulate events A, B, and C. (B) A phylogenetic network with five reticulate events A, B, C, D, and E.
A comparison of runtime between QS-Net and Quartet-Net on all data sets.
| Method | Tree | Network (3) | Network (5) | GC rich | GC poor and rich | Influenza |
|---|---|---|---|---|---|---|
| QS-Net | 1.25 s | 6.02 s | 24.39 s | 0.92 s | 9.49 min | 3.22 min |
| Quartet-Net | 0.20 s | 1.05 s | 4.05 s | 0.19 s | 10.17 s | 4.54 s |
Figure 3The reconstructed QS-Net network of 11 GC-rich bacteria.
Figure 4The reconstructed network on 25 GC-poor or GC-rich bacteria. (A) The reconstructed QS-Net network of 25 GC-poor or GC-rich bacteria. (B) The reconstructed Quartet-Net network of the 25 bacteria.
The number of full splits reconstructed by five methods on bacterial data set and the influenza data set.
| Data set | QS-Net | Quartet-Net | Neighbor-Net | Split-Decomposition | Neighbor-Joining |
|---|---|---|---|---|---|
| GC rich | 26 | 22 | 29 | 23 | 19 |
| GC poor and rich | 48 | 45 | 77 | 48 | 47 |
| Influenza | 47 | 45 | 68 | 36 | 41 |
Figure 5The reconstructed network on influenza data. (A) The reconstructed Quartet-Net network related to H7N9 influenza A viruses. (B) The reconstructed QS-Net network related to H7N9 influenza A viruses.