| Literature DB >> 34698524 |
Daniel Novikov1, Sergey Knyazev1, Mark Grinshpon2, Pelin Icer1, Pavel Skums1, Alex Zelikovsky1,3.
Abstract
This article presents a novel scalable character-based phylogeny algorithm for dense viral sequencing data called SPHERE (Scalable PHylogEny with REcurrent mutations). The algorithm is based on an evolutionary model where recurrent mutations are allowed, but backward mutations are prohibited. The algorithm creates rooted character-based phylogeny trees, wherein all leaves and internal nodes are labeled by observed taxa. We show that SPHERE phylogeny is more stable than Nextstrain's, and that it accurately infers known transmission links from the early pandemic. SPHERE is a fast algorithm that can process >200,000 sequences in <2 hours, which offers a compact phylogenetic visualization of Global Initiative on Sharing All Influenza Data (GISAID).Entities:
Keywords: SARS-CoV-2 sequences; phylogenetic tree inference; recurrent mutations
Mesh:
Year: 2021 PMID: 34698524 PMCID: PMC8819518 DOI: 10.1089/cmb.2021.0306
Source DB: PubMed Journal: J Comput Biol ISSN: 1066-5277 Impact factor: 1.479