| Literature DB >> 35643449 |
Teemu Smura1,2, Ravi Kant3,1, Ilya Plyusnin4,5, Phuoc Thien Truong Nguyen1, Tarja Sironen3,1, Olli Vapalahti3,1,2.
Abstract
BACKGROUND: SARS-CoV-2 is the highly transmissible etiologic agent of coronavirus disease 2019 (COVID-19) and has become a global scientific and public health challenge since December 2019. Several new variants of SARS-CoV-2 have emerged globally raising concern about prevention and treatment of COVID-19. Early detection and in-depth analysis of the emerging variants allowing pre-emptive alert and mitigation efforts are thus of paramount importance.Entities:
Keywords: Cluster analysis; Phylogenetic analysis; SARS-CoV-2; Variant calling; Virus
Mesh:
Year: 2022 PMID: 35643449 PMCID: PMC9143711 DOI: 10.1186/s12859-022-04709-8
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.307
Fig. 1ClusTRace flowchart. VFT, VeryFastTree
Fig. 2Amino acid mutations for Finnish Alpha (A) and Beta (B) datasets. Plotting all mutations found in at least ten sequences in Alpha (5379 sequences) and Beta (1051 sequences). Mutations that have been reported as characteristic for a given lineage [37, 38] are plotted in purple, all other mutations are plotted in green. Numbers in cirles indicate the number of sequences with the given mutation. Graphics were created with the ClusTRace interface to g3viz [36]
Fig. 3Consensus tree for Finnish Alpha dataset with clusters collapsed. Bar plots on the right indicate the number of sequences in each cluster. For clarity, clusters with less than ten sequences and singletons were removed. Inner nodes with no large cluster descendants are plotted in grey
Fig. 4Consensus tree for Finnish Beta dataset with clusters collapsed. Bar plots on the right indicate the number of sequences in each cluster. For clarity, clusters with less than ten sequences were removed
Fig. 5Growth rates for the ten fastest growing clusters in Alpha (B.1.1.7)
Mutations in Alpha (B.1.1.7) clusters
| Cluster | S | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| D80A | D215G | 241_243del | K417N | E484K | N501Y | D614G | A701V | ||||
| *GISAID B.1.351 | 0 | 0 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0 |
| Clust 10 | 0 | 1 | 1 | 1 | 1 | 0 | 1 | 1 | 1 | 1 | 0 |
| Clust 17 | 0 | 1 | 1 | 1 | 1 | 0 | 1 | 1 | 1 | 1 | 0 |
| Clust 19 | 0 | 1 | 1 | 0 | 0 | 0 | 1 | 1 | 1 | 1 | 0 |
| Clust 9 | 0 | 1 | 1 | 1 | 1 | 0 | 1 | 1 | 1 | 1 | 1 |
| Clust 4 | 0 | 0 | 1 | 1 | 1 | 0 | 1 | 1 | 1 | 1 | 0 |
| Clust 1 | 1 | 0 | 1 | 1 | 1 | 0 | 1 | 1 | 1 | 1 | 0 |
| Clust 8 | 0 | 1 | 1 | 1 | 1 | 0 | 1 | 1 | 1 | 1 | 0 |
| Clust 18 | 0 | 1 | 1 | 1 | 1 | 0 | 0 | 0 | 1 | 1 | 0 |
| Clust 3 | 0 | 0 | 1 | 1 | 1 | 0 | 1 | 1 | 1 | 1 | 0 |
| Clust 7 | 0 | 0 | 1 | 1 | 1 | 0 | 1 | 1 | 1 | 1 | 0 |
Here, aa mutations with frequencies exceeding 50% are listed in genomic order
*The first row depicts mutations characteristic for B.1.1.7 according to the lineage report [37]
Fig. 6Growth rates for the ten fastest growing clusters in Beta (B.1.351)
Mutations in Beta clusters
| Cluster | S | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| D80A | D215G | 241_243del | K417N | E484K | N501Y | D614G | A701V | ||||
| *GISAID B.1.351 | 0 | 0 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0 |
| Clust 10 | 0 | 1 | 1 | 1 | 1 | 0 | 1 | 1 | 1 | 1 | 0 |
| Clust 17 | 0 | 1 | 1 | 1 | 1 | 0 | 1 | 1 | 1 | 1 | 0 |
| Clust 19 | 0 | 1 | 1 | 0 | 0 | 0 | 1 | 1 | 1 | 1 | 0 |
| Clust 9 | 0 | 1 | 1 | 1 | 1 | 0 | 1 | 1 | 1 | 1 | 1 |
| Clust 4 | 0 | 0 | 1 | 1 | 1 | 0 | 1 | 1 | 1 | 1 | 0 |
| Clust 1 | 1 | 0 | 1 | 1 | 1 | 0 | 1 | 1 | 1 | 1 | 0 |
| Clust 8 | 0 | 1 | 1 | 1 | 1 | 0 | 1 | 1 | 1 | 1 | 0 |
| Clust 18 | 0 | 1 | 1 | 1 | 1 | 0 | 0 | 0 | 1 | 1 | 0 |
| Clust 3 | 0 | 0 | 1 | 1 | 1 | 0 | 1 | 1 | 1 | 1 | 0 |
| Clust 7 | 0 | 0 | 1 | 1 | 1 | 0 | 1 | 1 | 1 | 1 | 0 |
Annotation as in Table 1