| Literature DB >> 35242782 |
Ricardo Ariel Zimerman1, Patrícia Aline Gröhs Ferrareze2, Flavio Adsuara Cadegiani3, Carlos Gustavo Wambier4, Daniel do Nascimento Fonseca5, Andrea Roberto de Souza5, Andy Goren6, Liane Nanci Rotta2, Zhihua Ren7, Claudia Elizabeth Thompson2,8.
Abstract
BACKGROUND: P.1 lineage (Gamma) was first described in the State of Amazonas, northern Brazil, in the end of 2020, and has emerged as a very important variant of concern (VOC) of SARS-CoV-2 worldwide. P.1 has been linked to increased infectivity, higher mortality, and immune evasion, leading to reinfections and potentially reduced efficacy of vaccines and neutralizing antibodies.Entities:
Keywords: COVID-19; NSP; P.1; SARS-CoV-2; VOC; genomic epidemiology; phylogenomics; spike
Year: 2022 PMID: 35242782 PMCID: PMC8885995 DOI: 10.3389/fmed.2022.806611
Source DB: PubMed Journal: Front Med (Lausanne) ISSN: 2296-858X
Figure 1Flowchart of SARS-CoV-2 RT-qPCR and genome sequencing.
Figure 2Mutations of the SARS-CoV-2 genomes from the Amazonas state, Northern Brazil sampled in February and March 2021. (A) Genome map for the 44 genomes sequenced. Nucleotide substitutions are colored in red and blank regions represent low sequencing coverage. (B) Frequency of SNPs per SARS-CoV-2 genome position along the 44 genomes. These mutations are corresponding to the red lines in (A) and only missense substitutions represented by >10 sequences have their respective amino acid changes indicated above the bars. Main Open Reading Frames (ORFs) and SARS-CoV-2 proteins are indicated at the bottom to allow a rapid visualization of the viral proteins affected.
Frequency of spike mutations identified in the P.1 sequenced genomes (this study) compared with sequences from Amazonas, Brazil and World (up to September 12, 2021) available on GISAID.
|
| ||||
|---|---|---|---|---|
| L18F | 100.00 (44) | 92.72 (777) | 99.03 (19,633) | 94.69 (50,703) |
| T20N | 97.73 (43) | 91.88 (770) | 97.87 (19,403) | 94.38 (50,540) |
| P26S | 100.00 (44) | 92.72 (777) | 98.69 (19,566) | 94.66 (50,689) |
| D138Y | 97.73 (43) | 92.24 (773) | 93.85 (18,607) | 94.82 (50,771) |
| N188del | 6.82 (3) | 0.00 (0) | 0.00 (0) | 0.002 (1) |
| L189del | 6.82 (3) | 0.00 (0) | 0.06 (12) | 0.009 (5) |
| R190S | 95.45 (42) | 91.05 (763) | 97.81 (19,391) | 90.96 (48,708) |
| P209H | 2.27 (1) | 0.00 (0) | 0.00 (0) | 0.00 (0) |
| D215G | 2.27 (1) | 1.07 (9) | 0.02 (4) | 0.009 (5) |
| A243del | 2.27 (1) | 0.24 (2) | 0.05 (10) | 0.06 (31) |
| L244del | 2.27 (1) | 0.24 (2) | 0.05 (10) | 0.06 (34) |
| K417T | 97.73 (43) | 93.08 (780) | 92.26 (18,292) | 96.53 (51,690) |
| E484K | 100.00 (44) | 92.96 (779) | 95.46 (18,926) | 95.57 (51,177) |
| N501Y | 100.00 (44) | 92.84 (778) | 95.34 (18,902) | 95.92 (51,360) |
| D614G | 100.00 (44) | 99.28 (832) | 99.37 (19,702) | 99.41 (53,234) |
| H655Y | 97.73 (43) | 98.45 (825) | 99.42 (19,711) | 92.47 (52,730) |
| A688V | 4.54 (2) | 0.00 (0) | 0.18 (36) | 5.34 (2,862) |
| T1027I | 97.73 (43) | 98.33 (824) | 99.52 (19,730) | 96.07 (51,445) |
| T1066A | 2.27 (1) | 0.00 (0) | 0.00 (0) | 0.00 (0) |
| V1176F | 100.00 (44) | 99.40 (833) | 99.25 (19,677) | 97.16 (52,029) |
AM, Amazonas; BR, Brazil. The genomes sequenced in this study are not included in AM counts. AM counts are not included in BR and BR counts are not included in World counts. This analysis included 74,255 P.1 genome sequences available on the GISAID database up to September 12, 2021. Mutations were verified on the GISAID database on September 22, 2021.
Frequency of ORF1a mutations identified in the P.1 sequenced genomes (this study) compared with sequences from Amazonas, Brazil and World (up to September 12, 2021) available on GISAID.
|
|
| ||||
|---|---|---|---|---|---|
| NSP1:T170I | ORF1a:T170I | 2.27 (1) | 0.00 (0) | 0.00 (0) | 0.01 (4) |
| NSP2:N9S | ORF1a:N189S | 2.27 (1) | 0.00 (0) | 0.00 (0) | 0.00 (0) |
| NSP2:L113F | ORF1a:L293F | 2.27 (1) | 0.24 (2) | 0.02 (3) | 0.01 (7) |
| NSP2:L400F | ORF1a:L580F | 2.27 (1) | 0.00 (0) | 0.10 (20) | 0.06 (30) |
| NSP2:K456R | ORF1a:K636R | 2.27 (1) | 0.60 (5) | 0.03 (5) | 0.004 (2) |
| NSP2:V469F | ORF1a:V649F | 2.27 (1) | 0.00 (0) | 0.05 (9) | 0.03 (17) |
| NSP3:A41V | ORF1a:A859V | 2.27 (1) | 0.12 (1) | 0.07 (14) | 0.04 (14) |
| NSP3:T133I | ORF1a:T951I | 25.00 (11) | 11.79 (99) | 1.59 (316) | 0.99 (530) |
| NSP3:T186P | ORF1a:T1004P | 2.27 (1) | 0.60 (5) | 0.02 (4) | 0.002 (1) |
| NSP3:A231V | ORF1a:A1049V | 2.27 (1) | 0.12 (1) | 0.37 (74) | 0.04 (19) |
| NSP3:S370L | ORF1a:S1188L | 97.73 (43) | 95.95 (806) | 97.63 (19,358) | 97.60 (52,286) |
| NSP3:K977Q | ORF1a:K1795Q | 56.82 (25) | 99.29 (834) | 99.80 (19,788) | 99.89 (52,976) |
| NSP3:T1189I | ORF1a:T2007I | 2.27 (1) | 0.60 (5) | 0.17 (33) | 0.03 (14) |
| NSP3:T1365A | ORF1a:T2183A | 2.27 (1) | 0.12 (1) | 0.07 (14) | 0.01 (3) |
| NSP3:S1437F | ORF1a:S2255F | 2.27 (1) | 0.00 (0) | 0.03 (6) | 0.002 (1) |
| NSP3:S1670F | ORF1a:S2488F | 2.27 (1) | 0.00 (0) | 0.11 (21) | 0.21 (110) |
| NSP4:V30A | ORF1a:V2793A | 2.27 (1) | 0.00 (0) | 0.00 (0) | 0.00 (0) |
| NSP4:T83I | ORF1a:T2846I | 2.27 (1) | 0.60 (5) | 1.45 (288) | 1.26 (676) |
| NSP4:H313Y | ORF1a:H3076Y | 2.27 (1) | 0.00 (0) | 0.05 (9) | 0.04 (23) |
| NSP4:S481L | ORF1a:S3244L | 2.27 (1) | 0.24 (2) | 0.15 (29) | 0.10 (51) |
| NSP5:A70V | ORF1a:A3333V | 6.82 (3) | 2.14 (18) | 0.06 (12) | 0.002 (1) |
| NSP5:V86I | ORF1a:V3349I | 6.82 (3) | 2.14 (18) | 0.01 (1) | 0.00 (0) |
| NSP5:P241L | ORF1a:P3504L | 2.27 (1) | 0.24 (2) | 0.65 (128) | 0.13 (72) |
| NSP6:A46V | ORF1a:A3615V | 2.27 (1) | 0.00 (0) | 0.13 (25) | 0.02 (13) |
| NSP6:A51V | ORF1a:A3620V | 6.82 (3) | 2.14 (18) | 0.07 (14) | 0.03 (14) |
| NSP6:S106del | ORF1a:S3675del | 27.27 (12) | 98.33 (826) | 96.55 (19,142) | 97.22 (52,081) |
| NSP6:G107S | ORF1a:G3676S | 4.55 (2) | 0.24 (2) | 1.06 (210) | 0.81 (436) |
| NSP6:G107del | ORF1a:G3676del | 27.27 (12) | 98.45 (827) | 96.81 (19,195) | 97.23 (52,085) |
| NSP6:F108L | ORF1a:F3677L | 11.36 (5) | 0.24 (2) | 1.85 (367) | 1.02 (546) |
| NSP6:F108del | ORF1a:F3677del | 27.27 (12) | 98.57 (828) | 96.86 (19,205) | 97.21 (52,077) |
| NSP6:V149A | ORF1a:V3718A | 2.27 (1) | 0.60 (5) | 0.02 (4) | 0.01 (4) |
| NSP8:E155G | ORF1a:E4097G | 2.27 (1) | 0.00 (0) | 0.06 (11) | 0.01 (8) |
Seq., Sequenced; AM, Amazonas; BR, Brazil. The genomes sequenced in this study are not included in AM counts. AM counts are not included in BR and BR counts are not included in World counts. This analysis included 74,281 P.1 genome sequences available on the GISAID database up to September 12, 2021. Mutations were verified on the GISAID database on September 23, 2021. There is no data on GISAID for mutations in ORF9b (alternative N gene ORF).
First occurrence (by collection date) registered on GISAID for spike mutations identified in the sequenced genomes from this study.
|
|
|
|
|
|
|---|---|---|---|---|
| L18F | 2020-12-03 | 2020-09-22/PE | 2020-09-22/PE | 2020-12-17/Peru |
| T20N | 2020-12-03 | 2020-09-22/PE | 2020-09-22/PE | 2020-12-17/Peru |
| P26S | 2020-12-03 | 2020-09-22/PE | 2020-09-22/PE | 2020-12-17/Peru |
| D138Y | 2020-12-03 | 2020-10-01/PB | 2020-10-01/PB | 2020-12-17/Peru |
| N188del |
|
| – | 2021-05-25/Suriname |
| L189del |
| 2021-01-27/AL | 2021-01-27/AL | 2021-01-28/Taiwan |
| R190S | 2020-12-03 | 2020-10-01/PB | 2020-10-01/PB | 2020-12-17/Peru |
| P209H |
|
| – | – |
| D215G | 2021-03-10 | 2021-03-08/GO | 2021-03-08/GO | 2021-04-07/USA |
| A243del |
|
| 2021-02-23/SE | 2021-03-23/USA |
| L244del |
|
| 2021-02-23/SE | 2021-03-23/USA |
| K417T | 2020-12-03 | 2020-09-22/PE | 2020-09-22/PE | 2020-12-17/Peru |
| E484K | 2020-12-03 | 2020-10-01/PB | 2020-10-01/PB | 2020-11-11/USA |
| N501Y | 2020-12-03 | 2020-10-01/PB | 2020-10-01/PB | 2020-12-17/Peru |
| D614G | 2020-12-03 | 2020-09-11/SP | 2020-09-11/SP | 2020-11-11/USA |
| H655Y | 2020-12-03 | 2020-09-11/SP | 2020-09-11/SP | 2020-12-17/Peru |
| A688V | 2021-02-22 | 2021-01-26/SP | 2021-01-26/SP | 2021-02-15/Peru |
| T1027I | 2020-12-03 | 2020-09-11/SP | 2020-09-11/SP | 2020-11-11/USA |
| T1066A |
|
| – | – |
| V1176F | 2020-12-03 | 2020-09-11/SP | 2020-09-11/SP | 2020-12-17/Peru |
Only complete collection dates were considered for this analysis. Date verification on September 23, 2021. The first occurrence in the world of some mutations that were identified in the genomes sequenced in this study are highlighted in bold.
According to the GISAID database, there are three P.1 genomes from the USA carrying these mutations with collection dates from April and May, 2020 (before their first occurrence in Brazil). However, we do not find any publication confirming this USA origin of P.1 lineage. AL, Alagoas; AM, Amazonas; GO, Goiás; PB, Paraíba; PE, Pernambuco; SE, Sergipe; SP, São Paulo.
Figure 3Genetic distance plot of the 44 sequenced genomes in comparison with the NC_045512.2 reference genome (Wuhan). (A) Genomic sequence identity. (B) Sequence identity for the spike gene region.
Figure 4Maximum Likelihood tree of 1,887 unique genomes from the North region—Brazil, added to the 44 sequenced samples of this study. Pangolin lineages represented by
Figure 5Phylogenetic analyses of spike and non-structural proteins from ORF1a including northern Brazilian SARS-CoV-2 genomes. (A) spike, (B) NSP1, (C) NSP3, (D) NSP5, (E) NSP6, and (F) NSP9.
Fast Unbiased AppRoximate Bayesian (FUBAR) analysis of pervasive diversifying selection on spike protein.
|
|
|
|
|
|---|---|---|---|
|
|
|
|
|
|
|
|
|
|
| 29 | 0.917 | 6.771 | 0.9244 |
| 49 | 1.045 | 8.832 | 0.9502 |
| 67 | 0.932 | 5.622 | 0.9036 |
| 95 | 0.920 | 6.822 | 0.9248 |
| 96 | 1.246 | 7.541 | 0.9075 |
|
|
|
|
|
| 222 | 0.894 | 8.019 | 0.9562 |
| 257 | 0.922 | 6.516 | 0.9204 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 613 | 1.266 | 12.121 | 0.9555 |
| 653 | 0.932 | 5.622 | 0.9036 |
|
|
|
|
|
| 677 | 1.246 | 8.376 | 0.9173 |
|
|
|
|
|
| 689 | 1.067 | 6.999 | 0.9136 |
| 769 | 1.361 | 8.428 | 0.9111 |
| 1174 | 0.932 | 5.623 | 0.9037 |
|
|
|
|
|
| 1264 |
|
|
|
Underlined data indicate codons also identified by FEL/MEME with positive selection evidence. Highlighted in bold are the mutations found in the 44 genomes of this study.
Fast Unbiased AppRoximate Bayesian (FUBAR) analysis of pervasive diversifying selection on non-structural proteins (NSPs) from ORF1a.
|
|
|
|
|
|
|
|---|---|---|---|---|---|
|
|
|
|
|
|
|
| NSP1 | 43 | 43 | 0.719 | 6.085 | 0.9032 |
| NSP1 | 77 | 77 | 1.102 | 13.438 | 0.9401 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1211 |
|
|
|
|
| NSP3 | 1228 | 2046 | 1.530 | 13.043 | 0.9525 |
| NSP3 | 1610 | 2428 | 1.229 | 11.464 | 0.9183 |
|
|
|
|
|
|
|
| NSP5 | 93 | 3356 | 2.001 | 9.761 | 0.9024 |
|
|
|
|
|
|
|
| NSP7 | 71 | 3930 | 1.158 | 11.157 | 0.9525 |
|
|
|
|
|
|
|
|
|
| 4187 |
|
|
|
| NSP9 | 77 | 4217 | 1.818 | 9.274 | 0.9120 |
| NSP10 | 49 | 4302 | 1.476 | 14.744 | 0.9301 |
Underlined data indicate codons also identified by FEL/MEME with positive selection evidence. Highlighted in bold are the mutations found in the 44 genomes of this study.
Figure 6Collapsed tree representing the assigned lineages of the 4,997 subsampled genomes from global phylogeny.
Figure 7Global phylogeny of 4,997 subsampled genomes from P.1 and B.1.1.28 lineages.
Figure 8Representative subtree of the 44 sequenced samples of this study and their nearest grouped neighbors inside the P.1 clade. The branch support values are represented by the SH-aLRT and ultrafast bootstrap percentages ≥70.