| Literature DB >> 35474215 |
Junxian Ou1,2, Wendong Lan2, Xiaowei Wu2, Tie Zhao1, Biyan Duan1, Peipei Yang1, Yi Ren1, Lulu Quan2, Wei Zhao2, Donald Seto3, James Chodosh4, Zhen Luo1,5, Jianguo Wu6,7, Qiwei Zhang8,9,10.
Abstract
The current pandemic of COVID-19 is fueled by more infectious emergent Omicron variants. Ongoing concerns of emergent variants include possible recombinants, as genome recombination is an important evolutionary mechanism for the emergence and re-emergence of human viral pathogens. In this study, we identified diverse recombination events between two Omicron major subvariants (BA.1 and BA.2) and other variants of concern (VOCs) and variants of interest (VOIs), suggesting that co-infection and subsequent genome recombination play important roles in the ongoing evolution of SARS-CoV-2. Through scanning high-quality completed Omicron spike gene sequences, 18 core mutations of BA.1 (frequency >99%) and 27 core mutations of BA.2 (nine more than BA.1) were identified, of which 15 are specific to Omicron. BA.1 subvariants share nine common amino acid mutations (three more than BA.2) in the spike protein with most VOCs, suggesting a possible recombination origin of Omicron from these VOCs. There are three more Alpha-related mutations in BA.1 than BA.2, and BA.1 is phylogenetically closer to Alpha than other variants. Revertant mutations are found in some dominant mutations (frequency >95%) in the BA.1. Most notably, multiple characteristic amino acid mutations in the Delta spike protein have been also identified in the "Deltacron"-like Omicron Variants isolated since November 11, 2021 in South Africa, which implies the recombination events occurring between the Omicron and Delta variants. Monitoring the evolving SARS-CoV-2 genomes especially for recombination is critically important for recognition of abrupt changes to viral attributes including its epitopes which may call for vaccine modifications.Entities:
Mesh:
Substances:
Year: 2022 PMID: 35474215 PMCID: PMC9039610 DOI: 10.1038/s41392-022-00992-2
Source DB: PubMed Journal: Signal Transduct Target Ther ISSN: 2059-3635
Comparison of Spike protein amino acid mutations between the Omicron subvariants and other VOCs and VOIs
| Spike region | Position | Mutation (BA.1) | Frequency (BA.1) | Percentage | Mutation (BA.2) | Frequency (BA.2) | Percentage | Mutation in VOCs/VOIs | New mutation | |
|---|---|---|---|---|---|---|---|---|---|---|
| 19 | – | – | – | T19I | 2953 | 100.00% | – | – | ||
| 24 | – | – | – | deletion | 2513 | 85.10% | – | – | ||
| 25 | – | – | – | deletion | 2513 | 85.10% | – | – | ||
| 26 | – | – | – | deletion | 2513 | 85.10% | – | – | ||
| 27 | – | – | – | A27S | 2513 | 85.10% | – | – | ||
| 67 | A67V | 49,475 | 99.73% | – | – | – | – | – | ||
| 69 | Deletion | 49,385 | 99.55% | – | – | – | Alpha(del69) | – | ||
| 70 | Deletion | 49,382 | 99.54% | – | – | – | Alpha(del70) | – | ||
| 95 | T95I | 49,513 | 99.81% | – | – | – | Mu(T95I) | – | ||
| NTD | 142 | G142D | 49,424 | 99.63% | G142D | 2934 | 99.36% | – | – | |
| 143 | Deletion | 49,439 | 99.66% | – | – | – | – | – | ||
| 144 | Deletion | 49,439 | 99.66% | – | – | – | Alpha(del144), Mu(Y144S) | – | ||
| 145 | Deletion | 49,440 | 99.66% | – | – | – | Mu(Y145N) | – | ||
| 211 | Deletion | 47,840 | 96.43% | – | – | – | – | – | ||
| 212 | N212I | 47,839 | 96.43% | – | – | – | – | – | ||
| 213 | – | – | – | V213G | 2950 | 99.90% | – | – | ||
| 214 | insertE | 44,368 | 89.44% | – | – | – | – | – | ||
| 214 | insertP | 44,353 | 89.41% | – | – | – | – | – | ||
| – | 214 | insertE | 44,365 | 89.43% | – | – | – | – | – | |
| 339 | G339D | 48,729 | 98.23% | G339D | 2953 | 100.00% | – | – | ||
| 346 | R346K | 16,819 | 33.90% | – | – | – | Mu(R346K) | Yes | ||
| 371 | S371L | 48,297 | 97.36% | S371F | 2949 | 99.86% | – | – | ||
| S1 | 373 | S373P | 48,322 | 97.41% | S373P | 2952 | 99.97% | – | – | |
| 375 | S375F | 48,316 | 97.39% | S375F | 2951 | 99.93% | – | – | ||
| 376 | – | – | – | T376A | 2949 | 99.86% | – | – | ||
| 405 | – | – | – | D405N | 2949 | 99.86% | – | – | ||
| 408 | – | – | – | R408S | 2946 | 99.76% | – | – | ||
| 417 | K417N | 44,711 | 90.13% | K417N | 2952 | 99.97% | Beta(K417N), Gamma(K417T) | – | ||
| 440 | N440K | 46,470 | 93.67% | N440K | 2926 | 99.09% | – | – | ||
| RBD | 446 | G446S | 46,892 | 94.52% | – | – | – | – | – | |
| 452 | L452R | 897 | 1.81% | – | – | – | Delta(L452R) | Yes | ||
| 477 | S477N | 48,185 | 97.13% | S477N | 2952 | 99.97% | – | – | ||
| 478 | T478K | 48,320 | 97.40% | T478K | 2952 | 99.97% | Delta(T478K) | – | ||
| 484 | E484A | 48,024 | 96.81% | E484A | 2952 | 99.97% | Beta/Gamma/Mu (E484K) | – | ||
| 493 | Q493R | 47,999 | 96.75% | Q493R | 2953 | 100.00% | – | – | ||
| 496 | G496S | 47,965 | 96.69% | – | – | – | – | – | ||
| 498 | Q498R | 47,917 | 96.59% | Q498R | 2953 | 100.00% | – | – | ||
| 501 | N501Y | 47,933 | 96.62% | N501Y | 2953 | 100.00% | Alpha/Beta/Gamma/Mu (N501Y) | – | ||
| – | 505 | Y505H | 47,888 | 96.53% | Y505H | 2952 | 99.97% | – | – | |
| 547 | T547K | 49,496 | 99.77% | – | – | – | – | – | ||
| 614 | D614G | 49,568 | 99.92% | D614G | 2953 | 100.00% | Alpha/Beta/Gamma/Delta (N501Y) | – | ||
| SD | 655 | H655Y | 49,509 | 99.80% | H655Y | 2953 | 100.00% | Gamma(H655Y) | – | |
| 679 | N679K | 49,523 | 99.83% | N679K | 2953 | 100.00% | – | – | ||
| – | 681 | P681H | 49,515 | 99.81% | P681H | 2953 | 100.00% | Alpha/Mu(P681H), Delta(P681R) | – | |
| 701 | A701V | 2729 | 5.50% | – | – | – | Beta(A701V) | Yes | ||
| FP | 764 | N764K | 49,046 | 98.87% | N764K | 2953 | 100.00% | – | – | |
| 796 | D796Y | 49,338 | 99.45% | D796Y | 2952 | 99.97% | – | – | ||
| S2 | – | 856 | N856K | 49,488 | 99.76% | – | – | – | – | – |
| 954 | Q954H | 49,559 | 99.90% | Q954H | 2953 | 100.00% | – | – | ||
| HR1 | 969 | N969K | 49,537 | 99.85% | N969K | 2953 | 100.00% | – | – | |
| 981 | L981F | 49,373 | 99.52% | – | – | – | – | – | ||
VOCs variants of concern, VOIs variants of interest
52,563 high-quality Omicron spike gene sequences (49,609 BA.1 sequences, and 2954 BA.2 sequences) released before 15 January 2022 were analyzed. The mutations that have appeared in more than 800 sequences were used in this analysis
Fig. 1Spike protein amino acid mutations of the Omicron subvariants (BA.1 and BA.2) compared with mutations from the other four variants of concern (VOCs). a Venn diagram noting mutations of Omicron (BA.1) and those of VOCs. b Venn diagram of Omicron (BA.2) mutations compared to ones of VOCs. c Venn diagram of mutations between Omicron (BA.1) and Omicron (BA.2). d Spike protein amino acid mutation counts of Omicron subvariants (BA.1 and BA.2) compared with mutations of VOCs
Novel mutations identified in the spike protein of the recently emerged Omicron subvariants (emerged before 15 January 2022; frequency > 50 sequences)
| Spike region | Position | New Mutation | Frequency | Percentage | Mutation in VOCs & VOIs | Early event occurrence time | First country emerging | |
|---|---|---|---|---|---|---|---|---|
| S1 | – | 5 | L5F | 184 | 0.37% | Iota(L5F) | 2021.11.19 | South Africa |
| NTD | 76 | T76I | 51 | 0.10% | Lambda (T76I) | 2021.11.26 | Netherlands | |
| 141 | L141F | 56 | 0.11% | – | 2021.11.19 | South Africa | ||
| 142 | G142Y | 51 | 0.10% | – | 2021.12.16 | USA | ||
| 156 | deletion | 68 | 0.14% | Delta(del156) | 2021.12.13 | USA | ||
| 157 | deletion | 71 | 0.14% | Delta(del157) | 2021.12.13 | USA | ||
| 158 | R158G | 69 | 0.14% | Delta(E158G) | 2021.12.15 | USA | ||
| RBD | 346 | R346K | 16819 | 33.90% | Mu(R346K) | 2021.11.4 | Canada | |
| 452 | L452R | 897 | 1.81% | Delta(L452R) | 2021.11.11 | South Africa | ||
| SD | 643 | F643L | 138 | 0.28% | – | 2021.11.29 | Thailand | |
| 681 | P681R | 62 | 0.12% | Delta(P681R) | 2021.11.23 | Botswana | ||
| S2 | – | 701 | A701V | 2729 | 5.50% | Beta(A701V) | 2021.11.10 | South Africa |
| – | 1081 | I1081V | 351 | 0.71% | – | 2021.11.18 | U.K. | |
| 1147 | S1147L | 120 | 0.24% | – | 2021.12.13 | USA | ||
| 1162 | P1162S | 60 | 0.12% | – | 2021.12.10 | Canada | ||
Fig. 2Structure of the Spike protein with amino acid mutations detected in Omicron BA.1 subvariant. a Structure of human ACE2 receptor complexed with SARS-CoV-2 Omicron RBD, mapped with the recent mutations. b Structure of SARS-CoV-2 Omicron spike protein mapped with the novel mutations. Mutated residues in each domain of the spike protein are annotated in color (red: RBD; yellow: NTD; green: S1/S2; blue: S2) using with Pymol 2.0 software through SARS-CoV-2 Omicron model PDB:7WBL and 7QO7
Fig. 3Phylogenetic network and scanning of the spike gene from representative Omicron subvariant sequences. a Representative Omicron spike protein haplotypes (each consisting of at least 50 sequences) were constructed with PopART using the median-joining method. Nucleotide changes were notated with lines. The spike gene from Wuhan-Hu-1 strain was set as the root. The number of sequences in each haplotype were modified into different orders of magnitude, and subgroups based on the mutation types were delineated by color. b BootScan analysis of revertant and representative haplotypes of Omicron spike gene. Representative spike Omicron haplotypes (H3, H4, H7) sequences and selected reversion haplotypes (H18, H39, H44) sequences are included. Bootscan map was constructed by Simplot 3.5.1 (https://sray.med.som.jhmi.edu/SCRoftware/SimPlot/) using neighboring-joining method with 100 bootstrap replicates. Wuhan-Hu-1 spike sequences were set as references and reversion region was annotated. c Overview of possible evolution mechanism of reversion haplotypes and haplotypes with mutations from Delta and other variants
Fig. 4Geographic distribution and whole genome analyses of “Delatcron”-like BA.1 subvariants (with S:L452R mutations). a Geographic distribution of “Delatcron”-like BA.1 subvariants (with S:L452R), with the number of genome sequences noted. b Whole genome annotation of amino acid mutations highlighting the “Deltacron”-like BA.1 subvariants (with S:L452R). 897 full genome sequences of BA.1 with S:L452R mutation were shown with red circles. c Phylogenetic analysis of the “Deltacron”-like BA.1 subvariants (with S:L452R). Maximum likelihood tree was constructed using Iqtree2 with 1000 bootstrap replicates and TIM + F + R3 model. Low quality sequences were excluded. 897 SARS-CoV-2 BA.1 full genome sequences with S:L452R mutation submitted to the GISAID database before 15 January 2022 and reference sequences from VOCs were included. Pango sublineages BA.1, BA.1.1, BA.1.15, and BA.1.17 are marked in colors