| Literature DB >> 35450165 |
Guo-Li Zheng1, Zheng-Xi Zhu1, Jia-le Cui1, Jie-Mei Yu1.
Abstract
GII.2[P16] and GII.4 Sydney [P16] are currently the two predominant norovirus genotypes. This study sought to clarify their evolutionary patterns by analyzing the major capsid VP1 and RNA-dependent RNA polymerase (RdRp) genes. Sequence diversities were analyzed at both nucleotide and amino acid levels. Selective pressures were evaluated with the Hyphy package in different models. Phylogenetic trees were constructed by the maximum likelihood method from full VP1 sequences, and evolutionary rates were estimated by the Bayesian Markov Chain Monte Carlo approach. The results showed that (1) several groups of tightly linked mutations between the RdRp and VP1 genes were detected in the GII.2[P16] and GII.4[P16] noroviruses, and most of these mutations were synonymous, which may lead to a better viral fitness to the host; (2) although the pattern of having new GII.4 variants every 2-4 years has been broken, both the pre- and the post-2015 Sydney VP1 had comparable evolutionary rates to previously epidemic GII.4 variants, and half of the major antigenic sites on GII.4 Sydney had residue substitutions and several caused obvious changes in the carbohydrate-binding surface that may potentially alter the property of the virus; and (3) GII.4 Sydney variants during 2018-21 showed geographical specificity in East Asia, South Asia, and North America; the antigenic sites of GII.2 are strictly conserved, but the GII.2 VP1 chronologically evolved into nine different sublineages over time, with sublineage IX being the most prevalent one since 2018. This study suggested that both VP1 and RdRp of the GII.2[P16] and GII.4 Sydney [P16] noroviruses exhibited different evolutionary directions. GII.4[P16] is likely to generate potential novel epidemic variants by accumulating mutations in the P2 domain, similar to previously epidemic GII.4 variants, while GII.2[P16] has conserved predicted antigenicity and may evolve by changing the properties of nonstructural proteins, such as polymerase replicational fidelity and efficiency. This study expands the understanding of the evolutionary dynamics of GII.2[P16] and GII.4[P16] noroviruses and may predict the emergence of new variants.Entities:
Keywords: GII.2[P16]; GII.4[P16]; evolutionary pattern; genetic diversity; human norovirus; selective pressure
Year: 2022 PMID: 35450165 PMCID: PMC9019527 DOI: 10.1093/ve/veac030
Source DB: PubMed Journal: Virus Evol ISSN: 2057-1577
Numbers of combined genotype or P-type of the downloaded sequences.
| GII.2 VP1 | GII.4 Sydney VP1 | GII.P16 | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Before 2016 | After 2016 | Before 2015 | After 2015 | Before 2016 | After 2016 | ||||||
| P-type |
| P-type |
| P-type |
| P-type |
| Genotype |
| Genotype |
|
| GII.P2 | 44 (37.6) | GII.P2 | 27 (4.8) | GII.P4 | 1 (0.7) | GII.P4 | 15 (4.3) | GII.2 | 54 (47.8) | GII.1 | 12 (2.1) |
| GII.P12 | 1 (0.9) | GII.P16 | 535 (95.2) | GII.P16 | 23 (16.2) | GII.P16 | 182 (52.6) | GII.3 | 15 (13.3) | GII.2 | 413 (71.8) |
| GII.P16 | 56 (47.9) | GII.P31 | 118 (83.1) | GII.P31 | 145 (41.9) | GII.4 | 12 (10.6) | GII.3 | 11 (1.9) | ||
| GII.P21 | 2 (1.7) | GII.P12 | 3 (0.9) | GII.5 | 1 (0.9) | GII.4 | 119 (20.7) | ||||
| GII.P30 | 3 (2.7) | GII.P13 | 1 (0.3) | GII.13 | 23 (20.4) | GII.12 | 11 (1.9) | ||||
| GII.P31 | 9 (7.7) | GII.16 | 6 (5.3) | GII.13 | 9 (1.6) | ||||||
| GII.P34 | 1 (0.9) | GII.17 | 2 (1.8) | ||||||||
| GII.P35 | 1 (0.9) | ||||||||||
Figure 1.Information for the SNPs and amino acid substitutions. (A) Numbers of SNPs in the VP1 and RdRp genes. GII.4 VP1 was more diversified than GII.2 VP1. GII.P16 RdRp associated with GII.2 had more SNPs than that associated with GII.4 but had less of high-frequency (>50 per cent) SNPs. (B) High-frequency SNP sites on the GII.4-associated GII.P16 RdRp. Six of the twenty-two sites showed very low frequencies or even no mutation in the GII.2-associated GII.P16 RdRp. (C) Geographic distributions of the sequences involved in the changed amino acid sites in the GII.2- and GII.4-associated GII.P16 RdRp. The GII.2-associated RdRp sequences were mostly from East Asia, and the GII.4-associated RdRp sequences were mostly from North America.
Tightly linked mutations on RdRp and VP1 genes of GII.2- and GII.4-associated GII.P16 norovirus.
| RdRp | VP1 | ||||||
|---|---|---|---|---|---|---|---|
| Groups (frequency) | nt site | Mutation | aa change | nt site | Mutation | aa change | |
| GII.4 VP1—GII.P16 RdRp | Group 1 (31%) | 276 | T-A | 92 (–) | 438 | A-G | 146 (–) |
| Group 2 (27%) | 279 | C-T | 93 (–) | 1252 | C-T | 418 (–) | |
| Group 3 (43%) | 1370 | A-G | K457R | 732 | T-C | 243 (–) | |
| 753 | G-A | 251 (–) | |||||
| 834 | G-T | 278 (–) | |||||
| GII.2 VP1—GII.P16 RdRp | Group 4 (31%) | 126 | G-A | 42 (–) | 387 | G-A | 129 (–) |
| 1224 | T-C | 408 (–) | |||||
| Group 5 (12%) | 165 | T-C | 55 (–) | 1212 | A-G | 404 (–) | |
| Group 6 (38%) | 171 | T-C | 57 (–) | 126 | C-T | 42 (–) | |
Notes: ‘nt’, nucleotide. ‘aa’, amino acid. ‘–’ stands for a synonymous mutation. Sites on the RdRp and VP1 that are in the same row with the same color were linked.
Figure 2.Informative amino acid substitutions on the VP1 and RdRp. (A, B) Pre-2016 and post-2016 GII.2 VP1. (C, D) Pre-2015 and post-2015 GII.4 Sydney VP1. (E, F) GII.P16 RdRp associated with GII.2 and GII.4 Sydney VP1. Multiple changed sites of VP1 were located on major antigenic motifs. The changed sites with >5 per cent frequencies in the GII.2-associated and GII.4-associated RdRp were totally distinct. Antigenic sites on the GII.2 that extrapolated from GII.4 were marked with ‘▲’, the major antigenic sites on GII.4 Sydney were marked with ‘↑’, and amino acid substitutions with frequencies of >5 per cent on the GII.P16 RdRp were marked with ‘●’.
Positively selected sites in the VP1 and RdRp genes by different methods.
| Methods | |||
|---|---|---|---|
| Genotype or P-type of the sequences | FUBAR | MEME | SLAC |
| GII.2 VP1 1971–2020 |
|
|
|
| GII.4 VP1 2011–14 | 6, 9, | 6, 8, 9, |
|
| GII.4 VP1 2015–21 |
|
|
|
| GII.2-combined GII.P16 RdRp |
| 59, 270, 274, |
|
| GII.4-combined GII.P16 RdRp | – | 137 | – |
| other genotype-combined GII.P16 RdRp | – | 138 | – |
Note: ‘MEME’, ‘FUBAR’, and ‘SLAC’ represent the ‘mixed-effects model of evolution’, the ‘fast unbiased Bayesian approximation’, and the ‘single-likelihood ancestor counting’, respectively. ‘–’ stands for ‘none’. Strong positively selected sites are marked in bold and those located at major antigenic sites are in italics and underlined.
Figure 3.Phylogenic analysis using VP1 nucleotide sequences of GII.2 and GII.4 Sydney by the maximum likelihood method and spatial distributions of the cluster/subclusters in the GII.4 Sydney norovirus. Sequences of the early GII.2 VP1 in the 1970s and the previously reported epidemic GII.4 norovirus strains (New Orleans, DenHaag, Hunter, Farmington, Grimsby, and Osaka) were used as their roots, respectively. (A) GII.2 VP1. The sequences from 2018–20 evolved into nine determined sublineages. (B) GII.4 Sydney VP1. The sequences evolved into three lineages (I–III). (C) Geographical distributions of the clusters in the GII.4 Sydney VP1. Lineage I only contained eleven sequences from North America, lineage III contained most sequences from South Asia, and Lineage II from North America. (D) Geographical distributions of the latest GII.4 Sydney VP1. The sequences from East Asia mainly distributed in sublineage III-1, while the ones from North America and South Asia mostly distributed in sublineage II-1.
Figure 4.Evolutionary rate comparison of VP1 and RdRp genes between different groups. The mean rate of VP1 for GII.4 Sydney was higher than that for GII.2, and the rates of the pre-2016 GII.2 VP1, pre-2015 GII.4 Sydney VP1, and GII.2-associated GII.P16 RdRp were, respectively, comparable to those for the post-2016 GII.2 VP1, post-2015 GII.4 Sydney VP1, and GII.4-associated GII.P16 RdRp.
Figure 5.Spatial structures of the major epitopes on GII.4 Sydney norovirus. (A) Wild-type GII.4 Sydney VP1 (from the USA in 2015); (B–G) GII.4 Sydney norovirus mutants. Substitutions of T294A, G295N, R297H/Q, and H373R in epitope A, N412K/S/D and H414L/P in epitope E, and H396P in epitope D had greatly altered the binding surface, while E368Q, D372N, and H373N in epitope A, S393G in epitope D, and Y352L and A359S in epitope G exhibited marginal effects. The regions with amino acid change were marked in color.
Figure 6.Seven organized motifs on the GII.P16 RdRp. Motifs A, B, D, and E were strictly conserved in different P-types of RdRp. Site 163 in F motif and 337 in C were different in some P-types. Numbers of GII.P16 RdRps from 2016–9 and GII.P2 had the same amino acid at site 121 in motif G.