| Literature DB >> 35446251 |
Peter N Mwangi1, Nicola A Page2,3, Mapaseka L Seheri4, M Jeffrey Mphahlele4,5,6, Sandrama Nadan2, Mathew D Esona4, Benjamin Kumwenda7, Arox W Kamng'ona7, Celeste M Donato8,9,10, Duncan A Steele4, Valantine N Ndze11, Francis E Dennis12, Khuzwayo C Jere13,14, Martin M Nyaga1.
Abstract
The transient upsurge of G2P[4] group A rotavirus (RVA) after Rotarix vaccine introduction in several countries has been a matter of concern. To gain insight into the diversity and evolution of G2P[4] strains in South Africa pre- and post-RVA vaccination introduction, whole-genome sequencing was performed for RVA positive faecal specimens collected between 2003 and 2017 and samples previously sequenced were obtained from GenBank (n=103; 56 pre- and 47 post-vaccine). Pre-vaccine G2 sequences predominantly clustered within sub-lineage IVa-1. In contrast, post-vaccine G2 sequences clustered mainly within sub-lineage IVa-3, whereby a radical amino acid (AA) substitution, S15F, was observed between the two sub-lineages. Pre-vaccine P[4] sequences predominantly segregated within sub-lineage IVa while post-vaccine sequences clustered mostly within sub-lineage IVb, with a radical AA substitution R162G. Both S15F and R162G occurred outside recognised antigenic sites. The AA residue at position 15 is found within the signal sequence domain of Viral Protein 7 (VP7) involved in translocation of VP7 into endoplasmic reticulum during infection process. The 162 AA residue lies within the hemagglutination domain of Viral Protein 4 (VP4) engaged in interaction with sialic acid-containing structure during attachment to the target cell. Free energy change analysis on VP7 indicated accumulation of stable point mutations in both antigenic and non-antigenic regions. The segregation of South African G2P[4] strains into pre- and post-vaccination sub-lineages is likely due to erstwhile hypothesized stepwise lineage/sub-lineage evolution of G2P[4] strains rather than RVA vaccine introduction. Our findings reinforce the need for continuous whole-genome RVA surveillance and investigation of contribution of AA substitutions in understanding the dynamic G2P[4] epidemiology.Entities:
Keywords: G2P[4] group A rotavirus strains; rotavirus; sub-lineages; whole-genome analysis
Mesh:
Substances:
Year: 2022 PMID: 35446251 PMCID: PMC9453071 DOI: 10.1099/mgen.0.000809
Source DB: PubMed Journal: Microb Genom ISSN: 2057-5858
Fig. 1.VP7 Maximum likelihood phylogenetic tree. Maximum likelihood phylogenetic tree based on the ORF of the VP7 encoding genome segment. The evolutionary model used was Tamura-3 parameter + Gamma (T92 + G). South African pre-vaccine G2 sequences are indicated with black circles while post-vaccine strains are highlighted in red circles. Thirteen previously sequenced G2 strains (from 1984 - 1996) were included for reference and are indicated with black triangles. The reference sequences used to construct the tree are indicated with blue circles. The pie-chart summarizes the number of pre- and post-vaccine sequences. Lineages are indicated in roman numerals. Only bootstrap values > 70% are shown adjacent to each branch node. The scale bar indicates the number of NT substitutions per site.
Fig. 2.Non-synonymous amino acid substitutions in South African G2 sequences. Non-synonymous amino acid substitutions shared in each sub-lineage of G2 lineage IV where South African G2 strains in this study clustered in respect to the outgroup G2 strain, RVA/Human-tc/USA/DS-1/1976/G2P[4], are shown. South African pre-vaccine G2 sequences are indicated with black circles while post-vaccine strains are highlighted in red circles. Nodes with defining non-synonymous amino acid substitutions are highlighted in blue rectangles.
Fig. 3.VP4 Maximum likelihood phylogenetic tree. Maximum likelihood phylogenetic tree based on the ORF of the VP4 encoding genome segment. The evolutionary model used was Tamura-3 parameter + Gamma (T92 + G). South African pre-vaccine P[4] sequences are indicated with black circles while post-vaccine sequences are highlighted in red circles. The reference sequences used to construct the tree are indicated with blue circles. The pie-chart summarizes the number of pre- and post-vaccine sequences. Lineages are indicated in roman numerals. Only bootstrap values > 70% are shown adjacent to each branch node. The scale bar indicates the number of NT substitutions per site.
Fig. 4.Non-synonymousamino acid substitutions shared in each sub-lineage of G2 lineage IV where South African P[4] strains in this study clustered in respect to the outgroup P[4] strain, RVA/Human-tc/USA/DS-1/1976/G2P[4], are shown. South African pre-vaccine G2 sequences are indicated with black circles while post-vaccine strains are highlighted in red circles. Nodes with defining non-synonymous amino acid substitutions are highlighted in blue rectangles.
Fig. 5.Protein structural and free energy change analysis of VP7. The superimposed VP7 protein structure models of strain, RVA/Human-wt/ZAF/UFS-NGS-NICD18920/2017/G2P[4], representative of the current post-vaccine G2P[4] strains with that of an old G2 South African strain, RVA/Human/ZAF/SA405GR/87/1987/G2P[4]. The observed AA substitutions, A87T, D96N, I113T, N125T, V129M, S178N, N213D, N242S and I287V were shared by G2 strains in lineage IV, where all the South African G2 strains in this study clustered, in relation to the G2 strains in lineage I where a 1987 G2 strain RVA/Human/ZAF/SA405GR/87/1987/G2P[4] and also the prototype DS-1-like strain, RVA/Human-tc/USA/DS-1/1976/G2P[4], clustered. The free energy change difference after mutations was -1.592 kcal/mol. The structure of the pre-vaccine strain is in deep-teal while that of the post-vaccine strain is dark gray. The AA residues are represented in green and firebrick red to examine the structural conformation of the protein structure. The AA residues highlighted in firebrick red represents the pre-vaccine strain while that in green represents the post-vaccine strain.
Protein structural and free energy change analysis of VP7
|
Pre-vaccine strain |
Post-vaccine strain |
Energy change in kcal/mol |
Alignment value in RMSD (Å) |
|---|---|---|---|
|
RVA/Human-wt/ZAF/SA296GR/84/1984 /G2P[4] |
RVA/Human-wt/ZAF/UFS-NGS-NICD18920/2017 /G2P[4] |
−0.523 |
0.018 |
|
RVA/Human-wt/ZAF/SA410GR/85/1985 /G2P[4] |
RVA/Human-wt/ZAF/UFS-NGS-MRC-DPRU68/2013 /G2P[4] |
−1.017 |
0.018 |
|
RVA/Human-wt/ZAF/SA659GR/86/1986 /G2P[4] |
RVA/Human-wt/ZAF/UFS-NGS-NICD13522/2014 /G2P[4] |
−0.786 |
0.017 |
|
RVA/Human-wt/ZAF/SA405GR/87/1987 /G2P[4] |
RVA/Human-wt/ZAF/UFS-NGS-NICD15034/2015 /G2P[4] |
−1.592 |
0.017 |
|
RVA/Human-wt/ZAF/SA514GR/87/1987 /G2P[4] |
RVA/Human-wt/ZAF/UFS-NGS-NICD12041/2013 /G2P[4] |
−0.528 |
0.019 |
The table includes five G2 strains that circulated in South Africa from 1984−1987 and clustered in lineage I alongside the prototype DS-1-like strain. Five post-vaccine G2 strains were randomly selected for the protein structural and free energy change analyses. Protein stability was predicted by estimating the energy change, expressed in kcal mol−1, between a pre-vaccine strain and post-vaccine strain. The energy change o ±0.5 was regarded significant for stabilizing or destabilizing effect. A positive (+) value indicates destabilizing effect while a negative (−) value indicates stabilizing effect. The impact of mutation on the structural conformation of the protein was assessed by superposing a pre-vaccine strain with a post-vaccine strain and the RMSD (Root Mean Square Deviation) alignment value expressed in Angstroms (Å). An RMSD value of zero indicates absolute structural alignment
Positively selected codon sites in 103 South African G2P[4] pre- and post-RVA amino acid residues as identified by FEL, FUBAR and MEME analysis
|
Method |
Amino acid sites in the gene segments under positive selection | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
|
|
VP1 |
VP2 |
VP3 |
VP4 |
VP6 |
VP7 |
NSP1 |
NSP2 |
NSP3 |
NSP4 |
NSP5 |
|
|
2,62,90, 97 128, 157 158, 187 332, 364 371, 406 546, 654,1033,1041 |
5,13, 723 869, 873 874, 877 878 |
346 473, 704 788, 828 835 |
625 627 |
|
315 |
10 212, 271 313 |
249 |
3, 16,27 |
2,3, 16,17,51,79,170 |
3,7 |
|
|
60 364 |
|
|
|
|
66 |
10,11, 103 380 402 |
75 |
|
|
– |
|
|
– |
|
|
|
|
– |
319 |
|
|
|
– |
Bold and underlined sites are those identified by all the three methods: Fixed Effects Likelihood (FEL), Fast Unconstrained Bayesian AppRoximation (FUBAR) and Mixed Effects Model of Evolution (MEME). For MEME and FEL, statistical significance was assessed at P≤0.1 while for FUBAR it was assessed at posterior probability ≥0.9. The dash (-) sign indicates no site was identified. Further analysis reports are provided in the Supplementary Excelspreadsheet1.