| Literature DB >> 35627294 |
Craig H Decker1, Naomi Rapier-Sharman1, Brett E Pickett1.
Abstract
H1N1 influenza A virus is a respiratory pathogen that undergoes antigenic shift and antigenic drift to improve viral fitness. Tracking the evolutionary trends of H1N1 aids with the current detection and the future response to new viral strains as they emerge. Here, we characterize antigenic drift events observed in the hemagglutinin (HA) sequence of the pandemic H1N1 lineage from 2015-2019. We observed the substitutions S200P, K147N, and P154S, together with other mutations in structural, functional, and/or epitope regions in 2015-2019 HA protein sequences from the Mountain West region of the United States, the larger United States, Europe, and other Northern Hemisphere countries. We reconstructed multiple phylogenetic trees to track the relationships and spread of these mutations and tested for evidence of selection pressure on HA. We found that the prevalence of amino acid substitutions at positions 147, 154, 159, 200, and 233 significantly changed throughout the studied geographical regions between 2015 and 2019. We also found evidence of coevolution among a subset of these amino acid substitutions. The results from this study could be relevant for future epidemiological tracking and vaccine prediction efforts. Similar analyses in the future could identify additional sequence changes that could affect the pathogenicity and/or infectivity of this virus in its human host.Entities:
Keywords: H1N1; HA; bioinformatics; comparative genomics; hemagglutinin; influenza virus; phylogenetic tree; selection pressure; virology
Mesh:
Substances:
Year: 2022 PMID: 35627294 PMCID: PMC9141826 DOI: 10.3390/genes13050909
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.141
Figure 1A visual representation of the analytical workflow implemented in the current study.
Figure 2Geographical regions represented by each dataset. (A) Mountain West dataset; (B) USA dataset; (C) European dataset; (D) Northern Hemisphere dataset.
Total number of samples in each dataset.
| Mountain West | USA | Europe | Northern Hemisphere |
|---|---|---|---|
| 933 | 1924 | 309 | 2389 |
Significant results from each HyPhy selection pressure algorithm by codon position, with the MW, US, EU, and NH datasets represented in blue, lavender, red, and yellow (respectively).
| MW | MW | MW | US | US | US | EU | EU | EU | NH | NH | NH |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 3 | 3 | 3 | 3 | 3 | |||||||
| 6 | 6 | 6 | 6 | 6 | |||||||
| 11 | 11 | 11 | 11 | 11 | 11 | 11 | 11 | 11 | |||
| 36 | |||||||||||
| 57 | |||||||||||
| 60 | |||||||||||
| 65 | 65 | ||||||||||
| 86 | |||||||||||
| 90 | 90 | ||||||||||
| 137 | 137 | ||||||||||
| 145 | |||||||||||
| 146 | |||||||||||
| 156 | 156 | 156 | 156 | ||||||||
| 158 | |||||||||||
| 180 | 180 | ||||||||||
| 200 | 200 | 200 | 200 | ||||||||
| 203 | 203 | 203 | 203 | 203 | 203 | 203 | |||||
| 273 | |||||||||||
| 278 | 278 | 278 | |||||||||
| 289 | |||||||||||
| 472 | |||||||||||
| 505 | |||||||||||
| 513 | 513 | ||||||||||
| 524 | 524 | ||||||||||
| 545 | 545 | 545 | 545 | 545 | 545 | ||||||
| 550 | 550 | 550 | |||||||||
| 566 |
Evolutionary timeline of hemagglutinin K147N variant by flu season and dataset, with the MW, US, EU, and NH datasets represented in blue, lavender, red, and yellow (respectively).
| MW | US | EU | NH | |||||
|---|---|---|---|---|---|---|---|---|
| Year | %K | %N | %K | %N | %K | %N | %K | %N |
| Summer 2015 | 100% (5/5) | 0% (0/5) | 100% (11/11) | 0% (0/11) | 100% (25/25) | 0% (0/25) | 100% (37/37) | 0% (0/37) |
| Winter 2015–2016 | 100% (297/297) | 0% (0/297) | 100% (568/569) | 0% (0/569) | 100% (64/64) | 0% (0/64) | 100% (665/666) | 0% (1/666) |
| Summer 2016 | 100% (34/34) | 0% (0/34) | 100% (59/59) | 0% (0/59) | 100% (4/4) | 0% (0/4) | 100% (66/66) | 0% (0/66) |
| Winter 2016–2017 | 100% (69/69) | 0% (0/69) | 100% (143/143) | 0% (0/143) | 90% (9/10) | 10% (1/10) | 99% (153/154) | 1% (1/154) |
| Summer 2017 | 100% (12/12) | 0% (0/12) | 96% (25/26) | 0% (0/26) | NA * | NA * | 96% (25/26) | 4% (1/26) |
| Winter 2017–2018 | 100% (122/122) | 0% (0/122) | 100% (316/317) | 0% (0/317) | % (53/53) | 0% (0/53) | 100% (469/470) | 0% (1/470) |
| Summer 2018 | 97% (28/29) | 3% (1/29) | 93% (42/45) | 0% (0/45) | 100% (1/1) | 0% (0/1) | 94% (47/50) | 6% 3/50 |
| Winter 2018–2019 | 91% (292/322) | 8% (26/322) | 90% (595/658) | 9% (56/658) | 95% (116/122) | 5% (6/122) | 91% (722/791) | 8% 62/791 |
| Summer 2019 | 44% (19/43) | 56% (24/43) | 55% (53/96) | 44% (42/96) | 100% (30/30) | 0% (0/30) | 67% (86/129) | 33% 42/129 |
K: lysine; N: asparagine; winter season includes October through March; Summer season includes April through September. MW: Mountain West region: US: USA region; EU: Europe region; NH: Northern Hemisphere region. * NA values were recorded for absence of sequences during specific time periods.
Evolutionary timeline of hemagglutinin P154S variant by flu season and dataset, with the MW, US, EU, and NH datasets represented in blue, lavender, red, and yellow (respectively).
| MW | US | EU | NH | |||||
|---|---|---|---|---|---|---|---|---|
| Year | %P | %S | %P | %S | %P | %S | %P | %S |
| Summer 2015 | 100% (5/5) | 0% (0/5) | 100% (11/11) | 0% (0/11) | 60% (15/25) | 32% (8/25) | 70% (26/37) | 24% (9/37) |
| Winter 2015–2016 | 99% (293/297) | 1% (4/297) | 99% (563/569) | 1% (6/569) | 84% (54/64) | 9% (6/64) | 98% (650/666) | 2% (12/666) |
| Summer 2016 | 91% (31/34) | 9% (3/34) | 95% (56/59) | 5% (3/59) | 50% (2/4) | 25% (1/4) | 92% (61/66) | 6% (4/66) |
| Winter 2016–2017 | 100% (69/69) | 0% (0/69) | 99% (142/143) | 1% (1/143) | 40% (4/10) | 60% (6/10) | 95% (147/154) | 5% (7/154) |
| Summer 2017 | 100% (12/12) | 0% (0/12) | 100% (26/26) | 0% (0/26) | NA * | NA * | 100% (26/26) | 0% (0/26) |
| Winter 2017–2018 | 98% (120/122) | 2% (2/122) | 98% (311/317) | 2% (6/317) | 87% (46/53) | 13% (7/53) | 95% (447/470) | 5% (22/470) |
| Summer 2018 | 97% (28/29) | 3% (1/29) | 93% (42/45) | 7% (3/45) | 0% (0/1) | 100% (1/1) | 92% (46/50) | 8% (4/50) |
| Winter 2018–2019 | 75% (240/322) | 25% (80/322) | 76% (503/658) | 23% (153/658) | 94% (115/122) | 6% (7/122) | 80% (629/791) | 20% (160/791) |
| Summer 2019 | 93% (40/43) | 7% (3/43) | 91% (87/96) | 9% (9/96) | 100% (30/30) | 0% (0/30) | 93% (120/129) | 7% (9/129) |
P: proline; S: serine; winter season includes October through March; Summer season includes April through September. MW: Mountain West region: US: USA region; EU: Europe region; NH: Northern Hemisphere region. * NA values were recorded for absence of sequences during specific time periods.
Evolutionary timeline of hemagglutinin S200P variant by flu season and dataset, with the MW, US, EU, and NH datasets represented in blue, lavender, red, and yellow (respectively).
| MW | US | EU | NH | |||||
|---|---|---|---|---|---|---|---|---|
| Year | %S | %P | %S | %P | %S | %P | %S | %P |
| Summer 2015 | 100% (5/5) | 0% (0/5) | 100% (11/11) | 0% (0/11) | 64% (16/25) | 36% (9/25) | 73% (27/37) | 27% (10/37) |
| Winter 2015–2016 | 99% (295/297) | 1% (2/297) | 100% (567/569) | 0% (2/569) | 84% (54/64) | 16% (10/64) | 98% (654/666) | 2% (12/666) |
| Summer 2016 | 91% (31/34) | 9% (3/34) | 95% (56/59) | 5% (3/59) | 50% (2/4) | 50% (2/4) | 92% (61/66) | 8% (5/66) |
| Winter 2016–2017 | 43% (30/69) | 57% (39/69) | 55% (79/143) | 45% (64/143) | 20% (2/10) | 80% (8/10) | 53% (82/154) | 47% (72/154) |
| Summer 2017 | 83% (10/12) | 17% (2/12) | 81% (21/26) | 19% (5/26) | NA * | NA * | 81% (21/26) | 19% (5/26) |
| Winter 2017–2018 | 25% (31/122) | 75% (91/122) | 19% (61/317) | 81% (256/317) | 75% (40/53) | 25% (13/53) | 33% (153/470) | 67% (316/470) |
| Summer 2018 | 14% (4/29) | 86% (25/29) | 16% (7/45) | 84% (38/45) | 0% (0/1) | 100% (1/1) | 18% (9/50) | 82% (41/50) |
| Winter 2018–2019 | 4% (13/322) | 96% (309/322) | 3% (22/658) | 97% (636/658) | 7% (9/122) | 93% (113/122) | 4% (31/791) | 96% (760/791) |
| Summer 2019 | 5% (2/43) | 95% (41/43) | 3% (3/96) | 97% (93/96) | 3% (1/30) | 97% (29/30) | 3% (4/129) | 97% (125/129) |
P: proline; S: serine; winter season includes October through March; Summer season includes April through September. MW: Mountain West region: US: USA region; EU: Europe region; NH: Northern Hemisphere region. * NA values were recorded for absence of sequences during specific time periods.
Mutual information coevolution for the 20 highest-scoring pairs of residues *.
| 1° Residue | 1° Residue | 2° Residue | 2° Residue | Mutual Information Value ** |
|---|---|---|---|---|
| 181 | S | 312 | I | 1175.141846 |
| 91 | S | 181 | S | 1163.426392 |
| 91 | S | 312 | I | 1159.654785 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 421 | I | 523 | E | 307.183044 |
* Bold text emphasizes rows containing at least one position from the current work (e.g., 62, 147, 154, 177, 190, 200, 202, 233, 252, 277, 299, and 313). ** Larger MI values signify higher levels of coevolution.