| Literature DB >> 19924230 |
Jun Shen1, Jianpeng Ma, Qinghua Wang.
Abstract
The Pandemic (H1N1) 2009 is spreading to numerous countries and causing many human deaths. Although the symptoms in humans are mild at present, fears are that further mutations in the virus could lead to a potentially more dangerous outbreak in subsequent months. As the primary immunity-eliciting antigen, hemagglutinin (HA) is the major agent for host-driven antigenic drift in A(H3N2) virus. However, whether and how the evolution of HA is influenced by existing immunity is poorly understood for A(H1N1). Here, by analyzing hundreds of A(H1N1) HA sequences since 1918, we show the first evidence that host selections are indeed present in A(H1N1) HAs. Among a subgroup of human A(H1N1) HAs between 1918 approximately 2008, we found strong diversifying (positive) selection at HA(1) 156 and 190. We also analyzed the evolutionary trends at HA(1) 190 and 225 that are critical determinants for receptor-binding specificity of A(H1N1) HA. Different A(H1N1) viruses appeared to favor one of these two sites in host-driven antigenic drift: epidemic A(H1N1) HAs favor HA(1) 190 while the 1918 pandemic and swine HAs favor HA(1) 225. Thus, our results highlight the urgency to understand the interplay between antigenic drift and receptor binding in HA evolution, and provide molecular signatures for monitoring future antigenically drifted 2009 pandemic and seasonal A(H1N1) influenza viruses.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19924230 PMCID: PMC2773012 DOI: 10.1371/journal.pone.0007789
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
The values of log-likelihood (l), d N/d S, and parameter estimates in CODEML analysis of human A(H1N1) Has.
| Model | l |
| Parameters estimates |
|
| |||
| M0 (one-ratio) | −806.78 | 0.516 | ω = 0.516 |
| M1a (nearly neutral) | −805.91 | 0.323 |
|
| M2a (positive selection) | −804.19 | 0.564 |
|
| M7 (beta) | −805.92 | 0.300 |
|
| M8a (beta&ω = 1) | −805.91 | 0.323 |
|
| M8 (beta&ω>1) | −804.19 | 0.561 |
|
|
| |||
| M0 (one-ratio) | −2489.23 | 0.223 | ω = 0.223 |
| M1a (nearly neutral) | −2486.25 | 0.242 |
|
| M2a (positive selection) | −2486.25 | 0.242 |
|
| M7 (beta) | −2485.63 | 0.232 |
|
| M8a (beta&ω = 1) | −2485.63 | 0.232 |
|
| M8 (beta&ω>1) | −2485.63 | 0.232 |
|
|
| |||
| M0 (one-ratio) | −1686.21 | 0.279 | ω = 0.279 |
| M1a (nearly neutral) | −1684.58 | 0.261 |
|
| M2a (positive selection) | −1683.09 | 0.287 |
|
| M7 (beta) | −1684.52 | 0.265 |
|
| M8a (beta&ω = 1) | −1684.58 | 0.261 |
|
| M8 (beta&ω>1) | −1683.10 | 0.287 |
|
|
| |||
| M0 (one-ratio) | −2720.83 | 0.187 | ω = 0.187 |
| M1a (nearly neutral) | −2709.20 | 0.181 |
|
| M2a (positive selection) | −2708.27 | 0.187 |
|
| M7 (beta) | −2708.99 | 0.183 |
|
| M8a (beta&ω = 1) | −2709.40 | 0.180 |
|
| M8 (beta&ω>1) | −2708.82 | 0.186 |
|
|
| |||
| M0 (one-ratio) | −3871.33 | 0.303 | ω = 0.303 |
| M1a (nearly neutral) | −3813.28 | 0.241 |
|
| M2a (positive selection) | −3782.07 | 0.313 |
|
| M7 (beta) | −3814.21 | 0.246 |
|
| M8a (beta&ω = 1) | −3812.40 | 0.231 |
|
| M8 (beta&ω>1) | −3781.55 | 0.305 |
|
|
| |||
| M0 (one-ratio) | −2374.98 | 0.277 | ω = 0.277 |
| M1a (nearly neutral) | −2373.20 | 0.282 |
|
| M2a (positive selection) | −2372.81 | 0.282 |
|
| M7 (beta) | −2373.21 | 0.281 |
|
| M8a (beta&ω = 1) | −2374.03 | 0.282 |
|
| M8 (beta&ω>1) | −2372.81 | 0.282 |
|
Due to the inclusion of two partial sequences of A/London/1/1918 and A/London/1/1919 in this subgroup, the analysis was performed on a total of 187 amino-acid residues that covered the antigenic and receptor-binding sites in the region of HA1 (51∼237) [11]. 2 The analysis was performed on the first 340 residues of HA1 including the signal peptide.
LRT tests for HA1 sequences of human A(H1N1) influenza viruses.
| LRT (M2a − M1a) (2Δl) ( | LRT (M8 − M7) (2Δl) ( | LRT (M8 − M8a) (2Δl) ( | |
| I- | 3.44 (0.1791) | 3.46 (0.1773) | 3.44 (0.0318) |
| I- | 0 | 0 | 0 |
| I- | 2.98 (0.2254) | 2.84 (0.2417) | 2.96 (0.0427) |
| I- | 1.86 (0.3946) | 0.34 (0.8437) | 1.16 (0.1407) |
| I- | 62.42 (0.0000) | 65.32 (0.0000) | 61.70 (0.0000) |
| II 2007∼2009 (74 strains) | 0.78 (0.6771) | 0.80 (0.6703) | 2.44 (0.0591) |
We used the degree of freedom of 2 for these LRT tests that is expected to be too conservative. 2 The p-values were calculated from χ2 distribution using degree of freedom of 1 that was then divided by a factor of 2 for the mixture distribution, as suggested by the author of PAML 4.0.
Codons under positive selection in HA1 of human A(H1N1) influenza viruses.
| Positively selected sites | ||
|
| M2a | 156 (97.5%), 190 (100%) |
| M8 | 156 (99.7%), 190 (100%) | |
Positively selected sites from PAML 4.0 [45] using Bayes Empirical Bayes analysis [72]. Only codons with greater than 95% posterior probabilities to be under positive selection were listed with the corresponding posterior probabilities shown in parentheses.
Figure 1Antigenic structure and positive selection of A(H1N1) HA.
a) Antigenic structure of A/PR/8/34 (H1N1) HA (PDB accession code 1RU7 [78]). Five antigenic sites were identified by using a large number of monoclonal antibodies [73], [74]: Sa (cyan), Sb (red), Ca1 (yellow), Ca2 (green), Cb (blue), using H3 HA numbering. The receptor-binding site (RBS) was labeled for reference. b) Codons on A(H1N1) HA that were identified to be under various selection in PAML and HyPhy analysis.
The values of log-likelihood (l), d N/d S, and parameter estimates in CODEML analysis of egg-adapted human A(H1N1) HAs between 1933-1979.
| Model | l |
| Parameters estimates |
|
| |||
| M0 (one-ratio) | −3336.01 | 0.411 | ω = 0.411 |
| M1a (nearly neutral) | −3283.75 | 0.336 |
|
| M2a (positive selection) | −3275.24 | 0.454 |
|
| M7 (beta) | −3285.56 | 0.345 |
|
| M8a (beta&ω = 1) | −3283.78 | 0.336 |
|
| M8 (beta&ω>1) | −3275.41 | 0.452 |
|
|
| |||
| M0 (one-ratio) | −2068.01 | 0.435 | ω = 0.435 |
| M1a (nearly neutral) | −2056.27 | 0.337 |
|
| M2a (positive selection) | −2046.27 | 0.501 |
|
| M7 (beta) | −2056.44 | 0.324 |
|
| M8a (beta&ω = 1) | −2056.28 | 0.337 |
|
| M8 (beta&ω>1) | −2046.29 | 0.501 |
|
|
| |||
| M0 (one-ratio) | −1759.58 | 0.385 | ω = 0.385 |
| M1a (nearly neutral) | −1751.46 | 0.260 |
|
| M2a (positive selection) | −1747.17 | 0.408 |
|
| M7 (beta) | −1751.61 | 0.300 |
|
| M8a (beta&ω = 1) | −1751.46 | 0.260 |
|
| M8 (beta&ω>1) | −1747.19 | 0.411 |
|
The analysis was performed on the first 337 residues of HA1 including the signal peptide.
LRT tests and codons under positive selection for HA1 sequences of egg-adapted human A(H1N1) influenza viruses between 1933-1979.
| LRT (2Δl) ( | Positively selected sites | |||
|
| M2a | M2a-M1a | 17.02 (0.0002) |
|
| M8 | M8-M7 | 20.30 (0.0000) |
| |
| M8-M8a | 16.74 (0.0000) | |||
|
| M2a | M2a-M1a | 20.0 (0.0000) |
|
| M8 | M8-M7 | 20.30 (0.0000) |
| |
| M8-M8a | 19.98 (0.0000) | |||
|
| M2a | M2a-M1a | 8.58 (0.0137) | 225 (99.1%) |
| M8 | M8-M7 | 8.84 (0.0120) | 225 (99.8%) | |
| M8-M8a | 8.54 (0.0017) | |||
Positively selected sites from PAML 4.0 [45] using Bayes Empirical Bayes analysis [72]. Only codons with greater than 95% posterior probabilities to be under positive selection were listed with the corresponding posterior probabilities shown in parentheses. Highlighted in bold were codons that were not associated with egg-adapted substitutions [62], [63], [64].
Codons at HA1 190 and 225 in human and swine A(H1N1) influenza viruses.
| D190 | Non-D190 | D225 | G225 | Non-D225/G225 | |
| Human 1979∼2008 Epidemic (575 sequences) | 477 (83.0%) | 98 (17.0%) | 565 (98.2%) | 2 (0.4%) | 8 (1.4%) |
| Human 1918 Pandemic (5 sequences) | 5 (100%) | 0 | 3 (60%) | 2 (40%) | 0 |
| Human 2009 Pandemic (73 sequences) | 73 (100%) | 0 | 69 (94.5%) | 1 (1.4%) | 3 (4.1%) |
| Human 1947∼1957 (12 strains) (egg-adapted) | 9 (75%) | 3 (25%) | 2 (16.7%) | 10 (83.3%) | 0 |
| Human 1948∼1979 (17 strains) (egg-adapted) | 16 (94.1%) | 1 (5.9%) | 4 (23.5%) | 13 (76.5%) | 0 |
| Swine 1990∼2009 (42 sequences) | 41 (97.6%) | 1 (2.4%) | 28 (66.6%) | 12 (28.6%) | 2 (4.8%) |
The number of cases that a particular type of residues occurs at each site. Shown in parenthesis was the occurrence in percentage.
The values of log-likelihood (l), d N/d S, and parameter estimates in CODEML analysis of swine A(H1N1) HAs between 1990–2009.
| Model | l |
| Parameters estimates | LRT (2Δl) ( |
| M0 (one-ratio) | −6021.08 | 0.158 | ω = 0.158 | |
| M1a (nearly neutral) | −5949.30 | 0.217 |
| LRT (M2a-M1a) = 1.24 (0.5379) |
| M2a (positive selection) | −5948.68 | 0.224 |
| |
| M7 (beta) | −5925.79 | 0.174 |
| LRT (M8-M7) = 6.78 (0.0337) LRT (M8-M8a) = 3.40 (0.0326) |
| M8a (beta&ω = 1) | −5924.10 | 0.171 |
| |
| M8 (beta&ω) | −5922.40 | 0.177 |
|
The analysis was performed on the first 338 residues of HA1 including the signal peptide.
Directional selection analysis on human A(H1N1) Has.
| Tree L | Residue |
| Bias | Proportion (%) | No. of Sites | |
| Human I- | 0.474 | T | 0.0002 | 32.995 | 5.4 | 2 |
| R | 0.0002 | 12.583 | 11.8 | 1 | ||
| V | 0.0004 | 34.478 | 4.0 | 2 | ||
| K | 0.0023 | 29.301 | 7.2 | 1 | ||
| Human 1933∼1979 (32 strains) (egg-adapted) | 0.613 | D | 0.0000 | 78.148 | 3.4 | 1 |
| Human 1948∼1979 (17 strains) (egg-adapted) | 0.091 | D | 0.0007 | 133.611 | 5.1 | 1 |
Sites found to be under directional selection in human A(H1N1) Has.
| Sites | Composition | Root | Preferred | Inferred Substitutions | |
| Human I- | 143 | V99T1 | V | V | T→1V |
| 156 | G90R9E1 | R | R | G→1E, G→7R | |
| 158 | N96K4 | N | K | N→4K | |
| 190 | D67N25V8 | D | V | D→16N, D→4V | |
| 193 | A52T48 | A | T | A→6T | |
| 197 | T82K18 | T | T | K→2T | |
| Human 1933∼1979 (32 strains) (egg-adapted) | 225 | G23D9 | D | D | D→1G, G→6D |
| Human 1948∼1979 (17 strains) (egg-adapted) | 225 | G13D4 | G | D | G→4D |