| Literature DB >> 24415946 |
Alexey D Neverov1, Ksenia V Lezhnina2, Alexey S Kondrashov3, Georgii A Bazykin4.
Abstract
Reassortments and point mutations are two major contributors to diversity of Influenza A virus; however, the link between these two processes is unclear. It has been suggested that reassortments provoke a temporary increase in the rate of amino acid changes as the viral proteins adapt to new genetic environment, but this phenomenon has not been studied systematically. Here, we use a phylogenetic approach to infer the reassortment events between the 8 segments of influenza A H3N2 virus since its emergence in humans in 1968. We then study the amino acid replacements that occurred in genes encoded in each segment subsequent to reassortments. In five out of eight genes (NA, M1, HA, PB1 and NS1), the reassortment events led to a transient increase in the rate of amino acid replacements on the descendant phylogenetic branches. In NA and HA, the replacements following reassortments were enriched with parallel and/or reversing replacements; in contrast, the replacements at sites responsible for differences between antigenic clusters (in HA) and at sites under positive selection (in NA) were underrepresented among them. Post-reassortment adaptive walks contribute to adaptive evolution in Influenza A: in NA, an average reassortment event causes at least 2.1 amino acid replacements in a reassorted gene, with, on average, 0.43 amino acid replacements per evolving post-reassortment lineage; and at least ~9% of all amino acid replacements are provoked by reassortments.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24415946 PMCID: PMC3886890 DOI: 10.1371/journal.pgen.1004037
Source DB: PubMed Journal: PLoS Genet ISSN: 1553-7390 Impact factor: 5.917
Characteristics of RCBs and amino acid replacements.
| PB2 | PB1 | PA | HA | NP | NA | M1 | NS1 | |
| Mean dN/dS | 0.07 | 0.07 | 0.08 | 0.27 | 0.07 | 0.25 | 0.05 | 0.35 |
| Sequence length, nucleotides | 2271 | 1986 | 2142 | 1692 | 1485 | 1377 | 687 | 471 |
| RCBs | ||||||||
| Internal | 19 | 20 | 21 | 16 | 19 | 20 | 12 | 7 |
| With descendant substitutions | 6 | 11 | 10 | 10 | 9 | 11 | 3 | 5 |
| Without descendant substitutions | 6 | 4 | 3 | 1 | 4 | 3 | 3 | 0 |
| Pre-terminal | 7 | 5 | 8 | 5 | 6 | 5 | 6 | 2 |
| Terminal | 15 | 16 | 16 | 13 | 15 | 16 | 6 | 7 |
| Internal branches with substitutions | ||||||||
| All | 125 | 117 | 139 | 271 | 91 | 257 | 27 | 89 |
| Descendant to RCBs | 104 | 99 | 105 | 223 | 68 | 216 | 14 | 68 |
| Substitutions on internal branches | ||||||||
| All | 154 | 146 | 173 | 460 | 118 | 411 | 29 | 107 |
| Descendant to RCBs | 127 | 123 | 127 | 336 | 83 | 320 | 14 | 77 |
| Distance between post-RCB substitutions and most recent RCB, dS units | ||||||||
| Mean | ||||||||
| expected | 0.050 |
| 0.027 | 0.036 | 0.029 |
| 0.024 | 0.038 |
| observed | 0.047 |
| 0.026 | 0.035 | 0.032 |
| 0.021 | 0.038 |
| Median | ||||||||
| expected | 0.028 |
| 0.023 | 0.029 | 0.022 |
| 0.024 | 0.040 |
| observed | 0.028 |
| 0.020 | 0.026 | 0.028 |
| 0.023 | 0.049 |
| Wilcoxon one-tailed p-value | 0.211 |
| 0.149 | 0.106 | 0.952 |
| 0.196 | 0.065 |
The mean distances between the amino acid replacements and most recent RCBs that were significantly (p<0.05) lower than expected are in boldface.
Pre-terminal branches are those immediately ancestral to the terminal branches.
Figure 1Reassortment events inferred from comparison of phylogenies of HA and NA segments of Influenza A H3N2 virus, and subsequent accumulation of amino acid replacements.
(A) The tanglegram [77] of the NA (left) and HA (right) genes for the same set of strains. Topologies of the phylogenies of the two genes are incongruent; such incongruence can be resolved by assuming reassortments. The clades corresponding to inferred reassortments are shown in color, with the lines of the corresponding color connecting the inferred reassortants for the two genes. (B) Zoom-in of the clade in rectangle in A. Red dot, inferred RCB (node id 62); green hatches, amino acid replacements on internal branches following the RCB. Some clades are collapsed into black triangles for clarity, and substitutions on such clades are not shown.
Reassortments involving NA.
| Distance between post-RCB substitutions and their most recent ancestral RCB, excluding the current RCB, dS units | Reassortment-provoked substitutions very soon after reassortments | |||||||||
| Node id | No. descendant substitutions | Mean, observed | Mean, expected | Median, observed | Median, expected | Wilcoxon one-tailed p-value | Total | Phylogenetically independent | Number of lineages | Length of adaptive walk |
| 1090 | 32 | 0.0347 | 0.0355 | 0.0208 | 0.0224 | 0.02616 | 0 | 0 | 18 | 0.00 |
| 62 | 77 | 0.0318 | 0.0353 | 0.0256 | 0.0305 | 0.00045 | 26 | 12 | 27 | 0.96 |
| 49 | 78 | 0.0454 | 0.0519 | 0.0191 | 0.0223 | 0.00011 | 2 | 1 | 4 | 0.50 |
| 1092 | 19 | 0.0265 | 0.0302 | 0.0190 | 0.0208 | 0.00005 | 0 | 0 | 6 | 0.00 |
| 57 | 49 | 0.0391 | 0.0446 | 0.0233 | 0.0324 | 0.00004 | 0 | 0 | 1 | 0.00 |
| 257 | 5 | 0.0261 | 0.0298 | 0.0176 | 0.0207 | 0.00004 | 0 | 0 | 1 | 0.00 |
| 35 | 9 | 0.0260 | 0.0299 | 0.0160 | 0.0192 | 0.00002 | 0 | 0 | 3 | 0.00 |
| 38 | 42 | 0.0300 | 0.0338 | 0.0168 | 0.0208 | 0.00002 | 0 | 0 | 1 | 0.00 |
| 986 | 4 | 0.0260 | 0.0298 | 0.0160 | 0.0203 | 0.00002 | 0 | 0 | 1 | 0.00 |
| 1446 | 1 | 0.0261 | 0.0299 | 0.0160 | 0.0207 | 0.00002 | 0 | 0 | 1 | 0.00 |
| 952 | 4 | 0.0260 | 0.0298 | 0.0160 | 0.0203 | 0.00002 | 2 | 2 | 2 | 1.00 |
| 602 | 0 | 0.0260 | 0.0298 | 0.0160 | 0.0203 | 0.00001 | 0 | 0 | 1 | 0.00 |
| 941 | 0 | 0.0260 | 0.0299 | 0.0160 | 0.0203 | 0.00001 | 0 | 0 | 1 | 0.00 |
| 1433 | 0 | 0.0260 | 0.0299 | 0.0160 | 0.0207 | 0.00001 | 0 | 0 | 2 | 0.00 |
| All | 320 | 0.0260 | 0.0297 | 0.0160 | 0.0203 | 0.00002 | 30 | 15 | 69 | 0.43 |
The 14 non-terminal, non-pre-terminal RCBs are shown. The table shows the observed and expected distances between the amino acid replacements and the preceding reassortment, and the Wilcoxon p-value for the difference between the observed and expected values. The RCBs are ordered by the Wilcoxon p-value.
a Substitutions at phylogenetic distances up to 0.003 ds units after the reassortment.
b Phylogenetically independent substitutions are such that none of them are descendant to any of the remaining ones.
c Number of substitutions accumulated per lineage.
d RCBs also described in [7].
e RCBs confirmed by sampling times.
Figure 2Excess of amino acid replacements after RCBs in PB1.
Top: frequencies of amino acid replacements observed at various phylogenetic distances from the most recent ancestral reassortment; pink: observed; gray: expected; red: overlayed observed and expected. Bottom: the excess of amino acid replacements, compared to the expectation. Gray shaded area indicates the 90% CI for the expected value. The expected distribution was obtained from 10,000 reshuffling trials.
Figure 3Excess of amino acid replacements after RCBs in NA.
Notations as in Figure 2.
Phylogenetic distances from reassortments for different classes of amino acid replacements.
| Distance between post-RCB nonsynonymous substitutions and most recent RCB, dS units | |||||||
| Gene | Subset | No. substitutions | Mean, subset | Mean, complement | Median, subset | Median, complement | Wilcoxon one-tailed p-value |
| NA |
|
|
|
|
|
|
|
| Epitopes | 88 | 0.0257 | 0.0261 | 0.0160 | 0.0176 | 0.772 | |
|
|
|
|
|
|
|
| |
|
|
|
|
|
|
|
| |
|
|
|
|
|
|
|
| |
|
|
|
|
|
|
|
| |
| HA | Positively selected | 91 | 0.0317 | 0.0366 | 0.0238 | 0.0291 | 0.165 |
| Epitopes | 215 | 0.0342 | 0.0372 | 0.0251 | 0.0277 | 0.273 | |
|
|
|
|
|
|
|
| |
|
|
|
|
|
|
|
| |
| Revertions | 60 | 0.0315 | 0.0361 | 0.0221 | 0.0269 | 0.218 | |
|
|
|
|
|
|
|
| |
| Epistatic trailing | 169 | 0.0357 | 0.0349 | 0.0251 | 0.0274 | 0.898 | |
The phylogenetic distances to the most recent ancestral RCB are compared between a given class of amino acid replacements and all remaining replacements. Classes that are significantly (p<0.05) closer to the most recent RCB are in boldface, and classes that are farther from RCB, in italic.