| Literature DB >> 30429836 |
Cyril Barbezange1,2,3,4, Louis Jones2,3,4,5, Hervé Blanc1,3, Ofer Isakov6, Gershon Celniker6, Vincent Enouf2,3,4, Noam Shomron6, Marco Vignuzzi1,3, Sylvie van der Werf2,3,4.
Abstract
After a pandemic wave in 2009 following their introduction in the human population, the H1N1pdm09 viruses replaced the previously circulating, pre-pandemic H1N1 virus and, along with H3N2 viruses, are now responsible for the seasonal influenza type A epidemics. So far, the evolutionary potential of influenza viruses has been mainly documented by consensus sequencing data. However, like other RNA viruses, influenza A viruses exist as a population of diverse, albeit related, viruses, or quasispecies. Interest in this quasispecies nature has increased with the development of next generation sequencing (NGS) technologies that allow a more in-depth study of the genetic variability. NGS deep sequencing methodologies were applied to determine the whole genome genetic heterogeneity of the three categories of influenza A viruses that circulated in humans between 2007 and 2012 in France, directly from clinical respiratory specimens. Mutation frequencies and single nucleotide polymorphisms were used for comparisons to address the level of natural intrinsic heterogeneity of influenza A viruses. Clear differences in single nucleotide polymorphism profiles between seasons for a given subtype also revealed the constant genetic drift that human influenza A virus quasispecies undergo.Entities:
Keywords: NGS; genetic drift; influenza season; influenza virus; quasispecies
Year: 2018 PMID: 30429836 PMCID: PMC6220372 DOI: 10.3389/fmicb.2018.02596
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 5.640
FIGURE 1Comparison of LoFreq and ViVAN. (A) Correlation of the mean depths of sequencing (in number of reads per position) obtained by both pipelines. X-axis: LoFreq; Y-axis: ViVAN. Each dot represents one segment of one sample: polymerase sub-unit PB2: olive; polymerase sub-unit PB1: green; polymerase sub-unit PA: purple; hemagglutinin HA: red; nucleoprotein NP: dark blue; neuraminidase NA: pink; matrix and ion channel proteins M: orange; non-structural protein and nuclear export protein NS: light blue. (B) Mean number of variant positions identified by the pipelines, without or with setting a mutation frequency threshold. For each bar, from left to right: gray, specific to LoFreq; light gray, common to LoFreq and ViVAN; dark gray, specific to ViVAN. Error bars represent the Standard Error to the Mean. (C) Correlation of the global mutation frequencies obtained with both pipelines with or without setting a mutation frequency threshold. X-axis: LoFreq; Y-axis: ViVAN. Each dot represents one segment of one sample: PB2: olive; PB1: green; PA: purple; HA: red; NP: dark blue; NA: pink; M: orange; NS: light blue.
Correlation between global mutation frequencies obtained with LoFreq and ViVAN, evaluated by Spearman non-parametric test.
| Virus | Threshold | Genomic segment | Pairs | Spearman correlation test | |
|---|---|---|---|---|---|
| rs | 95% CI | ||||
| sH1N1 | 0.0 | PB2 | 27 | 0.9695 | 0.9318–0.9865 |
| PB1 | 27 | 0.9811 | 0.9574–0.9917 | ||
| PA | 27 | 0.9536 | 0.8973–0.9794 | ||
| HA | 27 | 0.9470 | 0.8832–0.9764 | ||
| NP | 27 | 0.9707 | 0.9344–0.9870 | ||
| NA | 27 | 0.9690 | 0.9307–0.9863 | ||
| M | 26 | 0.9096 | 0.8020–0.9600 | ||
| NS | 22 | 0.9367 | 0.8476–0.9744 | ||
| 0.5 | PB2 | 27 | 0.9858 | 0.9679–0.9937 | |
| PB1 | 27 | 0.9750 | 0.9439–0.9889 | ||
| PA | 27 | 0.8630 | 0.7129–0.9375 | ||
| HA | 27 | 0.9321 | 0.8516–0.9696 | ||
| NP | 27 | 0.9807 | 0.9565–0.9915 | ||
| NA | 27 | 0.9670 | 0.9263–0.9854 | ||
| M | 26 | 0.9304 | 0.8456–0.9694 | ||
| NS | 22 | 0.9133 | 0.7948–0.9647 | ||
| pH1N1 | 0.0 | PB2 | 38 | 0.9941 | 0.9883–0.9970 |
| PB1 | 35 | 0.9950 | 0.9897–0.9975 | ||
| PA | 39 | 0.9921 | 0.9845–0.9959 | ||
| HA | 39 | 0.9883 | 0.9771–0.9940 | ||
| NP | 38 | 0.9932 | 0.9866–0.9966 | ||
| NA | 39 | 0.9954 | 0.9910–0.9976 | ||
| M | 39 | 0.9860 | 0.9728–0.9928 | ||
| NS | 39 | 0.9775 | 0.9564–0.9885 | ||
| 0.5 | PB2 | 38 | 0.9965 | 0.9931–0.9982 | |
| PB1 | 35 | 0.9988 | 0.9976–0.9994 | ||
| PA | 39 | 0.9860 | 0.9727–0.9928 | ||
| HA | 39 | 0.9931 | 0.9866–0.9965 | ||
| NP | 38 | 0.9928 | 0.9859–0.9964 | ||
| NA | 39 | 0.9945 | 0.9893–0.9972 | ||
| M | 39 | 0.9880 | 0.9767–0.9939 | ||
| NS | 39 | 0.9642 | 0.9310–0.9815 | ||
| sH3N2 | 0.0 | PB2 | 35 | 0.9549 | 0.9101–0.9776 |
| PB1 | 35 | 0.9826 | 0.9649–0.9915 | ||
| PA | 35 | 0.9504 | 0.9014–0.9754 | ||
| HA | 35 | 0.9437 | 0.8884–0.9720 | ||
| NP | 35 | 0.9894 | 0.9784–0.9948 | ||
| NA | 35 | 0.9731 | 0.9459–0.9867 | ||
| M | 35 | 0.9717 | 0.9431–0.9860 | ||
| NS | 32 | 0.9065 | 0.8120–0.9547 | ||
| 0.5 | PB2 | 35 | 0.9661 | 0.9320–0.9832 | |
| PB1 | 35 | 0.9694 | 0.9386–0.9849 | ||
| PA | 35 | 0.9569 | 0.9139–0.9786 | ||
| HA | 35 | 0.9383 | 0.8779–0.9693 | ||
| NP | 35 | 0.9874 | 0.9744–0.9938 | ||
| NA | 35 | 0.9837 | 0.9671–0.9920 | ||
| M | 35 | 0.9935 | 0.9868–0.9968 | ||
| NS | 32 | 0.9781 | 0.9542–0.9896 | ||
FIGURE 2Representation of the global mutation frequencies according to season and group of interest. Values were calculated from the data obtained with LoFreq at a 0.005 threshold. Each symbol represents one segment of one sample: polymerase sub-unit PB2: olive; polymerase sub-unit PB1: green; polymerase sub-unit PA: purple; hemagglutinin HA: red; nucleoprotein NP: dark blue; neuraminidase NA: pink; matrix and ion channel proteins M: orange; non-structural protein and nuclear export protein NS: light blue. For pre-pandemic sH1N1 viruses, groups were based on the oseltamivir-resistance mutation H275Y (275H: sensitive; 275Y: resistant) in the neuraminidase; for both H3N2 and pandemic pH1N1 viruses, groups were based on the severity according to the network of sampling (mild through community physicians, severe through hospitals).
Median global mutation frequency per segment for each virus.
| Virus | Genomic segment | Specific season | Sample number | Median | 95% CI of median | |
|---|---|---|---|---|---|---|
| Actual % | Lower to upper | |||||
| sH1N1 | PB2 | 27 | 2.5 × 10-5 | 98.08 | 6.4 × 10-6 to 5.0 × 10-5 | |
| PB1 | 27 | 1.4 × 10-4 | 98.08 | 3.5 × 10-5 to 8.3 × 10-4 | ||
| PA | 27 | 2.9 × 10-5 | 98.08 | 1.1 × 10-5 to 4.0 × 10-5 | ||
| HA | 27 | 5.1 × 10-5 | 98.08 | 9.4 × 10-6 to 1.2 × 10-4 | ||
| NP | 27 | 4.0 × 10-5 | 98.08 | 9.9 × 10-6 to 2.1 × 10-4 | ||
| NA | 27 | 4.3 × 10-5 | 98.08 | 1.6 × 10-5 to 1.1 × 10-4 | ||
| M | 26 | 2.3 × 10-5 | 97.10 | 7.0 × 10-6 to 8.2 × 10-5 | ||
| NS | 22 | 6.7 × 10-6 | 98.31 | 2.0 × 10-7 to 6.3 × 10-5 | ||
| sH3N2 | PB2 | 31 | 2.3 × 10-5 | 97.06 | 7.0 × 10-6 to 7.5 × 10-5 | |
| PB1 | 31 | 2.9 × 10-5 | 97.06 | 9.4 × 10-6 to 8.3 × 10-5 | ||
| PA | 31 | 2.0 × 10-5 | 97.06 | 7.8 × 10-6 to 1.1 × 10-4 | ||
| HA | 31 | 2.5 × 10-5 | 97.06 | 8.0 × 10-6 to 6.2 × 10-5 | ||
| NP | 31 | 3.2 × 10-5 | 97.06 | 1.8 × 10-5 to 6.7 × 10-5 | ||
| NA | 31 | 1.0 × 10-5 | 97.06 | 3.0 × 10-6 to 5.3 × 10-5 | ||
| M | 31 | 1.9 × 10-5 | 97.06 | 3.8 × 10-6 to 6.7 × 10-5 | ||
| NS | 28 | 2.3 × 10-5 | 96.43 | 9.0 × 10-6 to 4.5 × 10-5 | ||
| pH1N1 | PB2 | 35 | 4.2 × 10-4 | 95.90 | 1.0 × 10-4 to 5.4 × 10-4 | |
| PB1 | 32 | 1.7 × 10-4 | 97.99 | 6.1 × 10-5 to 7.0 × 10-4 | ||
| PA | 2009–2010 | 20 | 6.9 × 10-5 | 95.86 | 3.9 × 10-5 to 2.3 × 10-4 | |
| 2010–2011 | 16 | 3.3 × 10-4 | 97.87 | 1.0 × 10-4 to 1.2 × 10-3 | ||
| HA | 36 | 1.7 × 10-4 | 97.12 | 6.4 × 10-5 to 5.4 × 10-4 | ||
| NP | 35 | 1.3 × 10-4 | 95.90 | 5.9 × 10-5 to 2.6 × 10-4 | ||
| NA | 36 | 1.1 × 10-4 | 97.12 | 5.6 × 10-5 to 2.8 × 10-4 | ||
| M | 2009–2010 | 20 | 6.1 × 10-5 | 95.86 | 1.0 × 10-6 to 1.6 × 10-4 | |
| 2010–2011 | 16 | 2.5 × 10-4 | 97.87 | 2.2 × 10-5 to 7.6 × 10-4 | ||
| NS | 36 | 1.3 × 10-4 | 97.12 | 6.6 × 10-5 to 2.9 × 10-4 | ||
FIGURE 3Differences between subtypes. Results of the pairwise comparisons by Mann–Whitney–Wilcoxon test of the global mutation frequency distributions (Supplementary Table 4). The genomic segments considered are mentioned on the left. X-axis: difference between the medians (symbol) and 95% CI of the difference (line). Symbols: circle, sH1N1 vs. sH3N2; square, sH1N1 vs. pH1N1 (for MP and PA, open square for pH1N1 season 2009–2010, plain square for pH1N1 season 2010–2011); diamond, sH3N2 vs. pH1N1 (for MP and PA, open square for pH1N1 season 2009–2010, plain square for pH1N1 season 2010–2011). ns, non-significant; ∗0.05 ≥p-value > 0.01; ∗∗0.01 ≥p-value > 0.001; ∗∗∗0.001 ≥p-value > 0.0001; ∗∗∗∗p-value ≤ 0.0001.
FIGURE 4Distribution of vSNPs along the genome. A viral Single Nucleotide Polymorphism (vSNP) is a position with a significant variant that has been identified in at least 15% of the samples in a group. X-axis: nucleotide position along each genomic segment; Y-axis: percentage of samples sharing a significant variant. (A) sH3N2 samples. In purple: season 2008–2009; in blue: season 2011–2012. (B) pH1N1 samples. In green: season 2009–2010; in orange: season 2010–2011.