Literature DB >> 29299533

Cooperating H3N2 Influenza Virus Variants Are Not Detectable in Primary Clinical Samples.

Katherine S Xue1,2, Alexander L Greninger3,4, Ailyn Pérez-Osorio5, Jesse D Bloom1,2.   

Abstract

The high mutation rates of RNA viruses lead to rapid genetic diversification, which can enable cooperative interactions between variants in a viral population. We previously described two distinct variants of H3N2 influenza virus that cooperate in cell culture. These variants differ by a single mutation, D151G, in the neuraminidase protein. The D151G mutation reaches a stable frequency of about 50% when virus is passaged in cell culture. However, it is unclear whether selection for the cooperative benefits of D151G is a cell culture phenomenon or whether the mutation is also sometimes present at appreciable frequency in virus populations sampled directly from infected humans. Prior work has not detected D151G in unpassaged clinical samples, but those studies have used methods like Sanger sequencing and pyrosequencing, which are relatively insensitive to low-frequency variation. We identified nine samples of human H3N2 influenza virus collected between 2013 and 2015 in which Sanger sequencing had detected a high frequency of the D151G mutation following one to three passages in cell culture. We deep sequenced the unpassaged clinical samples to identify low-frequency viral variants. The frequency of D151G did not exceed the frequency of library preparation and sequencing errors in any of the sequenced samples. We conclude that passage in cell culture is primarily responsible for the frequent observations of D151G in recent H3N2 influenza virus strains. IMPORTANCE Viruses mutate rapidly, and recent studies of RNA viruses have shown that related viral variants can sometimes cooperate to improve each other's growth. We previously described two variants of H3N2 influenza virus that cooperate in cell culture. The mutation responsible for cooperation is often observed when human samples of influenza virus are grown in the lab before sequencing, but it is unclear whether the mutation also exists in human infections or is exclusively the result of lab passage. We identified nine human isolates of influenza virus that had developed the cooperating mutation after being grown in the lab and performed highly sensitive deep sequencing of the unpassaged clinical samples to determine whether the mutation existed in the original human infections. We found no evidence of the cooperating mutation in the unpassaged samples, suggesting that the cooperation arises primarily under laboratory conditions.

Entities:  

Keywords:  D151G; cooperation; deep sequencing; influenza virus; neuraminidase; quasispecies

Year:  2018        PMID: 29299533      PMCID: PMC5750391          DOI: 10.1128/mSphereDirect.00552-17

Source DB:  PubMed          Journal:  mSphere        ISSN: 2379-5042            Impact factor:   4.389


INTRODUCTION

RNA viruses like influenza virus mutate rapidly to form genetically diverse quasispecies. Several recent studies have suggested that interactions between different variants in a quasispecies can promote overall population fitness. In poliovirus, variants generated through spontaneous mutation are important for neurotropism, innate immune suppression, and overall pathogenesis in mouse models (1–3). Other groups have identified cooperative interactions in measles virus (4), West Nile virus (5), hepatitis B virus (6), and coxsackievirus (7). These cooperative interactions have been observed primarily in cell culture or animal models rather than clinical infections. We previously described two distinct variants of H3N2 influenza virus that cooperate in cell culture (8). The two variants differ by a single mutation at amino acid 151 of neuraminidase (NA), the protein that releases new virions from host cells. The D151 viral variant, typically encoded as GAT, predominates among clinical influenza virus samples, and it grows robustly in cell culture. The G151 viral variant, typically encoded as GGT, binds sialic acid receptors rather than cleaving them (9, 10) and grows extremely poorly in isolation. However, a mixed population of D151 and G151 viral variants outgrows either single variant in cell culture. An important question is whether cooperation between these two viral variants is purely a cell culture phenomenon or whether the D151 and G151 variants coexist in natural infections. The D151G mutation is frequently observed when influenza virus is passaged through cell culture (9, 11–16), but it remains unclear whether the G151 variant exists within natural human infections or is primarily a cell culture artifact. Prior groups that have performed matched clinical sequencing of unpassaged and passaged clinical samples have failed to detect the G151 variant before passaging (13, 15), but those studies have used methods like Sanger sequencing and pyrosequencing, which are relatively insensitive to rare variation. More sensitive characterization of clinical samples that give rise to D151G upon lab passage can determine whether this mutation reaches high frequencies in cell culture because it is amplified from low- to modest-frequency standing diversity or whether it arises spontaneously in the lab. We sought to determine whether the D151G mutation is present in viral populations isolated from natural human infections. We identified nine clinical samples that, on the basis of prior Sanger sequencing, consisted of a mixture of D151 and G151 viruses after passage in cell culture. We deep sequenced the original unpassaged nasal swab samples to survey the variation present prior to laboratory growth. The D151G mutation did not exceed the frequency of library preparation and sequencing errors in any of these samples. These results suggest that most of the variation observed at site 151 results from passage in cell culture rather than standing variation in human infections.

RESULTS

Most influenza virus sequences in public databases are determined by Sanger sequencing of clinical isolates that have been passaged one or more times in cell culture (17). A substantial number of recent human H3N2 influenza virus sequences in these databases contain an ambiguous nucleotide at NA site 151 because the lab-passaged samples often converge to a mixture of the D151 and G151 variants (8). We compared passaged samples that contain this ambiguous nucleotide at site 151 with unpassaged samples from the same viral infections. We first identified strains from western Washington State in the Global Initiative on Sharing All Influenza Data (GISAID) EpiFlu database (18) for which Sanger sequencing had reported an ambiguous nucleotide at NA site 151 corresponding to a mixture of the D151 and G151 variants (8). On the basis of the annotations available in the GISAID EpiFlu database, most of these strains had been passaged in cell culture prior to Sanger sequencing. We obtained original, unpassaged nasal swab samples of the nine strains in Table 1 that contained a mixture of D151 and G151 variants after passage in cell culture. These samples had been collected between 2013 and 2015 and had undergone one to three passages in cell culture prior to sequencing. We performed whole-genome sequencing of the influenza virus genome from the unpassaged clinical samples by influenza virus-specific reverse transcription and PCR (19). For each sample, we prepared sequencing libraries in duplicate, beginning from separate reverse transcription reaction mixtures (20). We sequenced each viral sample to an average depth of 100× to 10,000× (Fig. 1), which allowed us to observe viral variants at frequencies below the limit of detection by Sanger sequencing or pyrosequencing.
TABLE 1

Strains deep sequenced in this study

SampleStrainPassage historySite 151 genotypeCt value
WSPHL1A/Washington/10/2013C1/C1X23.19
WSPHL2A/Washington/13/2013C1X17
WSPHL3A/Washington/17/2013C2X24.57
WSPHL4A/Washington/18/2013C3X21.52
WSPHL5A/Washington/08/2014C1X22.8
WSPHL6A/Washington/07/2015S3X23.78
WSPHL7A/Washington/24/2015S3X17.4
WSPHL8A/Washington/32/2015S3X18.69
WSPHL9A/Washington/36/2015S3X25.03

Genotypes were determined by Sanger sequencing of passaged isolates and are taken from those reported in the GISAID EpiFlu database. Annotations of passage history are not standardized, but Cn generally refers to n passages of the virus in cell culture prior to sequencing and Sn generally refers to n passages of the virus in MDCK-SIAT1 cells (17). For the genotype at site 151, an annotation of X indicates a mixture of D151 and G151 in the original Sanger sequencing. The C value is the amount of viral material in the original clinical sample as determined by quantitative PCR.

FIG 1

Sequencing coverage along the influenza virus genome. Average sequencing coverage is plotted for 50-bp bins across the genome, with library replicates shown as solid and dashed lines.

Strains deep sequenced in this study Genotypes were determined by Sanger sequencing of passaged isolates and are taken from those reported in the GISAID EpiFlu database. Annotations of passage history are not standardized, but Cn generally refers to n passages of the virus in cell culture prior to sequencing and Sn generally refers to n passages of the virus in MDCK-SIAT1 cells (17). For the genotype at site 151, an annotation of X indicates a mixture of D151 and G151 in the original Sanger sequencing. The C value is the amount of viral material in the original clinical sample as determined by quantitative PCR. Sequencing coverage along the influenza virus genome. Average sequencing coverage is plotted for 50-bp bins across the genome, with library replicates shown as solid and dashed lines. We identified all of the minor viral variants present at a frequency of at least 3% in the viral genome in both library replicates (Table 2). We did not observe the D151G variant in any of the nine clinical samples under these variant-calling criteria. To ensure that we were not missing extremely low-frequency variation, we calculated the frequency of D151G in each clinical sample on the basis of the frequency of G-to-A mutations at the second nucleotide position of NA site 151. We compared this frequency to the frequency of G-to-A mutations at other sites across the genome (Fig. 2). Minor-variant frequencies at NA site 151 fell well within the range of error expected through library preparation and sequencing errors. Therefore, we conclude that the D151G variant was not present at appreciable frequencies in the original clinical infections. Instead, the mutation must have arisen de novo or been enriched from an extremely low frequency during passage in cell culture.
TABLE 2

Within-host variants identified by deep sequencing

SampleVariantFrequency
WSPHL1NS1-G47S0.042
WSPHL3HA-D513Y0.035
WSPHL4NA-E83K0.32
WSPHL4PB2-E40G0.035
WSPHL4PB2-R175K0.042
WSPHL4HA-E325K0.06
WSPHL6PB1-M372I0.038
WSPHL6PB1-H562Y0.059
WSPHL7PB1-F254F0.34
WSPHL7PA-P238P0.119
WSPHL7HA-I202V0.115
WSPHL7NP-P419P0.268
WSPHL8NA-F42F0.153
WSPHL8NA-N86T0.204
WSPHL8PB2-M631V0.061
WSPHL8PB1-I392M0.079
WSPHL8HA-R208S0.081
WSPHL8HA-A425A0.161
WSPHL9PB1-N518N0.248
WSPHL9PB1-E731E0.21

Sites were called as variable if a nonconsensus base exceeded a frequency of 0.03, given a sequencing coverage of at least 100×, in both sequencing replicates.

FIG 2

D151G does not exceed the frequency of library preparation and sequencing errors in unpassaged clinical samples. Shown is the distribution of frequencies of A-to-G mutations across the genome for each clinical sample. Typically, the D151 viral variant is encoded by the nucleotides GAT, and the G151 variant is encoded by GGT, meaning that D151G arises as the result of an A-to-G mutation. The red vertical line shows the proportion of A-to-G mutations at codon position 2 of amino acid site 151 of NA, which corresponds to the frequency of D151G. In cases where no A-to-G mutations were identified at this site, this red line is not shown. At each nucleotide site in the genome with consensus identity A, we calculated the total proportion of reads reporting an identity of G at that site and averaged this proportion between the two replicate libraries. As expected, A-to-G mutations make up <0.1% of the total sequencing reads at most sites in the genome and are probably errors introduced through library preparation and sequencing.

Within-host variants identified by deep sequencing Sites were called as variable if a nonconsensus base exceeded a frequency of 0.03, given a sequencing coverage of at least 100×, in both sequencing replicates. D151G does not exceed the frequency of library preparation and sequencing errors in unpassaged clinical samples. Shown is the distribution of frequencies of A-to-G mutations across the genome for each clinical sample. Typically, the D151 viral variant is encoded by the nucleotides GAT, and the G151 variant is encoded by GGT, meaning that D151G arises as the result of an A-to-G mutation. The red vertical line shows the proportion of A-to-G mutations at codon position 2 of amino acid site 151 of NA, which corresponds to the frequency of D151G. In cases where no A-to-G mutations were identified at this site, this red line is not shown. At each nucleotide site in the genome with consensus identity A, we calculated the total proportion of reads reporting an identity of G at that site and averaged this proportion between the two replicate libraries. As expected, A-to-G mutations make up <0.1% of the total sequencing reads at most sites in the genome and are probably errors introduced through library preparation and sequencing.

DISCUSSION

The results of our deep-sequencing study support prior studies that failed to detect the D151G mutation in unpassaged clinical samples by Sanger sequencing or pyrosequencing (13, 15). In the GISAID EpiFlu database, mixed populations of D151 and G151 viral variants are common in clinical samples that have been passaged in cell culture, but these mixed populations are rare among unpassaged and egg-passaged populations (8). It is impossible to rule out the possibility that the D151G mutation reaches appreciable frequencies in some natural human infections, but strong and repeated selection for cooperation in cell culture seems to account for its prevalence among sequences in public databases. It is interesting to speculate about what biological factors might cause a variant that is rare in natural human infections to be strongly selected in cell culture. Influenza virus strains often acquire stereotypical mutations when they are grown in eggs (21, 22), but these passage adaptations appear to be less common in cell culture, particularly in MDCK-SIAT1 cells (17, 23). Nevertheless, differences in the types and distributions of cell surface receptors between MDCK-SIAT1 cells and human airways could account for some of the differences in genotypes we observe at NA site 151. We also previously observed that cooperation is stronger at high multiplicities of infection (MOIs) (8). Viral loads can be large during natural infections (Table 1), but recent studies of natural human infections have found that the effective reassortment rate is limited, suggesting that spatial heterogeneity within the host may limit viral circulation and coinfection (24). Moreover, human influenza virus infections, as well as those in animal models (25), experience a severe transmission bottleneck that greatly limits the genetic diversity initially present in an infection (26–28). In contrast, viral populations can rapidly reach high MOIs in cell culture (29). These different growth conditions may also promote the emergence of D151G within cell culture, but natural infections may not. Our study also underscores the importance of sequencing directly from unpassaged clinical samples. Mutations like D151G accumulate in cell culture within just a few passages and affect downstream analyses like inferences of positive selection (17). Careful records of passage histories combined with deep sequencing of unpassaged clinical samples can help distinguish natural variation from that generated in the lab.

MATERIALS AND METHODS

Viral samples.

We downloaded the set of 66 sequences in the GISAID EpiFlu database (18) corresponding to all of the full-length NA coding regions from human H3N2 influenza A virus isolates collected from 1 January 2000 to 26 August 2015 and submitted from Seattle, WA, or Shoreline, WA (see Table S1 in the supplemental material). We aligned each sequence pairwise with the A/Hanoi/Q118/2007 (H3N2) coding sequence (GenBank accession number CY104446) by using the program needle from EMBOSS version 6.6.0 (30). For each sequence, we determined the genotype at site 151 and designated the genotype X if there was an ambiguous nucleotide at that site. We identified sequences with ambiguous identities at site 151, suggesting the presence of mixed viral populations, and we extracted passage histories based on the metadata available in the GISAID EpiFlu database. For the nine strains described in Table 1, we were able to obtain aliquots of the original, unpassaged nasal swab samples in viral transport medium. GISAID acknowledgment table for the H3N2 influenza virus isolates analyzed in this study. Download TABLE S1, XLS file, 0.03 MB.

Viral deep sequencing.

We performed viral deep sequencing as previously described (19). In brief, we extracted viral RNA from unpassaged clinical samples with the QIAamp Viral RNA Minikit (Qiagen) in accordance with the manufacturer’s instructions. We reverse transcribed the viral RNA with Superscript III First-Strand Reaction Mix (Thermo Fisher) and an equimolar mixture of influenza virus-specific primers 5′ TATTGGTCTCAGGGAGCAAAAGCAGG 3′ and 5′ TATTGGTCTCAGGGAGCGAAAGCAGG 3′, which both bind to the conserved U12 region at one end of each influenza virus gene. The two primers differ by a single nucleotide to account for a known polymorphism in the region. We incubated the reverse transcription reaction mixtures at 25°C for 10 min (to help the short primer anneal), 50°C for 50 min, and 85°C for 5 min. We amplified the influenza virus genome with a mixture of 24 primers that bind to the ends of each influenza virus gene (31). For each gene, one primer binds to the conserved U13 region at one end of the gene and two primers bind to the conserved U12 region at the other end of the gene, allowing for the known polymorphism in the U12 region. We performed 35 cycles of PCR with an annealing temperature of 55°C and an extension time of 3 min. We purified the PCR product with 1× AMPure beads (Beckman Coulter, Inc.) and prepared libraries for Illumina sequencing by Nextera XT (Illumina) tagmentation. We sequenced the libraries on a NextSeq 500 platform (Illumina) with 150-bp paired-end reads. We performed all library preparation and sequencing in duplicate, starting from independent reverse transcription reaction mixtures (20).

Analysis of deep-sequencing data.

We first used Bowtie2 (32) to filter out reads that mapped to the human genome. Remaining reads are available in the SRA as BioProject PRJNA412675. We trimmed adapters from the raw reads with cutadapt version 1.8.3 (33). We first aligned the reads with the A/Victoria/361/2011 genome by using Bowtie2 and the --very-sensitive setting, then we used custom scripts to generate a new consensus genome sequence for each viral sample. We then realigned the reads with the corresponding consensus sequence and removed PCR duplicates with Picard version 1.43. We used custom scripts to filter out base calls with a quality score below 20, tally the total number of high-quality bases at each genome position, and annotate each variant’s codon position. We performed these initial analyses separately for each replicate library. We reported only variants that were located in a protein-coding sequence.

A note on codon numbering and gene annotation.

We numbered HA codons in accordance with the H3 numbering system. This HA numbering scheme assigns position 1 to codon 17 of the full HA-encoding gene, which is the beginning of the mature HA protein. The codons for all other genes are numbered sequentially, beginning with 1 at the N-terminal methionine. The M1 and M2 genes have 27 bp of in-frame overlap and 44 bp of out-of-frame overlap, and the NS1 and NEP genes have 30 bp of in-frame overlap and 251 bp of out-of-frame overlap. We annotated variants separately for each gene if they occurred in these overlap regions.

Availability of data.

Sequencing reads are available in the Sequence Read Archive as BioProject PRJNA412675. The computer code that performs the analyses is available at GitHub (https://github.com/ksxue/D151G-clinical-public).
  30 in total

1.  Density-dependent selection in vesicular stomatitis virus.

Authors:  Isabel S Novella; Daniel D Reissig; Claus O Wilke
Journal:  J Virol       Date:  2004-06       Impact factor: 5.103

2.  Quasispecies diversity determines pathogenesis through cooperative interactions in a viral population.

Authors:  Marco Vignuzzi; Jeffrey K Stone; Jamie J Arnold; Craig E Cameron; Raul Andino
Journal:  Nature       Date:  2005-12-04       Impact factor: 49.962

3.  Coexistence of hepatitis B virus quasispecies enhances viral replication and the ability to induce host antibody and cellular immune responses.

Authors:  Liang Cao; Chunchen Wu; Hui Shi; Zuojiong Gong; Ejuan Zhang; Hui Wang; Kaitao Zhao; Shuhui Liu; Songxia Li; Xiuzhu Gao; Yun Wang; Rongjuan Pei; Mengji Lu; Xinwen Chen
Journal:  J Virol       Date:  2014-05-21       Impact factor: 5.103

4.  Fast gapped-read alignment with Bowtie 2.

Authors:  Ben Langmead; Steven L Salzberg
Journal:  Nat Methods       Date:  2012-03-04       Impact factor: 28.547

5.  Sequential passage of influenza virus in embryonated eggs or tissue culture: emergence of mutants.

Authors:  C Brand; P Palese
Journal:  Virology       Date:  1980-12       Impact factor: 3.616

6.  Recent H3N2 influenza virus clinical isolates rapidly acquire hemagglutinin or neuraminidase mutations when propagated for antigenic analyses.

Authors:  Benjamin S Chambers; Yang Li; Richard L Hodinka; Scott E Hensley
Journal:  J Virol       Date:  2014-07-02       Impact factor: 5.103

7.  Cooperation between different RNA virus genomes produces a new phenotype.

Authors:  Yuta Shirogane; Shumpei Watanabe; Yusuke Yanagi
Journal:  Nat Commun       Date:  2012       Impact factor: 14.919

8.  Measurements of Intrahost Viral Diversity Are Extremely Sensitive to Systematic Errors in Variant Calling.

Authors:  John T McCrone; Adam S Lauring
Journal:  J Virol       Date:  2016-07-11       Impact factor: 5.103

9.  Increased fidelity reduces poliovirus fitness and virulence under selective pressure in mice.

Authors:  Julie K Pfeiffer; Karla Kirkegaard
Journal:  PLoS Pathog       Date:  2005-10-07       Impact factor: 6.823

10.  Poliovirus intrahost evolution is required to overcome tissue-specific innate immune responses.

Authors:  Yinghong Xiao; Patrick Timothy Dolan; Elizabeth Faul Goldstein; Min Li; Mikhail Farkov; Leonid Brodsky; Raul Andino
Journal:  Nat Commun       Date:  2017-08-29       Impact factor: 14.919

View more
  12 in total

Review 1.  Influenza Neuraminidase: Underrated Role in Receptor Binding.

Authors:  Feng Wen; Xiu-Feng Wan
Journal:  Trends Microbiol       Date:  2019-03-29       Impact factor: 17.079

Review 2.  Evolutionary Virology at 40.

Authors:  Jemma L Geoghegan; Edward C Holmes
Journal:  Genetics       Date:  2018-12       Impact factor: 4.562

3.  Reconciling disparate estimates of viral genetic diversity during human influenza infections.

Authors:  Katherine S Xue; Jesse D Bloom
Journal:  Nat Genet       Date:  2019-09       Impact factor: 38.330

4.  The secret social lives of viruses.

Authors:  Elie Dolgin
Journal:  Nature       Date:  2019-06       Impact factor: 49.962

Review 5.  The Ecology and Evolution of Influenza Viruses.

Authors:  Michelle Wille; Edward C Holmes
Journal:  Cold Spring Harb Perspect Med       Date:  2020-07-01       Impact factor: 5.159

6.  Viral Entry Properties Required for Fitness in Humans Are Lost through Rapid Genomic Change during Viral Isolation.

Authors:  Sho Iketani; Ryan C Shean; Marion Ferren; Negar Makhsous; Dolly B Aquino; Amedee des Georges; Bert Rima; Cyrille Mathieu; Matteo Porotto; Anne Moscona; Alexander L Greninger
Journal:  mBio       Date:  2018-07-03       Impact factor: 7.867

7.  An optimized methodology for whole genome sequencing of RNA respiratory viruses from nasopharyngeal aspirates.

Authors:  Stephanie Goya; Laura E Valinotto; Estefania Tittarelli; Gabriel L Rojo; Mercedes S Nabaes Jodar; Alexander L Greninger; Jonathan J Zaiat; Marcelo A Marti; Alicia S Mistchenko; Mariana Viegas
Journal:  PLoS One       Date:  2018-06-25       Impact factor: 3.240

8.  Uncovering Virus-Virus Interactions by Unifying Approaches and Harnessing High-Throughput Tools.

Authors:  Samuel L Díaz-Muñoz
Journal:  mSystems       Date:  2019-06-04       Impact factor: 6.496

9.  Antibody Neutralization of an Influenza Virus that Uses Neuraminidase for Receptor Binding.

Authors:  Lauren E Gentles; Hongquan Wan; Maryna C Eichelberger; Jesse D Bloom
Journal:  Viruses       Date:  2020-05-30       Impact factor: 5.048

10.  Beneficial coinfection can promote within-host viral diversity.

Authors:  Asher Leeks; Ernesto A Segredo-Otero; Rafael Sanjuán; Stuart A West
Journal:  Virus Evol       Date:  2018-10-01
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.