Literature DB >> 27658193

Sensitive Detection and Simultaneous Discrimination of Influenza A and B Viruses in Nasopharyngeal Swabs in a Single Assay Using Next-Generation Sequencing-Based Diagnostics.

Jiangqin Zhao1, Jikun Liu1, Sai Vikram Vemula1, Corinna Lin1, Jiying Tan1, Viswanath Ragupathy1, Xue Wang1, Christelle Mbondji-Wonje1, Zhiping Ye2, Marie L Landry3, Indira Hewlett1.   

Abstract

Reassortment of 2009 (H1N1) pandemic influenza virus (pdH1N1) with other strains may produce more virulent and pathogenic forms, detection and their rapid characterization is critical. In this study, we reported a "one-size-fits-all" approach using a next-generation sequencing (NGS) detection platform to extensively identify influenza viral genomes for diagnosis and determination of novel virulence and drug resistance markers. A de novo module and other bioinformatics tools were used to generate contiguous sequence and identify influenza types/subtypes. Of 162 archived influenza-positive patient specimens, 161(99.4%) were positive for either influenza A or B viruses determined using the NGS assay. Among these, 135(83.3%) were A(H3N2), 14(8.6%) were A(pdH1N1), 2(1.2%) were A(H3N2) and A(pdH1N1) virus co-infections and 10(6.2%) were influenza B viruses. Of the influenza A viruses, 66.7% of A(H3N2) viruses tested had a E627K mutation in the PB2 protein, and 87.8% of the influenza A viruses contained the S31N mutation in the M2 protein. Further studies demonstrated that the NGS assay could achieve a high level of sensitivity and reveal adequate genetic information for final laboratory confirmation. The current diagnostic platform allows for simultaneous identification of a broad range of influenza viruses, monitoring emerging influenza strains with pandemic potential that facilitating diagnostics and antiviral treatment in the clinical setting and protection of the public health.

Entities:  

Year:  2016        PMID: 27658193      PMCID: PMC5033603          DOI: 10.1371/journal.pone.0163175

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

Influenza viruses belonging to the family Orthomyxoviridae, are classified into influenzavirus A, B, and C. While influenza A and C viruses infect humans and many other species, influenza B viruses only infect humans and seals [1,2,3]. The Influenza virus genome consists of eight negative single-stranded RNA segments encoding eleven proteins: hemagglutinin (HA), neuraminidase (NA), nucleoprotein (NP), polymerase proteins (PB1, PB2, and PA), matrix proteins (M1 and M2), and nonstructural proteins (NS1 and NS2) [4]. Based on reactivity with surface glycoproteins HA and NA, influenza A viruses are classified into eighteen HA subtypes (H1-H18) and eleven NA subtypes (N1-N11) with a total of 144 subtypes possible, while influenza B virus has no subtypes [4,5,6,7,8]. New influenza viral variants emerge either due to mutations in the virus surface glycoproteins (antigenic drift) or reassortment between viral gene segments from different strains (antigenic shift). While, seasonal influenza outbreaks are thought to arise primarily due to antigenic drift, pandemics are thought to result from complex reassortment events among swine, human, and avian reservoirs. Some of them have been reported to directly infect humans, or transmit avian influenza A viruses from domestic poultry to human [4,9,10]. In addition, influenza viruses normally circulate in pigs are called “variant” viruses when found in people. There was a large outbreak of influenza A(H3N2) variant virus (H3N2v) during the 2010–13 seasons in U.S. and a highly pathogenic avian influenza (HPAI) H5N1 virus was identified in early 2014 in North America [10]. According to a USDA report in January 21 2015, HPAI H5N1, H5N2, and H5N8 viruses, which can be potentially transmitted to domestic poultry population leading to an outbreak [11], have been detected in U.S. wild birds in the winter of 2015 [12]. Identification of H7N9, H10N8 and other viruses in humans outside the U.S. has also been reported [9,13]. Reassortment of a 2009 H1N1 pandemic influenza A virus (pdH1N1) and influenza A(H3N2) variant virus (H3N2v) with other circulating seasonal strains in the U.S. can produce virus variants with transmissibility and altered virulence for humans [14,15]. Multiple influenza types/subtypes usually circulate during each season, for example, pdH1N1 were the predominant viruses circulating in December 2013 with fewer influenza B viruses, but influenza B virus became the predominant circulating virus by late March 2014, while seasonal A(H3N2), novel H3N2v, and mixed A/B viruses infections were also identified during the season in the U.S. [16]. It was reported that rapid influenza diagnostic tests showed low and variable sensitivity [17,18] depending on the influenza A subtype [19], while some FDA-cleared diagnostic tests did not detect the novel H3N2v viruses [20]. Diagnosis of influenza infection from an unknown risk respiratory specimen involves multiple assays [17,21]. Identification of the first case of A(H5N1) virus infection in North America involved multiple laboratories [10]. Current methods that evaluate antiviral resistance require two independent tests for the S31N substitutions conferring amantadine-resistance [22,23] and the H274Y substitution conferring oseltamivir-resistance [22,24]. Conventional methods used by diagnostic or reference laboratories for characterization of novel influenza viruses are ineffective since the tests are based on a small conserved region of each HA, NA, and M genes for detection and subtyping, requires the use of many sets of primers, steps and multiple reactions. In addition, this approach may be ineffective for newly emerging influenza subtypes due to mismatch of PCR primers and culture-based genome sequencing which requires high biosafety levels, preventing its application in clinics. Next-generation sequencing (NGS) analysis has been previously reported to greatly improve manipulation procedure for identification of influenza virus strains, these studies focused on examining influenza A [25,26,27] or B [28] viruses using cultured virus [27], random [27], universal [25], and multiple [28] primers. None of these studies reported the use of a paired set of degenerate universal primers to simultaneously identify whole-genome sequences of influenza A and B viruses from a large set of clinical specimens using an NGS assay. We previously reported development of a genomic nanomicroarray and NGS combination approach for influenza virus screening and laboratory confirmation [29]. Here, we extended the NGS approach to potential application for influenza virus surveillance, whole-genome characterization of putative antiviral resistance markers and virulence factors that may confer high virulence in human hosts. We developed a novel universal RT-PCR-NGS platform by testing 162 clinical influenza-positive specimens and demonstrated a “one-size-fits-all” approach to simultaneously identify unknown influenza infections and co-infections in a single tube reaction per sample. The analytical sensitivity of the NGS platform was also evaluated in our studies.

Materials and Methods

Viruses, Clinical Specimens and Tests

Pre-titrated stocks of reference strains of influenza viruses were procured from Dr. Stephen Lindstrom (Centers for Disease Control and Prevention, CDC, Atlanta, GA) or cultured in embryonated chicken eggs at US Food and Drug Administration (FDA). Clinical nasopharyngeal swab specimens were collected from patients with symptoms of influenza-like illness, submitted during the 2010–11 and 2012–13 influenza seasons to the Clinical Virology Laboratory (CVL), Yale New Haven Hospital, New Haven, CT. These specimens were tested as requested by the patients’ physicians using a direct fluorescent antibody (DFA) immunoassay (SimulFluor Respiratory Virus Screen Reagent, Millipore, Billerica, MA) and/or by quantitative RT-PCR (RT-qPCR), as previously described [21]. A total of 162 nasopharyngeal swab specimens that tested as influenza A- and/or B-positive were de-identified and sent to the Laboratory of Molecular Virology (LMV) at FDA (Institutional Review Board (IRB) approval# Research Involving Human Subjects Committee (RIHSC):13-048B) for sequencing (Fig 1). All participants, or their parents or guardians, provided written informed consent when their specimens collected in Yale New Haven Hospital.
Fig 1

Study profile.

Enrollment of 162 patients aged 20 months to 89 years with symptoms of influenza-like illness during February 19, 2011 to March 8, 2013 from over twenty cities and towns in Connecticut, and submitted to the Clinical Virology Laboratory (CVL) at Yale New Haven Hospital, New Haven, Connecticut prior to detect by universal RT-PCR and NGS assays at the Laboratory of Molecular Virology (LMV) in FDA. A total of 381 genome sequences identified from 55 specimens including A(H3N2), A(pdH1N1), and Influenza B viruses were submitted into GenBank. DFA, direct fluorescent antigen test; RT-qPCR, quantitative RT-PCR; NGS, next-generation sequencing.

Study profile.

Enrollment of 162 patients aged 20 months to 89 years with symptoms of influenza-like illness during February 19, 2011 to March 8, 2013 from over twenty cities and towns in Connecticut, and submitted to the Clinical Virology Laboratory (CVL) at Yale New Haven Hospital, New Haven, Connecticut prior to detect by universal RT-PCR and NGS assays at the Laboratory of Molecular Virology (LMV) in FDA. A total of 381 genome sequences identified from 55 specimens including A(H3N2), A(pdH1N1), and Influenza B viruses were submitted into GenBank. DFA, direct fluorescent antigen test; RT-qPCR, quantitative RT-PCR; NGS, next-generation sequencing.

RNA Extraction and Universal RT-PCR

The RNA extraction, some primers, and Reverse Transcription-PCR (RT-PCR) procedures were described previously [29] and in S1 File. According to previous reports and our testing algorithm using multiple alternative options [29,30,31,32], a new set of degenerate universal primers for RT (uniflu, 5’-IAGCARAAGC -3’) and fusion primers for PCR (unifluF, 5’-; unifluR, 5’-) were designed and tested. The 13bp flanking sequences italicized were added at the 5’ end of each PCR primer to enhance the annealing temperature and achieve high fidelity and yield in PCR amplification. These primers target highly conserved regions at the 5'- and 3'- terminus of each segment of influenza virus to amplify the whole-genome concomitantly, and have coverage of sequence variants for influenza A, B, and C viruses to achieve improvements in analytical sensitivity and inclusivity.

MiSeq Sequencing

Sample preparation and bioinformatics data analysis was performed as described [29]. Briefly, the concentration of mega-amplicons were measured by using the Qubit dsDNA BR Assay System (Covaris, Woburn, MA, USA), and 1 ng of DNA product was processed for NGS sample preparation by using a Illumina Nextera XT DNA Sample Preparation Kit. Mega-amplicons for each specimen were internally marked with dual-index primers (S2 Fig). Up to 96 specimens were barcoded and run in one lane of the sequencer. NGS was performed using a MiSeq v2 kit (500 cycles) to produce 2x250 paired-end reads (Illumina, San Diego, CA). After automated cluster generation in MiSeq, the sequencing was processed and genomic sequence reads obtained.

De novo Assembly Strategy

Sequencing reads of approximately 250bp in length were trimmed and the sequence data verified using FastQC software prior to de novo assembly. Contigs for individual gene segments from influenza viruses were generated using the de novo assembly module on the CLC bio software (v6.0.6, CLC bio, Cambridge, MA). The parameter for mapping reads back to contiguous was set on as similarity fraction 0.9; length fraction 0.5; mismatch cost 2; insertion cost 3 and deletion cost 3. For characterization of influenza whole-genomes with high confidence coverage, minimum contig length was set at 800bp to assemble the consensus sequences and minimum coverage was 1000 reads [33]. For diagnosis of influenza virus infections, minimum contig length used was 200bp and higher lengths for 2x reads per base in de novo assembly (low confidence coverage, Fig 1 and S3 Fig) [34].

Bioinformatics Data Analysis

A master file containing all unique contigous sequences of each mega-amplicon was generated and used to perform an all-by-all blast search in Influenza Research Database (IRD, www.fludb.org), the Global Initiative on Sharing All Influenza Data (GISAID, platform.gisaid.org) and NCBI database. The match with the highest score was retained to identify the specific influenza genome. Assembled sequences were aligned by the CLUSTAL W program, and phylogenetic analysis was performed using MEGA 5 and the neighbor-joining method [35]. The amino acid sequence for each segment identified was generated and aligned using the Vector NTI Advance package (v·11·5·2, Life Technologies, Grand Island, NY). The serotype and genotype according to the genetic lineage of influenza A viruses were determined using FluGenome [36].

Coverage of Depth and Breadth

The depth of sequence coverage (DOC) was calculated using formulae LN/G, where L is the read length, N is the number of reads and G is the theoretical genome length. The breadth of sequence coverage (BOC) was calculated as percentage of actual testing contig length (≥1000 reads or ≥30x of DOC) divided by the theoretical genome length [34,37]. Total average length of influenza A and B virus genomes was 13,588bp and 13,559bp, respectively. Representative lengths of individual gene segments from influenza A virus (A/Puerto Rico/8/1934(H1N1)) were as follows; 2341bp (PB2), 2341bp (PB1), 2233bp (PA), 1778bp (HA), 1565bp (NP), 1413bp (NA), 1027bp (M) and 890bp (NS).

Sequence Accession Numbers

The newly reported influenza genome sequences are available in GenBank under the following accession numbers: KM654599-KM654931, and KM654933-KM654980.

Results

NGS Detects Influenza Whole-Genome Segments

To determine the sensitivity of NGS detection, ten-fold serial dilutions of H5N1 viral RNA (1.8x109 TCID50/mL) were first produced followed by RT-PCR exponential amplification for whole-genome segments. As shown in Fig 2A, 10−7 dilution of A(H5N1) viral RNA (1.8x103 TCID50/mL) was detected in the RT-PCR assay shown as multiple faint visible bands. A total of ten PCR mega-amplicons from the dilutions were next tested using MiSeq, multiple contigs were finally automatically generated for each mega-amplicons, and all contig sequences were correctly identified as genome of A/Viet Nam/1203/2004(H5N1) virus. The overall average of DOC was 4,715. The average of BOC for each identified contig was 96% (≥30x depth) indicating that the near full-length genome sequence was covered (Table 1). Analysis of reads at each dilution level showed that the whole-genome DOC decreases from 4,073 in 10−1 dilution to 1,167 in 10−9 dilution (Fig 2B), and the whole-genome BOC decreases from 98% to 11% at the 10−9 dilution. We observed that the NP gene (segment 5) represented 99% of BOC at all detected titration points with the lowest detectable level at 10−9 dilution (1.8x101 TCID50/mL) with high confidence coverage (Fig 2C).
Fig 2

(A) Agarose gel illustrating the sensitivity of the universal RT-PCR assay. The picture shows PCR mega-amplicons in 2% agarose gel representing the whole-genome segments of A/Viet Nam/1203/2004(H5N1) virus in dilution series, and detection of viral RNA in a dilution of 10−7 (Table 1). (B) Illustrating average depth of coverage (DOC). (C) average breadth of coverage (BOC) in NGS characterization of the influenza genome.

Table 1

Length, reads and coverage from serial dilutions of influenza A(H5N1) virus.

gene segment (bp)serial dilutions
10−110−210−310−410−510−610−710−810−910−10
PB2 (2341)length (bp)2341213222302323185420240000
reads (x103)22·321·417·627·215·529·30000
depth2381228518802905165531290000
breadth100%91%95%99%79%86%0%0%0%0%
PB1 (2341)length (bp)2109212321022158189516482174000
reads (x103)16·217·613·421·720·115·227·7000
depth1730188014302317214716232958000
breadth90%91%90%92%81%70%93%0%0%0%
PA (2233)length (bp)2233219221712233201620852224000
reads (x103)2021·216·925·922·739·125·5000
depth2239237318922900254143782855000
breadth100%98%97%100%90%93%100%0%0%0%
HA (1779)length (bp)1772174717531757176017511160000
reads (x103)19·620·216·726·729·430·61·9000
depth275428392347375241324300267000
breadth100%98%99%99%99%98%65%0%0%0%
NP (1565)length (bp)1556155615561556155615541556155615430
reads (x103)31·433·127·22934·2120·537·819·663·30
depth501652884345463354631924960383131101120
breadth99%99%99%99%99%99%99%99%99%0%
NA (1398)length (bp)1371138913981398138913890000
reads (x103)17·717·714·419·32134·90000
depth3165316525753451375562410000
breadth98%99%100%100%99%99%0%0%0%0%
M (1027)length (bp)1027101610271003101610271027101800
reads (x103)5357·150·453·978·194·986·53000
depth12902139001226913121190122310121056730300
breadth100%99%100%98%99%100%100%99%0%0%
NS (875)length (bp)88488388989100087700
reads (x103)40·844·138·645·700014·400
depth11657126001102913057000410300
breadth100%100%100%100%0%0%0%100%0%0%
Whole Genome (13559)length (bp)1329313038131261331911486114788141345115430
reads (x103)221232·5195·1249·3220·9364·5179·36463·30
depth*4073428735974597407367213306118011670
breadth*98%96%97%98%85%85%60%25%11%0%
No. of genes.8888775310

Ten-fold serial dilutions were used in titrations of influenza A(H5N1) virus to establish the analytical sensitivity by universal RT-PCR and NGS assays. Minimum contiguous length sets at 800bp to assemble the consensus sequences for characterization of each segment full-length sequence and minimum coverage is 1000 reads. A calculated whole-genome base of A/Viet Nam/1203/2004 virus is 13559bp.

* calculation for virus whole-genome.

(A) Agarose gel illustrating the sensitivity of the universal RT-PCR assay. The picture shows PCR mega-amplicons in 2% agarose gel representing the whole-genome segments of A/Viet Nam/1203/2004(H5N1) virus in dilution series, and detection of viral RNA in a dilution of 10−7 (Table 1). (B) Illustrating average depth of coverage (DOC). (C) average breadth of coverage (BOC) in NGS characterization of the influenza genome. Ten-fold serial dilutions were used in titrations of influenza A(H5N1) virus to establish the analytical sensitivity by universal RT-PCR and NGS assays. Minimum contiguous length sets at 800bp to assemble the consensus sequences for characterization of each segment full-length sequence and minimum coverage is 1000 reads. A calculated whole-genome base of A/Viet Nam/1203/2004 virus is 13559bp. * calculation for virus whole-genome.

Molecular Confirmation of Simultaneous Detection of Influenza A and B Viruses in a Single Specimen/Test

Since some influenza related hospitalized cases were found to be associated with mixed A and B virus co-infections [1,16], analytical inclusivity studies were performed next by using a new set of degenerate universal primers to achieve improved coverage for varieties of both influenza A and B viruses. We tested A/Fujian Gulou/1896/09(H1N1), A/Perth/16/2009(H3N2), and B/Wisconsin/01/2010 reference viruses. Total RNA was extracted from spiked samples containing either individual or a mixture of two strains 1:1, and RT-PCR was performed. After the NGS test and following de novo assembly, eight contigs were generated for A(H1N1), A (H3N2), and B viruses, respectively. Fifteen and sixteen contigs were generated from the mixed A(H1N1), and A(H3N2) with B viruses supported by high confidence coverage, respectively (S1 Table). A validation assessment for these contigs from mixed viruses confirmed that the genome sequences are identical to the gene segments of the A(H1N1), A(H3N2), or B viruses, respectively. This is the first report of analytical validation of degenerate universal primers that can simultaneously amplify influenza A and B viruses for NGS identification and discrimination of influenza viruses in a single sample/test.

Detection of Influenza Virus in Nasopharyngeal Swab Specimens

By using the aforementioned strategy, we next tested 162 clinical nasopharyngeal swab specimens (Fig 1) which previous detection in CVL showed 74%(120/162) DFA test positive, 60%(98/162) RT-qPCR test positive, and 6.2%(10/162) for influenza B viruses. The range of Ct values for the 162 specimens is between 17.4 and 39.2. The PCR mega-amplicons prepared for each of the clinical specimens were analyzed using MiSeq. 99.4%(161/162) specimens detected influenza viral genome (see description below, S4 Table). To simplify the table, we reported only 57 isolates with whole-genome segments including 35 A(H3N2) isolates, twelve A(pdH1N1), and ten influenza B viruses identified in NGS data analysis. The results from four different detection methods including DFA, RT-qPCR, universal RT-PCR and NGS assays were summarized in Table 2 to represent the types/subtypes for influenza viruses. Results of NGS data analysis representing full-length sequences were listed in S2 Table. The average whole-genome DOC for these viruses was between 0.1x103 and 13.7x103, and average whole-genome BOC was between 13% and 100% supporting high confidence coverage.
Table 2

Detection of influenza A and B viruses in nasopharyngeal swab specimens from naturally infected patients collected in the 2010–11 and 2012–13 seasons in Connecticut.

Patient IDAgeGenderCollected DateCity, StateDetection Methods
DFART-qPCR(Ct)Univ·PCRNGS Verified
Flu0039M12/29/2012Hamden, CTBB(22·9)++?B
Flu05328m.M12/30/2012New Haven, CTBnot done+?B
Flu0969F2/5/2013New Haven, CTBnot done+?B
Flu10172M3/5/2013West Haven, CTBB(21·2)++?B
Flu10333F3/7/2013New Haven, CTBnot done+++?B
Flu10723m.M3/5/2013New Haven, CTBnot done+?B
Flu10815M3/4/2013Hamden, CTBB(18·7)++?B
Flu11033F3/8/2013New Haven, CTBB(21·4)++?B
Flu11111F3/8/2013East Haddam, CTBnot done+?B
Flu1183M1/27/2013West Haven, CTBnot done++?B
Flu04276M12/28/2012Cheshire, CTAnot done+++H3N2
Flu05079F12/29/2012Essex, CTAA(21·1)++++H3N2
Flu05136F12/30/2012New Haven, CTAnot done++++H3N2
Flu05526F12/27/2012East Haven, CTAnot done++++H3N2
Flu05662F12/30/2012Clinton, CTAA(24·4)++++H3N2
Flu06068F12/30/2012New Haven, CTAnot done++++H3N2
Flu06629F12/30/2012East Haven, CTAnot done++H3N2
Flu06884M12/29/2012Hamden, CTAA(17·9)++++H3N2
Flu07228M12/29/2012New Haven, CTAnot done+++H3N2
Flu07489M12/29/2012West Haven, CTnot doneA(18·0)++++H3N2
Flu07547M12/30/2012Killingworth, CTAA(19·2)++++H3N2
Flu08571M12/31/2012Branford, CTAnot done++++H3N2
Flu08671M12/31/2012Branford, CTAnot done+++H3N2
Flu09049M1/1/2013Wallingford, CTAA(17·4)++++H3N2
Flu09882M2/4/2013Milford, CTAA(21·8)+++H3N2
Flu09944F1/3/2013Madison, CTAA(17·9)++++H3N2
Flu10520m.F3/4/2013Hamden, CTAnot done++++H3N2
Flu11761F1/25/2013New Haven, CTAnot done++++H3N2
Flu1196M1/24/2013Shelton, CTnot doneA(20·2)++++H3N2
Flu12285F1/27/2013Bridgeport, CTAnot done++++H3N2
Flu12456F1/25/2013New Haven, CTAnot done+++H3N2
Flu13514F1/28/2013North Haven, CTAnot done++++H3N2
Flu13822m.M1/30/2013New Haven, CTAA(19·5)++++H3N2
Flu1397M2/1/2013Derby, CTAnot done++++H3N2
Flu14016M1/29/2013New Haven, CTAnot done++++H3N2
Flu14156M1/28/2013New Haven, CTAnot done++++H3N2
Flu14644F2/8/2013New Haven, CTAA(23·1)++++H3N2
Flu1476F2/1/2013New Haven, CTAnot done++++H3N2
Flu15984M2/10/2013Hamden, CTAA(18·0)++++H3N2
Flu16637F1/26/2013New Haven, CTAA(28·8)++++H3N2
Flu16924F1/25/2013West Haven, CTAA(25·4)++++H3N2
Flu17066M1/25/2013West Haven, CTnot doneA(19·4)++++H3N2
Flu17539M1/25/2013New Haven, CTAA(28·5)-H3N2
Flu17784F1/27/2013New Haven, CTAA(27·5)-H3N2
Flu17839M1/31/2013Hamden, CTAnot done++++H3N2
Flu10039M2/5/2013Meriden, CTnot doneA(23·9)+++pdH1N1
Flu11342F1/25/2013West Haven, CTAnot done++++pdH1N1
Flu15039M2/6/2013Meriden, CTAnot done++++pdH1N1
Flu16161F1/31/2013Hamden, CTAnot done++++pdH1N1
Flu1658M1/30/2013North Haven, CTAnot done++++pdH1N1
Flu18255M4/7/2011New Haven, CTAA++++pdH1N1
Flu18376F3/7/2011West Haven, CTnot doneA+++pdH1N1
Flu18430F2/22/2011Old Lyme, CTnot doneA+++pdH1N1
Flu18655M2/19/2011Naugatuck, CTnot doneA+++pdH1N1
Flu18781F2/23/2011New Haven, CTAA++++pdH1N1
Flu18810M2/28/2011Stamford, CTAA++++pdH1N1
Flu18946M3/6/2011New Haven, CTAA++++pdH1N1

Abbreviations: M, male; F, female; m., months; DFA, direct fluorescent antigen test; RT-qPCR, quantitative RT-PCR; Ct, cycle threshold; Inad, inadequate cells for DFA; A, influenza A virus; B, influenza B virus; “+”, universal PCR positive in agarose gel assay for influenza A virus; “-”, universal PCR negative in agarose gel assay; “?”, PCR positive with different pattern from influenza A virus.

Abbreviations: M, male; F, female; m., months; DFA, direct fluorescent antigen test; RT-qPCR, quantitative RT-PCR; Ct, cycle threshold; Inad, inadequate cells for DFA; A, influenza A virus; B, influenza B virus; “+”, universal PCR positive in agarose gel assay for influenza A virus; “-”, universal PCR negative in agarose gel assay; “?”, PCR positive with different pattern from influenza A virus.

NGS Assay Shows High Detection Sensitivity

When we evaluated analytical sensitivity of the NGS method and compared it with other clinical tests (S3 Table), the criteria of minimum coverage length of 200bp and more than 2 reads was used to assemble contigs mapping to the influenza genome. Both M (278bp/810 reads) and NS (716bp/16 reads) genes were identified at the 10−10 dilution level in the reference H5N1 strain indicating highly sensitive detection. Likewise, we further re-analyzed ten universal RT-PCR negative specimens and 19 universal RT-PCR positive but NGS negative specimens when using the criteria of minimum coverage length of 800bp and 1000 reads (Fig 1). The NGS data showed that multiple partial influenza genomic sequences were identified in each of 28 specimens representing different influenza genomic profiles (S3 Table) and confirmed the presence of 24 A(H3N2) and two A(pdH1N1) infections. There were two cases identified as A(H3N2) and A(pdH1N1) viruses co-infections (Flu087 and Flu185). The specimen Flu175 contained full-length segments for M (1033bp/3033 reads) and NS genes (920bp/1203 reads), while Flu177 contained full-length gene sequence for the M gene (1031bp/3645 reads).

Overall Reporting of Influenza A and B Virus Infections

A total of 27.4 million sequence reads representing a total length of 1.73 million base-pairs were obtained from 162 clinical specimens that were processed (S4 Table). Of these, approximately 26 million reads (95%) matched the influenza virus genome representing 78% of total read length with an average of 30,000 reads per contig from a total of 867 contigs. 99.4% (161/162) of specimens were identified to contain influenza genomes and their types/subtypes were determined. Among the influenza positive specimens, 83.3% (135/162) were seasonal A(H3N2), 8.6% (14/162) were A(pdH1N1), and 6.2% (10/162) were influenza B viruses. Two cases (1.2%) were identified as A(H3N2) and A(pdH1N1) virus co-infections, and influenza genome was not found in specimen Flu199. Among 123 influenza A viruses with whole-genome sequence data supported by high confidence reads (S5 Table), we obtained 83.6% (823/8 x123) contigs in total. The highest DOC and BOCs were observed for the HA, NP, NA, M, and NS genes with each demonstrating 99–100% coverage. The minimum average DOC observed for the PB1 gene was 1,467. The frequency of being detected by the RT-PCR-NGS assay was highest for gene segment M, followed by NS, NP, PA, NA, HA, PB1, and PB2. Genotype results and representative phylogenetic trees for HA, NA, and M genes are described in S1 Fig. The details of key signature amino-acid mutations from 47 selected influenza A viruses are summarized in Table 3, The molecular biomarkers for pandemic risk, drug-resistant-, transmission-associated signature mutations, and virulence factors were assessed and further described in the discussion section. We concluded that we have simultaneously and correctly identified genetically distinct lineages of influenza A(H3N2 and pdH1N1) and B viruses in clinical specimens.
Table 3

Comparison of the amino-acid substitutions in the proteins of 47 influenza A(H3N2, and pdH1N1) viruses.

StrainSubtypePB2PB1PANPNAM2NS1
19947562732710035640910011915211203192
Flu042H3N2SMKKARNVVYINND
Flu050H3N2SMKKAKSIVYINND
Flu051H3N2SMKKARSVVYINND
Flu055H3N2SMKKARNVVYINND
Flu056H3N2SMKKAKSIVYINND
Flu060H3N2SLKKARSIVYINND
Flu066H3N2SLKKARSVVYINND
Flu068H3N2SMKKARSVVYINND
Flu072H3N2SLKKARSVVYINND
Flu074H3N2SMKKARSVVYINND
Flu075H3N2SMKKARSVVYINND
Flu085H3N2SMKKARSVVYINND
Flu086H3N2SMKKARSVVYINND
Flu090H3N2SLKKARSVVYINND
Flu098H3N2SMKKARSIVYINND
Flu099H3N2SMKKARSVVYINND
Flu105H3N2SLKKARSVVYINND
Flu117H3N2SLKKARSVVYINND
Flu119H3N2SMKKARSVVYINND
Flu122H3N2SMKKARSVVYINND
Flu124H3N2SMKKARSVVYINND
Flu135H3N2SMKKARSVVYINND
Flu138H3N2SMKKARSVVYINND
Flu139H3N2SMKKARNVVYINND
Flu140H3N2SMKKARSVVYINND
Flu141H3N2SMKKARSVVYINND
Flu146H3N2SMKKARSVVYINND
Flu147H3N2SMKKARSVVYINND
Flu159H3N2SMKKARNVVYINND
Flu166H3N2SLKKARSVVYINND
Flu169H3N2SMKKARSVVYINND
Flu170H3N2SMKKAKSIVYINND
Flu175H3N2SMKKARSVVYINND
Flu177H3N2SMKKARSVVYINND
Flu178H3N2SMKKARSVVYINND
Flu100pdH1N1ALER-RNISRTSND
Flu113pdH1N1ALERIRNISRTSND
Flu150pdH1N1ALERVRNISRTSND
Flu161pdH1N1ALERIRNISRTSND
Flu165pdH1N1ALEK-RNISRTSND
Flu182pdH1N1ALERVRNISRTNND
Flu183pdH1N1ALERVRNISRTSND
Flu184pdH1N1ALERVRN-SRTSND
Flu186pdH1N1ALERVRNISRTSND
Flu187pdH1N1ALER-RNISRTSND
Flu188pdH1N1ALER-RNISRTNND
Flu189pdH1N1ALERVRNISRTSND
A/California/04/09pdH1N1ALERVRNVSRTSND
A/Panama/07/99H3N2SMKKARNVVYINSD
A/Alberta/01/2014H5N1ALERVKSRSRTSSE

The reference sequences are based on the inspection of amino-acids found in the majority of human pdH1N1, H3N2, and H5N1 viruses. The human-specific substitution residues were highlighted in bold and italic. A dashed line indicates that a space was inserted in the sequence to preserve the alignment.

The reference sequences are based on the inspection of amino-acids found in the majority of human pdH1N1, H3N2, and H5N1 viruses. The human-specific substitution residues were highlighted in bold and italic. A dashed line indicates that a space was inserted in the sequence to preserve the alignment.

Discussion

We performed NGS as part of a new detection platform for diagnosis of influenza virus infections using clinical specimens. A method for identification and discrimination of functional significance of diverse influenza viruses has been established. We report (i) multiplexed detection and simultaneous discrimination of influenza virus infections in nasopharyngeal swabs and identification of genetically distinct lineages of influenza A(H3N2 and pdH1N1) and B viruses in a single sequencing run, (ii) identification of influenza virus co-infections in a single specimen/test, (iii) development of new degenerate universal primer pairs for identification of influenza viruses and demonstration of highly sensitive detection of influenza virus infections using the NGS assay. We proposed a “one-size-fits-all” approach using an NGS diagnostic platform for extensive identification of a broad range of influenza virus infections including co-infections, by revealing their comprehensive genetic context and providing matrix reporting information for final sequence-based confirmation. A total of 162 influenza-positive de-identified nasopharyngeal swab specimens collected from patients in the Connecticut were tested in a blinded fashion using the RT-PCR-NGS platform. No reassortants were found but multiple mutations were detected in the specimens tested. Sequence analysis of 123 influenza A viruses revealed that 66.7%(82/123) of A(H3N2) viruses had a single signature mutation of E627K in the PB2 protein, and 88%(108/123) of influenza A(H3N2 and pdH1N1) viruses contained the S31N mutation in the M2 protein. A mutation of PB2 E627K has been reported to confer high virulence to the virus by enhancing replication efficiency, and increasing polymerase activity and disease severity of avian influenza viruses in mammals [38]. The S31N mutation in the transmembrane region of the M2 protein confers resistance to amantadine [9,39]. The emergence of E627K(PB2) and S31N(M2) mutations raise concerns of increased human disease severity. Influenza viruses contain multi-basic amino-acid motif at the proteolytic cleavage site of HAs, which is associated with broad tissue tropism and organ dissemination and determines viral pathogenicity [40]. Investigation of HAs in tested specimens showed a single arginine that appeared at the site (PSIQSR↓G) in A(pdH1N1) and (PEKQTR↓G) in A(H3N2) viruses, suggesting that these viruses belonged to a low pathogenic strain that poses less risk to humans. We aligned all protein sequences of 47 influenza A viruses and compared their sequences with those in humans that have been reported as amino-acid signatures in past influenza outbreaks (Table 3). The emergence of zanamivir- and oseltamivir-resistant viruses is facilitated by mutations in the NA protein, which provides a major target for developing anti-influenza drugs [41,42]. Sequence analysis revealed that the E119V signature mutations in these specimens may be susceptible to oseltamivir in A(H3N2) but not in A(pdH1N1) viruses. Three signature residues in the PA protein, PA-100A, PA-356A, and PA-409N, were reported in most H1N1, H2N2, H3N2 or H5N1 strains and could cause pandemics [43]. A mammalian-adapting V100A substitution was identified in most A(H3N2) but not in A(pdH1N1) viruses, and S409N was identified in all A(pdH1N1) and some A(H3N2) viruses, indicating their pandemic potential. NP protein plays an important role in viral RNA replication and cross-species transmission [44]. An amino-acid signature, valine at position NP-100 reported in 2009 pandemic H1N1 strain [45], was found as V100I mutation in all A(pdH1N1) and five A(H3N2) viruses, suggesting genetic changes of influenza A viruses in the 2012–13 seasons in this region. Influenza B virus is typically considered a mild disease and has been causing 20% to 50% of total influenza incidence [46]. An E105K point mutation at the NA tetramer in influenza B virus was reported leading to reduced susceptibility to NA inhibitor drugs [47]. This mutation was not identified in the B viruses tested in current study. It has been recognized that clinical influenza tests have variable sensitivity and multiple tests may be required for accurate diagnosis. NGS analysis of influenza viruses has been previously reported [25,26,27,28]. However, the sensitivity and simultaneous detection of various influenza viruses from a large set of clinical specimens in a single test has not been reported. Our analytical sensitivity study demonstrated that NGS can identify the full-length NP gene of influenza at a 10−9 dilution level (1.8x101 TCID50/mL) in the characterization study, and detect partial sequences of M and NS genes at 10−10 dilution level in the analytical study (S3 Table). When we performed universal RT-PCR-NGS assays in the analytical study using specimens with low virus concentrations [21], we further confirmed that multiple influenza genomic sequences were identified in each of the 28 human specimens demonstrating that NGS based diagnostics can achieve highly sensitive detection equivalent to the clinical RT-qPCR test at low virus concentrations. To determine whether the current NGS platform could detect influenza virus in different vertebrate species in addition to humans, we tested chicken specimens obtained from Egypt and identified A(H5N1) [48] and A(H9N2) (unpublished data) subtypes. Influenza C virus infection is not routinely tested in clinical settings, therefore, specimens and reference materials were not evaluated in current study. Since the degenerate universal primers reported here target conserved sequences for influenza A, B and C viruses, it is anticipated that the current NGS based detection platform would enable detection and accurate characterization of influenza infection including novel, emerging strains and reassortants arising during outbreaks. This platform allows multiplex identification and simultaneous discrimination of functional significance in a single test and provides the whole spectrum of the influenza genome. The method described here will have significant implications from the perspective of screening, monitoring, drug resistance, and vaccine development. These features of the assay will facilitate diagnostics and antiviral treatment in the clinical setting and enhance protection of public health. With the aid of bioinformatics, mathematics and epidemiologic studies, a typical genetic matrix composition from a known, and a novel emerging influenza infection can be determined. Development of an automated assembly and analysis pipeline will help uncover new levels of innovation and efficiency of transferring raw reads to specific genomic identification which will facilitate future use of an NGS diagnostics platform for public health surveillance and in clinical microbiology laboratory [49].

Phylogenetic analysis of the HA, NA and M gene sequences.

(TIF) Click here for additional data file.

Experimental flow (provided in response to reviewers queries).

(TIF) Click here for additional data file.

Comparison of NGS data presented in Table 1 and S3 Table (provided in response to reviewers queries).

(TIFF) Click here for additional data file.

Supporting Information.

(PDF) Click here for additional data file.

NGS discrimination of influenza A and B viruses in a single specimen using a set of degenerate universal primers in RT-PCR amplification.

(DOC) Click here for additional data file.

de novo assembly and BLASTn analysis of 57 nasopharyngeal swab specimens.

(DOC) Click here for additional data file.

Determination of analytical sensitivity using NGS sequencing-based diagnostics.

(DOC) Click here for additional data file.

Summary of NGS data analysis of 162 nasopharyngeal swab specimens.

(DOC) Click here for additional data file.

Summary of data analysis for characterization study of 123 influenza A viruses.

(DOC) Click here for additional data file.
  47 in total

1.  Incidence of amantadine-resistant influenza A viruses in sentinel surveillance sites and nursing homes in Niigata, Japan.

Authors:  H Masuda; H Suzuki; H Oshitani; R Saito; S Kawasaki; M Nishikawa; H Satoh
Journal:  Microbiol Immunol       Date:  2000       Impact factor: 1.955

2.  Comparison of Simplexa Flu A/B & RSV PCR with cytospin-immunofluorescence and laboratory-developed TaqMan PCR in predominantly adult hospitalized patients.

Authors:  Marie L Landry; David Ferguson
Journal:  J Clin Microbiol       Date:  2014-05-21       Impact factor: 5.948

Review 3.  Evolution and ecology of influenza A viruses.

Authors:  R G Webster; W J Bean; O T Gorman; T M Chambers; Y Kawaoka
Journal:  Microbiol Rev       Date:  1992-03

4.  Universal influenza B virus genomic amplification facilitates sequencing, diagnostics, and reverse genetics.

Authors:  Bin Zhou; Xudong Lin; Wei Wang; Rebecca A Halpin; Jayati Bera; Timothy B Stockwell; Ian G Barr; David E Wentworth
Journal:  J Clin Microbiol       Date:  2014-02-05       Impact factor: 5.948

5.  Simultaneous and complete genome sequencing of influenza A and B with high coverage by Illumina MiSeq Platform.

Authors:  Wiriya Rutvisuttinunt; Piyawan Chinnawirotpisan; Sriluck Simasathien; Sanjaya K Shrestha; In-Kyu Yoon; Chonticha Klungthong; Stefan Fernandez
Journal:  J Virol Methods       Date:  2013-07-12       Impact factor: 2.014

6.  Reassortment of pandemic H1N1/2009 influenza A virus in swine.

Authors:  D Vijaykrishna; L L M Poon; H C Zhu; S K Ma; O T W Li; C L Cheung; G J D Smith; J S M Peiris; Y Guan
Journal:  Science       Date:  2010-06-18       Impact factor: 47.728

7.  Emergence of H274Y oseltamivir-resistant A(H1N1) influenza viruses in Japan during the 2008-2009 season.

Authors:  Tatiana Baranovich; Reiko Saito; Yasushi Suzuki; Hassan Zaraket; Clyde Dapat; Isolde Caperig-Dapat; Taeko Oguma; Iman Ibrahim Shabana; Takehiko Saito; Hiroshi Suzuki
Journal:  J Clin Virol       Date:  2009-12-03       Impact factor: 3.168

8.  Identification and characterization of a highly pathogenic H5N1 avian influenza A virus during an outbreak in vaccinated chickens in Egypt.

Authors:  O Amen; S V Vemula; J Zhao; R Ibrahim; A Hussein; I K Hewlett; S Moussa; S K Mittal
Journal:  Virus Res       Date:  2015-09-09       Impact factor: 3.303

9.  Nanomicroarray and multiplex next-generation sequencing for simultaneous identification and characterization of influenza viruses.

Authors:  Jiangqin Zhao; Viswanath Ragupathy; Jikun Liu; Xue Wang; Sai Vikram Vemula; Haja Sittana El Mubarak; Zhiping Ye; Marie L Landry; Indira Hewlett
Journal:  Emerg Infect Dis       Date:  2015-03       Impact factor: 6.883

10.  New world bats harbor diverse influenza A viruses.

Authors:  Suxiang Tong; Xueyong Zhu; Yan Li; Mang Shi; Jing Zhang; Melissa Bourgeois; Hua Yang; Xianfeng Chen; Sergio Recuenco; Jorge Gomez; Li-Mei Chen; Adam Johnson; Ying Tao; Cyrille Dreyfus; Wenli Yu; Ryan McBride; Paul J Carney; Amy T Gilbert; Jessie Chang; Zhu Guo; Charles T Davis; James C Paulson; James Stevens; Charles E Rupprecht; Edward C Holmes; Ian A Wilson; Ruben O Donis
Journal:  PLoS Pathog       Date:  2013-10-10       Impact factor: 6.823

View more
  17 in total

1.  Biographical Feature: Marie-Louise Landry, M.D.

Authors:  Yi-Wei Tang
Journal:  J Clin Microbiol       Date:  2019-10-23       Impact factor: 5.948

Review 2.  Application of viromics: a new approach to the understanding of viral infections in humans.

Authors:  Mageshbabu Ramamurthy; Sathish Sankar; Rajesh Kannangai; Balaji Nandagopal; Gopalan Sridharan
Journal:  Virusdisease       Date:  2017-12-05

3.  Genome characterization and mutation analysis of human influenza A virus in Thailand.

Authors:  Somruthai Rattanaburi; Vorthon Sawaswong; Pattaraporn Nimsamer; Oraphan Mayuramart; Pavaret Sivapornnukul; Ariya Khamwut; Prangwalai Chanchaem; Kritsada Kongnomnan; Nungruthai Suntronwong; Yong Poovorawan; Sunchai Payungporn
Journal:  Genomics Inform       Date:  2022-06-30

4.  Multiplex Reverse Transcription-PCR for Simultaneous Surveillance of Influenza A and B Viruses.

Authors:  Bin Zhou; Yi-Mo Deng; John R Barnes; October M Sessions; Tsui-Wen Chou; Malania Wilson; Thomas J Stark; Michelle Volk; Natalie Spirason; Rebecca A Halpin; Uma Sangumathi Kamaraj; Tao Ding; Timothy B Stockwell; Mirella Salvatore; Elodie Ghedin; Ian G Barr; David E Wentworth
Journal:  J Clin Microbiol       Date:  2017-10-04       Impact factor: 5.948

5.  A robust and cost-effective approach to sequence and analyze complete genomes of small RNA viruses.

Authors:  Kiril M Dimitrov; Poonam Sharma; Jeremy D Volkening; Iryna V Goraichuk; Abdul Wajid; Shafqat Fatima Rehmani; Asma Basharat; Ismaila Shittu; Tony M Joannis; Patti J Miller; Claudio L Afonso
Journal:  Virol J       Date:  2017-04-07       Impact factor: 4.099

6.  Distinctive variation in the U3R region of the 5' Long Terminal Repeat from diverse HIV-1 strains.

Authors:  Christelle Mbondji-Wonje; Ming Dong; Xue Wang; Jiangqin Zhao; Viswanath Ragupathy; Ana M Sanchez; Thomas N Denny; Indira Hewlett
Journal:  PLoS One       Date:  2018-04-17       Impact factor: 3.240

7.  Seasonal Genetic Drift of Human Influenza A Virus Quasispecies Revealed by Deep Sequencing.

Authors:  Cyril Barbezange; Louis Jones; Hervé Blanc; Ofer Isakov; Gershon Celniker; Vincent Enouf; Noam Shomron; Marco Vignuzzi; Sylvie van der Werf
Journal:  Front Microbiol       Date:  2018-10-31       Impact factor: 5.640

8.  INSaFLU: an automated open web-based bioinformatics suite "from-reads" for influenza whole-genome-sequencing-based surveillance.

Authors:  Vítor Borges; Miguel Pinheiro; Pedro Pechirra; Raquel Guiomar; João Paulo Gomes
Journal:  Genome Med       Date:  2018-06-29       Impact factor: 11.117

Review 9.  Current Approaches for Diagnosis of Influenza Virus Infections in Humans.

Authors:  Sai Vikram Vemula; Jiangqin Zhao; Jikun Liu; Xue Wang; Santanu Biswas; Indira Hewlett
Journal:  Viruses       Date:  2016-04-12       Impact factor: 5.048

10.  Whole Genome Sequencing of Influenza A and B Viruses With the MinION Sequencer in the Clinical Setting: A Pilot Study.

Authors:  Kazuo Imai; Kaku Tamura; Tomomi Tanigaki; Mari Takizawa; Eiko Nakayama; Takahiko Taniguchi; Misako Okamoto; Yasumasa Nishiyama; Norihito Tarumoto; Kotaro Mitsutake; Takashi Murakami; Shigefumi Maesaki; Takuya Maeda
Journal:  Front Microbiol       Date:  2018-11-13       Impact factor: 5.640

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.