Literature DB >> 32550541

Evaluating methods for Avian avulavirus-1 whole genome sequencing.

Saar Tal1, Meirav Ben Izhak2, Chaim Wachtel3, Anat Wiseman1,4, Tzipi Braun2, Elinor Yechezkel1, Einav Golan1, Ruth Hadas5, Adi Turjeman6, Caroline Banet-Noach7, Michal Bronstein6, Avishai Lublin5, Elyakum Berman1, Ziv Raviv4, Michael Pirak1, Eyal Klement4, Yoram Louzoun8.   

Abstract

BACKGROUND: Avian avulavirus-1 (AAvV-1, previously Newcastle Disease Virus) is responsible for poultry and wild birds' disease outbreaks. Numerous whole genome sequencing methods were reported for this virus. These methods included cloning, specific primers amplification, shotgun PCR approaches, Sequence Independent Single Primer Amplification and next generation sequencing platform kits.
METHODS: Three methods were used to sequence 173 Israeli Avian avulavirus-1 field isolates and one vaccine strain (VH). The sequencing was performed on Proton and Ion Torrent Personal Genome Machine and to a lesser extent, Illumina MiSeq and NextSeq sequencers. Target specific primers (SP) and Sequence Independent Single Primer Amplification (SISPA) products sequenced via the Ion torrent sequencer had a high error rate and truncated genomes. All the next generation sequencing platform sequencing kits generated high sequence accuracy and near-complete genomic size.
RESULTS: A high level of mutations was observed in the intergenic regions between the avian avulavirus-1 genes. Within genes, multiple regions are more mutated than the Fusion region currently used for typing.
CONCLUSIONS: Our findings suggest that the whole genome sequencing by the Ion torrent sequencing kit is sufficient. However, when higher fidelity is desired, the Illumina NextSeq and Proton torrent sequencing kits were found to be preferable.
© 2019 Published by Elsevier B.V.

Entities:  

Keywords:  AAvV-1; AAvV-1, Newcastle Disease virus; Mutations; NCBI, National Center for Biotechnology Information; NGS; NGS, Next Generation Sequencing; Newcastle disease virus; PGM, Personal Genome Machine; SISPA, Sequence Independent Single Primer Amplification; SP, Specific primers; Sequencing; WGS; WGS, Whole Genome Sequence or sequencing; nt, nucleotide

Year:  2019        PMID: 32550541      PMCID: PMC7285907          DOI: 10.1016/j.gene.2019.100004

Source DB:  PubMed          Journal:  Gene X        ISSN: 2590-1583


Introduction

Newcastle Disease Virus, currently named Avian avulavirus 1 (Amarasinghe et al., 2017) is the causative agent of Newcastle disease (ND). The disease affects over 200 species of birds including wild birds, pigeons and poultry Newcastle disease outbreaks can have a devastating effect on flocks with mortality approaching 100%. AAvV-1 was first described in 1926 in Newcastle-on-Tyne England (hence the name) and on the island of Java (Alexander, 2001). AAvV-1 is a member of the order Mononegavirales, family Paramyxoviridae, genus Avulaviruse (Afonso et al., 2016; Amarasinghe et al., 2017; De Battisti et al., 2013). The virus has an approximately 15.2 kb ssRNA genome with varying lengths (Shi et al., 2011). The viral genome is composed of six genes expressing six proteins: nucleoprotein [NP], matrix, phosphoprotein [P], [M], fusion [F], hemagglutinin-neuraminidase [HN], and RNA polymerase [L]. AAvV-1 has only one serotype but has varying degrees of virulence (Gogoi et al., 2015). The F protein is cleaved between amino acid 111–117. The AAvV-1 virulence is correlated with the frequency of basic amino acids in the F protein proteolytic cleavage site (Gogoi et al., 2015). Accordingly, AAvV-1 has five virulence categories or pathotypes. These virulence pathotypes are according to severity, starting with the highly virulent to the non-virulent: Velogenic neurotropic, Velogenic viscerotropic, Mesogenic, Lentogenic and asymptomatic (Ganar et al., 2014). The AAvV-1 genome was previously sequenced using several methods, including: cloning (De Leeuw and Peeters, 1999), sequencing portions of the viral genome (Barbezange and Jestin, 2003, Barbezange and Jestin, 2005), or sequencing the full genome using target specific primers (Absalón et al., 2014; Meng et al., 2012; Munir et al., 2011). Prior to whole genome sequencing kits, most publications used target specific primer amplification of the viral genome. This method utilizes AAvV-1-universal primers that can be applied to any AAvV-1 subtype as well as a rapid amplification of cDNA ends (RACE) to obtain the viral genome 5′ and 3′ end sequences (Chenchik et al., 1996; Li et al., 2005; Schaefer, 1995; Shi et al., 2011; Shi and Jarvis, 2006). Pyrosequencing and shotgun PCR methods were also successful in producing a whole genome sequence or parts of the whole genome (Campana et al., 2014; Ma et al., 2014). SISPA, a shotgun PCR approach, was also shown to successfully sequence the AAvV-1 genome (Allander et al., 2001; Chrzastek et al., 2017; Djikeng et al., 2008; Karlsson et al., 2013; Van Borm et al., 2013). The AAvV-1 whole genome was successfully sequenced using the Illumina NGS (Dimitrov et al., 2017; Forrester et al., 2013; Frey et al., 2014; Satharasinghe et al., 2016) as well as via the Ion Torrent sequencing kits (Jakhesara et al., 2016; Karlsson et al., 2013). In this study we evaluated the performances of three different methods for AAvV-1 whole genome sequencing during AAvV-1 epizootic research. The three methods used were the target specific primer (SP) approach followed by SISPA and finally the whole genome sequencing platforms: Ion Torrent PGM and Proton Torrent. Few samples were sequenced using the Illumina NextSeq and MiSeq sequencing kits for comparison.

Materials and methods

Viral isolation

Five tracheal or cloacal swabs were pooled in a tube containing 2-3 ml PBS solution (Hylabs, Cat. BP 532/100S). The sample was incubated for 1 h, at room temperature. A 250 μl aliquot of the sample was added to a 2 ml Eppendorf containing 1.5 ml of PBS and 250 μl of X10 antibiotics solution for injection (HyLabs, Cat. TT359). The solution was incubated for 1 h at room temperature before injection to 9–11 days old embryonated chicken eggs (ECE's). A total of 200 μl of the prepared solution was injected per egg. Eggs were examined for seven days post injection using an egg lamp. Eggs containing dead embryos were opened and the allantoic fluid was collected and stored at −20 °C.

Viral RNA purification

AAvV-1 RNA purification was based on previously published methods (Dimitrov et al., 2014; Jakhesara et al., 2016; Hussain et al., 1989). Briefly, a total of 6 ml thawed allantoic fluid were added to 4 ml PBS. The samples were centrifuged in the Beckman Avanti J-E centrifuge at 10,000g for 15 min at 4 °C to remove tissues and cells. The clear supernatant was transferred into clear ultracentrifuge tubes (Beckman #344058) and was supplemented with 25 ml of PBS to a final volume of 35 ml. The samples were centrifuged in the Beckman Optima XE-90K using the swing rotor (SW 28) at 70,000g for 3 h at 4 °C. Following centrifugation, the supernatant was decanted and the pellet containing the virus was resuspended in 110 μl PBS (final volume 170 ul). The viral suspension was transferred to an Eppendorf tube containing 30 μl of DNase-RNase buffer. The DNase-RNase buffer was comprised of 10 units of DNaseI and 4.5 units of RNaseA suspended in x1 DNaseI buffer (NEB Cat. M0303L and Sigma Cat. R4875, respectively). The viral suspension was incubated at 37 °C for 2 h to decrease host DNA and RNA. RNA purification was performed using the Trizol LS kit (Thermo Fisher, Cat 10296028) according to the manufacturer's instructions. Viral RNA was stored at −80 °C.

Sequence specific primer amplification of AAvV-1 RNA

A total of 260 AAvV-1 full genome sequences from NCBI and one Israeli AAvV-1 vaccine (VH) sequence were used (sequence courtesy of Dr. Banet-Noach, Phibro Israel). The sequences were aligned using Clustal Omega v1.1.0 (Sievers et al., 2011). The resulting reverse transcription primers and PCR primers (see Table 1) were designed to amplify 3.5 kb overlapping amplicons of the AAvV-1 genome.
Table 1

AAvV-1 universal reverse transcription primers (AAvV-1uniRT) and amplification primers (AAvV-1final).

Final primersSequenceFromToProduct
AAvV-1uni RT_1gactttgYgcaRgcgatgatg24272461
AAvV-1uni RT_2caaatgcYtctccYcaggtDgcYaagatac42074236
AAvV-1uni RT_3aaccaatgaRgctgtRcaYgaRgtcac50275053
AAvV-1uni RT_4acYcgRccaggtagtRtccc77577776
AAvV-1uni RT_5gcRgagcaYcagatYatcctacc84098431
AAvV-1uni RT_6gYcagaagctRtggacRatgatctc10,54610,570
AAvV-1uni RT_7gaYggRtcacaccaRcttgc12,97512,994
AAvV-1uni RT_8tatgcMtgtMgaggRgatatg14,22314,243
AAvV-1final_IFgccatgactgcRtatgagac6566753281
AAvV-1final_IRtcYacStccacatYaatagtgac39153937
AAvV-1final_IIFagYctgtaYaatctYgcRctcaatgtcac389139473533
AAvV-1final_IIRccctccRtaRactgRgaacc73537424
AAvV-1final_IIIFgcYtgYatgtaYtcaaagac565256712851
AAvV-1final_IIIRagYggtagcccRgtYaatttcc84828503
AAvV-1final_IVFctMtaYtactggaaattRacYgggctacc847285003442
AAvV-1final_IVRtctatattgctYggRagatgRaacc11,89011,914
AAvV-1final_VFaacacYgtaatgtcYtgtgc10,90510,9243791
AAvV-1final_VRctgYttRagtgatgtYctct14,67714,696
AAvV-1 universal reverse transcription primers (AAvV-1uniRT) and amplification primers (AAvV-1final). Optimized reverse transcription was performed using Agilent's Accuscript high fidelity Reverse Transcriptase (Cat. 600089) with the following modifications. A total of 100 ng RNA template was supplemented with 2 μl of accuscript buffer (×10), 2 μl of 10 μΜ the reverse transcription primers mix, 0.8 μl of dNTP's 100 mM and DEPC treated DDW to a final volume of 10 μl. Pre-boil was performed at 65 °C for 15 min and the tubes were allowed to cool slowly to room temperature, according to the manufacturer's instructions. The pre-boiled mix was supplemented with 3 μl of DEPC DDW, 2 μl DTT, 3 μl of 20% DMSO solution, 1 μl of Betain 5 M solution and 1 μl of Accuscript reverse transcriptase enzyme. The reverse transcription was performed at 37 °C for 2 h followed by a heat inactivation step at 70 °C for 15 min. The Q5 high fidelity PCR mix (NEB, Cat. M0493) was optimized to efficiently amplify up to 6 kb of the viral genome. To this end, 2 μl of the reverse transcription reaction were used as template supplemented with 2 μl of 10 μM forward and reverse primers, 2 μl of 10 mM dNTP's, 4 μl of NEB Q5 buffer (×5), 0.4 μl Q5 enzyme, 3 μl of 20% DMSO solution, 3 μl of Betain 5 M solution and 3.6 μl DEPC DDW (20 μl final reaction volume). The PCR reaction was performed in the S1000 thermal cycler (Biorad). The reaction conditions were as follows: 95 °C for 3 min, 35 cycles of 95 °C – 20 s, 35 °C – 30 s, 72 °C – 45 s followed by a final step of 72 °C - 10 min. The PCR results were examined in a 1.5% agarose gel using Ethidium bromide prior to sequencing. The PCR reactions were purified using Stratec's PCR and enzymatic reaction purification kit (MSB PCRapace kit, Statec, Cat. 1,020,220,300) according to the manufacturer's instructions.

SISPA

The SISPA protocol was performed according to the literature with the modifications listed below (Djikeng et al., 2008). Briefly, 100 ng of purified RNA were reverse transcribed using Agilent Accuscript reverse transcriptase using 1 μl of 10 mM FR26RV-N primer (5′ GCCGGAGCTCTGCAGATATCNNNNNN 3′), 2 μl Agilent RT buffer, 0.8 μl of 100 mM dNTP's and DEPC DDW were added to a final volume of 13 μl. The pre-boil was performed at 65 °C for 15 min and the samples were allowed to cool to room temperature. The reverse transcription reaction supplemented contained 2 μl DTT, 3 μl DMSO 20%, 1 μl Betain 5 M and 1 μl Agilent accuscript reverse transcriptase, reaching a final reaction volume of 20 μl. The reaction mix was incubated for 1 h at 37 °C followed by heat inactivation at 75 °C for 15 min. The reverse transcription reaction was supplemented with Klenow fragment reaction mix. The Klenow reaction contained 5 μl NEB buffer 2 (X10), 1 μl FR26RV-N primer, 1 μl (5 units) of Klenow fragment (NEB Cat. M0210S), 1 μl of 10 mM dNTP's and DEPC DDW to a mix volume of 30 μl. The Klenow reaction was incubated at 37 °C for 1 h followed by heat inactivation at 70 °C for 10 min. The reaction was purified using Stratec's PCR and enzymatic reaction purification kit, as before, having a final elution volume of 20 μl. The PCR reaction was performed using the Q5 high fidelity polymerase kit, as listed above, using 2 μl of the RT-Klenow purified reaction as template and 2 μl of 10 mM primer FR26RV (5′ GCCGGAGCTCTGCAGATATC 3′), 2 μl dNTP's, 4 μl Q5 buffer X10, 0.4 μl Q5 enzyme, 3 μl Betain 5 M, 3 μl DMSO 20% and 3.6 DEPC DDW to a final reaction volume of 20 μl. The reaction conditions were as follows: 95 °C-5 min, 40 cycles of 95 °C-30 s, 55 °C-35 s, 72 °C-7 s and a final elongation step of 72 °C-5 min. The PCR results were examined in a 1.5% agarose gel using Ethidium bromide prior to sequencing.

Sequencing platforms

The Specific Primers amplified DNA, SISPA amplified DNA and purified RNA samples were sent for sequencing using the Ion Torrent, Ion Proton (HyLabs) and Illumina NextSeq and MiSeq platforms (The Center for Genomic Technologies, the Hebrew university of Jerusalem).

SISPA and SP

The amplified samples were fragmented by sonication using a Bioruptor sonicator (Diagenode). The fragmented DNA was then used to prepare barcoded libraries using the The Ion Xpress™ DNA library prep kit (Life Technologies) according to manufacturer's instructions. Sequencing was performed using the Ion Torrent PGM (Bar-Ilan university).

Ion torrent PGM sequencing kit

RNA libraries were prepared from purified viral RNA using the Ion ress™ RNA library prep kit (Life Technologies) and the resulting barcoded DNA libraries were then sequenced on the Ion Torrent Proton according to the manufacturer's instructions. Sequencing was performed on the Ion and Proton Torrent PGM (HyLabs).

Illumina sequencing

Samples were sequenced using the Illumina MiSeq and NextSeq. The cDNA synthesis, sample denaturation, loading and Illumina kits were used according to the manufacturer's instructions. The samples were pair ended (251 × 2 and 38 × 2 for MiSeq and NextSeq, respectively) using the Illumina TruSeq sample preparation kit. The samples were sequenced either using the MiSeq reagent kit V2 (500 cycles) or the NextSeq 500 high output kit V2 (75 cycle).

Genome assembly

As a part of the quality control, nucleotides from the edges were trimmed and reads with poor quality scores were removed. Short reads were discarded. The remaining reads were used to de novo reconstruct the consensus sequence using Trinity v2.0.6 (Grabherr et al., 2011). Reads were mapped to the viral consensus sequence using bowtie2 v2.2.4 (Langmead and Salzberg, 2012). The resulting sam files were used to determine the quality of the sequence. Positions that had coverage of less than 5 reads were considered undetermined and written as N.

Accession numbers

The sequences of the AAvV-1 isolates included in the study were submitted to GenBank under the accession numbers MH371022-MH371102, MH377246-MH377325. Accession numbers are detailed in Supplementary Table S1.

Results

A total of 174 AAvV-1 samples were sequenced (See Appendix A and Supplementary Information). Of these, 173 were field isolates (10 wild bird; 31 backyard birds; 132 commercial poultry) and the Israeli AAvV-1 vaccine (VH) were sequenced using different platforms. The viruses were sequenced using three different methods including the specific primers (34), SISPA (57) and 96 samples via NGS sequencing kits (8 IonTorrent, 84 Proton Torrent, 2 Illumina MiSeq and 2 Illumina NextSeq). Several samples were sequenced using multiple methods. Except for the VH, all these methods utilized enriched viral RNA that was purified from the allantoic fluid (Jakhesara et al., 2016). The VH genome was purified from a commercial vaccination tablet. For the template-specific primers method, AAvV-1 universal primers were designed using the entire Pubmed database AAvV-1 sequences. Eight forward AAvV-1 universal primers were used for the reverse transcription reaction step and five universal primer pairs amplified 3.5 kb overlapping segments of the 15.2 kb viral cDNA. The resulting PCR products were purified prior to library preparation and sequencied via the Ion Torrent PGM platform kits followed by genome assembly. Preliminary tests using the lentogenic VH vaccine RNA as a template generated a contig of 14,000 nt's. The resulting VH contig covered ~94% of the viral genome (680–14,680 nt's). Following this test, 34 AAvV-1 velogenic field isolates were sequenced using this method. However, in all the field isolates the first amplicon (656–3937) was missing, resulting in a partial contig covering 72% of the viral genome (3890 nt–14,680 nt, see Fig. 1). The first amplicon PCR failure presumably occurred due to a poly-C region in the AAvV-1 genome located in a ~100 bp region near 1700 nt (1656–1761). The velogenic AAvV-1 field isolates have a poly-C region containing 3–4 repeats of 5–6 tandem Poly-Cytosine while the VH sequence had a smaller poly-C region containing only 3 repeats of 3–4 tandem repeats (data not shown). This region has a highly consistent drop in viral genome coverage in all the velogenic samples sequenced, regardless of the amplification method or NGS platform used (see Fig. 1, Fig. 2). This is likely due to G-quadruplexe (Chambers et al., 2015).
Fig. 1

Average coverage of each position in the AAvV-1 viral genome sequences, per method. The viruses were sequenced using three different methods including the specific primers (34), SISPA (57) and 96 samples via NGS sequencing kits (8 IonTorrent, 84 Proton Torrent, 4 Illumina). In all the methods a drop in the coverage near nucleotide 1700 is seen, possibly due to the G-quadruplexes. The Illumina and Proton Torrent have the highest coverage. The lowest coverage was obtained by the specific primers and SISPA methods (sequenced via the Ion Torrent).

Fig. 2

Fraction of covered samples in each position along the sequence per method as measured by the fraction of samples that have at least five reads for a given position (otherwise, the position is assigned an N). The coverage drop at position1, 700 is clearly observed, as well as the coverage drop in the 5′ and 3′ ends of the viral genome. Intragenic regions also have a high error rate.

Average coverage of each position in the AAvV-1 viral genome sequences, per method. The viruses were sequenced using three different methods including the specific primers (34), SISPA (57) and 96 samples via NGS sequencing kits (8 IonTorrent, 84 Proton Torrent, 4 Illumina). In all the methods a drop in the coverage near nucleotide 1700 is seen, possibly due to the G-quadruplexes. The Illumina and Proton Torrent have the highest coverage. The lowest coverage was obtained by the specific primers and SISPA methods (sequenced via the Ion Torrent). Fraction of covered samples in each position along the sequence per method as measured by the fraction of samples that have at least five reads for a given position (otherwise, the position is assigned an N). The coverage drop at position1, 700 is clearly observed, as well as the coverage drop in the 5′ and 3′ ends of the viral genome. Intragenic regions also have a high error rate. The SISPA method was developed to sequence novel RNA viruses (Froussard, 1992). It has been shown to be efficient and to successfully sequence the AAvV-1 genome as well as other RNA viruses (Allander et al., 2001; Djikeng et al., 2008; Djikeng and Spiro, 2009). The standard SISPA primers were examined in silico to verify no match to the AAvV-1 known sequences, thus avoiding technical issues (Karlsson et al., 2013). The SISPA method resulted in a contig coverage of over 99% of the viral genome. Additional 56 AAvV-1 isolates were sequenced using the SISPA method via the Ion Torrent sequencer. This resulted in an average coverage of 98–99% of the viral genomes (See Fig. 1, Fig. 2). The Ion Torrent, Illumina MiSeq and NextSeq RNA library preparation and sequencing kits were also examined using the VH vaccine and Israeli isolates. Purified VH RNA sequenced using the Ion Torrent sequencing kit resulted in 99.98% of the viral genome with the small exception of 5–7 mismatches compared to the known sequence. These mismatches were found only at the ends of the viral genome (at the sense 5′ end). The VH RNA was successfully sequenced using Illumina NextSeq reaching an average depth of 80,000 reads/nt, and a perfect 100% match to the known VH sequence. A total of 6 AAvV-1 field isolates were sequenced using the Ion Torrent sequencing kit followed by a second set of 82 isolates sequenced using the ProtonTorrent sequencing kit with a final average cover of >99% of the viral genome (see Fig. 1, Fig. 2). All three sequencing platforms and their kits (IonTorrrent, ProtonTorrent and Illumina) had similar nucleotide profile with few N reads or gaps. The SP and SISPA methods had higher N reads compared to the NGS sequencer kit average (8.9 and 10.3 respectively) (see Fig. 3).
Fig. 3

The mean nucleotide percentage, N reads and average gap per sequencing method. The graph indicates a similarity in the nucleotide consistency between the sequencing methods. The SP method has an 8.9 and SISPA has a 10.3 times higher N reads compared to the NGS sequencer kit average. The Proton torrent had the least N reads, approximately 10 times lower than the other NGS sequencing kits. All the sequencing kits (Illumina, IonTorrrent and Proton Torrent) had the lowest N frequency.

The mean nucleotide percentage, N reads and average gap per sequencing method. The graph indicates a similarity in the nucleotide consistency between the sequencing methods. The SP method has an 8.9 and SISPA has a 10.3 times higher N reads compared to the NGS sequencer kit average. The Proton torrent had the least N reads, approximately 10 times lower than the other NGS sequencing kits. All the sequencing kits (Illumina, IonTorrrent and Proton Torrent) had the lowest N frequency. Given the high sequencing accuracy and coverage of the NGS methods, we addressed the mutational load at each site within a host (sample). The high coverage allows us to detect not only the main consensus sequence, but also the quasi-species of the virus genomic population. A high mutation load was seen in SISPA and the specific primers sequences all along the viral genome, suggesting that these observed mutations are mainly sequencing errors. However, in the NGS based methods, mutations are limited to specific regions. The high mutation and deletion rate is observed in all sequencing methods near the 3′ and 5′ ends of the viral genome is possibly due to the folding of the viral genome ends leading to misreading. The mutation hotspots in the viral genome mutations are mainly between the genes, regardless of the method or NGS sequencer used (see Fig. 4a and b). Note that the average mutation rate with a sample in the F gene typically used to classify the virus into sub-types is lower than in most other genes.
Fig. 4

A: Mutation frequency per position in the sequence. The rectangles represent the gene locations along the sequence. The SISPA and Specific primer methods were unable to reproduce the sequence near the 3′ and 5′ ends (Qiu et al., 2014). 4b. The average mutation fraction in each segment of the sequence for each method. It is notable that the segments between the genes, and in the 5′ and 3′ ends are the ones with higher percentage of mutations.

A: Mutation frequency per position in the sequence. The rectangles represent the gene locations along the sequence. The SISPA and Specific primer methods were unable to reproduce the sequence near the 3′ and 5′ ends (Qiu et al., 2014). 4b. The average mutation fraction in each segment of the sequence for each method. It is notable that the segments between the genes, and in the 5′ and 3′ ends are the ones with higher percentage of mutations. To further estimate the variability, we computed the fraction of NS mutations. While, there is no practically no difference between sequencing methods, as expected intergenic regions have significantly higher NS fractions compared with genic regions (Data not shown).

Discussion

Brute force NGS sequencing approach (i.e. total DNA or RNA purification and sequencing) has its own advantages (Li et al., 2016). It is mostly used for sequencing a small number of samples at low coverage. However, for sequencing of tens to hundreds of samples it is more cost-effective to remove the host genomic DNA or RNA, as much as possible (Dimitrov et al., 2017; Thomson et al., 2016; Tyler et al., 2016). This can be done using the physical properties of the virion (Jakhesara et al., 2016; Hussain et al., 1989) or by using purification kits post DNA or RNA purifications (Dimitrov et al., 2014, Dimitrov et al., 2017). The latter involves a considerable loss of target genomic material due to the purification steps losses. During this study we opted for the former approach for viral RNA purification (Temmam et al., 2015). This method resulted in 90–95% viral RNA sequences (data not shown). The low host DNA and RNA enabled loading the NGS platforms with a large pool of tagged cDNA samples while still obtaining high coverage. For the Ion Torrent and Proton Torrent up to 12 and 16 samples were loaded per sequencing run, respectively. Several methods and sequencing platforms were reported to amplify the AAvV-1 genome. The specific primer approach was mostly used for AAvV-1 whole genome sequencing (Absalón et al., 2014; Meng et al., 2012; Munir et al., 2011; Wei et al., 2008). However, the partial sequence obtained, the need for the RACE method to amplify the viral genome ends, the sequence inaccuracies and the amplification termination near position 1700 nt's of the field isolate genomes all suggest that such a method is not appropriate for whole genome sequencing of AAvV-1 (Chenchik et al., 1996; Li et al., 2005; Liu and Gorovsky, 1994; Schaefer, 1995; Shi and Jarvis, 2006). The SISPA method was also shown to reliably sequence the AAvV-1 genome (Allander et al., 2001; Chrzastek et al., 2017; Djikeng et al., 2008; Karlsson et al., 2013). The sequencing of other viruses using this method has failed, possibly due to technical issues (Karlsson et al., 2013). In our hands, SISPA successfully sequenced the AAvV-1 genome via the Ion Torrent PGM. Similarly to the specific primers method, it was found to have high error rates resulting in a considerably higher mutation rate compared to the sequencing kits of the NGS platforms. The SISPA was usually unable to sequence 100-200 bp from the 5′ and 3′ ends of the viral genome. This problem could possibly be resolved using a higher coverage, at the cost of running fewer samples per chip. Our findings concur with the reports of PCR amplification bias in NGS sequencing which were recently reported in several articles (Chrzastek et al., 2017; Dimitrov et al., 2017; Jones et al., 2015; Pinard et al., 2006; Thomson et al., 2016; Tyler et al., 2016; Van Borm et al., 2016). While PCR amplified samples can be used for sequencing low titer samples, the bias involved should be considered and avoided when possible (Aigrain et al., 2016; Chrzastek et al., 2017; Dimitrov et al., 2017; Huggett et al., 2015; Thomson et al., 2016; Tyler et al., 2016; Van Borm et al., 2016). In our hands, the Illumina NextSeq, ProtonTorrent and Ion Torrent sequencing kits generated nearly full viral genome (98–99% for the Ion and proton torrent systems and 100% for the Illumina NextSeq) with few mismatches at the 3′ and 5′ ends. From our experience, it is advisable to opt for a high coverage sequence (a minimum coverage of 5000, preferably 50,000) to cover the 1700 nt position (the poly-C region) as well as the 7000 nt region. We therefore recommend using the Proton Torrent or the Illumina Nextseq sequencing kits for whole genome sequencing of AAvV-1 isolates. Given the high fidelity of the NGS based sequencing, it is possible to study beyond the main variant. The quasi-species cloud (surrounding each main variant) is highly mutated in the intergenic regions, and in the 3′ and 5′. We have found a mutation rate of 30 mutations/year in the viral genomes examined (data not shown). Such mutations are expected, since their effect on the viral fitness may be limited. Considering the viral reverse transcriptase mutation rate, we expected a higher mutation in the viral genome (Menéndez-Arias, 2002; Menéndez-Arias et al., 2017). The AAvV-1 found mutation rate suggests there is a natural selection process. This process is possibly affecting the expected mutation rate and limiting the regions that can be affected (Fan et al., 2017; Miller et al., 2009).

Conclusions

We have evaluated different methods of AAvV-1 full genome sequencing. The main conclusions are a clear recommendation to purify the viral genome prior to sequencing. PCR amplifications of the viral genomes, either using shotgun PCR (SISPA in this case) or via target specific primers have generated poor results compared to NGS platform kits. Sequencing kits of the Illumina NextSeq and Proton torrent PGM have successfully resulted in 99–100% of the viral genome at a high coverage. The high coverage obtained via NGS methods shows a map of mutations in the AAvV-1 genome which are mainly in intergenic regions. The following are the supplementary data related to this article.

Table S1

Average Coverage per position of the virus per method in nucloetides. Coverage is defined as the fraction of samples that had at least one read in the appropriate position.

Table S2

Average Coverage per position of the virus per method in nucloetides. Depth is defined as the average number of read per sample in the appropriate position.

Table S3

Properties of each run, including average depth per position for each run, fraction of covered positions, Number of reads per run and the method used in each run. The names of the runs and the run numbers are for internal use and can be ignored.

Table S4

Submitted sequences.

Table S5

Time and cost per method.

Availability of data and material

The datasets used and/or analysed during the current study are available from the corresponding author on reasonable request.

Competing interests

The authors declare that they have no competing interests.

Funding

This work was supported by the [Grant No. 22-2222-14].

Authors' contributions

Tal Saar, Ben Izhak Meirav, Wachtel Chaim and Wiseman Anat and Broun Tzipi have made substantial contribution to the experimental design and acquisition of data. Yechezkel Elinor, Golan Einav, Hadash Ruth, Adi Turjeman, Banet Noach Caroline, Michal Bronstein and Avishai Lublin have all have made substantial contributions to acquisition of data, methodology and resources. Berman Elyakum, Raviv Ziv, Pirak Michael, Klement Eyal and Louzoun Yoram have made substantial contributions to analysis and interpretation of data, as well as funding acquisition. Tal Saar, Wiseman Anat and Louzoun Yoram have been involved in drafting the original manuscript draft as well as reviewing and editing the final draft. Pirak Michael, Klement Eyal, Raviv Ziv and Louzoun Yoram have been involved in Project administration.
Sample numberSourceSpecific primersSISPAPlatform sequencing kits
Ion torrentProton torrentMiSeqNextSeq
1818EPBv
11278EPBv
120420EPBv
173849EPBv
176674EPBv
120147EPBv
120430EPBv
195810EPBvvvv
LasotaPhibrovvvv
120809EPBv
124026EPBv
141353EPBv
120860EPBv
120654EPBv
120423EPBv
124027EPBvv
123498EPBv
117699EPBv
120807EPBv
137495EPBv
123526EPBv
121808EPBv
104477EPBvv
138692EPBv
137592EPBv
122509EPBv
137569EPBv
99188EPBv
114599EPBv
121118EPBv
125877EPBv
118279EPBv
115038EPBv
114112EPBv
141759EPBv
142551EPBv
142626EPBv
142784EPBv
143114EPBv
143696EPBv
144702EPBv
147004EPBv
149165EPBv
152628EPBv
153812EPBv
156916EPBv
159057EPBv
166631EPBv
168757EPBv
169730EPBv
170338EPBv
173294EPBvv
175571EPBv
176851EPBv
201029EPBv
203145EPBv
203308EPBvv
203762EPBv
205659EPBv
205770EPBv
206519EPBv
207436EPBv
209108EPBv
217142EPBv
218637EPBv
223030EPBv
223068EPBv
223759EPBv
123168EPBv
123945EPBv
125057EPBv
127508EPBv
127935EPBv
129688EPBv
131986EPBv
133813EPBv
137866EPBv
138629EPBv
139013EPBv
139141EPBv
140266EPBv
140339EPBv
147993EPBv
151754EPBv
208243EPBv
224349EPBv
224903EPBv
225193EPBv
226419EPBv
226501EPBv
28469EPBv
30507EPBv
34617EPBv
36911EPBv
37114EPBv
40978EPBv
76558EPBv
87809EPBv
88629EPBv
95252EPBv
95493EPBv
97202EPBv
116035EPBv
117126EPBv
117244EPBv
119228EPBv
121082EPBv
121551EPBv
125051EPBv
141932EPBv
143483EPBv
145002EPBv
149777EPBv
150791EPBv
154728EPBv
156566EPBv
167788EPBv
169208EPBv
173633EPBv
204094EPBv
2041EPBv
5620EPBv
5869EPBv
9273EPBv
11365EPBv
11543EPBv
23536EPBv
24317EPBv
24639EPBv
25318EPBv
27571EPBv
17222EPBv
238796EPBv
24153EPBv
24993EPBv
151347EPBv
147406EPBv
223397EPBv
232809EPBv
238651EPBv
237660EPBv
204642EPBv
208252EPBv
204895EPBv
174621EPBv
176533EPBv
251453EPBv
251612EPBv
251747EPBv
253817EPBv
253685EPBv
257758EPBv
264746/1KVIv
264747/1KVIv
264748/1KVIv
264750/1KVIv
264751/1KVIv
264752/1KVIv
264753/1KVIv
264754/1KVIv
264755/1KVIv
264756/1KVIv
264757/1KVIv
264758/1KVIv
246332EPBv
249275EPBv
251542EPBv
255913EPBv
264220EPBv
265941EPBv
265559EPBv
266018EPBv
256987EPBv
253633EPBv
245632EPBv
264759EPBv
265752EPBv

Note: Specific primers and SISPA samples were sequenced using the IonTorrent.

Acronyms: EPB - egg and poultry board. KVI - Kimron Veterinary institute.

  51 in total

1.  Whole genome sequencing and biological characterization of Duck/JS/10, a new lentogenic class I Newcastle disease virus.

Authors:  Chunchun Meng; Xvsheng Qiu; Shiqiang Jin; Shengqing Yu; Hongjun Chen; Chan Ding
Journal:  Arch Virol       Date:  2012-02-05       Impact factor: 2.574

2.  Full-length cDNA cloning and determination of mRNA 5' and 3' ends by amplification of adaptor-ligated cDNA.

Authors:  A Chenchik; L Diachenko; F Moqadam; V Tarabykin; S Lukyanov; P D Siebert
Journal:  Biotechniques       Date:  1996-09       Impact factor: 1.993

Review 3.  Next-generation sequencing workflows in veterinary infection biology: towards validation and quality assurance.

Authors:  S Van Borm; J Wang; F Granberg; A Colling
Journal:  Rev Sci Tech       Date:  2016-04       Impact factor: 1.181

4.  Pathotypic and Sequence Characterization of Newcastle Disease Viruses from Vaccinated Chickens Reveals Circulation of Genotype II, IV and XIII and in India.

Authors:  S J Jakhesara; V V S P Prasad; J K Pal; M K Jhala; K S Prajapati; C G Joshi
Journal:  Transbound Emerg Dis       Date:  2014-11-18       Impact factor: 5.005

5.  What's in a strain? Viral metagenomics identifies genetic variation and contaminating circoviruses in laboratory isolates of pigeon paramyxovirus type 1.

Authors:  Steven Van Borm; Toon Rosseel; Mieke Steensels; Thierry van den Berg; Bénédicte Lambrecht
Journal:  Virus Res       Date:  2012-12-05       Impact factor: 3.303

6.  Host-Associated Metagenomics: A Guide to Generating Infectious RNA Viromes.

Authors:  Sarah Temmam; Sonia Monteil-Bouchard; Catherine Robert; Hervé Pascalis; Caroline Michelle; Priscilla Jardot; Rémi Charrel; Didier Raoult; Christelle Desnues
Journal:  PLoS One       Date:  2015-10-02       Impact factor: 3.240

7.  Direct next-generation sequencing of virus-human mixed samples without pretreatment is favorable to recover virus genome.

Authors:  Dingchen Li; Zongwei Li; Zhe Zhou; Zhen Li; Xinyan Qu; Peisong Xu; Pingkun Zhou; Xiaochen Bo; Ming Ni
Journal:  Biol Direct       Date:  2016-01-12       Impact factor: 4.540

8.  Use of Sequence-Independent, Single-Primer-Amplification (SISPA) for rapid detection, identification, and characterization of avian RNA viruses.

Authors:  Klaudia Chrzastek; Dong-Hun Lee; Diane Smith; Poonam Sharma; David L Suarez; Mary Pantin-Jackwood; Darrell R Kapczynski
Journal:  Virology       Date:  2017-06-21       Impact factor: 3.616

9.  Full-length transcriptome assembly from RNA-Seq data without a reference genome.

Authors:  Manfred G Grabherr; Brian J Haas; Moran Yassour; Joshua Z Levin; Dawn A Thompson; Ido Amit; Xian Adiconis; Lin Fan; Raktima Raychowdhury; Qiandong Zeng; Zehua Chen; Evan Mauceli; Nir Hacohen; Andreas Gnirke; Nicholas Rhind; Federica di Palma; Bruce W Birren; Chad Nusbaum; Kerstin Lindblad-Toh; Nir Friedman; Aviv Regev
Journal:  Nat Biotechnol       Date:  2011-05-15       Impact factor: 54.908

10.  Quantitation of next generation sequencing library preparation protocol efficiencies using droplet digital PCR assays - a systematic comparison of DNA library preparation kits for Illumina sequencing.

Authors:  Louise Aigrain; Yong Gu; Michael A Quail
Journal:  BMC Genomics       Date:  2016-06-13       Impact factor: 3.969

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.