| Literature DB >> 27809232 |
Christopher Noune1, Caroline Hauxwell2.
Abstract
Complete genome sequences of two Australian isolates of H. armigera single nucleopolyhedrovirus (HaSNPV) and nine strains isolated by plaque selection in tissue culture identified multiple polymorphisms in tissue culture-derived strains compared to the consensus sequence of the parent isolate. Nine open reading frames (ORFs) in all tissue culture-derived strains contained changes in nucleotide sequences that resulted in changes in predicted amino acid sequence compared to the parent isolate. Of these, changes in predicted amino acid sequence of six ORFs were identical in all nine derived strains. Comparison of sequences and maximum likelihood estimation (MLE) of specific ORFs and whole genome sequences were used to compare the isolates and derived strains to published sequence data from other HaSNPV isolates. The Australian isolates and derived strains had greater sequence similarity to New World SNPV isolates from H. zea than to Old World isolates from H. armigera, but with characteristics associated with both. Three distinct geographic clusters within HaSNPV genome sequences were identified: Australia/Americas, Europe/Africa/India, and China. Comparison of sequences and fragmentation of ORFs suggest that geographic movement and passage in vitro result in distinct patterns of baculovirus strain selection and evolution.Entities:
Keywords: Helicoverpa; Next Generation Sequencing; SNPV; baculovirus; strain selection; virus evolution
Mesh:
Year: 2016 PMID: 27809232 PMCID: PMC5127010 DOI: 10.3390/v8110280
Source DB: PubMed Journal: Viruses ISSN: 1999-4915 Impact factor: 5.048
Figure 1Isolation of H. armigera single nucleopolyhedrovirus (HaSNPV) strains in tissue culture (TC).
HaSNPV-AC53 (AC53) Isolated Strains.
| Strain | Time (h) Post-Infection (pi) | First Round Isolation Method |
|---|---|---|
| 48 and 48 | Agar Overlay | |
| 48 and 48 | Agar Overlay | |
| 48 and 48 | Agar Overlay | |
| 48 and 48 | Agar Overlay | |
| 72 and 72 | Agar Overlay | |
| 48 and 48 | Tissue Culture—No Overlay | |
| 72 and 96 | Tissue Culture—No Overlay | |
| 72 and 96 | Tissue Culture—No Overlay | |
| 72 and 120 | Tissue Culture—No Overlay |
Eight isolates were harvested from haemolymph at different times post-infection (passages 1 and 2). Strains C1, C5, C6, C9 and C3 were selected by Method 1, and strains T2, T4 and T5 by Method 2; * Selected from strain T4 at two time points (168 h and 288 h pi) during final passage in neonate larvae.
Sequenced H. armigera single nucleopolyhedrovirus (HaSNPV) and H. zea single nucleopolyhedrovirus (HzSNPV) nucleotide identity compared to AC53 with sequence identity ranging between 81.692% (L1 strain) and 99.604% (AC53-T5).
| Common Name | Genbank Accession | Sequence Length (bp) | Nucleotide Identity to AC53 (%) | Country/Region of Origin |
|---|---|---|---|---|
| KU738896 | 130,460 | 99.624 | Australia | |
| KU738898 | 130,439 | 99.600 | Australia | |
| KU738899 | 130,435 | 99.601 | Australia | |
| KU738902 | 130,440 | 99.602 | Australia | |
| KU738904 | 130,442 | 99.603 | Australia | |
| KU738897 | 130,437 | 99.599 | Australia | |
| KU738901 | 130,440 | 99.596 | Australia | |
| KU738896 | 130,443 | 99.530 | Australia | |
| KU738897 | 130,437 | 99.595 | Australia | |
| KJ922128 | 130,436 | 99.423 | Australia | |
| KJ004000 | 130,890 | 99.220 | Unknown—Sequenced in Russia | |
| AF334030 | 130,869 | 99.208 | USA—Sequenced in China | |
| KM596835 | 129,694 | 98.277 | Brazil | |
| AP010907 | 132,425 | 96.203 | Kenya | |
| KJ701029 | 131,966 | 96.012 | Iberia | |
| KJ701032 | 132,481 | 95.961 | Iberia | |
| KJ701033 | 132,265 | 95.810 | Iberia | |
| KJ701030 | 130,949 | 95.799 | Iberia | |
| KJ701031 | 130,992 | 95.798 | Iberia | |
| AF303045 | 130,759 | 95.353 | China | |
| JN584482 | 130,992 | 94.860 | Australia—Sequenced in China | |
| AF271059 | 131,405 | 94.442 | China |
AC53-derived, H25EA1, and HzSNPV strains are all within 2% nucleotide identity, whereas the remaining HaSNPV strains are within 5.5% nucleotide identity—excluding the L1 strain, which seems to be an outlier.
Amino acid (AA) and nucleotide (N) identity (%) of the regions that are not identical to AC53. The main difference between AC53 and the derived strains occur with both baculovirus repeated open reading frame (BRO)-A and BRO-B, Hr1–Hr5 and HOAR, which is to be expected due to the known hypervariability of the regions. However, open reading frame (ORF) 7 and the hypothetical ORF contains an early stop resulting in a smaller sequence, whereas ORF61 is longer due to the derived strains containing an early stop.
| ORF | Protein | AC53-C1 | AC53-C3 | AC53-C5 | AC53-C6 | AC53-C9 | AC53-T2 | AC53-T4.1 | AC53-T4.2 | AC53-T5 | Notes | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| AA | N | AA | N | AA | N | AA | N | AA | N | AA | N | AA | N | AA | N | AA | N | |||
| HOAR | 95.6 | 95.7 | 95.8 | 96.1 | 96.9 | 97.1 | 96.8 | 96.9 | 96.9 | 96.9 | 96.7 | 96.9 | 96.7 | 96.9 | 96.9 | 96.9 | 96.7 | 96.9 | ||
| 34.9 | 97.2 | 98.3 | 98.9 | 34.9 | 97.2 | 34.9 | 97.2 | 34.9 | 97.2 | 34.9 | 97.2 | 34.9 | 97.2 | 34.9 | 97.2 | 34.9 | 97.2 | AC53 and AC53-C3 have identical length | ||
| 99.3 | 99.3 | 99.3 | 99.3 | 99.3 | 99.3 | 99.3 | 99.3 | 99.3 | 99.3 | 99.3 | 99.3 | 99.3 | 99.3 | 99.3 | 99.3 | 99.3 | 99.3 | |||
| 94.1 | 99.3 | 94.1 | 99.3 | 94.1 | 99.3 | 94.1 | 99.3 | 94.1 | 99.3 | 94.1 | 99.3 | 94.1 | 99.3 | 94.1 | 99.3 | 94.1 | 99.3 | AC53 is 85 bp shorter | ||
| Hr1 | N.A | 99.7 | N.A | 99.7 | N.A | 99.7 | N.A | 99.8 | N.A | 99.8 | N.A | 99.7 | N.A | 99.7 | N.A | 99.8 | N.A | 99.7 | ||
| Hr2 | N.A | 95.5 | N.A | 95.2 | N.A | 95.2 | N.A | 95.2 | N.A | 95.3 | N.A | 95.2 | N.A | 95.2 | N.A | 95.2 | N.A | 95.2 | ||
| Hypothetical Protein | 70.7 | 98.0 | 70.7 | 98.0 | 70.7 | 98.0 | 70.7 | 98.0 | 70.7 | 98.0 | 70.7 | 98.0 | 70.7 | 98.0 | 70.7 | 98.0 | 70.7 | 98.0 | AC53 is 24 bp shorter | |
| BRO-A | 90.9 | 92.1 | 90.9 | 92.1 | 90.9 | 92.1 | 90.9 | 92.1 | 90.9 | 92.1 | 91.3 | 92.1 | 90.9 | 92.1 | 90.9 | 92.1 | 90.9 | 92.1 | ||
| BRO-B | 90.4 | 94.0 | 90.4 | 94.0 | 90.4 | 94.0 | 90.4 | 94.0 | 90.4 | 94.0 | 90.4 | 94.0 | 90.4 | 94.0 | 90.4 | 94.0 | 90.4 | 94.0 | ||
| Hr3 | N.A | 99.6 | N.A | 99.6 | N.A | 99.6 | N.A | 99.6 | N.A | 99.6 | N.A | 99.69 | N.A | 99.6 | N.A | 99.8 | N.A | 99.6 | ||
| 80.0 | 86.9 | 86.8 | 86.9 | 80.0 | 86.9 | 80.0 | 86.9 | 80.0 | 86.9 | 80.0 | 86.9 | 80.0 | 86.9 | 80.0 | 86.9 | 80.0 | 86.9 | AC53 is 41 bp longer | ||
| DNA polymerase | 100 | 100 | 99.9 | 99.9 | 99.9 | 99.9 | 100 | 100 | 100 | 100 | 99.9 | 99.9 | 100 | 100 | 100 | 100 | 100 | 100 | ||
| 100 | 100 | 100 | 100 | 100 | 100 | 76.3 | 99.4 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | Split in two with AC53-C6 | ||
| Hr4 | N.A | 99.5 | N.A | 99.0 | N.A | 99.0 | N.A | 99.0 | N.A | 99.0 | N.A | 99.0 | N.A | 99.0 | N.A | 99.0 | N.A | 99.0 | ||
| Hr5 | N.A | 99.3 | N.A | 99.1 | N.A | 99.1 | N.A | 99.5 | N.A | 99.2 | N.A | 99.1 | N.A | 99.4 | N.A | 99.1 | N.A | 99.4 | ||
| 38.7K | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 99.7 | 99.4 | 100 | 100 | 100 | 99.8 | 100 | 100 | ||
| 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 23.2 | 87.6 | 100 | 100 | Split in two with AC53-T4.2 | ||
| PKIP-1 | 100 | 100 | 100 | 99.8 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 | ||
| 97.2 | 99.3 | 97.2 | 99.3 | 97.2 | 99.3 | 97.2 | 99.3 | 97.2 | 99.3 | 97.2 | 99.3 | 97.2 | 99.3 | 97.2 | 99.3 | 97.2 | 99.3 | |||
| 9 | 14 | 10 | 16 | 10 | 15 | 10 | 15 | 8 | 14 | 11 | 16 | 9 | 14 | 10 | 16 | 9 | 14 | |||
N.A. = not applicable.
Comparison of the nucleotide and amino acid sequence similarity of AC53 derived strains to each other. The greatest diversity was within the five homologous repeat regions, DNA polymerase and HOAR. Only AC53-T2 contained an amino acid difference in BRO-A. ORF5 is shorter in length in all strains except AC53-C3 than in AC53. AC53-C6 and AC53-T4.2 contained unique mutations within ORF78 and ORF128, respectively, due to an inserted stop.
| ORF/Region | Nucleotide Similarity and Clusters within Derived Strains | Amino Acid Similarity and Clusters within Derived Strains |
|---|---|---|
| HOAR | - AC53-T4.1, AC53-T5 = 100% | - AC53-T4.1 and AC53-T5 = 100% |
| - AC53-C6, AC53-C9, AC53-T4.2 = 100% | - AC53-C6, AC53-C9 ,AC53-T4.2 = 100% | |
| - Remaining 4 strains all different at 96.7% to 99.9% | - Remaining 4 strains all different at 95.8% to 99.8% | |
| ORF5 * | - AC53-C3 = 96.1% | - AC53-C3 = 33.3% |
| - Remaining strains all identical | - Remaining strains all identical | |
| BRO-A | - AC53-T2 = 99.9% | - AC53-T2 = 99.5% |
| - Remaining strains all identical | - Remaining strains all identical | |
| DNA-Polymerase | - AC53-T5, AC53-T4.2, AC53-T4.1, AC53-C9, AC53-C6, AC53-C1 = 100% | - AC53-C3, AC53-C5, AC53-T2 = 100% |
| - AC53-C5, AC53-C3 = 100% | - AC53-T5, AC53-T4.2, AC53-T4.1, AC53-C9, AC53-C6 | |
| - AC53-T2 = 99.9% | - AC53-C1 = 100% | |
| ORF78/ORF78a and 78b in AC53-C6 | - AC53-C6 = 99.4% | - AC53-C6 = 77.9% |
| - Remaining strains all identical | - Remaining strains all identical | |
| 38.7K Protein | - AC53-T2 = 99.4% to other 7 strains and 99.6% to AC53-T4.2 | - AC53-T2 = 99.7% |
| - AC53-T4.2 = 99.8% to other 7 strains | - Remaining 8 strains all identical | |
| - Remaining 7 strains all identical | ||
| ORF128/ORF128a and 128b in AC53-T4.2 | - AC53-T4.2 = 87.6% | - AC53-T4.2 = 23.2% |
| - Remaining strains all identical | - Remaining strains all identical | |
| PKIP-1 | - AC53-C3 = 99.8% | - All strains = 100% |
| - Remaining strains all identical | ||
| Hr1 | - AC53-T4.2, AC53-C9, AC53-C6 = 100% | - Not Applicable |
| - AC53-T5, AC53-T4.1, AC53-T2, AC53-C5, AC53-C3, AC53-C1 = 100% | ||
| - 99.9% when both groups compared | ||
| Hr2 | - AC53-C6, AC53-T4.1 = 100% | - Not Applicable |
| - Remaining strains all identical | ||
| Hr3 | - AC53-T4.2 = 99.8% | - Not Applicable |
| - Remaining strains all identical | ||
| Hr4 | - AC53-C1 = 99.2% | - Not Applicable |
| - Remaining strains all identical | ||
| Hr5 | - AC53-T2, AC53-C5, AC53-C3 = 100% | - Not Applicable |
| - Remaining strains all identical |
* ORF5 is 87 bp longer in AC53-C3.
Figure 2Amino acid sequence and predicted protein structure of hypothetical proteins identified using the EMBOSS garnier [63]. Alpha helix (purple rectangle), beta strands (yellow arrows), coils (gray wavy lines), and the turns (blue curved arrows) are depicted here. AC53-derived strains contain an additional turn and an additional beta strand; CDS, coding DNA sequence.
Figure 3Phylogenetic Relationship of all of the HaSNPV and H. zea single nucleopolyhedrovirus (HzSNPV) strains (with bootstrap support as a percentage) and rooted to Autographica californica multiple nucleopolyhedrovirus (AcMNPV). Tree has been collapsed based on a minimum of 70% support. Geographically, three distinct groups can be observed; an Australian/American group consisting of AC53, H25EA1, AC53-derived strains and the HzSNPV isolates (purple), a Chinese group containing the HaSNPV C1, G4 and AU strains (green), and a third African/European group containing the Iberian (SP, LB) and Kenyan (NNg1) isolates (red).
Figure 4The root for both trees is the Heliothis virescens Ascovirus 3e isolate and have been collapsed based on 70% bootstrap support. (A) baculovirus repeated open reading frame (BRO)-A maximum likelihood estimation (MLE) identified two distinct clusters, AC53 derived and non-AC53 derived; (B) BRO-B also identified two distinct clusters, however the parent AC53 strain was grouped with non-AC53 derived. A third group in BRO-B consisting of HzSNPV isolates could also be identified.
Figure 5All trees have not been collapsed, due to poor bootstrapping support, and would not be differentiated otherwise, and have all been rooted to Busseola fusca NPV A2-4. (A) The geographic relationships of lef-8 have at least three clusters, Africa/India/Europe (red), China (green), Australia/Americas (purple) and the Australian H25EA1 isolate (yellow-green); (B) The geographic relationships of lef-9 have at least four clusters, Africa/India/Europe (red), Poland (aqua), the United States of America (blue) and North and South America (black), Australia (purple) and China (green); (C) The geographic relationships of polh have at least four clusters, India (yellow-green), Africa/Indian/Europe (red), China (green), South Africa (light green), Australia (yellow) and Australia/Americas (purple).
Figure 6MLEs using (A) open reading frame (ORF) 42 and its homologs; and (B) ORF78 and its homologs identify the same three geographic clusters as the whole-genome sequences. Both trees have been rooted to Plasmodium falciparum 3D7 with a similar nucleotide sequence to ORF42 and ORF78, and collapsed to 60% minimum support values. Coloring code is identical to Figure 5.
Figure 7ORF61 (ORF62 homolog), rooted to MacoNPV-A, showing only six strains clustering based on geography.