Literature DB >> 25329074

Genomic sequencing and analysis of Sucra jujuba nucleopolyhedrovirus.

Xiaoping Liu1, Feifei Yin1, Zheng Zhu1, Dianhai Hou1, Jun Wang1, Lei Zhang1, Manli Wang1, Hualin Wang1, Zhihong Hu1, Fei Deng1.   

Abstract

The complete nucleotide sequence of Sucra jujuba nucleopolyhedrovirus (SujuNPV) was determined by 454 pyrosequencing. The SujuNPV genome was 135,952 bp in length with an A+T content of 61.34%. It contained 131 putative open reading frames (ORFs) covering 87.9% of the genome. Among these ORFs, 37 were conserved in all baculovirus genomes that have been completely sequenced, 24 were conserved in lepidopteran baculoviruses, 65 were found in other baculoviruses, and 5 were unique to the SujuNPV genome. Seven homologous regions (hrs) were identified in the SujuNPV genome. SujuNPV contained several genes that were duplicated or copied multiple times: two copies of helicase, DNA binding protein gene (dbp), p26 and cg30, three copies of the inhibitor of the apoptosis gene (iap), and four copies of the baculovirus repeated ORF (bro). Phylogenetic analysis suggested that SujuNPV belongs to a subclade of group II alphabaculovirus, which differs from other baculoviruses in that all nine members of this subclade contain a second copy of dbp.

Entities:  

Mesh:

Year:  2014        PMID: 25329074      PMCID: PMC4201490          DOI: 10.1371/journal.pone.0110023

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

Baculoviruses are rod-shaped, insect-specific viruses with double-stranded, circular DNA 80–180 kb genomes [1]. Baculoviruses have been widely used as bio-pesticides to control insect pests in agriculture and forestry [2], as vectors for protein expression, and as potential vectors for gene therapy [3], [4]. The family Baculoviridae used to be grouped into two genera: Nucleopolyhedroviruses (NPVs) and Granuloviruses (GVs), dependent upon differing morphologies of occlusion bodies (OBs) [5]. More recently, a new classification has subdivided the Baculoviridae into four genera, based on phylogeny and host specificities: Alphabaculovirus (lepidopteran-specific NPVs), Betabaculovirus (lepidopteran-specific GVs), Gammabaculovirus (hymenopteran-specific NPVs) and Deltabaculovirus (dipteran-specific NPVs) [6]. Alphavaculoviruses can be further gathered into group I and group II based on phylogenetic analyses, The NPVs are also characterized as single nucleocapsid NPVs (SNPVs) and multiple-nucleocapsid NPVs (MNPVs) according to the number of nucleocapsids per virion. To date, 62 baculovirus reference genomes are available in the National Centre for Biotechnology Information (NCBI) database; 42 of them are alphabaculoviruses, 15 betabaculoviruses, three gammabaculoviruses, one deltabaculovirus and one unclassified baculovirus. Sucra jujuba Chu (Lepidopteral: Geometridae) is an important pest of jujube, and it is widespread in the jujube-growing regions of China. The larvae feed on the young leaves and buds of jujube, apple, pear and mulberry. In 2009, 1250 square hectometers of mulberry became infested with Sucra jujuba Chu in China [7]. Sucra jujuba NPV (SujuNPV) is a SNPV, which was first isolated from naturally diseased Sucra jujuba larvae in the early 1980s [8]. The virus is highly infectious to Sucra jujuba with an LC50 of 3.5×105 PIBs/mL in the third instar larvae [9]. It appears to be specific to Sucra jujuba as bioassay studies showed that it did not infect Antheraea pernyi, Arge captiva, Bombyx mori, Culcula panterinaria, Euproctis flava, Leucoma salicis, Lymantria dispar, Macaria elongaria, Phthonandria atrilineata, Plusia agnate or Semiothisa cineraria [8], [10], [11]. In the present study, the genome of SujuNPV is completely sequenced and annotated, and compared with those of the other representative baculoviruses. Results indicate that SujuNPV is a novel species belonging to a unique subclade of group II alphabaculoviruses, which contain a second copy of the DNA binding protein gene (dbp).

Materials and Methods

DNA extraction of the viral genome

The SujuNPV were purified from the dead Sucra jujube preserved in “Chinese general virus collection center” (CGVCC) with collection Number IVCAS 1.0048, which was originally isolated from Shandong Province, China, in 1983 [12]. The ODVs were purified as previously reported [13]. To extract DNA, the ODVs were incubated with four times volume 1 M DAS (5 M Nacl, 5 M NaCO3 and 0.5 M EDTA (pH8), mixed in the ratio of 3∶3∶0.6) at 37°C for 30 min. Then, the same volume of 1 M Tris (pH 7.4) was added followed by centrifugation at 10,000 rpm (5 min) to obtain the viral DNA.

Sequencing and sequence analysis of the SujuNPV genome

The SujuNPV genome sequence was determined by 454 pyrosequencing. A total of 92,684 reads were obtained and assembled into 10 contigs using GS De Novo Assembler software, covering 97.8% of the whole genome with a sequencing depth of 225x. The remaining gaps were filled using PCR and Sanger sequencing. Briefly, the genome was broken randomly into small fragments of about 600–900 bp by nebulization and adapters were added to construct a genomic library. Subsequently, the library was amplified by emPCR before sequencing. The SujuNPV genome was assembled using a GS De Novo Assembler providing 454 programs. Additional verifications were performed for gaps and ambiguous sequences using sequence-specific primers. The hypothetical ORFs of the SujuNPV genome were predicted by fgenesV0 (http://www.softberry.com/berry.phtml) [14], adopting the criteria of a size of at least 50 aa with a minimal overlap with other ORFs. Predicted aa sequences were compared with homologues of typical baculoviruses of the four genera, including AcMNPV (NC_001623), HearNPV-G4 (NC_002654), CpGV (NC_002816), NeleNPV (NC_005906) and CuniNPV (NC_003084), and similarities were obtained by DNAStar software with default parameters. Gene parity plots were generated in order to analyze the gene order of SujuNPV relative to three other closely related baculoviruses (ApciNPV, EcobNPV and OrleNPV) and the five representative viruses mentioned above. Consensus promoter motifs were searched for in the upstream 150 bp region from the start codon of each ORF based on the characterization of baculovirus' promoters, that’s a TATA box linked with a CAKT motif 20–40 bp downstream and a DTAAG box.

Phylogenetic analysis

Phylogenetic analysis of baculoviruses was performed using the concatenated aa sequence of 37 core genes [15] from 62 baculovirus reference genomes (http://www.ncbi.nlm.nih.gov/genomes/GenomesGroup.cgi?taxid=10442, data update until Jan.5th, 2014). The sequences were aligned by ClustalW with default parameters of MEGA5. And the maximum likelihood (ML) phylogenetic tree was reconstructed according to the previous report [16] with 1000 bootstrap values. The phylogenetic trees of dbp, helicase and p26 were constructed based on the same parameters.

Prediction of secondary structure

The secondary structures of DNA sequences were predicted by the Mfold Web Server using default parameters [17].

Results and Discussion

Characteristics of the SujuNPV genome sequence

The full SujuNPV genome [GeneBank: KJ676450] was 135,952 bp in length with an A+T content of 61.34%. Following convention, the adenine coding for the start methionine of the polyhedrin gene (ph) was chosen as the zero point of the SujuNPV genome and ph was designated as the first ORF. Overall, 131 putative ORFs were detected in the SujuNPV genome with the criteria of a length of at least 50 amino acids (aas) and a minimal overlap with adjacent ORFs. The total ORFs covered 89.2% of the whole genome, distributed with 60 ORFs in a forward orientation and 71 ORFs in a reverse orientation. In addition, seven homologous regions (hrs) were identified in SujuNPV (Fig. 1).
Figure 1

Circular map of the SujuNPV genome.

The arrows inside or outside the circle indicate the orientation of putative ORFs. Arrows, red represent the core genes, blue represent Lepidoptera baculovirus conserved genes, grey represent genes common to baculoviruses, open are genes unique to SujuNPV, and yellow rectangles indicate hrs. The collinear region conserved in Lepidoptera baculoviruses is also shown.

Circular map of the SujuNPV genome.

The arrows inside or outside the circle indicate the orientation of putative ORFs. Arrows, red represent the core genes, blue represent Lepidoptera baculovirus conserved genes, grey represent genes common to baculoviruses, open are genes unique to SujuNPV, and yellow rectangles indicate hrs. The collinear region conserved in Lepidoptera baculoviruses is also shown. BLAST comparisons of the 131 protein sequences of the SujuNPV, deduced from the homologous sequences of other baculoviruses, revealed that SujuNPV has 37 core genes (shown in red in Fig. 1) and 24 other genes conserved in lepidopteran baculoviruses (shown in blue in Fig. 1). It also contains 65 additional genes commonly found in various baculoviruses (shown in grey in Fig. 1) and five unique genes (shown as open arrows in Fig. 1). Consensus promoter motifs were searched for in the upstream 150 bp region of the start codon of each ORF. Amongst all 131 ORFs identified in the SujuNPV genome, 24 ORFs possessed the early promoter motif (a TATA box linked with a CAKT motif 20–40 bp downstream), whereas 61 ORFs had the late promoter motif DTAAG and 10 ORFs contained both the early and late promoter motifs (Table 1). No obvious baculoviral promoter motifs were detected for the remaining 36 ORFs.
Table 1

SujuNPV Genome Annotation.

ORFnamemotifstartendlength (aa)str.ORF positionamino acid identity (%)
AcMNPVHearNPVCpGVNeleNPVCuniNPVAcMNPVHearNPVCpGVNeleNPVCuniNPV
1 polyhedrin E1741246+811186.588.253.745.1
2 orf1629 L774247456692214.919.617.2
3 pk-1 24673270267+103332.644.630.3
4 hoar E33135259506415
5 orf5 56636724353+
6 pif-5* L69628059365+14815182310253.453.745.137.519.7
7 bro-1 L81228466114+5923.7
8 cg30-2 85709388272+887715.512.1
9 p10 L9436969686137212227.435.731
10 p26-1 E,L9747106042851362234.643.8
hr1 1066611269
11 ac29 114381165672+292325.440.3
12 lef-6 L117621250824828248019.726.216.8
13 dbp-2 12532134643102525811418.123.914.114.8
hr2 1354614517
14 orf14 136381388983+
15 p74* L14521164796521382060477457.854.439.438.932.8
hr3 1658017170
16 me-53 E172191834937613916–1714316.822.617.2
17 ie-0 E,L1869819495265+141824.527.5
18 p49* L1959521028477+142915603044.455.826.319.26.3
19 odv-e18* L210582132488+1431014623161.351.938.118.87.9
20 odv-ec27* L2142822306292+1441197633245.952.122.22115.3
21 chtb L223362261492+1451296445.544.631.527.2
22 ep23 L226392326220714613829.425.619.3
23 ie-1 E2332525439704+14714724.927.211.7
24 ac34 L2558626134182342723.650
25 orf25 E2622426802192
26 ubiquitin L269882725488+3528547472.368.2
27 39k 275292842829936315733.135.55.8
28 lef-11 28430287861183732583330.527.1
29 bv-e31 L287112943324038336953.748.339.5
30 dbp-1 E,L2978330712309+25258124.93412.4
31 p47* 3082732011394403568467353.650.845.225.214
32 lef-12 E3227233114280413630.929.6
33 lef-8* 3332136038905503813178266368.949.130.518.3
34 orf34 3350434118204+34orf34
35 djbp 3607837316412+513915.420.1
hr4 3732337964
36 iap-1 L38038388442682710317111724.323.124.312.73.1
37 ac52 3922139895224524219.530.6
38 ac53 L3985540328157+534313477284652.217.312.18.9
39 orf39 L40332414593754417.3
40 orf40 L414734170978
41 vp1054 L4176042869369+544713883839.747.930.118.818.5
42 ac55 430394324267+554835.856.7
43 ac56 L4318443534116+564911.932.8
44 ac57 E,L4371744208163+575035.438.7
45 chaB E,L4427944800173595137.734.4
46 chaB L448764514589605241.439.8
47 bro-2 L45281456971386027.5
48 fp/25k L4590546549214615311851.956.128
49 lef-9 4668748216509+6255117375966.266.652.33418.9
50 dna ligase 4854750361604+12023.5
51 bro-3 504365142833021053613.3
52 gp37 E515345239128564581345.355.641.4
53 orf53 5247153133220+
54 chitinase L5331955037572126411068.863.757.3
55 v-cath L5514456136330+127561166.64742.4
56 p26-2 E56200569192391362217.215.9
57 helicase-2 570355839045112627.5
58 ac150 L5843558752105150127925.321.725.7
59 iap-2 E587565968230871621728.935.216
60 pif-6 59711600881256864114385835.2442421.616
61 lef-3 6008761340417+676511317.923.74.5
62 desmoplakin L61434640438696666112219217.617.112.311.710.7
63 dna polymerase 64042672481068+6567111209144.551.630.924.316.5
64 ac75 L672876767913075691082034.610.8
65 ac76 L677746803185767010741.765.935.7
66 vlf-1 L68185693543897771106421867.569.428.625.118.2
67 ac78 L6937869710110787210543343345.513.621.316.7
68 gp41 L69758709634018073104443343.153.428.426.311.5
69 ac81 709327162122981741034510648.952.441.933.116.1
70 tlp-20 L7151572237240827510226.733.811.6
71 vp91 L7220674731841+8376101823537.642.821.823.422.3
72 cg30 E7479275739315887719.718.7
73 vp39 L7586176835324897896882436.44222.51914.5
74 lef-4 7683478264476+907995599645.745.130.32513.7
75 orf75 L7830178726141779.9
76 p33 E,L7882279580252928093161450.457.935.119.419
77 p18 L7957980055158+938192171352.5623117.14.8
78 odv-e25 L8005780728223+94829118153959.248.81311.2
79 helicase-1 L8077784505124295849058894146.922.517.411.1
80 pif-4 8445984980173+968589579050.357.835.426.225.4
81 38k L8500785927306988688568742.850.339.527.224.8
82 lef-5 L8581186662283+998787558846.854.43827.27.9
83 p6.9 867398699986+100888628237.38.14.18.113.8
84 p40 E,L87012881543801018985292240.241.217.613.77.6
85 p12 L8819488553119102908421.820.213.8
86 p48/p45 E,L88540897273951039183315542.447.531.915.66
87 vp80 8982092120766+104921217.2
88 ac110 921609233457+110935328.640.425
89 odv-ec43 L9234393482379+1099455676948.358.430.415.310.3
90 ac108 L935329378985+1089529.441.2
91 orf91 9380794313168
92 endonuclease L9440394762119+79652426.6
93 ac112 9474695765339+11231
hr5 9579696554
94 nrk1 9675397811352+331626.429.6
95 p43 E97850990103863918.2
96 iap-3 9900999479156+271039423.124.427.6
97 ac106 E,L99518100228236106101523250.849.625.416.5
98 parg L10028610197456210016.7
99 orf99 1020551025221559917.8
100 pif-3 L1025121031352071159835664644.147.235.728.531.5
101 orf101 103207103581124
102 sod L103683104159158+311065960.962.746.2
103 ac117 104202104513103+11711022.139.8
104 calyx/pep L104559105578339131120225022.635.712.112.1
105 orf105 105634106980448+
106 orf106 107008107532174
107 orf107 L108086108496136+6813.2
hr6 108513109329
108 p24 L109521110312263+1291187137.95224.1
109 orf109 L110450110818122+
110 lef-2 110826111515229+611741542536.236.718.715.411.6
111 pkip L111630112145171+241301329.6
112 orf112 112189112524111
113 pif-2 L112615113769384+2213248523860.268.150.344.346.4
114 ac111 1138271140336811111655.230.9
hr7 114159114766
115 F E,L114888116945685231333110414.237.824.515.3
116 orf116 117225120083952+12924.6
117 ac17 1201961209092371712816.529.1
118 orf118 E,L12095912160021312721.4
119 egt E1218121233535131512614145.151.735.5
120 orf120 L123560123916118
121 lef-1 124016124717233+1412474654535.642.133.928.421.5
122 38.7k 124783126009408+131237322.931.913.1
123 ac19 L1260651265081471911521.323.3
124 orf124 L126507127745412+
125 alk-exo L127807129066419+133114125335436.340.332.424.421.5
126 orf126 L129111129875254
127 fgf 130158131180340+3211312323.819.69.1
128 orf128 13120113144079112
129 pif-1 L13144713304553211911175762952.146.433.329.126
130 bro-4 133123133962279213.6
131 dna photolyase E134303135796497+

ORFs listed are those predicted in the SujuNPV genome and their homologues in the five representative genomes (AcMNPV, HearNPV-G4, CpNPV, NeleNPV and CuniNPV). The start gene is polyhedrin and core genes were marked with*. E and L indicate the Early and Late promoter motifs, respectively. ‘+’ and ‘−’ means the transcription direction; ‘+’ clockwise; ‘−’ anticlockwise.

ORFs listed are those predicted in the SujuNPV genome and their homologues in the five representative genomes (AcMNPV, HearNPV-G4, CpNPV, NeleNPV and CuniNPV). The start gene is polyhedrin and core genes were marked with*. E and L indicate the Early and Late promoter motifs, respectively. ‘+’ and ‘−’ means the transcription direction; ‘+’ clockwise; ‘−’ anticlockwise.

Relationship with other baculoviruses

Phyogenetic analysis of the 37 core genes of the 62 reference baculoviruses revealed that SujuNPV is a group II alphabaculovirus (Fig. 2). The virus is a novel member of a subclade containing eight other baculoviruses, including Apocheima cinerarium NPV (ApciNPV), Clanis bilineata NPV (ClbiNPV) [18], Ectropis obliqua NPV (EcobNPV) [19], Euproctis pseudoconspersa NPV (EupsNPV) [20], Hemileuca sp. NPV (HespNPV) [21], Lymantria dispar MNPV (LdMNPV) [22], Lymantria xylina MNPV (LyxyMNPV) [23] and Orgyia leucostigma NPV (OrleNPV) [24].
Figure 2

Phylogenic analysis of 62 complete baculovirus genomes.

The maximum likelihood (ML) tree was generated based on the concatenated protein sequences of 37 core genes with default parametes and 1000 randoms. The SujuNPV was labeled by a red point and the number on the branch means bootstrap values (only the values over 50 were shown). Pink branches indicate the unique subclade containing a second copy of dbp.

Phylogenic analysis of 62 complete baculovirus genomes.

The maximum likelihood (ML) tree was generated based on the concatenated protein sequences of 37 core genes with default parametes and 1000 randoms. The SujuNPV was labeled by a red point and the number on the branch means bootstrap values (only the values over 50 were shown). Pink branches indicate the unique subclade containing a second copy of dbp. Five representative baculoviruses were chosen for the comparative study of SujuNPV: Autographa californica MNPV (AcMNPV, group I alphabaculovirus) [25], Helicoverpa armigera SNPV (HearNPV, group II alphabaculovirus) [26], Cydia pomonella GV (CpGV, betabaculovirus), Neodiprion lecontei NPV (NeleNPV, gammabaculovirus) and Culex nigripalpus NPV (CuniNPV, deltabaculovirus). SujuNPV shared 102 ORFs with AcMNPV, 108 with HearNPV, 78 with CpGV, 43 with NeleNPV, and 39 with CuniNPV, with an average amino acid (aa) identity of 36.0%, 39.0%, 28.4%, 23.0% and 16.3%, respectively. Gene-parity plots of SujuNPV against three viruses in the same subclade and the five representative baculoviruses are shown in Fig. 3. The gene order between SujuNPV and ApciNPV, EcobNPV or OrleNPV revealed a high collinearity along the genomes, with some inversions and drifts. The plots of SujuNPV with representative lepidopteran baculoviruses (AcMNPV, HearNPV and CpGV) showed that SujuNPV is largely collinear with AcMNPV and HearNPV, less collinear with CpGV, but all contains a collinear region from Suju60 to Suju86, containing 20 core genes and five additional lepidopteran baculovirus conserved genes. This region has been suggested to exist in the ancestor of lepidopteran baculoviruses [27]. No obvious collinear region could be found between SujuNPV and NeleNPV or CuniNPV (Fig. 3).
Figure 3

Gene-parity plot analysis.

Gene-parity plots of SujuNPV against three close viruses (EcobNPV, ApciNPV and OrleNPV) and five representative baculoviruses (AcMNPV, HearNPV, CpGV, NeleNPV and CuniNPV).

Gene-parity plot analysis.

Gene-parity plots of SujuNPV against three close viruses (EcobNPV, ApciNPV and OrleNPV) and five representative baculoviruses (AcMNPV, HearNPV, CpGV, NeleNPV and CuniNPV).

Homologous regions

Homologous regions (hrs) are common elements in many baculoviruses, with characteristically high A+T contents, tandem repeats and imperfect palindromes. Hrs vary in location within genomes, number of copies and nucleotide sequences between different baculoviruses. These regions are suggested to act as replication origins and transcription enhancers [28], [29]. The SujuNPV genome contains seven homologous regions, covering 3.7% of the genome, as displayed in Fig. 4A. The length of the hrs ranges from 590 bp-971 bp, and each hr consists of four to eight palindromic repeats of 99 bp in length (Fig. 4A and 4B). Fig. 4B shows the arrangement of palindrome repeats in each homologous region. These palindromic repeats share at least 97.6% identity. The predicted secondary structure of the hr1–3 revealed that it contains a core palindrome region, colored by orange in Fig. 4C, and it is highly conserved in all counterparts, with about 99.5% identity on average. While the other two loop were not such conservative, neither on the size nor sequence.
Figure 4

Analysis of SujuNPV hrs.

A. The location and distribution of hrs in the SujuNPV genome. Black bars indicate hrs in the SujuNPV linear map. The number in brackets refers to the number of palindrome repeats in the homologous region. B. The arrangement of palindrome repeats in each homologous region. The rectangle above or below the black line indicates the orientation of repeats and number in the bracket represent the corresponding size of each hr. Different colors means the different fragments within the repeats, and orange indicates the core palindrome region, which is conserved in all the hr repeats. One sequence of each orientation was displayed at the top as a sample. C. The second structure of the hr1–3 palindrome repeats. The background color is in line with the sequence displayed in Fig. 4B.

Analysis of SujuNPV hrs.

A. The location and distribution of hrs in the SujuNPV genome. Black bars indicate hrs in the SujuNPV linear map. The number in brackets refers to the number of palindrome repeats in the homologous region. B. The arrangement of palindrome repeats in each homologous region. The rectangle above or below the black line indicates the orientation of repeats and number in the bracket represent the corresponding size of each hr. Different colors means the different fragments within the repeats, and orange indicates the core palindrome region, which is conserved in all the hr repeats. One sequence of each orientation was displayed at the top as a sample. C. The second structure of the hr1–3 palindrome repeats. The background color is in line with the sequence displayed in Fig. 4B.

DNA replication genes

Five core genes, five additional lepidopteran baculovirus conserved genes and eight other common genes involved in DNA replication were found in the SujuNPV genome (Table 2) [30]–[32]. Among these genes: helicase unwinds DNA [32]; dna polymerase is involved in DNA synthesis; late expression factor gene 3 (lef-3) and DNA binding protein gene (dbp) are involved in single-strand binding [33], [34]; dna-ligase in ligation and alkaline exonuclease (alk-exo) in rectification [35], [36], together with some other stimulators are required in the process of replication.
Table 2

Classification of gene function.

Core genesLepidoptera baculovirus conserved genesCommon genes
Replication alk-exo(Suju125), dna polymerase(Suju63), helicase(Suju79), lef-1(Suju121), lef-2(Suju110)dbp-1(Suju30), ie-1(Suju23), lef11(Suju28), lef-3(Suju61), me53(Suju16)dbp-2(Suju13), dnaphotolyase(Suju131), dna-ligase(Suju50), endonuclease(Suju92), helicase-2(Suju57), ie-0(Suju17), nrk1(Suju94), parg(Suju98),
Transcription lef-4(Suju74), lef-5(Suju82), lef8(Suju33), lef-9(Suju49), P47(Suju31), vlf-1(Suju66)39k(Suju27), lef-6(Suju12), pk-1(Suju3)lef12(Suju32)
Structure 38k(Suju81), ac53(Suju38), ac78(Suju67), ac81(Suju69), desmoplakin(Suju62), gp41(Suju68), odv-e18(Suju19), odv-e25(Suju78), odv-ec27(Suju20), odv-ec43(Suju89), p18(Suju77), p33(Suju76), p40(Suju84), p48/p45(Suju86), p49(Suju18), p6.9(Suju83), vp1054(Suju41), vp39(Suju73), vp91(Suju71)F(Suju115), fp/25k(Suju48), orf1629(Suju2), p12(Suju85), p24(Suju108), polyhedrin(Suju1), tlp-20(Suju70)calyx/pep(Suju104), cg30-1(Suju72), cg30-2(Suju8), p10(Suju9), pkip(Suju111), vp80(Suju87)
Auxiliary fgf(Suju127)bro-1(Suju7), bro-2(Suju47), bro-3(Suju51), bro-4(Suju130), chitinase(Suju54), egt(Suju119), gp37(Suju52), iap-1(Suju36), iap-2(Suju59), iap-3(Suju96), sod(Suju102), ubiquitin(Suju26), v-cath(Suju55)
Pifs p74(Suju15), pif-1(Suju129), pif-2(Suju113), pif-3(Suju100), pif-4(Suju80), pif-5(Suju6), pif-6(Suju60)
Unknown 38.7k(Suju122), ac106(Suju97), ac110(Suju88),ac75(Suju64), ac76(Suju65), chtb(Suju21), ep23(Suju22), bv-e31(Suju29)ac108(Suju90), ac111(Suju114), ac112(Suju93), ac117(Suju103), ac150(Suju58), ac17(Suju117), ac19(Suju123), ac29(Suju11), ac34(Suju24), ac52(Suju37), ac55(Suju42), ac56(Suju43), ac57(Suju44), chaB(Suju45), chaB(Suju46), djbp(Suju35), hoar(Suju4), p26-1 (Suju10), p26-2(Suju56), p43(Suju95), Suju101, Suju105, Suju107, Suju109, Suju112, Suju116, Suju118, Suju120, Suju124, Suju126, Suju128, Suju34, Suju39, Suju40, Suju75, Suju91, Suju99

Functional classification of the genes in the SujuNPV genome columns indicate classification by function and rows represent conservatism. Genes in the SujuNPV genome were arranged according to their functions and conservatism in alphabetical order.

Functional classification of the genes in the SujuNPV genome columns indicate classification by function and rows represent conservatism. Genes in the SujuNPV genome were arranged according to their functions and conservatism in alphabetical order. Functional classification of the genes in the SujuNPV genome; columns indicate classification by function and rows represent conservatism. Genes in the SujuNPV genome were arranged according to their functions and conservatism in alphabetical order.Some common genes involved in baculovirus replication were not present in SujuNPV. For example, lef-7 which has been shown to be a replication enhancer in baculoviruses [37], was absent from SujuNPV. SujuNPV also lacked certain genes associated with nucleotide biosynthesis, such as the ribonucleotide reductase subunits (rr1, rr2) and dUTPase, which are involved in dTTP biosynthesis [38]. Amongst the DNA replication genes, there are two copies of helicase and dbp in the SujuNPV genome. A full length helicase (Suju79, 1242aa) is a core gene found in all sequenced baculoviruses, whilst a second copy of truncated helicase (helicase-2) (Suju57, 451aa) is present in only six alphabaculoviruses (HearMNPV, LdMNPV, LyxyMNPV, MacoNPV-B, OrleNPV and SpliNPV) and 13 GVs (all sequenced GVs except for ClanGV and CaLGV) [39]. The phylogenetic tree of helicase homologies showed that they can be clearly divided into two groups (Fig. 5A). It is very likely that they were acquired from different sources during evolution. The research of AcMNPV helicase reveals that it belongs to Superfamily 1 helicase, which contain 7 conserved motifs [40], [41]. Motifs I and II are two NPT-binding motifs, together with another four motifs to fulfill the function of helicase [42], [43]. The alignment of the conserved motifs with AcMNPV and E.coli UrvD (representative of Superfamily 1 helicase) reveals that they share the same motifs (Fig. 5B) and that helicase-2 is seemingly more conservative. It appears that the two copies have a common ancestor, but understanding how they evolved and came to balance their specialization and cooperation within one genome requires further research.
Figure 5

Analysis of the duplicated gene helicase and its conservative motifs.

A. The tree was reconstructed based on protein sequences by MEGA5. The second copy was colored by purple branches and pink background and the number on the branch indicates a bootstrap value of 1000 randoms. B. Conservative motifs of E.coli UrvD, AcMNPV, SujuNPV Helicase (SujuNPV-1) and SujuNPV Helicase-2 (SujuNPV-2) were displayed. The blank line indicates the relevant protein with length in the bracket. The colored boxes on the line indicate motifs I-IV and the numbers above and below the box mean the start and end position of each motif in the protein, respectively.

Analysis of the duplicated gene helicase and its conservative motifs.

A. The tree was reconstructed based on protein sequences by MEGA5. The second copy was colored by purple branches and pink background and the number on the branch indicates a bootstrap value of 1000 randoms. B. Conservative motifs of E.coli UrvD, AcMNPV, SujuNPV Helicase (SujuNPV-1) and SujuNPV Helicase-2 (SujuNPV-2) were displayed. The blank line indicates the relevant protein with length in the bracket. The colored boxes on the line indicate motifs I-IV and the numbers above and below the box mean the start and end position of each motif in the protein, respectively. SujuNPV is the ninth baculovirus identified to have double copies of dbp; the other eight are ApciNPV, ClbiNPV, EcobNPV, EupsNPV, HespNPV, LdMNPV, LyxyMNPV and OrleNPV. Interestingly all these viruses belong to the same subclade (Fig. 2). Dbp is a conserved gene in lepidopteran baculoviruses. Phylogenetic analysis indicated that the dbp duplicates of these nine baculoviruses may have evolved separately to the conserved dbp in alphabaculoviruses (Fig. 6). We propose to name the alphabaculovirus-conserved dbp gene as dbp-1, and the second copy as dbp-2. Dbp-2 appears to be more close to the dbp of betabaculovirus. In SujuNPV, dbp-1 (Suju30) and dbp-2 (Suju13) encode 309 aa and 310 aa proteins respectively, with 25% aa identity. Although the significance of SujuNPV and other bacuoviruses carrying two copies of dbp is unclear, it clearly marks out the subclade of these nine group II alhpabaculoviruses.
Figure 6

Analysis of the duplicated gene dbp.

The tree was reconstructed based on protein sequences by MEGA5. The second copy was colored by purple branches and pink background and the number on the branch indicates a bootstrap value of 1000 randoms.

Analysis of the duplicated gene dbp.

The tree was reconstructed based on protein sequences by MEGA5. The second copy was colored by purple branches and pink background and the number on the branch indicates a bootstrap value of 1000 randoms.

Transcriptional genes

In a baculovirus life cycle, the genes are transcribed in cascades by different polymerase. Early stage genes are transcribed by host RNA polymerase II, while genes expressed during the late period of the life cycle are transcribed by the virus-encoded RNA polymerase, comprising four core gene transcripts: LEF-4, LEF-8, LEF-9, P47 [44]. Two other core genes are involved in late phase transcription: lef-5 and very late factor (vlf-1), acting as an initiation factor [45] and a regulatory factor participating in the hyper-expression of very late genes [46], respectively. These core genes, in addition to genes such as 39k, lef-6, lef-10 and lef-12, are required for late transcription [47]. All of these genes appear in SujuNPV, except for lef-10 (Table 2) and among all the other alphabaculoviruses this gene was only absent from ClbiNPV and OrleNPV.

Structural genes

Nineteen core genes and seven additional lepidopteran-conserved genes related to structure were found in the SujuNPV genome (Table 2) [48]–[50]. In addition, six other common genes were also identified in the SujuNPV genome (Table 2). Cg30 is duplicated in SujuNPV: cg30-1 (Suju72, 315 aa) and cg30-2 (Suju8, 272 aa). Among all the baculoviruses sequenced, two copies of cg30 are only present in SpliNPV-(SpliNPV82 and SpliNPV89) and in SpliGV (SpliGV52 and SpliGV124). Cg30-1 of SujuNPV has many homologies with other baculoviruses, while cg30-2 groups with SpliNPV89 and SpliGV52 at the outmost of the phylogenetic tree (Fig. 7), sharing an aa identity of 15% and 14%, respectively.
Figure 7

Analysis of the duplicated gene cg30.

The tree was reconstructed based on protein sequences by MEGA5. The second copy was colored by purple branches and pink background and the number on the branch indicates a bootstrap value of 1000 randoms.

Analysis of the duplicated gene cg30.

The tree was reconstructed based on protein sequences by MEGA5. The second copy was colored by purple branches and pink background and the number on the branch indicates a bootstrap value of 1000 randoms.

Per os infectivity factors

So far seven genes have been identified as per os infectivity factors (PIFs), including p74, pif1, pif2, pif3, pif4 (odv-e28), pif5 (odv-e56) and pif6, which are essential for the oral infection of insect larvae [51]–[54]. PIF-1, PIF-2 and PIF-3 in association with P74 form a conserved complex on the surface of ODV and were proposed to perform an essential function in the early stages of virus infection [51]. PIF-4 is an envelope-associated protein found in both ODV and BV [52], whereas, PIF-5 and PIF6 have been recently demonstrated to be PIF members [53], [54]. All seven of their genes are conserved within the SujuNPV genome and share 44%–61.2% identity with their homologues in group II representative baculovirus HearNPV.

Auxiliary genes

Auxiliary genes are those not essential for replication, transcription or structures, but provide the virus with the stronger adaptive ability [55], such as affecting the host’s cellular metabolism for successful infection or by promoting the progeny yields of the virus. Examples are fibroblast growth factor (fgf) and gp37, which are proposed to help to spread virions from the primary infection site [56], [57], egt, which promotes viral progeny by delaying larval molting [58], and cathepsin and chitinase, which aid the horizontal spread of viruses [59]. Superoxide dismutase (sod) has been suggested to migrate the effects of free radicals in infected hemocytes [60] and ubiquitin is proposed to stabilize viral proteins against being degraded by hosts [61]. Among these auxiliary genes, no core gene has been found and only fgf is a lepidopteran-conserved gene. SujuNPV was found to contain all the genes above (Table 2). Anti-apoptosis genes are those encoded by viruses in order to resist the programmed death of infected cells, hence ensuring a successful infection [62]. SujuNPV possesses two types of anti-apoptotic genes: p49 (Suju18) and three copies of inhibitor of apoptosis gene (iaps): iap-1 (Suju36), iap-2 (Suju59) and iap-3 (Suju96). Among the three iaps, iap-2 and iap-3 have a C3HC4 motif at the C terminal, with DNA-binding properties [63]. Baculovirus repeated ORFs (bros) are repetitive genes, which are widespread in baculoviruses and some other insect virus DNA [64]. Research of BmNPV showed that bros contained DNA-binding activity that could influence host DNA replication and transcription [65]. Four bro genes were identified in SujuNPV, and named bro-1 to bro-4, based upon their order of appearance in the genome (Fig. 1). SujuNPV bro-3 had an aa similarity to its homologues in AcMNPV, OpMNPV, LdMNPV and HearNPV-G4, with 36%, 38.6%, 35.2% and 13.3% sequence identity, respectively. The other three bros only shared a C-terminal region with Ld-bro-m, Ld-bro-p and Ld-bro-n.

Unknown genes

SujuNPV contained an additional eight lepidoptera-conserved genes and 37 common genes with unknown functions (Table 2). P26 is an alphabaculovirus-specific gene. Among the 42 alphabaculoviruses previously sequenced, 19 contained a second copy of p26 and 16 of these belonged to group II. SujuNPV also contains two copies of p26, Suju10 (p26-1, 285 aa) and Suju56 (p26-2, 239 aa), which share 13.8% similarity. We name the one conserved in alphabaculoviruses as p26-1, and the second copy as p26-2. Phylogenetic analysis of p26 showed that the second copies of p26 could be classified into a unique subclade (colored pink in Fig. 8), with the exception of three group I baculoviruses (CfMNPV, ChocNPV and ChroNPV). Interestingly, the group II baculoviruses, except for LeseNPV, all specifically contain a conserved gene cluster that is p10, p26, ac29, lef-6 and dbp (dbp-2 in the 9 dbp-duplicated baculoviruses) in order. Although the significance of this gene cluster is unknown, it can provide us with more information for evolutionary analysis.
Figure 8

Analysis of the duplicated gene p26.

The tree was reconstructed based on protein sequences by MEGA5. The second copy was colored by purple branches and pink background and the number on the branch indicates a bootstrap value of 1000 randoms.

Analysis of the duplicated gene p26.

The tree was reconstructed based on protein sequences by MEGA5. The second copy was colored by purple branches and pink background and the number on the branch indicates a bootstrap value of 1000 randoms.

Unique genes

Five genes are unique to the SujuNPV genome, including Suju5 (353 aa), Suju14 (83 aa), Suju25 (192 aa), Suju53 (220 aa) and Suju106 (174 aa) which were not included in Table 2. Suju5 has a similar location and length to the ORF5 of Buzura suppressaria SNPV (BusuNPV) with14.8% aa identity, indicating they may have similar function [27]. Suju25 has an early promoter and a BLAST search showed it to have a slight similarity to the ATP-binding protein of Lysinibacillus sphaericus C3–41 with an E-value of 0.89. No homologues were found in GenBank for the other three ORFs, whether these are functional ORFs of SujuNPV requires further experimentation.

Conclusion

Our analyses revealed that SujuNPV is a novel baculovirus within a unique subclade of group II alphabaculovirues, the members of which all contain a second copy of dbp. The SujuNPV genome contains seven hrs and five unique ORFs, as well as several genes with two or more copies. The presence of duplicated genes in this virus raises the question on the mechanisms of its acquisition (duplication of virus genes or independent horizontal transfer) and maintain, which needs further researches. These findings will facilitate future applications of SujuNPV to pest control and provide new data for the elucidation of the evolutionary pathways of baculoviruses.
  50 in total

Review 1.  Viruses and apoptosis.

Authors:  A Roulston; R C Marcellus; P E Branton
Journal:  Annu Rev Microbiol       Date:  1999       Impact factor: 15.500

Review 2.  The genome sequence and evolution of baculoviruses.

Authors:  Elisabeth A Herniou; Julie A Olszewski; Jennifer S Cory; David R O'Reilly
Journal:  Annu Rev Entomol       Date:  2001-09-28       Impact factor: 19.686

3.  In vitro activity of the baculovirus late expression factor LEF-5.

Authors:  Linda A Guarino; Wen Dong; Jianping Jin
Journal:  J Virol       Date:  2002-12       Impact factor: 5.103

4.  Phylogenetic analysis and possible function of bro-like genes, a multigene family widespread among large double-stranded DNA viruses of invertebrates and bacteria.

Authors:  Dennis K Bideshi; Sylvaine Renault; Karine Stasiak; Brian A Federici; Yves Bigot
Journal:  J Gen Virol       Date:  2003-09       Impact factor: 3.891

5.  Evolutionary history and higher order classification of AAA+ ATPases.

Authors:  Lakshminarayan M Iyer; Detlef D Leipe; Eugene V Koonin; L Aravind
Journal:  J Struct Biol       Date:  2004 Apr-May       Impact factor: 2.867

6.  The Autographa californica nuclear polyhedrosis virus p143 gene encodes a DNA helicase.

Authors:  V V McDougal; L A Guarino
Journal:  J Virol       Date:  2000-06       Impact factor: 5.103

7.  Evidence for nucleic acid binding ability and nucleosome association of Bombyx mori nucleopolyhedrovirus BRO proteins.

Authors:  E A Zemskov; W Kang; S Maeda
Journal:  J Virol       Date:  2000-08       Impact factor: 5.103

8.  The sequence of the Helicoverpa armigera single nucleocapsid nucleopolyhedrovirus genome.

Authors:  Xinwen Chen; Wilfred F J IJkel; Renato Tarchini; Xiulian Sun; Hans Sandbrink; Hualin Wang; Sander Peters; Douwe Zuidema; René Klein Lankhorst; Just M Vlak; Zhihong Hu
Journal:  J Gen Virol       Date:  2001-01       Impact factor: 3.891

9.  Baculovirus alkaline nuclease possesses a 5'-->3' exonuclease activity and associates with the DNA-binding protein LEF-3.

Authors:  Victor S Mikhailov; Kazuhiro Okano; George F Rohrmann
Journal:  J Virol       Date:  2003-02       Impact factor: 5.103

10.  Genome sequence and analysis of Buzura suppressaria nucleopolyhedrovirus: a group II Alphabaculovirus.

Authors:  Zheng Zhu; Feifei Yin; Xiaoping Liu; Dianhai Hou; Jun Wang; Lei Zhang; Basil Arif; Hualin Wang; Fei Deng; Zhihong Hu
Journal:  PLoS One       Date:  2014-01-24       Impact factor: 3.240

View more
  7 in total

1.  Genome Characteristics of the Cyclophragma Undans Nucleopolyhedrovirus: A Distinct Species in Group I of Alphabaculovirus.

Authors:  Zheng Zhu; Jun Wang; Qianran Wang; Feifei Yin; Xiaoping Liu; Dianhai Hou; Lei Zhang; Haizhou Liu; Jiang Li; Basil M Arif; Hualin Wang; Fei Deng; Zhihong Hu; Manli Wang
Journal:  Virol Sin       Date:  2018-08-28       Impact factor: 4.327

2.  The complete sequence of the first Spodoptera frugiperda Betabaculovirus genome: a natural multiple recombinant virus.

Authors:  Paola E Cuartas; Gloria P Barrera; Mariano N Belaich; Emiliano Barreto; Pablo D Ghiringhelli; Laura F Villamizar
Journal:  Viruses       Date:  2015-01-20       Impact factor: 5.048

3.  The Operophtera brumata Nucleopolyhedrovirus (OpbuNPV) Represents an Early, Divergent Lineage within Genus Alphabaculovirus.

Authors:  Robert L Harrison; Daniel L Rowley; Joseph D Mowery; Gary R Bauchan; John P Burand
Journal:  Viruses       Date:  2017-10-21       Impact factor: 5.048

4.  A Novel Alphabaculovirus from the Soybean Looper, Chrysodeixis includens, that Produces Tetrahedral Occlusion Bodies and Encodes Two Copies of he65.

Authors:  Robert L Harrison; Daniel L Rowley; Holly J R Popham
Journal:  Viruses       Date:  2019-06-26       Impact factor: 5.048

5.  Identification of Multiple Replication Stages and Origins in the Nucleopolyhedrovirus of Anticarsia gemmatalis.

Authors:  Solange A B Miele; Carolina S Cerrudo; Cintia N Parsza; María Victoria Nugnes; Diego L Mengual Gómez; Mariano N Belaich; P Daniel Ghiringhelli
Journal:  Viruses       Date:  2019-07-15       Impact factor: 5.048

6.  Genome Analysis of a Novel Clade II.b Alphabaculovirus Obtained from Artaxa digramma.

Authors:  Jiang Li; Xiaoyan Duan; Qianran Wang; Lei Zhang; Fei Deng; Hualin Wang; Zhihong Hu; Manli Wang; Jun Wang
Journal:  Viruses       Date:  2019-10-09       Impact factor: 5.048

7.  Genome of Cnaphalocrocis medinalis Granulovirus, the First Crambidae-Infecting Betabaculovirus Isolated from Rice Leaffolder to Sequenced.

Authors:  Guangjie Han; Jian Xu; Qin Liu; Chuanming Li; Hongxing Xu; Zhongxian Lu
Journal:  PLoS One       Date:  2016-02-05       Impact factor: 3.240

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.