| Literature DB >> 21716740 |
Solange Ana Belen Miele1, Matías Javier Garavaglia, Mariano Nicolás Belaich, Pablo Daniel Ghiringhelli.
Abstract
The Baculoviridae is a large group of insect viruses containing circular double-stranded DNA genomes of 80 to 180 kbp. In this study, genome sequences from 57 baculoviruses were analyzed to reevaluate the number and identity of core genes and to understand the distribution of the remaining coding sequences. Thirty one core genes with orthologs in all genomes were identified along with other 895 genes differing in their degrees of representation among reported genomes. Many of these latter genes are common to well-defined lineages, whereas others are unique to one or a few of the viruses. Phylogenetic analyses based on core gene sequences and the gene composition of the genomes supported the current division of the Baculoviridae into 4 genera: Alphabaculovirus, Betabaculovirus, Gammabaculovirus, and Deltabaculovirus.Entities:
Year: 2011 PMID: 21716740 PMCID: PMC3119482 DOI: 10.4061/2011/379424
Source DB: PubMed Journal: Int J Evol Biol ISSN: 2090-052X
Baculovirus complete genomes.
| Genus | Name | Abbreviation | Code | Accesion number | Genome | Annotated ORFs | GC% | Ref. |
|---|---|---|---|---|---|---|---|---|
| AnpeNPV-Z | APN | NC_008035 | 126629 | 145 | 53.5 | [ | ||
| AnpeNPV-L2 | AP2 | EF207986 | 126246 | 144 | 53.5 | [ | ||
| AgMNPV-2D | AGN | NC_008520 | 132239 | 152 | 44.5 | [ | ||
| AcMNPV-C6 | ACN | NC_001623 | 133894 | 154 | 40.7 | [ | ||
| BmNPV | BMN | NC_001962 | 128413 | 137 | 40.4 | [ | ||
| BomaNPV | BON | NC_012672 | 126770 | 141 | 40.2 | [ | ||
| CfDEFMNPV | CDN | NC_005137 | 131160 | 149 | 45.8 | [ | ||
| CfMNPV | CFN | NC_004778 | 129593 | 145 | 50.1 | [ | ||
| EppoNPV | EPN | NC_003083 | 118584 | 136 | 40.7 | [ | ||
| HycuNPV | HCN | NC_007767 | 132959 | 148 | 45.5 | [ | ||
| MaviMNPV | MVN | NC_008725 | 111953 | 126 | 38.6 | [ | ||
| OpMNPV | OPN | NC_001875 | 131995 | 152 | 55.1 | [ | ||
| PlxyMNPV | PXN | NC_008349 | 134417 | 149 | 40.7 | U | ||
| RoMNPV | RON | NC_004323 | 131526 | 146 | 39.1 | [ | ||
| AdhoNPV | AHN | NC_004690 | 113220 | 125 | 35.6 | [ | ||
| AdorNPV | AON | NC_011423 | 111724 | 121 | 35.0 | [ | ||
| AgipNPV | AIN | NC_011345 | 155122 | 163 | 48.6 | U | ||
| AgseNPV | ASN | NC_007921 | 147544 | 153 | 45.7 | [ | ||
| ApciNPV | APO | FJ914221 | 123876 | 118 | 33.4 | U | ||
| ChChNPV | CCN | NC_007151 | 149622 | 151 | 39.0 | [ | ||
| ClbiNPV | CBN | NC_008293 | 135454 | 129 | 37.7 | [ | ||
| EcobNPV | EON | NC_008586 | 131204 | 126 | 37.6 | [ | ||
| EupsNPV | EUN | NC_012639 | 141291 | 139 | 40.4 | [ | ||
| HearNPV-C1 | HA1 | NC_003094 | 130759 | 135 | 38.9 | [ | ||
| HearNPV-G4 | HA4 | NC_002654 | 131405 | 135 | 39.0 | [ | ||
| HearMNPV | HAN | NC_011615 | 154196 | 162 | 40.1 | [ | ||
| HearSNPV-NNg1 | HAS | NC_011354 | 132425 | 143 | 39.2 | [ | ||
| HzSNPV | HZN | NC_003349 | 130869 | 139 | 39.1 | U | ||
| LeseNPV-AH1 | LSN | NC_008348 | 168041 | 169 | 48.6 | [ | ||
| LdMNPV | LDN | NC_001973 | 161046 | 163 | 57.5 | [ | ||
| LyxyMNPV | LXN | NC_013953 | 156344 | 157 | 53.5 | [ | ||
| MacoNPV-90-2 | MCN | NC_003529 | 155060 | 169 | 41.7 | [ | ||
| MacoNPV-90-4 | MC4 | AF539999 | 153656 | 168 | 41.7 | [ | ||
| MacoNPV-B | MCB | NC_004117 | 158482 | 169 | 40.0 | [ | ||
| OrleNPV | OLN | NC_010276 | 156179 | 135 | 39.9 | U | ||
| SeMNPV | SEN | NC_002169 | 135611 | 142 | 43.8 | U | ||
| SfMNPV-3AP2 | SF2 | NC_009011 | 131330 | 143 | 40.2 | [ | ||
| SfMNPV-19 | SF9 | EU258200 | 132565 | 141 | 40.3 | [ | ||
| SpliNPV-II | SLN | NC_011616 | 148634 | 147 | 45.0 | U | ||
| SpliNPV-G2 | SL2 | NC_003102 | 139342 | 141 | 42.8 | [ | ||
| TnSNPV | TNN | NC_007383 | 134394 | 144 | 39.0 | [ | ||
| AdorGV | AOG | NC_005038 | 99657 | 119 | 34.5 | [ | ||
| AgseGV | ASG | NC_005839 | 131680 | 132 | 37.3 | U | ||
| ChocGV | COG | NC_008168 | 104710 | 116 | 32.7 | [ | ||
| CrleGV | CLG | NC_005068 | 110907 | 129 | 32.4 | [ | ||
| CpGV | CPG | NC_002816 | 123500 | 143 | 45.3 | [ | ||
| HearGV | HAG | NC_010240 | 169794 | 179 | 40.8 | [ | ||
| PhopGV | POG | NC_004062 | 119217 | 130 | 35.7 | [ | ||
| PlxyGV | PXG | NC_002593 | 100999 | 120 | 40.7 | [ | ||
| PiraGV | PRG | GQ884143 | 108592 | 120 | 33.2 | U | ||
| PsunGV | PUG | EU678671 | 176677 | 183 | 39.8 | U | ||
| SpliGV | SLG | NC_009503 | 124121 | 136 | 38.8 | [ | ||
| XnGV | XCG | NC_002331 | 178733 | 181 | 40.7 | [ | ||
| NeabNPV | NAN | NC_008252 | 84264 | 93 | 33.4 | [ | ||
| NeleNPV | NLN | NC_005906 | 81755 | 93 | 33.3 | [ | ||
| NeseNPV | NSN | NC_005905 | 86462 | 90 | 33.8 | [ | ||
| CuniNPV | CNN | NC_003084 | 108252 | 109 | 50.9 | [ | ||
This table contains all of baculoviruses used in bioinformatic studies, sorted by genus (and within them by alphabetical order). MNPV is the abbreviation of multicapsid nucleopolyhedrovirus; NPV is the abbreviation of nucleopolyhedrovirus; SNPV is the abbreviation of single nucleopolyhedrovirus; GV is the abbreviation of granulovirus. The accession numbers are from National Center for Biotechnology Information (NCBI, http://www.ncbi.nlm.nih.gov/) and correspond to the sequences of complete genomes. Code is an acronym used for practicality. U: unpublished.
Figure 1GC content in baculovirus genomes. The different histograms contain the distribution of baculovirus genomes according to their GC content and their genus classification. Black bars highlight genomes with a GC content higher than 50%.
Core genes.
| ACN | LDN | CPG | NSN | CNN | |
|---|---|---|---|---|---|
| lef-1 [ | 14 | 123 | 74 | 68 | 45 |
| lef-2 [ | 6 | 137 | 41 | 57 | 25 |
| DNA pol [ | 65 | 83 | 111 | 28 | 91 |
| Helicase [ | 95 | 97 | 90 | 61 | 89 |
| lef-4 [ | 90 | 93 | 95 | 62 | 96 |
| lef-8 [ | 50 | 51 | 131 | 81 | 26 |
| lef-9 [ | 62 | 64 | 117 | 40 | 59 |
| p47 [ | 40 | 48 | 68 | 49 | 73 |
| lef-5 [ | 99 | 100 | 87 | 58 | 88 |
| p6.9 [ | 100 | 101 | 86 | 36 | 23 |
| vp39 [ | 89 | 92 | 96 | 89 | 24 |
| vlf-1 [ | 77 | 86 | 106 | 45 | 18 |
| alk-exo [ | 133 | 157 | 125 | 31 | 53 |
| vp1054 [ | 54 | 57 | 138 | 85 | 8 |
| vp91/p95 [ | 83 | 91 | 101 | 84 | 35 |
| gp41 [ | 80 | 88 | 104 | 47 | 33 |
| 38 k [ | 98 | 99 | 88 | 59 | 87 |
| p33 [ | 92 | 94 | 93 | 24 | 14 |
| odv-ec43 [ | 109 | 107 | 55 | 70 | 69 |
| p49 [ | 142 | 20 | 15 | 63 | 30 |
| odv-nc42 [ | 68 | 80 | 114 | 41 | 58 |
| odv-e18 [ | 143 | 19 | 14 | 65 | 31 |
| desmoplakin [ | 66 | 82 | 112 | 29 | 92 |
| odv-e27 [ | 144 | 18 | 97 | 66 | 32 |
| ac81 [ | 81 | 89 | 103 | 48 | 106 |
| pif-0/p74 [ | 138 | 27 | 60 | 50 | 74 |
| pif-1 [ | 119 | 155 | 75 | 79 | 29 |
| pif-2 [ | 22 | 119 | 48 | 55 | 38 |
| pif-3 [ | 115 | 143 | 35 | 69 | 46 |
| pif-4/19k/odv-e28 [ | 96 | 98 | 89 | 60 | 90 |
| pif-5/odv-e56 [ | 148 | 14 | 18 | 38 | 102 |
The virus names are indicated in three letter code according to established in Table 1.
Numbers in columns indicates the corresponding ORFs of each genome.
Figure 2Baculovirus core genes. The different circles represent the 4 baculovirus genera (in yellow Alphabaculovirus; in green Betabaculovirus; in red Gammabaculovirus; in blue Deltabaculovirus). The numbers contained within the overlapping regions indicate the amount of shared genes between all members of the genera. The numbers within the circles but outside the overlapping regions indicate the amount of genes shared by all members of that genus but with the absence of orthologous sequences in the remaining genera. These estimations were inferred by Blast P algorithm (http://www.ncbi.nlm.nih.gov/) considering E = 0.001 as cutoff value and comparing all reported baculovirus ORFs between them. The identity of common genes is provided in the Supplementary data available at doi:10.4061/2011/379424
Shared genes*.
| lef-2 (ACN6), lef-1 (ACN14), pif-2 (ACN22), p47 (ACN40), lef-8 (ACN50), vp1054 (ACN54), lef-9 (ACN62), DNA polymerase |
| (ACN65), Desmoplakin (ACN66), ACN68, vlf-1 (ACN77), gp41 (ACN80), ACN81, vp91/p95 (ACN83), vp39 (ACN89), lef-4 |
| (ACN90), p33 (ACN92), helicase (ACN95), 19K (ACN96), 38 K (ACN98), lef-5 (ACN99), p6.9 (ACN100), odv-ec43 (ACN109), |
| PIF-3 (ACN115), pif-1 (ACN119), alkaline exonuclease (ACN133), p74 (ACN138), p49 (ACN142), odv-e18 (ACN143), odv-e27 |
| (ACN144), odv-e56 (ACN148) |
| Polh (ACN8), dbp (ACN25), p48 (ACN103), ACN145, pp34/PEP (ACN131), odv-e25 (ACN94), p40 (ACN101), ACN106/107 |
| F-protein (ACN23) |
| pk-1 (ACN10), 38,7 kDa (ACN13), lef-6 (ACN28), pp31/39K (ACN36), ACN38, ACN53, 25K FP (ACN61), LEF-3 (ACN67), ACN75, |
| ACN76, tlp20 (ACN82), p18 (ACN93), P12 (ACN102), ACN108, p24 (ACN129), me53 (ACN139), ACN146, ie-1 (ACN147) |
| orf1629 capsid (ACN9), ACN19, pkip-1 (ACN24), ACN34, ACN51, iap-2 (ACN58/59), ACN104, p87/vp80 (ACN141), ie-0 (ACN71) |
| ptp-1/bvp (ACN1), ACN5, odv-e26 (ACN16), iap-1 (ACN27), ACN30, ACN72, ACN73, ACN114, ACN124, gp64 (ACN128), p25 |
| (ACN132), ie-2 (ACN151) |
| CPG4, CPG5, CPG20, CPG23, CPG29, CPG33, CPG39, CPG45, Metalloproteinase (CPG46), CPG62, FGF-1 (CPG76), CPG79, |
| CPG99, CPG100, CPG115, IAP-5 (CPG116), CPG123, CPG135, FGF-3 (CPG140) |
| NSN3, NSN9, NSN11, NSN12, NSN13, NSN16, NSN18, NSN19, NSN20, NSN26, NSN29, NSN34, NSN37, NSN39, NSN42, NSN43, |
| NSN44, NSN51, NSN52, NSN53, NSN54, NSN56, NSN64, NSN72, NSN74, NSN76, NSN77, NSN79, NSN82, NSN85, NSN86, |
| NSN89 |
| CNN2, CNN3, CNN6, CNN7, CNN9, CNN10, CNN11, CNN12, CNN13, CNN15, CNN16, CNN17, CNN20, CNN21, CNN22, |
| CNN27, CNN28, CNN31, CNN36, CNN37, CNN39, CNN40, CNN41, CNN42, CNN43, CNN44, CNN47, CNN48, CNN49, CNN50, |
| CNN51, CNN52, CNN53, CNN55, CNN56, CNN57, CNN60, CNN61, CNN62, CNN63, CNN64, CNN65, CNN66, CNN67, CNN68, |
| CNN70, CNN71, CNN72, CNN75, CNN76, CNN77, CNN78, CNN79, CNN80, CNN81, CNN82, CNN83, CNN84, CNN85, CNN86, |
| CNN93, CNN94, CNN97, CNN98, CNN99, CNN100, CNN101, CNN103, CNN105, CNN107 |
*Shared genes are indicated only for one selected specie. See supplementary tables for the respective ORF numbers in each specie.
Figure 3Whole baculovirus gene content. The histogram shows the amount of different reported genes in each baculovirus genus or recognized lineage (bars in pink color), and the subset of shared genes for all members of the corresponding phylogenetic clade (bars in green color). This bar graph was performed using the information resulting from the comparison of all ORFs reported in the 57 baculovirus with known genomes, analyzing all against all by Blast P algorithm (http://www.ncbi.nlm.nih.gov/) considering E = 0.001 as cutoff value.
Figure 4Baculovirus genome phylogeny. Cladogram based on amino acid sequence of core genes. The 31 identified core genes from Baculoviridae family were independently aligned using MEGA 4 [25] program with gap open penalty = 10, gap extension penalty = 1, and dayhoff matrix [26]. Then, a concatemer was generated and phylogeny inferred using the same software (UPGMA; bootstrap with 1000 replicates; gap/missing data = complete deletion; model = amino (dayhoff matrix); patterns among sites = same (homogeneous); rates among sites = different (gamma distributed); gamma parameter = 2.25). Baculoviruses are identified by the acronyms given in Table 1, and the accepted distribution in lineages and genera are also indicated. Gammabaculovirus and Deltabaculovirus are referenced by Greek letters. The proposed clades of Betabaculoviruses are shown in bold letters.
Figure 5Baculovirus core gene variability. Histograms show the average PAM250 distances for each core gene with their corresponding standard deviations. These values were calculated using MEGA 4 program (UPGMA; bootstrap with 1000 replicates; gap/missing data = complete deletion; model = amino (dayhoff matrix); patterns among sites = same (homogeneous); rates among sites = different (gamma distributed; gamma parameter = 2.25)). PAM (point accepted mutation) matrices refers to the evolutionary distance between pairs of sequences. Given the weak similarity between several core proteins, PAM250 matrix was selected. The divergence considered in this matrix is 250 mutations per 100 amino acid sequence and was calculated to analyze more distantly related sequences. PAM250 is considered a good general matrix for protein similarity search.