| Literature DB >> 19464330 |
Mary R Odom1, R Curtis Hendrickson, Elliot J Lefkowitz.
Abstract
To investigate the evolutionary origins of proteins encoded by the Poxviridae family of viruses, we examined all poxvirus protein coding genes using a method of characterizing and visualizing the similarity between these proteins and taxonomic subsets of proteins in GenBank. Our analysis divides poxvirus proteins into categories based on their relative degree of similarity to two different taxonomic subsets of proteins such as all eukaryote vs. all virus (except poxvirus) proteins. As an example, this allows us to identify, based on high similarity to only eukaryote proteins, poxvirus proteins that may have been obtained by horizontal transfer from their hosts. Although this method alone does not definitively prove horizontal gene transfer, it allows us to provide an assessment of the possibility of horizontal gene transfer for every poxvirus protein. Potential candidates can then be individually studied in more detail during subsequent investigation. Results of our analysis demonstrate that in general, proteins encoded by members of the subfamily Chordopoxvirinae exhibit greater similarity to eukaryote proteins than to proteins of other virus families. In addition, our results reiterate the important role played by host gene capture in poxvirus evolution; highlight the functions of many genes poxviruses share with their hosts; and illustrate which host-like genes are present uniquely in poxviruses and which are also present in other virus families.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19464330 PMCID: PMC2779260 DOI: 10.1016/j.virusres.2009.05.006
Source DB: PubMed Journal: Virus Res ISSN: 0168-1702 Impact factor: 3.303
Fig. 1(A) Best BLASTP scores of poxvirus proteins against all eukaryote proteins vs. against proteins of all viruses except poxviruses. Inset shows regions described in the text. (B) Best BLASTP scores of poxvirus proteins against all bacteria proteins vs. against proteins of all viruses except poxviruses. (C) Best BLASTP scores of poxvirus proteins against all eukaryote proteins vs. against all bacteria proteins. Plotted chordopoxvirus proteins (6443 points) are represented by black squares, and plotted entomopoxvirus proteins (278 points) are represented by red squares. Notable points mentioned in the text are circled. Points circled in blue are proteins of an avian retrovirus integrated into the genome of fowlpox virus. The cluster of points circled in purple are the large subunit of ribonucleotide reductase.
Number and percentage of poxvirus points in each region of Fig. 1.
| Region | No. of points | % of points |
|---|---|---|
| A, near | 657 | 2.3% |
| B, between | 65 | 0.2% |
| C, near diagonal | 1,657 | 5.7% |
| D, between diagonal and | 1,377 | 4.8% |
| E, near | 2,965 | 10.3% |
| F, near origin | 22,190 | 76.8% |
| A, near | 1,695 | 5.8% |
| B, between | 619 | 2.1% |
| C, near diagonal | 1,065 | 3.7% |
| D, between diagonal and | 622 | 2.1% |
| E, near | 1,307 | 4.5% |
| F, near origin | 23,783 | 81.8% |
| A, near | 258 | 0.9% |
| B, between | 45 | 0.2% |
| C, near diagonal | 1,000 | 3.4% |
| D, between diagonal and | 1,882 | 6.5% |
| E, near | 3,371 | 11.6% |
| F, near origin | 22,535 | 77.5% |
Fig. 2Phylogenetic reconstructions to investigate evolutionary histories of three poxvirus proteins appearing in different regions of the plot in Fig. 1A. All pictured trees were constructed by the method of Bayesian inference using MrBayes. The resulting topology for each tree agrees exactly with topology produced from the same alignment by the Maximum Likelihood method using Garli, and either agrees exactly or is very similar to topology produced by the Maximum Parsimony method using MEGA. MrBayes simulations for all three alignments were run with the GTR nucleotide substitution model and gamma distributed rate variation with an estimated proportion of invariable sites. The legend below each tree shows the scale for branch lengths as measured in expected nucleotide substitutions per site. The number to the right of each taxon name is the protein GI number for that sequence. (A) Variola virus B22R (plotted in region A near the virus axis) is a large surface glycoprotein and appears outside the poxvirus family only in the carp herpesvirus CyHV-3. (B) The interleukin-10 inhibitory cytokine (plotted on the diagonal) is evidently of eukaryote origin but has several apparent homologs in diverse virus genomes, potentially acquired in distinct gene transfer events. (C) Monoglyceride lipase (plotted in region E near the eukaryote axis) is an enzyme which may facilitate use of cellular fatty acids, and may have been acquired from a fish or reptilian host by a poxvirus ancestral to the orthopoxvirus and yatapoxvirus genera.
Fig. 3All proteins of cowpox strain GRI-90 were analyzed by taxonomic group plots, to compare the relationships of core and non-core protein subsets with proteins of eukaryotes and with proteins of viruses outside the poxvirus family. Panel (A) represents proteins classified according to genomic locus, as non-centrally located (black squares, 99 points) or centrally located (red squares, 115 points). Panel (B) represents proteins classified according to the number of poxvirus species with conserved orthologs, with genes in only 1–10 species in black (20 points), genes in 11–20 species in purple (59 points), genes in 21–30 species in blue (28 points), genes in 31–35 species in green (21 points), and genes in 36–40 species in red (86 points).
Fig. 4A genome map of cowpox strain Gri-90 is color coded (see legend) according to the degree of similarity of each cowpox protein to its best hit when compared against all virus (non-pox) or eukaryote proteins.
Similarity of each cowpox virus Gri-90 protein to the best blastp hit in the eukaryote and virus (non-pox) protein datasets.
| Region | CPXV protein | Protein description | Eukaryote with best score | Euk. Bitscore | Virus with best score | Virus Bitscore |
|---|---|---|---|---|---|---|
| A | B22R | Surface glycoprotein | 51 | Cyprinid herpesvirus 3 | 156 | |
| E11L | NPH-I/Helicase, virion | 60 | 135 | |||
| B | J6R | Topoisomerase type I | 64 | 97 | ||
| E6R | Morph, VETF-s (early transcription factor small) | 67 | Lymphocystis disease virus 1 | 99 | ||
| E5R | NTPase, DNA replication | 41 | 60 | |||
| C | A41R | Semaphorin/CD100 antigen | 135 | Ovine herpesvirus 2 | 146 | |
| B4R | Complement control/CD46/EEV | 89 | 91 | |||
| D9L | C-type lectin | 86 | Rat cytomegalovirus | 86 | ||
| O4R | RNA pol (RPO147) | 201 | 186 | |||
| A51R | Thymidylate kinase | 167 | Chilo iridescent virus | 151 | ||
| G4L | Ribonucleotide Reductase small subunit | 506 | 457 | |||
| F9L | DNA-directed DNA polymerase | 113 | Human herpesvirus 7 | 102 | ||
| A47L | Hydroxysteroid dehydrogenase | 279 | Rana grylio virus 9506 | 247 | ||
| A42R | Lectin homolog | 62 | African swine fever virus | 54 | ||
| C17L | Complement binding (secreted) | 190 | 166 | |||
| J1L | Tyr/Ser phosphatase | 69 | Chilo iridescent virus | 61 | ||
| C3L | Ankyrin | 88 | 76 | |||
| A25R | RNA pol 132 (RPO132) | 239 | 207 | |||
| G2L | DeoxyUTP pyrophosphatase (dUTPase) | 164 | 138 | |||
| L4L | Ribonucleotide reductase large subunit | 1226 | 1011 | |||
| B1R | Ser/Thr kinase | 246 | Chilo iridescent virus | 194 | ||
| D | B11R | Ser/Thr kinase | 154 | Chilo iridescent virus | 120 | |
| K1R | Ankyrin | 84 | 62 | |||
| B18R | Ankyrin | 79 | 58 | |||
| B3R | Ankyrin | 112 | 82 | |||
| C7R | Ubiquitin Ligase/host defense modulator | 71 | Rock bream iridovirus | 52 | ||
| M1L | Ankyrin/NFkB inhib | 71 | 51 | |||
| L8R | RNA helicase/NPH-II | 70 | 50 | |||
| C1L | Ankyrin | 64 | 45 | |||
| A56R | TNF receptor (CrmC) | 108 | Grouper iridovirus | 74 | ||
| D14L | Ankyrin | 87 | 59 | |||
| B16R | Ankyrin | 91 | 62 | |||
| C11L | Ankyrin | 68 | 45 | |||
| P1L | Ankyrin | 94 | 61 | |||
| K3R | TNF-a receptor/CD27 cysteine-rich region | 136 | Singapore grouper iridovirus | 87 | ||
| K2R | TNF receptor (CrmD) | 120 | Singapore grouper iridovirus | 72 | ||
| S2R | Thymidine kinase | 248 | Cyprinid herpesvirus 3 | 147 | ||
| D2L, I4R | TNF-α receptor II (CrmB) | 149 | Grouper iridovirus | 87 | ||
| M2L | Proteinase inhibitor I4, serpin | 160 | 92 | |||
| D3L, I3R | Ankyrin | 97 | 55 | |||
| B12R | Serpin | 196 | 102 | |||
| D13L | Unknown | 84 | Lymphocystis disease virus—isolate China | 43 | ||
| D7L | Kelch-like | 71 | 34 | |||
| A26L | A type inclusion protein | 134 | 64 | |||
| O1R | Poly(A) polymerase-small (VP39) | 66 | Vibrio phage CTX | 31 | ||
| B20R | Serpin | 202 | 94 | |||
| C5R | Epidermal growth factor | 63 | Crimean-Congo hemorrhagic fever virus | 29 | ||
| A53R | DNA ligase | 622 | 274 | |||
| C18L | Kelch-like | 111 | Pseudomonas phage phiEL | 48 | ||
| A44R | Profilin homolog | 72 | Bacteriophage phi-MhaA1-PHL101 | 30 | ||
| B19R | Kelch-like (EV-M-167) | 107 | 45 | |||
| A57R | Kelch-like | 202 | Human papillomavirus type 68 | 71 | ||
| D11L | Kelch-like | 138 | Pseudomonas phage phiKZ | 43 | ||
| E | R2L | Glutaredoxin 1 | 105 | 33 | ||
| B9R | Kelch-like | 115 | Pseudomonas phage phiKZ | 35 | ||
| G3L | Kelch-like | 119 | Pseudomonas phage phiKZ | 35 | ||
| A40L | CD47-like | 119 | Ranid herpesvirus 2 | 34 | ||
| G13L | Phospholipase EEV | 118 | Heliothis zea virus 1 | 32 | ||
| B14R | IL-1 beta receptor | 144 | Enterobacteria phage RB69 | 35 | ||
| E8L | Carbonic anhydrase/Virion | 143 | 32 | |||
| B2R | Schlafen | 231 | 47 | |||
| A59R | Guanylate kinase | 202 | Sapovirus SaKaeo-15/Thailand | 35 | ||
| M5L | Putative monoglyceride lipase | 280 | 42 | |||
| T1R | NMDA receptor-like protein | 256 | Chimpanzee cytomegalovirus | 32 | ||
| M4L | Nicking-joining enzyme | 368 | 32 | |||
| F | E10R | mutT motif/NPH-PPH/RNA levels regulator | 40 | 53 | ||
| A19R | DNA Helicase, transcription | 48 | 59 | |||
| D10L | CPV-B-012 | 49 | Rat cytomegalovirus | 51 | ||
| E1R | Large capping enzyme | 58 | 52 | |||
| D4L, I2R | Ankyrin | 59 | 46 | |||
| A48R | Superoxide dismutase-like | 57 | 44 | |||
| C9L | Ankyrin/host range | 59 | 45 | |||
| D8L | Ankyrin | 59 | 42 | |||
| A22R | DNA processivity factor | 52 | Bacteriophage 85 | 35 | ||
| C15L | Unknown | 55 | 36 | |||
| F3L | IFN resistance/PKR inhibitor (Z-DNA binding) | 52 | 33 | |||
| B17R | IFN-alpha/beta receptor | 56 | Listeria phage A118 | 32 | ||
| B7R | IFN-gamma receptor | 57 | 31 | |||
| C10L | Unknown | 20 | Enterobacteria phage K1-5 | 19 | ||
| A39R | Unknown | 24 | Rabies virus | 23 | ||
| H6R | RNA pol | 27 | 24 | |||
| N2R | Unknown | 27 | Pseudomonas phage D3 | 25 | ||
| A14L | Virion maturation | 29 | Equid herpesvirus 2 | 26 | ||
| A31L | Virion morphogenesis | 29 | Plum pox virus | 25 | ||
| NULL | Unknown | 29 | Rice tungro bacilliform virus | 25 | ||
| G8L | Cytoplasmic protein | 28 | 27 | |||
| G14L | Unknown | 29 | 27 | |||
| B10R | Unknown | 29 | Tomato chlorosis virus | 27 | ||
| A46R | Unknown | 30 | Human immunodeficiency virus 1 | 26 | ||
| G17R | DNA-binding phosphoprotein | 30 | Murid herpesvirus 1 | 26 | ||
| NULL | Unknown | 31 | Influenza A virus (A/seal/Massachusetts/1/80(H7N7)) | 27 | ||
| L5L | IMV protein VP13 | 32 | Staphylococcus phage Twort | 28 | ||
| A13L | Structural protein | 32 | Bacteriophage phBC6A51 | 28 | ||
| A10L | Membrane protein | 34 | Influenza A virus (A/Hong Kong/481/97(H5N1)) | 26 | ||
| A3L | Thioredoxin-like | 31 | Impatiens necrotic spot virus | 30 | ||
| A18L | IMV MP PO4 | 35 | Mycoreovirus 3 | 27 | ||
| L2L | Unknown | 30 | Cyanophage phage S-PM2 | 31 | ||
| A15L | IMV PO4 MP | 32 | 30 | |||
| C2L | MPV-Z-N3R | 32 | Avian infectious bronchitis virus | 30 | ||
| N5R | Entry and fusion IMV protein | 33 | 29 | |||
| G15L | Unknown conserved | 34 | Influenza A Virus (A/Fujian/555/2003(H3N2)) | 28 | ||
| B13R | Unknown | 34 | American plum line pattern virus | 29 | ||
| G7L | Unknown | 37 | 26 | |||
| F10R | Disulfide bond formation | 31 | African swine fever virus | 32 | ||
| H2L | Unknown | 34 | 30 | |||
| A16L | Unknown | 34 | Rice stripe virus | 30 | ||
| B5R | Unknown | 34 | Little cherry virus 1 | 29 | ||
| J2R | Entry and cell–cell fusion | 34 | 29 | |||
| E7R | RNA pol 18(RPO18) | 34 | 30 | |||
| A5L | Core protein | 34 | Hibiscus latent Fort Pierce virus | 30 | ||
| G6L | Unknown | 35 | Bacteriophage 933W | 29 | ||
| D6L | Alpha-amanitin sensitivity | 34 | Bacillus thuringiensis phage MZTP02 | 31 | ||
| A36R | Unknown | Schizosaccharomyces pombe | 34 | 30 | ||
| G16L | Unknown | 34 | Maize dwarf mosaic virus | 30 | ||
| H7R | Unknown | 35 | 29 | |||
| A32R | Unknown | 35 | Avian infectious bronchitis virus | 30 | ||
| M6R | Unknown | 34 | Feline calicivirus | 31 | ||
| A34R | EEV Glycoprotein | 35 | Feline leukemia virus (strain Sarma) | 30 | ||
| F7R | Soluble/Myristyl EEV | 35 | Chilo iridescent virus | 29 | ||
| J5R | VLTF-4 (late transcription factor 4) | 35 | Measles virus | 30 | ||
| H9R | VLTF-1 | 35 | 31 | |||
| P2L | NFkB inh | 35 | Lymphocystis disease virus 1 | 31 | ||
| B21R | Unknown | 36 | Human adenovirus type 13 | 30 | ||
| B8R | Virulence factor | 34 | Simian immunodeficiency virus | 32 | ||
| A50L | Unknown | 35 | 31 | |||
| H4L | Glutaredoxin 2 | 37 | 30 | |||
| L6L | Telomere-binding protein | 35 | Staphylococcus phage Twort | 32 | ||
| Q1L | Virokine/NFkB inh/Str resemblence to apoptotic reg | 34 | 33 | |||
| A37R | IEV-specific | 35 | Influenza A virus (A/Chicken/NY/29878/91 (H2N2)) | 32 | ||
| C12L | Unknown | 34 | 34 | |||
| N1R | Myristylated MP IMV | 35 | Lymphocystis disease virus 1 | 33 | ||
| A23R | Holliday junction resolvase | 35 | 32 | |||
| G11L | Unknown | 35 | Human enterovirus 94 | 32 | ||
| F11L | Virion core protein | 35 | 32 | |||
| C4L | Unknown | 37 | 31 | |||
| C8L | IL-18 BP | 37 | Mamestra configurata nucleopolyhedrovirus B | 31 | ||
| D12L | TNF receptor (CrmB) | 35 | Ilesha virus | 33 | ||
| A1L | VLTF-2 (late transcription factor 2) | 36 | KI polyomavirus Stockholm 60 | 32 | ||
| B6R | Virulence, ER resident | 36 | Human papillomavirus type 50 | 32 | ||
| S1R | Virion morph | 38 | Cherry chlorotic rusty spot associated totiviral-like dsRNA 3 | 30 | ||
| A20L | Unknown | 37 | Mycobacteriophage Halo | 32 | ||
| A29L | IMV MP/virus entry | 35 | Porcine epidemic diarrhea virus | 33 | ||
| A21L | Entry and cell–cell Fusion | 39 | Human immunodeficiency virus type 1 | 30 | ||
| A55R | Intracellular TLR and IL-1 signaling inhibitor | 34 | Bacteriophage 2638A | 35 | ||
| E2L | Virion core | 34 | Bovine enteric calicivirus | 34 | ||
| J3L | IMV heparin binding surface protein | 37 | Enterobacteria phage JS98 | 32 | ||
| J7R | Unknown | 37 | 32 | |||
| H10R | Entry-fusion complex protein | 38 | 32 | |||
| C14L | Unknown | 37 | Bluetongue virus 22 | 32 | ||
| H8L | Virion assembly protein | 37 | 32 | |||
| A30L | RNA pol 35(RPO35) | 37 | Fiji disease virus | 33 | ||
| A6R | RNA pol 19 (RPO19) | 38 | 32 | |||
| E9R | mutT motif/NTP-PPH | 38 | Chilo iridescent virus | 32 | ||
| A43L | Virulence/secreted | 37 | Clostridium phage c-st | 34 | ||
| F5R | Virosome component | 38 | Citrus tristeza virus | 32 | ||
| D5L, I1R | Unknown | 35 | Lymphocystis disease virus 1 | 35 | ||
| A38R | Unknown | 38 | Murid herpesvirus 4 | 32 | ||
| A28L | Fusion protein | 40 | Lymphocystis disease virus—isolate China | 31 | ||
| A54R | Unknown | 38 | Taura syndrome virus | 33 | ||
| A12R | Viral membrane formation | 38 | 33 | |||
| D1L, I5R | Chemokine binding protein | 39 | 32 | |||
| E4R | Uracil-DNA glycosylase | 35 | Gallid herpesvirus 1 | 37 | ||
| O2R | RNA pol (RPO22) | 41 | 30 | |||
| F8R | ER-localized MP | 38 | 34 | |||
| A9R | VITF-3 34kda subunit | 39 | 34 | |||
| N3L | Internal virion protein | 40 | Adeno-associated virus | 32 | ||
| G12L | IEV associated | 37 | Bat coronavirus (BtCoV/133/2005) | 35 | ||
| G1L | Apoptosis inhibitor (mitochondrial-associated) | 39 | 34 | |||
| L3L | DNA-binding phosphoprotein | Tetrahymena thermophila SB210 | 40 | 32 | ||
| L7L | Virion core protease | 39 | Lactococcus phage Q54 | 34 | ||
| E3R | Virion core | 38 | 35 | |||
| G9L | Disulfide bond formation | 40 | Human immunodeficiency virus 1 | 33 | ||
| A17L | Myristylated entry/cell–cell fusion protein | 40 | Lymphocystis disease virus—isolate China | 33 | ||
| O3L | Unknown MP | 34 | Lymphocystis disease virus 1 | 39 | ||
| F6R | Unknown | 39 | Human immunodeficiency virus 1 | 35 | ||
| A24R | VITF-3 45kda subunit | 40 | 34 | |||
| A27L | P4c precursor | 41 | Bacteriophage RM 378 | 32 | ||
| C19L | Unknown | 39 | Bacteriophage 66 | 35 | ||
| C6L | IL-1 receptor antagonist | 42 | Human papillomavirus type 14D | 32 | ||
| G10L | Ser/Thr kinase Morph | 41 | 34 | |||
| Q2L | Alpha-amanitin sensitivity | 41 | 33 | |||
| G5L | 36 kDa major membrane protein | 39 | Cyanophage phage S-PM2 | 36 | ||
| A45R | Membrane glycoprotein-class I | 37 | Maize dwarf mosaic virus | 38 | ||
| A52R | Putative Phosphotransferase/anion transport protein | 40 | Chilo iridescent virus | 35 | ||
| E12L | Small capping enzyme | 39 | Porcine rotavirus | 37 | ||
| N4R | Core package/transcription | 41 | Staphylococcusphage CNPH82 | 34 | ||
| L1L | DNA-binding protein | 42 | 34 | |||
| A49R | IL-1 signaling inhibitor | 45 | 30 | |||
| C13L | Host range virulence factor | 41 | 35 | |||
| H3R | VLTF (late transcription elongation factor) | 41 | Chilo iridescent virus | 35 | ||
| H5R | Unknown | 41 | 36 | |||
| B15L | Unknown | 42 | Tomato leaf curl Madagascar virus | 35 | ||
| E13L | Trimeric virion coat protein (rifampicin res) | 42 | 35 | |||
| F1L | Poly (A) polymerase-large (VP55) | 42 | Staphylococcus phage 187 | 35 | ||
| H1L | Predicted metallo-protease | 39 | 39 | |||
| A11L | P4a precursor | 43 | 35 | |||
| C16L | IL-1 receptor antagonist | 42 | 37 | |||
| A35R | C-type lectin-like EEV protein | 44 | African swine fever virus | 35 | ||
| A33L | ATPase/DNA packaging protein | 43 | 36 | |||
| A58R | Hemagglutinin | 48 | 33 | |||
| F4L | RNA pol (RPO30) | 41 | African swine fever virus | 41 | ||
| J4L | RAP94 (RNA pol assoc protein) | 45 | 37 | |||
| F2L | Unknown | 44 | 39 | |||
| R1L | Unknown | 41 | 42 | |||
| M3L | IFN resistance/eIF2 alpha-like PKR inhibitor | 40 | 44 | |||
| A2L | VLTF-3 (late transcription factor 3) | 48 | 37 | |||
| A7L | Virion morphogenesis | 48 | 40 | |||
| A8L | VETF-L (early transcription factor large) | 45 | 44 | |||
| A4L | P4b precursor | 40 | 49 | |||