Literature DB >> 19464330

Poxvirus protein evolution: family wide assessment of possible horizontal gene transfer events.

Mary R Odom1, R Curtis Hendrickson, Elliot J Lefkowitz.   

Abstract

To investigate the evolutionary origins of proteins encoded by the Poxviridae family of viruses, we examined all poxvirus protein coding genes using a method of characterizing and visualizing the similarity between these proteins and taxonomic subsets of proteins in GenBank. Our analysis divides poxvirus proteins into categories based on their relative degree of similarity to two different taxonomic subsets of proteins such as all eukaryote vs. all virus (except poxvirus) proteins. As an example, this allows us to identify, based on high similarity to only eukaryote proteins, poxvirus proteins that may have been obtained by horizontal transfer from their hosts. Although this method alone does not definitively prove horizontal gene transfer, it allows us to provide an assessment of the possibility of horizontal gene transfer for every poxvirus protein. Potential candidates can then be individually studied in more detail during subsequent investigation. Results of our analysis demonstrate that in general, proteins encoded by members of the subfamily Chordopoxvirinae exhibit greater similarity to eukaryote proteins than to proteins of other virus families. In addition, our results reiterate the important role played by host gene capture in poxvirus evolution; highlight the functions of many genes poxviruses share with their hosts; and illustrate which host-like genes are present uniquely in poxviruses and which are also present in other virus families.

Entities:  

Mesh:

Substances:

Year:  2009        PMID: 19464330      PMCID: PMC2779260          DOI: 10.1016/j.virusres.2009.05.006

Source DB:  PubMed          Journal:  Virus Res        ISSN: 0168-1702            Impact factor:   3.303


Introduction

Poxviruses

The Poxviridae are a large family of double stranded DNA viruses whose members have a linear genome of 130–300 kbp, and replicate in the cytoplasm of eukaryote cells. The poxvirus family is composed of two subfamilies: the Entomopoxvirinae, comprised of viruses that infect insects, and the Chordopoxvirinae, comprised of viruses that infect vertebrates. Both subfamilies are further divided into genera, groups of viral species with genetic and antigenic similarity to one another. Chordopoxviruses are categorized into 9 genera: Avipoxvirus, Capripoxvirus, Cervidpoxvirus, Leporipoxvirus, Molluscipoxvirus, Orthopoxvirus, Parapoxvirus, Suipoxvirus, and Yatapoxvirus. Entomopoxviruses are categorized into the genera Alphaentomopoxvirus, Betaentomopoxvirus, and Gammaentomopoxvirus. The genus Orthopoxvirus is the most well characterized, and contains the species Variola virus, isolates of which are the causative agent of smallpox, as well as the species Vaccinia virus, containing less virulent but better studied viruses. The poxvirus family is postulated to have a common evolutionary origin with four other families of large eukaryotic DNA viruses, collectively referred to as nucleocytoplasmic large DNA viruses (NCLDV) (Iyer et al., 2001). These families include Asfarviridae, containing viruses which infect both pig and parasitic arthropods, Iridoviridae, whose members infect invertebrates, fish or amphibians, Phycodnaviridae, containing viruses which infect eukaryotic algae, and the recently discovered Mimiviridae, whose members are only known to infect amoeba.

Poxvirus replication cycle

Poxvirus virions are ovoid or brick-shaped, and consist of an envelope surrounding an outer membrane, which itself surrounds a densely packed and membrane bound core containing a double-stranded DNA genome, enzymes, and transcription factors. Unlike most viruses, poxvirus virions do not rely on particular cell surface receptors, but are capable of binding and penetrating the outer membrane of nearly any cell type. Virion cores are released into the cytoplasm where they immediately synthesize early mRNAs that are translated into growth factors, cell signaling and immune defense molecules, enzymes, and other factors necessary for DNA replication and intermediate transcription. Uncoating of the core next allows the DNA genome to be replicated to form concatemeric molecules along with the transcription of intermediate genes that when translated, provide late transcription factors. Subsequent transcription and translation of the late genes produces virion structural proteins, enzymes, and early transcription factors for packaging into virions (Moss, 2001). During virion formation, the concatemeric DNA genomes are resolved into individual genomes, packaged into the core membranes, and mature within the cytoplasm to form infectious mature virions (MV). These are subsequently wrapped in modified Golgi membranes and transported to the periphery of the cell via attached actin filaments. Fusion of the wrapped virions with the plasma membrane results in release of enveloped virus (EV) (Condit et al., 2006).

Viral evolution

DNA viruses have much lower mutation rates and genetic variability than RNA viruses, with nucleotide substitution rates closer to those of their hosts, on the order of 10−7 to 10−9 mutations per site per round of replication (Drake and Hwang, 2005, Duffy et al., 2008). Their resulting genetic stability, together with their high levels of host specificity have led in part to the hypothesis that many DNA viruses cospeciate with their hosts (DeFilippis and Villarreal, 2001). Intricate relationships with their hosts are evidenced by the many immunological and cellular factors these viruses have obtained through host gene capture via recombination between viral and host DNA. Such acquisition of new coding information by poxviruses may contribute to the ability of the virus to manipulate the host immune response and other cellular machinery to provide a selective advantage for virus replication. Evidence suggests many orthopoxviruses occasionally cross into other mammals from rodent reservoir populations, either as zoonotic infections of humans or via mutations that allow colonization of new host species (Esposito and Fenner, 2001, Li et al., 2007, Likos et al., 2005). Understanding species crossing events of both types is essential to understanding the threats poxviruses pose today. Investigation of evolutionary clues buried within the genome sequence of the virus, such as captured host genes left by such historical host interactions, may help us to better understand mechanisms of zoonotic infection, viral tropism, evolutionary adaptation, and pathogenesis.

Poxvirus recombination

In general, recombination may occur via homologous recombination, site-specific recombination, or non-homologous end joining, and may be between viral genomes or between a viral genome and some other genetic entity, such as the genome or cDNA of the viral host, or a co-infecting parasite or a plasmid. Recombination may be an important source of genetic variation among viruses, where it is often associated with rapid evolutionary divergence, due to the potential of providing a selective advantage much more quickly than through the accumulation of point mutations. Recombination has been detected in both DNA and RNA viruses including species of the families Caulimoviridae, Flaviviridae, Herpesviridae, Papillomaviridae, Picornaviridae, Potyviridae, Poxviridae, Polyomaviridae, Retroviridae, the genus Tobamovirus, and bacteriophages in the order Caudovirales (DeFilippis and Villarreal, 2001). Such evidence has led to the “modular” theory of virus evolution, whereby many viral genomes represent mosaics of genetic material obtained through multiple recombination events (Botstein, 1980, Shackelton and Holmes, 2004).

Roles of host-acquired proteins

Acquisition of host genes and apparent selection for maintenance of those genes has been documented in many virus families (Iyer et al., 2001, McFadden and Murphy, 2000). Many of the apparently host-derived genes fall into one of two well-defined categories of gene function: immunomodulatory genes and genes involved in nucleic acid metabolism. Viral proteins that are very similar to host genes are documented to interfere with a variety of host immune defense mechanisms including antigen display, cytokines and their receptors, cytoplasmic signaling resulting from immune activation, and genes involved in resistance of cells to oxidative stress and apoptosis (Hughes and Friedman, 2005, Shackelton and Holmes, 2004). Many DNA viruses encode genes involved in nucleic acid metabolism, with which they redirect the host nucleotide precursor pool to viral DNA synthesis (Iyer et al., 2006). These enzymes are often clearly similar to host enzymes, and are often very highly conserved, probably due to functional restraints on structure and biochemical properties. There are many additional, possibly host-derived genes, whose functions have not yet been fully explored, but based at least on similarity to other proteins, seem to manipulate various intracellular processes to facilitate steps in the viral life cycle. Examples of these are genes involved in signaling pathways, lipid and carbohydrate metabolism, vesicle transport, and protein–protein interactions (Afonso et al., 2000, Geserick et al., 2004, Laidlaw et al., 1998, Werden and McFadden, 2008).

Detection and analysis of horizontally transferred proteins

Several methods are available to detect genes that may have been horizontally transferred into virus genomes from hosts or other sources. These include phylogenetic inference, compositional features such as codon and nucleotide bias, and patterns of presence and absence of genes within genomes. A well accepted and widely used method to detect horizontal gene transfer (HGT) is demonstration of phylogenetic clustering of the gene of interest with taxa unrelated to the current genome in which the gene is found, to the exclusion of taxa more closely related to the current genome. This method provides information about the potential donor and recipient organisms, but its potential caveats include limited phylogenetic samples, undetected presence of paralogs, and unequal rates of evolution between lineages (Katz, 2002). Compositional features that may be used to detect recently horizontally acquired genes include nucleotide composition, oligonucleotide frequencies, and codon usage (Koonin and Wolf, 2008), but these methods work only for very recent HGT because the anomalous signatures of such genes decay rapidly due to continued evolution of the host genome (Katz, 2002, Koonin and Wolf, 2008, Monier et al., 2007), and these methods do not give information about the donor lineage (Katz, 2002). Presence of a gene within only a related subset of a taxonomic group is a possible indicator of HGT if apparent orthologs of the gene are present in unrelated taxa. Sequence similarity alone is not accepted as a definitive demonstration of HGT or of close evolutionary relationship, since, for example, such results may be dependent on sampling biases present in the search databases used (Koski and Golding, 2001). However, sequence similarity measures can be a powerful tool for scanning very large amounts of data to find promising individual protein candidates for further analysis. Such sequence similarity analyses may also provide evidence for possible large-scale evolutionary trends across an entire virus taxon. This report therefore presents our effort to assess overall trends in HGT for members of the family Poxviridae and to identify individual poxvirus genes that show evidence of HGT for more detailed, subsequent studies.

Materials and methods

Protein databases for various taxonomic groups were assembled and searched using BLASTP for best matches to query sets of viral proteins. Results were processed with perl scripts, and displayed in two-dimensional taxonomic group plots. This straightforward method of visually comparing two sets of BLAST scores for a set of proteins has been utilized previously to compare proteins of a single genome to the proteins of two other genomes (NCBI, 2006, Rasko et al., 2005), and to compare proteins of one taxonomically grouped set of genomes to the proteins of two other taxonomically grouped sets of genomes (Lefkowitz et al., 2006).

Taxonomic group plots

Protein sequences were downloaded from GenBank (Benson et al., 2008) and from the Viral Bioinformatics Resource Center (Lefkowitz et al., 2005, VBRC, 2008) and were sorted by taxonomic divisions into BLAST-formatted databases. Datasets were downloaded in June 2007 and August 2007. These datasets from 2007 were the primary datasets used for the analyses presented in this report. Datasets were again downloaded in March 2009 and the analyses repeated to detect any significant changes that might occur in the results due to additions and changes in GenBank sequences. No significant differences were obtained. (Table S1 provides a list of downloaded protein groups used for analysis.) The set of all proteins predicted to be encoded by poxviruses, and subsets of this set, were used to query individual taxonomic databases using NCBI command line blast version 2.2.15. The single best BLASTP hit (based on BLAST bit score) for each query protein against each database was identified. Query proteins that gave low scores against each taxonomic database and would therefore be plotted, along with every other insignificant hit, near the origin of the graph, were removed from the analysis. The threshold for exclusion from the analysis was usually a bitscore of less than 60 with an E value of 10−5 or larger. Results for each query protein were plotted on two-dimensional graphs with axes corresponding to mutually exclusive taxonomic databases. This allowed visual comparison of the query set's relationship to one taxonomic group to its relationship to another taxonomic group. This was done with a custom Java applet. Normalization of bitscores using alignment length, or use of other measures such as percent identity or similarity, did not significantly change the results.

Interpretation of results

When the two taxonomic group databases being compared yield similar scores for a query protein, this indicates that each group contains at least one protein with about the same degree of pairwise similarity to the query protein. A set of query proteins with scores that are similar between the two databases creates a diagonal between the two axes. When a point lies closer to one axis than to the other, this indicates that its best blast hit in the taxonomic group represented by the nearer axis has a greater degree of pairwise similarity to the query protein than its best match in the taxonomic group representing the opposing axis. Any tendency of a set of query proteins to skew towards a particular taxonomic group might suggest a common evolutionary origin for those sequences either through descent from a common ancestor, or through multiple horizontal gene transfer events. Each point on the graph is plotted based on the score of its single highest scoring hit in each of the target databases. In many cases, the target database provides many hits with scores that are nearly as high as the score for the single best hit. So the identity of the protein with the single best hit is only one representative of the group of all proteins with hits that exhibit closely related scores. The BLASTP bitscore is one of several metrics that can provide an indication of the degree of similarity between two proteins. No pairwise sequence metric can definitively establish an evolutionary relationship between two protein sequences, but many, including bitscore, can give clues regarding protein similarities. The similarities can be collectively examined to gauge general trends in the similarity between proteomes, and individual similarities might be suggestive of evolutionary relationships between proteins, which may then be followed up using more rigorous methods of investigation, such as phylogenetic analyses, to assess the nature and likelihood of possible evolutionary relationships between individual proteins.

Phylogenetic analyses

Phylogenetic analyses were conducted on a small number of poxvirus proteins suggested by taxonomic group plots as having potentially interesting evolutionary histories. Each poxvirus protein was aligned with the protein providing its best BLASTP score as depicted in the plot, along with similar sequences from representative taxa. Sequences were aligned using the CLUSTALW algorithm (Thompson et al., 1994) implemented in MEGA version 4 (Tamura et al., 2007). Consensus phylogenetic trees were constructed by the Maximum Parsimony method using MEGA version 4, by the Maximum Likelihood method using Garli 0.96 (Zwickl, 2006), and by Bayesian inference using MrBayes 3.12 (Ronquist and Huelsenbeck, 2003).

Results

Our initial analysis was performed using as a query set, all proteins predicted to be encoded by all species and isolates in both the chordopoxvirus and entomopoxvirus subfamilies of the Poxviridae. This query set was used to probe three protein databases: all proteins encoded by eukaryotes, all proteins encoded by bacteria, and all virus-encoded proteins except those encoded by poxviruses. The results from the viral protein database were plotted against results from eukaryote proteins (Fig. 1A) and against results from bacterial proteins (Fig. 1B), and results from the eukaryote and bacterial protein databases were plotted against each other (Fig. 1C). Overall, the resulting plots show that chordopoxvirus proteins tend to exhibit greater similarity to eukaryotic proteins than to bacterial or viral proteins, suggesting that many poxvirus proteins may share a common evolutionary origin with proteins of their eukaryotic hosts. The plots in Fig. 1 distinguish between chordopoxvirus and entomopoxvirus subsets of poxvirus proteins. Although entomopoxviruses share several of the host-like genes present in chordopoxviruses, entomopoxvirus proteins do not show the same general skew towards greater similarity with eukaryotic proteins in comparison to other viral proteins. This could be due to the relative shortage of insect sequences in GenBank or to a bias of entomopoxviruses towards trading genes with other insect viruses over acquiring them from hosts.
Fig. 1

(A) Best BLASTP scores of poxvirus proteins against all eukaryote proteins vs. against proteins of all viruses except poxviruses. Inset shows regions described in the text. (B) Best BLASTP scores of poxvirus proteins against all bacteria proteins vs. against proteins of all viruses except poxviruses. (C) Best BLASTP scores of poxvirus proteins against all eukaryote proteins vs. against all bacteria proteins. Plotted chordopoxvirus proteins (6443 points) are represented by black squares, and plotted entomopoxvirus proteins (278 points) are represented by red squares. Notable points mentioned in the text are circled. Points circled in blue are proteins of an avian retrovirus integrated into the genome of fowlpox virus. The cluster of points circled in purple are the large subunit of ribonucleotide reductase.

(A) Best BLASTP scores of poxvirus proteins against all eukaryote proteins vs. against proteins of all viruses except poxviruses. Inset shows regions described in the text. (B) Best BLASTP scores of poxvirus proteins against all bacteria proteins vs. against proteins of all viruses except poxviruses. (C) Best BLASTP scores of poxvirus proteins against all eukaryote proteins vs. against all bacteria proteins. Plotted chordopoxvirus proteins (6443 points) are represented by black squares, and plotted entomopoxvirus proteins (278 points) are represented by red squares. Notable points mentioned in the text are circled. Points circled in blue are proteins of an avian retrovirus integrated into the genome of fowlpox virus. The cluster of points circled in purple are the large subunit of ribonucleotide reductase.

Poxvirus proteins: eukaryote vs. virus axes

The prominent very high scoring proteins that skew towards the virus protein axis in both plots (Fig. 1A and B) are encoded by the copy of the avian retrovirus, reticuloendotheliosis virus, which has integrated into the genome of fowlpox virus, of the Avipoxvirus genus (Hertig et al., 1997). These proteins score very high against the virus database because they are identical to those encoded by reticuloendotheliosis virus. Their closest cousins among eukaryote-encoded proteins, also providing high blast bitscores, are those coded for by endogenous retroviruses of pig, koala and possum. Many proteins lie in clusters, which are nearly always made up of orthologous proteins from various poxvirus species. Because of slight sequence variations between orthologs, the proteins in a cluster get slightly different scores against the target database proteins, but still close enough to form an orthologous cluster. One example is the cluster of ribonucleotide reductase large subunit (RNR1) poxvirus orthologs that is circled in Fig. 1A–C. The proteins in Fig. 1A segregate into six categories based on their location on the plot: (A) along, or very close to the virus axis; (B) in the region between the diagonal and the virus axis; (C) on the diagonal between the two axes; (D) in the region between the diagonal and the eukaryote axis; (E) along or close to the eukaryote axis; and (F) proteins that fall near the origin and therefore do not exhibit significant sequence similarity to any proteins from other virus families or from eukaryotic species. Each category has its own range of values for the ratio of virus similarity to eukaryote similarity. For example, proteins in category A have recognizable sequence similarity to proteins of other viruses, as compared to their insignificant levels of similarity to proteins of eukaryotes, while category E is just the inverse, with a high eukaryote-to-virus sequence similarity ratio. Category C proteins have relatively equal levels of sequence similarity to proteins of both viruses and eukaryotes, and regions B and D on either side of the diagonal have recognizable similarity to proteins in both the eukaryote and virus databases, but get a higher score to one database than to the other. Poxvirus proteins plotted in the same region may have similar scores or similar score ratios, but they are not necessarily similar to one another in any other way, either by sequence similarity, by sequence length, by distribution among poxviruses, or by the species in either searched database which provide their closest match. Region F contains the majority of points, with 77% of points. A breakdown by numbers and percentages of points in each region of the plots in Fig. 1 is shown in Table 1 . A table of poxvirus proteins present in each region of Fig. 1A is available as supplemental Table S2. Table S2 identifies the taxonomic subset of poxviruses that encode each protein, the eukaryotic and/or virus species that exhibit the best scores to the poxvirus protein(s) and what is known about the function of the protein. This approach to evolutionary classification of poxvirus proteins is similar to that used to classify proteins of molluscum contagiosum virus (Senkevich et al., 1997).
Table 1

Number and percentage of poxvirus points in each region of Fig. 1.

RegionNo. of points% of points
Fig. 1A: X axis eukaryote proteins, Y axis proteins of all viruses except poxviruses
 A, near Y axis6572.3%
 B, between Y axis and diagonal650.2%
 C, near diagonal1,6575.7%
 D, between diagonal and X axis1,3774.8%
 E, near X axis2,96510.3%
 F, near origin22,19076.8%



Fig. 1B: X axis bacteria proteins, Y axis proteins of all viruses except poxviruses
 A, near Y axis1,6955.8%
 B, between Y axis and diagonal6192.1%
 C, near diagonal1,0653.7%
 D, between diagonal and X axis6222.1%
 E, near X axis1,3074.5%
 F, near origin23,78381.8%



Fig. 1C: X axis eukaryote proteins, Y axis bacteria proteins
 A, near Y axis2580.9%
 B, between Y axis and diagonal450.2%
 C, near diagonal1,0003.4%
 D, between diagonal and X axis1,8826.5%
 E, near X axis3,37111.6%
 F, near origin22,53577.5%
Number and percentage of poxvirus points in each region of Fig. 1. The following sections outline representative poxvirus proteins from each category identified in Fig. 1A, identifying and discussing them in terms of their similarity to proteins from other virus families and/or eukaryotic species, their degree of distribution among poxvirus species, and their general category of function or putative function.

Region A: points near the virus axis

Poxvirus proteins that fall in this region of the plot have significant levels of sequence similarity to proteins of viruses in other virus families, but have no similarity to proteins of eukaryotes. For each poxvirus protein and its high scoring non-poxvirus protein or proteins, this high level of similarity could be due to a shared evolutionary origin, or to convergent evolution of proteins serving the same role in viruses with similar evolutionary niches. The highest scoring chordopoxvirus proteins along the virus axis are the large group of homologues of the variola virus protein B22R, whose high scores against the virus database result from a single possible relative of this protein present in cyprinid herpesvirus 3 (CyHV-3), a recently discovered member of the family Alloherpesviridae which is notable for having several genes with unexpected high levels of similarity to poxvirus genes (Ilouzea et al., 2006). B22R is present in every chordopoxvirus genus except parapoxvirus, and is the largest protein encoded by poxviruses. While its function is still unknown, it is predicted to contain carboxyl-terminal transmembrane domains and cysteine residues which may mediate disulfide bond formation (Tulman et al., 2006). The position of this protein in a sparsely populated area of the plot, and its potential for relationship to a protein in a herpesvirus makes it a good candidate for further investigation by phylogenetic analysis. The consensus tree of the high scoring sequence from CyHV-3 and representative poxvirus sequences in Fig. 2A shows that a horizontal transfer event may have occurred between virus predecessors of crocodile poxvirus and CyHV-3.
Fig. 2

Phylogenetic reconstructions to investigate evolutionary histories of three poxvirus proteins appearing in different regions of the plot in Fig. 1A. All pictured trees were constructed by the method of Bayesian inference using MrBayes. The resulting topology for each tree agrees exactly with topology produced from the same alignment by the Maximum Likelihood method using Garli, and either agrees exactly or is very similar to topology produced by the Maximum Parsimony method using MEGA. MrBayes simulations for all three alignments were run with the GTR nucleotide substitution model and gamma distributed rate variation with an estimated proportion of invariable sites. The legend below each tree shows the scale for branch lengths as measured in expected nucleotide substitutions per site. The number to the right of each taxon name is the protein GI number for that sequence. (A) Variola virus B22R (plotted in region A near the virus axis) is a large surface glycoprotein and appears outside the poxvirus family only in the carp herpesvirus CyHV-3. (B) The interleukin-10 inhibitory cytokine (plotted on the diagonal) is evidently of eukaryote origin but has several apparent homologs in diverse virus genomes, potentially acquired in distinct gene transfer events. (C) Monoglyceride lipase (plotted in region E near the eukaryote axis) is an enzyme which may facilitate use of cellular fatty acids, and may have been acquired from a fish or reptilian host by a poxvirus ancestral to the orthopoxvirus and yatapoxvirus genera.

Phylogenetic reconstructions to investigate evolutionary histories of three poxvirus proteins appearing in different regions of the plot in Fig. 1A. All pictured trees were constructed by the method of Bayesian inference using MrBayes. The resulting topology for each tree agrees exactly with topology produced from the same alignment by the Maximum Likelihood method using Garli, and either agrees exactly or is very similar to topology produced by the Maximum Parsimony method using MEGA. MrBayes simulations for all three alignments were run with the GTR nucleotide substitution model and gamma distributed rate variation with an estimated proportion of invariable sites. The legend below each tree shows the scale for branch lengths as measured in expected nucleotide substitutions per site. The number to the right of each taxon name is the protein GI number for that sequence. (A) Variola virus B22R (plotted in region A near the virus axis) is a large surface glycoprotein and appears outside the poxvirus family only in the carp herpesvirus CyHV-3. (B) The interleukin-10 inhibitory cytokine (plotted on the diagonal) is evidently of eukaryote origin but has several apparent homologs in diverse virus genomes, potentially acquired in distinct gene transfer events. (C) Monoglyceride lipase (plotted in region E near the eukaryote axis) is an enzyme which may facilitate use of cellular fatty acids, and may have been acquired from a fish or reptilian host by a poxvirus ancestral to the orthopoxvirus and yatapoxvirus genera. Nucleoside triphosphatase I (NPH-1) transcription termination factor is the only protein appearing along the virus axis that is encoded by both entomopoxviruses and chordopoxviruses. This protein is found in most chordopoxvirus genera as well as in Melanoplus sanguinipes entomopoxvirus (MSEV), and all versions get best scores against viruses of the NCLDV group. Among proteins along the virus axis with scores above 100 (approximate E values less than 10−22), there are 11 groups of orthologous proteins encoded by the entomopoxviruses, and 5 orthologous groups of proteins encoded by the chordopoxviruses. The highest scoring points include entomopoxvirus DNA and RNA repair enzymes, RNA ligase, and NAD+-dependent DNA ligase. Some entomopoxvirus proteins plotted in this region get their highest scores against proteins of viruses in the NCLDV group, but equally as many get high scores against proteins of viruses in the family Baculoviridae, where several, including the Fusolin/gp37 protein, and the Methionine-threonine-glycine (MTG) motif gene family member appear to enhance virus infectivity of the insect host (Dall et al., 2001). While many of the entomopoxvirus proteins plotted in this region score very high against proteins in other viruses, they are of unknown function, and contain no characterized domains.

Region B: points between the virus axis and the diagonal

Proteins plotted in this region have relatively high sequence similarity to proteins of other viruses as compared to their levels of similarity to eukaryote proteins. These poxvirus proteins may have a shared evolutionary origin with both virus and eukaryote ancestors, with greater similarity between the virus homologs due to similar evolutionary selection pressure and functional constraints on the virus genes, in contrast to the selection pressure on the eukaryotic versions of the protein. Poxvirus proteins in this category may share only one or a few protein domains with similar eukaryotic proteins, while best hits with proteins from other virus families exhibit similarity across the entire protein sequence. Besides the proteins encoded by the reticuloendotheliosis virus integrated into fowlpox virus, the only points with scores above 100 which fall into region B, between the virus axis and the diagonal are encoded by members of the species Canarypox virus. CNPV153 has closest match with the viral replication protein, Rep, of members of the family Circoviridae, and the CNPV227 N1R/p28-like protein has closest match to acanthamoeba polyphaga mimivirus, of the family Mimiviridae.

Region C: points near the diagonal

Region C, surrounding the diagonal, contains proteins whose sequences are globally conserved throughout most DNA viruses and eukaryotes, but it also contains proteins which get a high score against sequences present in only one or a few members of the eukaryote or virus kingdom. Poxvirus proteins plotted in this region find best scores in the virus kingdom among possibly distantly related viruses, i.e. members of the NCLDV, as well as among species of other families including Herpesviridae and Adenoviridae. Many of these proteins are universally highly conserved, function in the synthesis and maintenance of DNA and RNA, and are present in many members of both the poxvirus family and the virus family in which the highest score is obtained, as well as in most eukaryotes. The ultimate origin of these proteins is uncertain, and their entries into the virus lineages may have occurred concurrently with the inception of the first ancestors of these viruses, or at many different times during the evolution of the different virus families. Other proteins plotted in this region are apparently of eukaryote origin, have functions involving immune response and intracellular processes, and seem likely to have been transferred horizontally from hosts into the corresponding virus families. The highest scoring proteins along the diagonal are the large and small subunits of ribonucleotide reductase (RNR) (class 1A), an enzyme that controls the cellular concentration of deoxyribonucleotides. Although there are three classes of RNR, only class 1, subclass A is found in eukaryote-infecting viruses. RNR class 1 is made up of large (RNR1) and small (RNR2) subunits, with two of each subunit required to associate into a heterotetramer to form a functioning enzyme (Stubbe, 1990). Both subunits are very well conserved in all major taxonomic groups in which RNR type I appears: eukaryotes, eubacteria, bacteriophages and eukaryotic viruses. The large subunit of RNR (RNR1) is present in orthopoxviruses and suipoxviruses, while the small subunit of RNR (RNR2) is present in most chordopoxviruses. For both subunits, the percent identities between queries and highest scoring hits are between 80% and 90% percent similarity, with such high levels of sequence conservation likely due to the stringent structural requirements the enzyme must maintain in order to function (Torrents et al., 2002). Although many chordopoxvirus species encode only the small subunit, it is probably functioning in association with host-encoded RNR1, based on the finding that even RNR subunits from vastly different species can associate to form heterotetramers (Hamann et al., 1998). In addition to the very high scoring RNR proteins, many other enzymes involved in nucleotide synthesis and metabolism are high on the diagonal, including deoxyuridine-triphosphatase (dUTPase), thymidine kinase (TK), thymidylate kinase (ThyK), deoxycytidine kinase, and the one example of thymidylate synthase present in poxviruses. All these enzymes catalyze steps in pyrimidine metabolism, in particular converting cellular pools of RNA components into nucleotides for synthesis of DNA. Also high on the diagonal are DNA polymerase, alpha and beta subunits of RNA polymerase, and DNA photolyase, a DNA repair enzyme well conserved in all branches of life, but notably missing from placental mammals. The poxvirus proteins in this category are widely, some even ubiquitously, distributed among poxviruses, and have most similar viral proteins outside poxviruses in a wide variety of double stranded DNA viruses, including members of the postulated NCLDV group of viruses such as the phycodnaviruses, iridoviruses and mimiviruses, as well as in viruses outside this group, such as adenoviruses and herpesviruses. Eukaryotic best hits come from an even wider range, spanning everything from fungi and plants to vertebrates and invertebrate animals. These types of proteins fulfill basic needs of DNA viruses and all organisms with DNA genomes, and both their omnipresence in nature and the high levels of sequence conservation can be confounding factors in attempts to phylogenetically trace their individual evolutionary lineages. Many poxvirus proteins plotted on the diagonal have limited distribution among poxviruses and have best virus hits almost exclusively in putatively unrelated viruses, such as members of the baculovirus and herpesvirus families. Proteins in this category, which probably participate in downregulation of the host immune response, include interleukin-10 (IL-10) proteins and complement-control proteins. This category also includes semaphorins and c-type lectin-like proteins whose functions in poxviruses are unknown, but similar proteins in other organisms have roles in immunological pathways. Poxvirus encoded apoptosis-inhibiting proteins and copper/zinc superoxide dismutase protect infected cells against programmed cell death. These virally encoded proteins find highest scores among eukaryotes which seem likely to be hosts or closely related to hosts of the respective viruses, which make the viral proteins seem likely to be the products of independent horizontal gene transfer events from hosts. Although actual assessments of such potential gene transfers may be provided only by further analysis of each gene group, notably by phylogenetic inference, the locations of the points on these plots and the identities of the highest scoring proteins on each axis suggest candidates for study, and provide clues as to which proteins may yield the most interesting results. A good candidate for further study is IL-10, presumably of eukaryotic origin, but with several apparently homologous proteins among poxviruses and herpesviruses. A phylogenetic reconstruction of several viral and host IL-10 sequences is provided in Fig. 2B. Analysis of the phylogenetic relationship between these proteins suggests the possibility of several independent IL-10 HGT events between hosts and infecting viruses. Three HGT events are suggested into different lineages of herpesviruses, and two separate HGT events are suggested for poxviruses, with one each into the capripoxvirus and parapoxvirus lineages. It is notable that for many of these HGT events, the most closely related eukaryote IL-10 protein to a specific virus IL-10 protein is between the particular host species and the virus that infects that host. A few orthologous groups of proteins plotted in this region have functions unrelated to DNA/RNA/nucleotide synthesis and have closest viral hits in viruses of the NCLDV group. The eukaryote species providing the highest scores to these proteins seem unlikely to be hosts of the respective poxviruses. 3-Beta-hydroxysteroid dehydrogenase proteins are widely distributed among poxvirus genera, are likely used to suppress the host inflammatory response, and find most similar virus proteins in fish-infecting iridoviruses. Orthopoxvirus and entomopoxvirus species encode a protein called vaccinia-related ser/thr kinase, which is widely distributed in the animal kingdom and seems to participate in regulation of cell cycle (Kang et al., 2008) and has closest virus relatives in iridovirus species. A few proteins with very limited poxvirus distribution have unknown functions and highest pairwise similarity to proteins of NCLDV member species. The ultimate evolutionary origins of these proteins are unknown.

Region D: points midway between the diagonal and the eukaryote axis

This region of the plot is one of the most densely populated, with many poxvirus proteins that show significant hits to eukaryotic proteins and lower scores to homologs in other viruses. As with region C, region D contains proteins whose high scores seem due to universal sequence conservation, as well as proteins of presumably eukaryote origin, whose high scores on both axes most likely reflect historical transfer of these genes by separate routes into poxviruses and other virus families. Several poxvirus protein families have points in both regions C and D, including the vaccinia-related kinase family, the c-type lectin-like proteins, the TK enzymes, and ankyrin repeat proteins. As with their orthologs in region C, best virus matches for these are found both among NCLDV and non-NCLDV DNA viruses. All these are encoded by viruses in many poxvirus genera, and get best eukaryote hits among a variety of animals. The best scoring protein sequences in region D of the plot are the ATP-dependent DNA ligases encoded by several poxvirus genera. These all have higher pairwise identity to proteins of various mammals than to their best virus hits, which are all among putatively unrelated nucleopolyhedrovirus (NPV) species, a group of viruses in the baculovirus family. This is the only DNA-related enzyme unique to this region of the plot. Two additional region D proteins with wide distribution among poxvirus genera may have functions modulating host immune response. These are G protein-coupled receptors (GPCR) with significant similarity to known CC chemokine receptors, and proteins in the serpin superfamily of proteinase inhibitors, which are implicated in the regulation of tumor progression, of inflammation, and of cell death (Silverman et al., 2001, Viswanathan et al., 2009). Various mammals provide the best eukaryotic blast scores for most of these sequences, but some avipoxvirus proteins score best to a chicken protein. Herpesviruses provide best virus scores to most of the GPCR proteins, while mimivirus gives the best score fort all the serpins, as it has the only known viral serpin outside the poxvirus family. A third set of proteins, the soluble tumor necrosis factor receptor (TNFR) II homologs, and has slightly less widespread poxvirus distribution. These putatively protect infected cells from TNF-mediated cell death, and get highest scores to proteins of various mammals, and to viruses of the herpesvirus and iridovirus families. Proteins with very limited distribution among poxvirus genera include a protein similar to eukaryotic initiation factor-4a (eIF-4a) and a protein possibly functioning as an oligoribonuclease, both encoded by diachasmimorpha longicaudata entomopoxvirus, the first known symbiotic entomopoxvirus, which infects a parasitic wasp. The best eukaryotic scores for these proteins come from potentially host-like species and both find best virus scores against NCLDV members. A possible dual specificity protein phosphatase, encoded by canarypox virus, and a protein similar to human MHC Class I, encoded by squirrel poxvirus, are plotted in this region for blast scores against proteins of vertebrates and non-NCLDV viruses. MHC Class I-like proteins encoded by poxviruses of several other genera are plotted very close to the origin due to low pairwise sequence identity to their best matches on both axes. MSEV and squirrel poxvirus each encode a sequence of unknown function, and which, although they do not share high identity with one another, may both be chromosome segregation ATPases.

Region E: points near the eukaryote axis

Region E contains poxvirus proteins that get notable scores against eukaryotic proteins and essentially insignificant scores against viral proteins. Poxvirus proteins that appear in this region are most likely of eukaryotic origin and have been transferred into poxviruses or ancestors of poxviruses, but have not been transferred or at least not maintained in sequenced viruses of any other present day virus family. Members of the poxvirus family may be the only virus species carrying these eukaryotic genes simply because poxviruses are more effective at capturing or maintaining host genes than other viruses. Alternatively, many of these genes may be absent from other viruses since they confer little or no selective advantage to these viruses, but do confer selective advantage to poxviruses due to unique aspects of their biology. Poxvirus proteins plotted in this region include enzymes involved in lipid and carbohydrate metabolism, nucleotide metabolism, protection against oxidative damage, and intracellular processes including signaling, cell cycle control and apoptosis. The few proteins in this region which have wide distribution among poxvirus genera are kelch proteins and tyrosine protein kinase-like proteins. These proteins have unknown functions, and get best scores to proteins in a variety of vertebrates. Orthologous groups of proteins from avipoxviruses appear more often in this region than proteins from any other genus. Several of these proteins have unknown function, but the functional characterizations of the others span the whole range of functions attributed to proteins of region E. Each orthologous group of avipoxvirus proteins gets best scores against a variety of eukaryotes, mostly vertebrates. Again, assessments of potential horizontal gene transfers may be provided only by detailed phylogenetic analysis of each gene group, but the wide range of vertebrates providing highest scores for each orthologous group is notable, and preliminary phylogenetic analyses (data not shown) may indicate that, although they score very well against vertebrate proteins, many of these avipoxvirus proteins may have begun diverging from the original host-acquired proteins in the ancient past. Glutathione peroxidase protects against oxidative damage, and is the only avipoxvirus protein in region E that is also encoded by another poxvirus genus. Molluscum contagiosum virus encodes an ortholog of glutathione peroxidase that gets its highest blastp score against a similar protein in macaque, while the avipoxvirus sequences get highest scores against insect versions of the protein. As with the avipoxvirus proteins, many of these proteins may have been transferred into the poxvirus lineage in the relatively distant past, from early vertebrates. The phylogenetic tree of the enzyme monoglyceride lipase (Fig. 2C), which appears in this region of the plot, provides evidence that the origin of the poxvirus homolog may represent a more ancient gene transfer into a poxvirus ancestor from an unknown host. Many orthologous groups of proteins in this region have best blast scores scattered over a wide range of vertebrates, rather than among a narrowly defined group of species related to a potential HGT source. However, with only two exceptions both encoded by avipoxvirus species, all proteins find best scores against vertebrates, rather than against the wide variety of metazoa which provide the best scores for many of the potentially more universally conserved proteins plotted closer to the diagonal.

Region F: points near the origin

Approximately 77% of poxvirus proteins fall very close to the origin of this plot. These include genes that may be unique to the poxvirus family, as well as genes that in poxviruses have primary sequences too divergent to achieve high blastp scores against potentially orthologous proteins outside poxviruses. Examples of the former, poxvirus-specific genes include a DNA-binding phosphoprotein (Cop-F17R) and a structural protein (Cop-A12L). Examples of the latter, sequence-diverged genes, are a putative ATPase (Cop-A32L) and a capsid protein (Cop-D13L), both postulated to have orthologs in all members of NCLDV and included in the originally proposed core NCLDV genes (Iyer et al., 2001).

Poxviruses: bacteria vs. virus axes

In addition to the comparison of poxvirus proteins to proteins of eukaryotes and other viruses, we also compared the similarity of poxvirus proteins to proteins of bacteria and other viruses (Fig. 1B). For almost all poxvirus proteins, bacteria provide lower pairwise scores than eukaryotes. Notably, most of the large groups of proteins that lie on and below the diagonal in Fig. 1A skew in Fig. 1B towards the virus axis due to the absence of similar proteins in the bacterial kingdom. With the exception of one entomopoxvirus protein, all proteins between the virus axis and the diagonal got higher scores against eukaryotes (Fig. 1A) than they get against bacteria. The exception is NAD+-dependent DNA ligase, encoded by MSEV and amsacta moorei entomopoxvirus (AMEV), which gets slightly higher scores against a sulfur-oxidizing bacterium and a fish-infecting mycoplasma than against its eukaryote best hits in amoeba. The unique status of this point on the plot marks it as a potentially interesting candidate for additional investigation. Preliminary analyses (data not shown) indicate that while apparent homologs of this gene are found predominately in bacterial genomes, a few are also found among species of bacteriophage and NCLDV, indicating a potential for interesting horizontal gene transfer events. These and all other such suggested relationships must of course be rigorously tested by phylogenetic analysis to provide the most reliable assessment of gene transfer pathways. As in Fig. 1A, the diagonal in Fig. 1B contains several proteins highly conserved throughout nature. Fig. 1B also contains several proteins that in Fig. 1A were below the diagonal, showing high similarity to eukaryote proteins, but with scores against bacteria proteins more comparable to other virus proteins thus shifting them to the diagonal in Fig. 1B. Nearly all points below the diagonal in Fig. 1B exhibit high bitscores against both bacterial and eukaryote proteins, although the eukaryote scores are usually higher. Among these proteins, all proteins with significant scores against virus proteins have mimivirus proteins as their best virus scores—possibly not surprising considering the many bacteria-like features of the mimivirus genome. Poxvirus proteins plotted near the bacterial axis have similar scores with their best eukaryote protein hits. This region contains more avipoxvirus genes than genes from any other poxvirus genus. The only proteins in this region to get better bacterial than eukaryote scores come from the entomopoxvirus subfamily. There are the two different leucine-rich repeat (LRR) proteins encoded by AMEV, which get moderately good scores against eukaryotes yeast and plants, but get somewhat better scores against both a gram-negative anaerobic bacterium and a symbiotic green sulfur bacterium.

Poxvirus proteome subset

Although individual poxviruses usually contain more than 150 genes, only 49 of these are present in all of the fully sequenced poxviruses, with larger subsets being shared among members of each genus (Lefkowitz et al., 2006). In poxvirus genomes, the conserved “core” genes are involved in key functions such as replication, transcription and virion assembly, and tend to cluster in the central region of the linear genome, while genes that are unique to specific genera or species are distributed towards the two ends of the genome. Many of these peripheral genes encode proteins that manipulate host immune response and cellular processes, including apoptosis, antigen presentation and recognition, interferon functions and immune signaling processes. Cowpox virus strain Gri-90 has one of the largest genomes among orthopoxviruses, and contains essentially all genes found in other members of the genus. For this reason, it serves well as an archetypical orthopoxvirus genome for the purpose of orthopoxvirus gene analysis. All proteins of this strain were analyzed by taxonomic group plots, to compare the relationships of core and non-core protein subsets with eukaryotes and with viruses outside the poxvirus family (Fig. 3 ). In Fig. 3A, proteins were classified according to genomic location, as located centrally (red points) or non-centrally (black points), where the central region of the genome is defined as all genes from G13L to A47L. In Fig. 3B, proteins were classified according to the number of poxvirus species with conserved orthologs, with red points representing the most widely conserved proteins among poxviruses, and proteins of most limited distribution in black.
Fig. 3

All proteins of cowpox strain GRI-90 were analyzed by taxonomic group plots, to compare the relationships of core and non-core protein subsets with proteins of eukaryotes and with proteins of viruses outside the poxvirus family. Panel (A) represents proteins classified according to genomic locus, as non-centrally located (black squares, 99 points) or centrally located (red squares, 115 points). Panel (B) represents proteins classified according to the number of poxvirus species with conserved orthologs, with genes in only 1–10 species in black (20 points), genes in 11–20 species in purple (59 points), genes in 21–30 species in blue (28 points), genes in 31–35 species in green (21 points), and genes in 36–40 species in red (86 points).

All proteins of cowpox strain GRI-90 were analyzed by taxonomic group plots, to compare the relationships of core and non-core protein subsets with proteins of eukaryotes and with proteins of viruses outside the poxvirus family. Panel (A) represents proteins classified according to genomic locus, as non-centrally located (black squares, 99 points) or centrally located (red squares, 115 points). Panel (B) represents proteins classified according to the number of poxvirus species with conserved orthologs, with genes in only 1–10 species in black (20 points), genes in 11–20 species in purple (59 points), genes in 21–30 species in blue (28 points), genes in 31–35 species in green (21 points), and genes in 36–40 species in red (86 points). Results show that the diagonal contains universally conserved as well as species-specific genes (Fig. 3B), and contains proteins with both central and peripheral locations (Fig. 3A). However, the proteins that lie to the eukaryote side of the diagonal are predominantly non-centrally located and appear in a very limited number of species. Presence of these genes in only one or a few genera or species strongly suggests the genes were acquired by the cowpox virus lineage subsequent to its divergence (or the divergence of the most recent orthopoxvirus ancestor) from the other poxvirus genera. High scores with eukaryotic proteins may also indicate relatively recent transfer of the genes from eukaryotes, and/or strong selection for sequence identity with host proteins. The sparsely populated area near the virus axis has only proteins widely conserved among poxviruses, and these are almost exclusively centrally located, with the one exception being the poxvirus B22R protein. B22R is a surface glycoprotein that is conserved in every chordopoxvirus genus, and as mentioned above, has only one possible homolog outside the poxvirus family, in CyHV-3. A genome map of cowpox virus strain Gri-90 (Shchelkunov et al., 1998) (GenBank accession no. X94355) in Fig. 4 depicts all cowpox virus genes color coded according to the degree of similarity of each cowpox virus protein to its best hit when compared against all virus (non-poxvirus) or all eukaryotic proteins. Genes and their descriptions are provided in Table 2 . Genes are labeled by their restriction fragment name and are colored according to the highest blastp bitscore obtained by the encoded poxvirus protein when searched against the respective taxonomy database. Bitscores are normalized by dividing by the highest possible bitscore the query protein could achieve, i.e. the bitscore it receives when compared to itself. Therefore the highest possible score for each comparison is 1. The map demonstrates the higher levels of similarity poxvirus proteins have to eukaryote proteins in comparison to virus proteins outside the poxvirus family. In addition, it is apparent that with only a few exceptions, poxvirus proteins with high levels of sequence identity to proteins of other organisms tend to lie towards the edges of the linear genome. Exceptions include S2R: thymidine kinase, L4L: ribonucleotide reductase large subunit, R2L: glutaredoxin 1, and E8L: carbonic anhydrase (virion protein).
Fig. 4

A genome map of cowpox strain Gri-90 is color coded (see legend) according to the degree of similarity of each cowpox protein to its best hit when compared against all virus (non-pox) or eukaryote proteins.

Table 2

Similarity of each cowpox virus Gri-90 protein to the best blastp hit in the eukaryote and virus (non-pox) protein datasets.

RegionCPXV proteinProtein descriptionEukaryote with best scoreEuk. BitscoreVirus with best scoreVirus Bitscore
AB22RSurface glycoproteinStrongylocentrotus purpuratus51Cyprinid herpesvirus 3156
E11LNPH-I/Helicase, virionCiona intestinalis60Acanthamoeba polyphaga mimivirus135



BJ6RTopoisomerase type ILeishmania donovani infantum64Acanthamoeba polyphaga mimivirus97
E6RMorph, VETF-s (early transcription factor small)Kluyveromyces lactis67Lymphocystis disease virus 199
E5RNTPase, DNA replicationPichia stipitis CBS 605441Acanthamoeba polyphaga mimivirus60



CA41RSemaphorin/CD100 antigenHomo sapiens135Ovine herpesvirus 2146
B4RComplement control/CD46/EEVPan troglodytes89Macaca mulatta rhadinovirus 1757791
D9LC-type lectinRattus norvegicus86Rat cytomegalovirus86
O4RRNA pol (RPO147)Mus musculus201Acanthamoeba polyphaga mimivirus186
A51RThymidylate kinaseAedes aegypti167Chilo iridescent virus151
G4LRibonucleotide Reductase small subunitDanio rerio506Lymantria dispar nucleopolyhedrovirus457
F9LDNA-directed DNA polymeraseTetrahymena thermophila SB210113Human herpesvirus 7102
A47LHydroxysteroid dehydrogenaseBos taurus279Rana grylio virus 9506247
A42RLectin homologHomo sapiens62African swine fever virus54
C17LComplement binding (secreted)Bos taurus190Macaca mulatta rhadinovirus 17577166
J1LTyr/Ser phosphataseGallus gallus69Chilo iridescent virus61
C3LAnkyrinTrichomonas vaginalis G388Acanthamoeba polyphaga mimivirus76
A25RRNA pol 132 (RPO132)Aspergillus niger239Aedes taeniorhynchus iridescent virus207
G2LDeoxyUTP pyrophosphatase (dUTPase)Macaca mulatta164Spodoptera litura granulovirus138
L4LRibonucleotide reductase large subunitMus musculus1226Spodoptera litura nucleopolyhedrovirus1011
B1RSer/Thr kinaseDanio rerio246Chilo iridescent virus194



DB11RSer/Thr kinaseBos taurus154Chilo iridescent virus120
K1RAnkyrinTrichomonas vaginalis G384Acanthamoeba polyphaga mimivirus62
B18RAnkyrinTrichomonas vaginalis G379Paramecium bursaria Chlorella virus 158
B3RAnkyrinTrichomonas vaginalis G3112Acanthamoeba polyphaga mimivirus82
C7RUbiquitin Ligase/host defense modulatorHomo sapiens71Rock bream iridovirus52
M1LAnkyrin/NFkB inhibTrichomonas vaginalis G371Paramecium bursaria Chlorella virus 151
L8RRNA helicase/NPH-IICaenorhabditis elegans70Acanthamoeba polyphaga mimivirus50
C1LAnkyrinStrongylocentrotus purpuratus64Acanthamoeba polyphaga mimivirus45
A56RTNF receptor (CrmC)Pan troglodytes108Grouper iridovirus74
D14LAnkyrinStrongylocentrotus purpuratus87Acanthamoeba polyphaga mimivirus59
B16RAnkyrinTrichomonas vaginalis G391Acanthamoeba polyphaga mimivirus62
C11LAnkyrinTrichomonas vaginalis G368Acanthamoeba polyphaga mimivirus45
P1LAnkyrinTrichomonas vaginalis G394Acanthamoeba polyphaga mimivirus61
K3RTNF-a receptor/CD27 cysteine-rich regionRattus rattus136Singapore grouper iridovirus87
K2RTNF receptor (CrmD)Canis familiaris120Singapore grouper iridovirus72
S2RThymidine kinaseHomo sapiens248Cyprinid herpesvirus 3147
D2L, I4RTNF-α receptor II (CrmB)Bos taurus149Grouper iridovirus87
M2LProteinase inhibitor I4, serpinMonodelphis domestica160Acanthamoeba polyphaga mimivirus92
D3L, I3RAnkyrinTrichomonas vaginalis G397Acanthamoeba polyphaga mimivirus55
B12RSerpinMonodelphis domestica196Acanthamoeba polyphaga mimivirus102
D13LUnknownMus musculus84Lymphocystis disease virus—isolate China43
D7LKelch-likeRattus norvegicus71Clanis bilineata nucleopolyhedrosis virus34
A26LA type inclusion proteinTrichomonas vaginalis G3134Gryllus bimaculatus nudivirus64
O1RPoly(A) polymerase-small (VP39)Paramecium tetraurelia66Vibrio phage CTX31
B20RSerpinBos taurus202Acanthamoeba polyphaga mimivirus94
C5REpidermal growth factorRattus norvegicus63Crimean-Congo hemorrhagic fever virus29
A53RDNA ligaseCanis familiaris622Lymantria dispar nucleopolyhedrovirus274
C18LKelch-likeCanis familiaris111Pseudomonas phage phiEL48
A44RProfilin homologHomo sapiens72Bacteriophage phi-MhaA1-PHL10130
B19RKelch-like (EV-M-167)Drosophila pseudoobscura107Acanthamoeba polyphaga mimivirus45
A57RKelch-likeMacaca mulatta202Human papillomavirus type 6871
D11LKelch-likeCanis familiaris138Pseudomonas phage phiKZ43



ER2LGlutaredoxin 1Rattus norvegicus105Ectocarpus siliculosus virus33
B9RKelch-likeMonodelphis domestica115Pseudomonas phage phiKZ35
G3LKelch-likeMus musculus119Pseudomonas phage phiKZ35
A40LCD47-likeBos taurus119Ranid herpesvirus 234
G13LPhospholipase EEVCanis familiaris118Heliothis zea virus 132
B14RIL-1 beta receptorRattus norvegicus144Enterobacteria phage RB6935
E8LCarbonic anhydrase/VirionHomo sapiens143Acanthamoeba polyphaga mimivirus32
B2RSchlafenMus musculus231Choristoneura fumiferana MNPV47
A59RGuanylate kinaseMus musculus202Sapovirus SaKaeo-15/Thailand35
M5LPutative monoglyceride lipaseRattus norvegicus280Paramecium bursaria Chlorella virus 142
T1RNMDA receptor-like proteinBos taurus256Chimpanzee cytomegalovirus32
M4LNicking-joining enzymeRattus norvegicus368Bombyx mori nuclear polyhedrosis virus32



FE10RmutT motif/NPH-PPH/RNA levels regulatorTetrahymena thermophila SB21040Aedes taeniorhynchus iridescent virus53
A19RDNA Helicase, transcriptionAshbya gossypii ATCC 1089548Ectocarpus siliculosus virus59
D10LCPV-B-012Rattus norvegicus49Rat cytomegalovirus51
E1RLarge capping enzymeTrichomonas vaginalis G358Acanthamoeba polyphaga mimivirus52
D4L, I2RAnkyrinAedes aegypti59Acanthamoeba polyphaga mimivirus46
A48RSuperoxide dismutase-likeLasius niger57Mamestra configurata nucleopolyhedrovirus B44
C9LAnkyrin/host rangeTrichomonas vaginalis G359Acanthamoeba polyphaga mimivirus45
D8LAnkyrinTrichomonas vaginalis G359Ectocarpus siliculosus virus42
A22RDNA processivity factorPlasmodium falciparum 3D752Bacteriophage 8535
C15LUnknownMonodelphis domestica55Acanthamoeba polyphaga mimivirus36
F3LIFN resistance/PKR inhibitor (Z-DNA binding)Rattus norvegicus52Paramecium bursaria Chlorella virus 133
B17RIFN-alpha/beta receptorPan troglodytes56Listeria phage A11832
B7RIFN-gamma receptorCanis familiaris57Choristoneura occidentalis granulovirus31
C10LUnknownTrypanosoma rangeli20Enterobacteria phage K1-519
A39RUnknownEntamoeba histolytica HM-1:IMSS24Rabies virus23
H6RRNA polDebaryomyces hansenii CBS76727Acanthamoeba polyphaga mimivirus24
N2RUnknownEntamoeba histolytica HM-1:IMSS27Pseudomonas phage D325
A14LVirion maturationPichia stipitis CBS 605429Equid herpesvirus 226
A31LVirion morphogenesisParamecium tetraurelia29Plum pox virus25
NULLUnknownMusca domestica29Rice tungro bacilliform virus25
G8LCytoplasmic proteinAspergillus oryzae28Cryptophlebia leucotreta granulovirus27
G14LUnknownCosmospora coccinea29Acanthamoeba polyphaga mimivirus27
B10RUnknownParamecium tetraurelia29Tomato chlorosis virus27
A46RUnknownPlasmodium falciparum 3D730Human immunodeficiency virus 126
G17RDNA-binding phosphoproteinPlasmodium vivax30Murid herpesvirus 126
NULLUnknownDictyostelium discoideum AX431Influenza A virus (A/seal/Massachusetts/1/80(H7N7))27
L5LIMV protein VP13Mustela vison32Staphylococcus phage Twort28
A13LStructural proteinCryptococcus neoformans var. neoformans B-3501A32Bacteriophage phBC6A5128
A10LMembrane proteinPichia stipitis CBS 605434Influenza A virus (A/Hong Kong/481/97(H5N1))26
A3LThioredoxin-likeCaenorhabditis elegans31Impatiens necrotic spot virus30
A18LIMV MP PO4Plasmodium vivax35Mycoreovirus 327
L2LUnknownAspergillus niger30Cyanophage phage S-PM231
A15LIMV PO4 MPTetrahymena thermophila SB21032Gryllus bimaculatus nudivirus30
C2LMPV-Z-N3RXenopus tropicalis32Avian infectious bronchitis virus30
N5REntry and fusion IMV proteinCryptosporidium parvum Iowa II33Chrysodeixis chalcites nucleopolyhedrovirus29
G15LUnknown conservedParamecium tetraurelia34Influenza A Virus (A/Fujian/555/2003(H3N2))28
B13RUnknownAspergillus nidulans FGSC A434American plum line pattern virus29
G7LUnknownPlasmodium yoelii yoelii37Lactococcus lactis bacteriophage Q3026
F10RDisulfide bond formationPlasmodium berghei31African swine fever virus32
H2LUnknownDrosophila melanogaster34Plutella xylostella multiple nucleopolyhedrovirus30
A16LUnknownPlasmodium berghei34Rice stripe virus30
B5RUnknownLeishmania braziliensis34Little cherry virus 129
J2REntry and cell–cell fusionCanis familiaris34Acanthamoeba polyphaga mimivirus29
E7RRNA pol 18(RPO18)Tetrahymena thermophila SB21034Glypta fumiferanae ichnovirus30
A5LCore proteinMonodelphis domestica34Hibiscus latent Fort Pierce virus30
G6LUnknownPlasmodium chabaudi35Bacteriophage 933W29
D6LAlpha-amanitin sensitivityPlasmodium falciparum 3D734Bacillus thuringiensis phage MZTP0231
A36RUnknownSchizosaccharomyces pombe34Acanthamoeba polyphaga mimivirus30
G16LUnknownTrichomonas vaginalis G334Maize dwarf mosaic virus30
H7RUnknownTetrahymena thermophila SB21035Streptococcus thermophilus bacteriophage Sfi1929
A32RUnknownDictyostelium discoideum35Avian infectious bronchitis virus30
M6RUnknownAspergillus terreus NIH262434Feline calicivirus31
A34REEV GlycoproteinCaenorhabditis elegans35Feline leukemia virus (strain Sarma)30
F7RSoluble/Myristyl EEVPlasmodium chabaudi35Chilo iridescent virus29
J5RVLTF-4 (late transcription factor 4)Canis familiaris35Measles virus30
H9RVLTF-1Apis mellifera35Acanthamoeba polyphaga mimivirus31
P2LNFkB inhArabidopsis thaliana35Lymphocystis disease virus 131
B21RUnknownParamecium tetraurelia36Human adenovirus type 1330
B8RVirulence factorBabesia bovis34Simian immunodeficiency virus32
A50LUnknownCandida albicans SC531435Paramecium bursaria Chlorella virus 131
H4LGlutaredoxin 2Gibberella zeae PH-137Ecotropis obliqua NPV30
L6LTelomere-binding proteinTrypanosoma cruzi35Staphylococcus phage Twort32
Q1LVirokine/NFkB inh/Str resemblence to apoptotic regDictyostelium discoideum34Neodiprion abietis nucleopolyhedrovirus33
A37RIEV-specificTetrahymena thermophila SB21035Influenza A virus (A/Chicken/NY/29878/91 (H2N2))32
C12LUnknownTheileria parva34Acanthamoeba polyphaga mimivirus34
N1RMyristylated MP IMVTetrahymena thermophila SB21035Lymphocystis disease virus 133
A23RHolliday junction resolvaseRattus norvegicus35Trichoplusia ni ascovirus 2c32
G11LUnknownEntamoeba histolytica HM-1:IMSS35Human enterovirus 9432
F11LVirion core proteinDictyostelium discoideum AX435Acanthamoeba polyphaga mimivirus32
C4LUnknownTetrahymena thermophila SB21037Acanthamoeba polyphaga mimivirus31
C8LIL-18 BPMacaca mulatta37Mamestra configurata nucleopolyhedrovirus B31
D12LTNF receptor (CrmB)Candida albicans SC531435Ilesha virus33
A1LVLTF-2 (late transcription factor 2)Mus musculus36KI polyomavirus Stockholm 6032
B6RVirulence, ER residentPlasmodium berghei36Human papillomavirus type 5032
S1RVirion morphEntamoeba histolytica38Cherry chlorotic rusty spot associated totiviral-like dsRNA 330
A20LUnknownMus musculus37Mycobacteriophage Halo32
A29LIMV MP/virus entryPlasmodium berghei35Porcine epidemic diarrhea virus33
A21LEntry and cell–cell FusionMedicago truncatula39Human immunodeficiency virus type 130
A55RIntracellular TLR and IL-1 signaling inhibitorCaenorhabditis briggsae34Bacteriophage 2638A35
E2LVirion coreRhipicephalus evertsi34Bovine enteric calicivirus34
J3LIMV heparin binding surface proteinNeosartorya fischeri NRRL 18137Enterobacteria phage JS9832
J7RUnknownTheileria annulata37Rachiplusia ou multiple nucleopolyhedrovirus32
H10REntry-fusion complex proteinPlasmodium falciparum 3D738Epiphyas postvittana nucleopolyhedrovirus32
C14LUnknownPlasmodium falciparum 3D737Bluetongue virus 2232
H8LVirion assembly proteinParamecium tetraurelia37Leucania separata nuclear polyhedrosis virus32
A30LRNA pol 35(RPO35)Trichomonas vaginalis G337Fiji disease virus33
A6RRNA pol 19 (RPO19)Trichomonas vaginalis G338Emiliania huxleyi virus 8632
E9RmutT motif/NTP-PPHTetrahymena thermophila SB21038Chilo iridescent virus32
A43LVirulence/secretedPlasmodium falciparum 3D737Clostridium phage c-st34
F5RVirosome componentMonodelphis domestica38Citrus tristeza virus32
D5L, I1RUnknownPlasmodium chabaudi35Lymphocystis disease virus 135
A38RUnknownTheileria annulata38Murid herpesvirus 432
A28LFusion proteinTribolium castaneum40Lymphocystis disease virus—isolate China31
A54RUnknownPlasmodium berghei38Taura syndrome virus33
A12RViral membrane formationParamecium tetraurelia38Staphylococcus aureus prophage phiPV8333
D1L, I5RChemokine binding proteinDictyostelium discoideum AX439Lactobacillus plantarum bacteriophage LP6532
E4RUracil-DNA glycosylasePlasmodium berghei35Gallid herpesvirus 137
O2RRNA pol (RPO22)Entamoeba histolytica HM-1:IMSS41Choristoneura fumiferana MNPV30
F8RER-localized MPHordeum vulgare38Oryctes rhinoceros virus34
A9RVITF-3 34kda subunitParamecium tetraurelia39Adoxophyes orana granulovirus34
N3LInternal virion proteinPlasmodium falciparum 3D740Adeno-associated virus32
G12LIEV associatedDanio rerio37Bat coronavirus (BtCoV/133/2005)35
G1LApoptosis inhibitor (mitochondrial-associated)Plasmodium falciparum 3D739Spodoptera litura granulovirus34
L3LDNA-binding phosphoproteinTetrahymena thermophila SB21040Acanthamoeba polyphaga mimivirus32
L7LVirion core proteasePlasmodium falciparum 3D739Lactococcus phage Q5434
E3RVirion corePlasmodium falciparum 3D738Agrotis segetum granulovirus35
G9LDisulfide bond formationGallus gallus40Human immunodeficiency virus 133
A17LMyristylated entry/cell–cell fusion proteinDanio rerio40Lymphocystis disease virus—isolate China33
O3LUnknown MPDictyostelium discoideum AX434Lymphocystis disease virus 139
F6RUnknownTetrahymena thermophila SB21039Human immunodeficiency virus 135
A24RVITF-3 45kda subunitOryza sativa (japonica cultivar-group)40Autographa californica nucleopolyhedrovirus34
A27LP4c precursorPlasmodium yoelii yoelii41Bacteriophage RM 37832
C19LUnknownTetrahymena thermophila SB21039Bacteriophage 6635
C6LIL-1 receptor antagonistPlasmodium yoelii yoelii42Human papillomavirus type 14D32
G10LSer/Thr kinase MorphPlasmodium vivax41Acanthamoeba polyphaga mimivirus34
Q2LAlpha-amanitin sensitivityTrichomonas vaginalis G341Xestia c-nigrum granulovirus33
G5L36 kDa major membrane proteinDanio rerio39Cyanophage phage S-PM236
A45RMembrane glycoprotein-class IPlasmodium falciparum 3D737Maize dwarf mosaic virus38
A52RPutative Phosphotransferase/anion transport proteinPlasmodium chabaudi40Chilo iridescent virus35
E12LSmall capping enzymeTetrahymena thermophila SB21039Porcine rotavirus37
N4RCore package/transcriptionPlasmodium falciparum41Staphylococcusphage CNPH8234
L1LDNA-binding proteinStrongylocentrotus purpuratus42Helicoverpa armigera nuclear polyhedrosis virus34
A49RIL-1 signaling inhibitorCryptosporidium parvum Iowa II45Aedes taeniorhynchus iridescent virus30
C13LHost range virulence factorPlasmodium falciparum 3D741Plutella xylostella granulovirus35
H3RVLTF (late transcription elongation factor)Trichomonas vaginalis G341Chilo iridescent virus35
H5RUnknownCaenorhabditis elegans41Acanthamoeba polyphaga mimivirus36
B15LUnknownDictyostelium discoideum AX442Tomato leaf curl Madagascar virus35
E13LTrimeric virion coat protein (rifampicin res)Bigelowiella natans42Neodiprion sertifer nucleopolyhedrovirus35
F1LPoly (A) polymerase-large (VP55)Strongylocentrotus purpuratus42Staphylococcus phage 18735
H1LPredicted metallo-proteaseDictyostelium discoideum39Acanthamoeba polyphaga mimivirus39
A11LP4a precursorTetraodon nigroviridis43Acanthamoeba polyphaga mimivirus35
C16LIL-1 receptor antagonistPlasmodium falciparum 3D742Acanthamoeba polyphaga mimivirus37
A35RC-type lectin-like EEV proteinCaenorhabditis briggsae44African swine fever virus35
A33LATPase/DNA packaging proteinTrichomonas vaginalis G343Cotesia congregata bracovirus36
A58RHemagglutininAnas platyrhynchos48Heliothis zea virus 133
F4LRNA pol (RPO30)Homo sapiens41African swine fever virus41
J4LRAP94 (RNA pol assoc protein)Plasmodium berghei45Trichoplusia ni SNPV37
F2LUnknownEntamoeba histolytica HM-1:IMSS44Staphylococcus aureus phage phiP6839
R1LUnknownPichia stipitis CBS 605441Gryllus bimaculatus nudivirus42
M3LIFN resistance/eIF2 alpha-like PKR inhibitorAnopheles gambiae str. PEST40Silurus glanis ranavirus44
A2LVLTF-3 (late transcription factor 3)Paramecium tetraurelia48Acanthamoeba polyphaga mimivirus37
A7LVirion morphogenesisPlasmodium reichenowi48Acanthamoeba polyphaga mimivirus40
A8LVETF-L (early transcription factor large)Plasmodium yoelii yoelii45Acanthamoeba polyphaga mimivirus44
A4LP4b precursorTetrahymena thermophila SB21040Acanthamoeba polyphaga mimivirus49
A genome map of cowpox strain Gri-90 is color coded (see legend) according to the degree of similarity of each cowpox protein to its best hit when compared against all virus (non-pox) or eukaryote proteins. Similarity of each cowpox virus Gri-90 protein to the best blastp hit in the eukaryote and virus (non-pox) protein datasets.

Discussion and conclusions

Protein coding genes of poxviruses have been the subject of much research. Poxvirus immunomodulatory genes, those both with and without host homologs, have been extensively examined (Finlay and McFadden, 2006, Iyer et al., 2006, McFadden and Murphy, 2000, Monier et al., 2007, Seet et al., 2003, Stanford et al., 2007) as have the gene content and gene families present in poxvirus species, and evolutionary relationships based on phylogenies of those genes (Bratke and McLysaght, 2008, Gubser et al., 2004, Iyer et al., 2001, Iyer et al., 2006, Lefkowitz et al., 2006, McLysaght et al., 2003, Upton et al., 2003, Xing et al., 2006). It is apparent that many genes have entered poxvirus genomes via horizontal transfer both from their hosts and also possibly from other viruses. From an evolutionary perspective, the genes poxviruses share with other viruses have been examined most notably in the context of exploring the hypothesis that the poxvirus family may share a common ancestor with several other families of large DNA viruses (the NCLDV). This hypothesis is based largely on the set of similar proteins these viruses share (at a sequence and/or functional level), which may have served as a “core” set of NCLDV genes. Poxviruses also code for genes with significant sequence similarity to genes from non-NCLDV virus family members, including virulence genes shared by entomopoxviruses, baculoviruses and iridoviruses (Dall et al., 2001, Means et al., 2007), host-interaction genes present in poxviruses and herpesviruses (Afonso et al., 2000, Iyer et al., 2006, McFadden and Murphy, 2000), and other poxvirus proteins with notable levels of similarity to genes of a recently discovered fish herpesvirus (Ilouze et al., 2006). The potential for horizontal gene transfer into poxviruses has been examined using several methods, including phylogenetic reconstructions, gene synteny analysis, and anomalous base composition. Phylogenetic reconstructions of gene families with members in other viruses and their hosts have suggested that multiple horizontal gene transfer (HGT) events have taken place into poxvirus genomes from other viruses (Dall et al., 2001) and from their eukaryotic hosts (Bratke and McLysaght, 2008, Hughes, 2002, Hughes and Friedman, 2005, Monier et al., 2007). Anomalous base composition (DaSilva and Upton, 2005, Monier et al., 2007), and gene synteny analysis (Bratke and McLysaght, 2008, McLysaght et al., 2003) have found evidence for HGT from hosts to poxviruses, including multiple HGT events for some genes. All methods of analysis conclude that the presence of many genes is best explained by HGT, although the process may not be frequent and recent (Lefkowitz et al., 2006, Monier et al., 2007), and some genes with noted similarity to genes of other organisms are proposed to not have been obtained via HGT (Hughes and Friedman, 2005, Iyer et al., 2001). The goals of our current analysis were to develop a method of measuring and visualizing the similarities of all proteins expressed by virus isolates belonging to the entire poxvirus family to various taxonomically distinct sets of proteins from other organisms. This analysis was designed to detect overall trends in gene similarity and to detect individual genes that may be of interest due to anomalous characteristics with regard to such levels of similarity. Each individual protein may then be further investigated with regard to its function, distribution in poxviruses and other organisms, and via phylogenetic analysis, to determine its most likely evolutionary history. Proteins identified as interesting candidates for follow-up research by this method may be further studied using more traditional phylogenetic methods as illustrated by our initial phylogenetic analyses of proteins in Fig. 2. Overall trends in sequence similarity of different subsets of poxvirus proteins, as well as information about individual proteins implicated by our analysis may contribute valuable information about the evolution of poxviruses and the mechanisms of host pathogenesis. Overall, analysis by taxonomic group plots shows that chordopoxvirus proteins tend to exhibit greater similarity to eukaryotic proteins than to bacterial or viral proteins, suggesting that many poxvirus proteins may share a common evolutionary origin derived from proteins of their eukaryotic hosts. Although entomopoxviruses also contain host-like genes, both with and without homologs in chordopoxviruses, entomopoxvirus proteins do not show the same general skew towards similarity to eukaryotic proteins. However, entomopoxviruses encode quite a few proteins with notably greater similarity to proteins of other viruses than to bacterial or eukaryotic proteins. The relatively small sampling of insect proteins available in GenBank could partly account for the low scores of these proteins to the eukaryote database, with insects being represented by 799,971 proteins and 210 complete genomes, compared to a vertebrate collection of 1,787,682 proteins and 1559 complete genomes. However, with only 3 exceptions, all chordopoxvirus proteins which achieve similarly high scores to proteins of other viruses are proteins with sequences universally conserved throughout nature, such as ribonucleotide reductase, DNA photolyase and RNA polymerase. For viruses of both the entomopoxvirus and chordopoxvirus subfamilies, the most similar virus proteins outside the poxvirus family are found both among members of the postulated NCLDV group of viruses and among non-NCLDV members, with viruses of the families Baculoviridae, Herpesviridae and Iridoviridae most represented. Inspection of the individual proteins represented on the plots reveals that many of the proteins are universally highly conserved. These function in the synthesis and maintenance of DNA and RNA, and are present in many virus species, as well as in most eukaryotes. All the poxvirus enzymes that convert cellular pools of nucleotides for RNA synthesis into deoxyribonucleotides for synthesis of DNA fall either on the diagonal or just below it. The ultimate origin of these proteins is uncertain, but those with greater similarity to eukaryote proteins may have been transferred more recently into the poxvirus lineage than into the other virus families in which they appear, or these proteins may have been constrained for functional purposes towards high sequence identity with host proteins. Many other proteins highlighted by this analysis are apparently of eukaryote origin, and fall either on the diagonal, just below it, or near the eukaryote axis, depending on their degree of similarity to proteins presumably transferred into viruses outside the poxvirus family. These have functions involving immune response and intracellular processes, and seem likely to have been transferred horizontally from hosts into poxviruses as well as into the families of other, non-poxvirus viruses. The functions of these proteins are presumed to be advantageous to the biology of viruses in all families where these proteins appear. Proteins near the eukaryotic axis in Fig. 1A are only present in viruses of the family Poxviridae. The majority of these proteins are involved in the manipulation of intracellular processes, including redox state, protein signaling cascades, and lipid and carbohydrate metabolism, as well as involved in the manipulation of the extracellular environment. Some of these proteins are of unknown function. The fact that these eukaryotic-like proteins are found only among viruses in the poxvirus family may be informative about what cellular processes and signaling cascades are unique to poxvirus infections. Finally, there are many proteins that are seemingly unique to poxviruses, with no significant sequence similarity to known proteins among other viruses, eukaryotes or bacteria. Together these results give us a picture of the many different subsets of proteins present in poxviruses, and allow us to draw some conclusions about each subset based on where else in nature proteins of these types appear. Investigation of the similarities and origins of particular proteins may yield further insights into poxvirus evolution and pathogenesis. For example, the fact that the poxvirus versions of universally highly conserved enzymes such as RNR have significantly more sequence similarity to RNR of eukaryotes than to those of bacteria or other viruses may imply a need for interoperability of the poxvirus enzymes with host proteins. Another example is the presence of different clusters of poxvirus TK sequences, where TK encoded by entomopoxviruses and avipoxviruses cluster together on the plot in a different location from the cluster of TK proteins encoded by poxviruses in other genera, agreeing with previously published suggestions that the TK enzymes of avipoxvirus, entomopoxvirus and the other chordopoxvirus genera may have different origins (Bratke and McLysaght, 2008, Koonin and Senkevich, 1992). Finally, by using taxonomic group plots to study the proteome of cowpox virus, we show that the most host-like genes tend to lie at the ends of the linear genome and have the most limited distributions among poxvirus species. More explicit conclusions about individual proteins, including gene origins, relationships to proteins of other organisms, and details of potential horizontal gene transfer events, will require additional, more extensive analyses at the level of each individual gene. Such investigations will require phylogenetic reconstruction of individual protein families utilizing sequences obtained from accurate annotations of poxvirus genomes, with particular attention to providing an accurate gene prediction for each genome and to the presence or absence of particular genes in each genome. In conclusion, using taxonomic group plots to analyze proteins of poxviruses confirms the presence of many eukaryotic-like proteins in the genomes of poxvirus species, underscoring the importance of the contribution of host gene capture in the evolution of these viruses. These results also provide an overview of the functional significance of many of the genes poxviruses share with their hosts, and expose which host genes are captured uniquely by poxviruses and which are captured by other virus families as well. Information yielded by more comprehensive phylogenetic analysis of poxvirus genes to genes of their hosts and other viruses, will illustrate details of molecular mechanisms of poxvirus adaptation and survival throughout the history of the virus family, giving a richer picture of the evolution of this once devastating and still dangerous group of viral pathogens.
  50 in total

Review 1.  Insect-virus relationships: sifting by informatics.

Authors:  D Dall; T Luque; D O'Reilly
Journal:  Bioessays       Date:  2001-02       Impact factor: 4.345

2.  Genome-based phylogeny of poxvirus.

Authors:  Ke Xing; Riqiang Deng; Jinwen Wang; Jinghua Feng; Mingsong Huang; Xunzhang Wang
Journal:  Intervirology       Date:  2006-01-11       Impact factor: 1.763

Review 3.  Immunopathogenesis of poxvirus infections: forecasting the impending storm.

Authors:  Marianne M Stanford; Grant McFadden; Gunasegaran Karupiah; Geeta Chaudhri
Journal:  Immunol Cell Biol       Date:  2007-01-16       Impact factor: 5.126

Review 4.  Ribonucleotide reductases.

Authors:  J Stubbe
Journal:  Adv Enzymol Relat Areas Mol Biol       Date:  1990

5.  CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice.

Authors:  J D Thompson; D G Higgins; T J Gibson
Journal:  Nucleic Acids Res       Date:  1994-11-11       Impact factor: 16.971

6.  A theory of modular evolution for bacteriophages.

Authors:  D Botstein
Journal:  Ann N Y Acad Sci       Date:  1980       Impact factor: 5.691

7.  The genome of fowlpox virus.

Authors:  C L Afonso; E R Tulman; Z Lu; L Zsak; G F Kutish; D L Rock
Journal:  J Virol       Date:  2000-04       Impact factor: 5.103

8.  Fowlpox virus encodes nonessential homologs of cellular alpha-SNAP, PC-1, and an orphan human homolog of a secreted nematode protein.

Authors:  S M Laidlaw; M A Anwar; W Thomas; P Green; K Shaw; M A Skinner
Journal:  J Virol       Date:  1998-08       Impact factor: 5.103

9.  VRK1 phosphorylates CREB and mediates CCND1 expression.

Authors:  Tae-Hong Kang; Do-Young Park; Wanil Kim; Kyong-Tai Kim
Journal:  J Cell Sci       Date:  2008-08-19       Impact factor: 5.285

10.  Identification of multiple independent horizontal gene transfers into poxviruses using a comparative genomics approach.

Authors:  Kirsten A Bratke; Aoife McLysaght
Journal:  BMC Evol Biol       Date:  2008-02-27       Impact factor: 3.260

View more
  28 in total

1.  Viral insulin-like peptides activate human insulin and IGF-1 receptor signaling: A paradigm shift for host-microbe interactions.

Authors:  Emrah Altindis; Weikang Cai; Masaji Sakaguchi; Fa Zhang; Wang GuoXiao; Fa Liu; Pierre De Meyts; Vasily Gelfanov; Hui Pan; Richard DiMarchi; C Ronald Kahn
Journal:  Proc Natl Acad Sci U S A       Date:  2018-02-21       Impact factor: 11.205

Review 2.  Mechanisms of immunomodulation by mammalian and viral decoy receptors: insights from structures.

Authors:  Jan Felix; Savvas N Savvides
Journal:  Nat Rev Immunol       Date:  2016-12-28       Impact factor: 53.106

Review 3.  The evolutionary biology of poxviruses.

Authors:  Austin L Hughes; Stephanie Irausquin; Robert Friedman
Journal:  Infect Genet Evol       Date:  2009-10-13       Impact factor: 3.342

4.  Use of host-like peptide motifs in viral proteins is a prevalent strategy in host-virus interactions.

Authors:  Tzachi Hagai; Ariel Azia; M Madan Babu; Raul Andino
Journal:  Cell Rep       Date:  2014-05-29       Impact factor: 9.423

5.  Orthopoxvirus genome evolution: the role of gene loss.

Authors:  Robert Curtis Hendrickson; Chunlin Wang; Eneida L Hatcher; Elliot J Lefkowitz
Journal:  Viruses       Date:  2010-09-15       Impact factor: 5.818

6.  Genes in the terminal regions of orthopoxvirus genomes experience adaptive molecular evolution.

Authors:  David J Esteban; Anne P Hutchinson
Journal:  BMC Genomics       Date:  2011-05-23       Impact factor: 3.969

7.  Identification of nucleotide-level changes impacting gene content and genome evolution in orthopoxviruses.

Authors:  Eneida L Hatcher; Robert Curtis Hendrickson; Elliot J Lefkowitz
Journal:  J Virol       Date:  2014-09-17       Impact factor: 6.549

8.  Domain analysis of symbionts and hosts (DASH) in a genome-wide survey of pathogenic human viruses.

Authors:  Mileidy W Gonzalez; John L Spouge
Journal:  BMC Res Notes       Date:  2013-05-24

9.  Horizontal transfer and the evolution of host-pathogen interactions.

Authors:  Elena de la Casa-Esperón
Journal:  Int J Evol Biol       Date:  2012-11-26

10.  Genome variability and gene content in chordopoxviruses: dependence on microsatellites.

Authors:  Eneida L Hatcher; Chunlin Wang; Elliot J Lefkowitz
Journal:  Viruses       Date:  2015-04-22       Impact factor: 5.048

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.