| Literature DB >> 34253028 |
Tatiana G Senkevich1, Natalya Yutin2, Yuri I Wolf2, Eugene V Koonin2, Bernard Moss1.
Abstract
The survival of viruses depends on their ability to resist host defenses and, of all animal virus families, the poxviruses have the most antidefense genes. Orthopoxviruses (ORPV), a genus within the subfamily Chordopoxvirinae, infect diverse mammals and include one of the most devastating human pathogens, the now eradicated smallpox virus. ORPV encode ∼200 genes, of which roughly half are directly involved in virus genome replication and expression as well as virion morphogenesis. The remaining ∼100 "accessory" genes are responsible for virus-host interactions, particularly counter-defense of innate immunity. Complete sequences are currently available for several hundred ORPV genomes isolated from a variety of mammalian hosts, providing a rich resource for comparative genomics and reconstruction of ORPV evolution. To identify the provenance and evolutionary trends of the ORPV accessory genes, we constructed clusters including the orthologs of these genes from all chordopoxviruses. Most of the accessory genes were captured in three major waves early in chordopoxvirus evolution, prior to the divergence of ORPV and the sister genus Centapoxvirus from their common ancestor. The capture of these genes from the host was followed by extensive gene duplication, yielding several paralogous gene families. In addition, nine genes were gained during the evolution of ORPV themselves. In contrast, nearly every accessory gene was lost, some on multiple, independent occasions in numerous lineages of ORPV, so that no ORPV retains them all. A variety of functional interactions could be inferred from examination of pairs of ORPV accessory genes that were either often or rarely lost concurrently. IMPORTANCE Orthopoxviruses (ORPV) include smallpox (variola) virus, one of the most devastating human pathogens, and vaccinia virus, comprising the vaccine used for smallpox eradication. Among roughly 200 ORPV genes, about half are essential for genome replication and expression as well as virion morphogenesis, whereas the remaining half consists of accessory genes counteracting the host immune response. We reannotated the accessory genes of ORPV, predicting the functions of uncharacterized genes, and reconstructed the history of their gain and loss during the evolution of ORPV. Most of the accessory genes were acquired in three major waves antedating the origin of ORPV from chordopoxviruses. The evolution of ORPV themselves was dominated by gene loss, with numerous genes lost at the base of each major group of ORPV. Examination of pairs of ORPV accessory genes that were either often or rarely lost concurrently during ORPV evolution allows prediction of different types of functional interactions.Entities:
Keywords: Adaptive mutations; Gene gain; Gene loss; Host range; Innate immunity; Poxvirus; Poxvirus evolution; Virus evolution; Virus phylogeny
Mesh:
Substances:
Year: 2021 PMID: 34253028 PMCID: PMC8406176 DOI: 10.1128/mBio.01495-21
Source DB: PubMed Journal: mBio Impact factor: 7.867
ORPV genes implicated in virus-host interaction (103 genes)
| OPG | Protein domain architecture, localization, function | Cop |
|---|---|---|
|
| PIE domain, chemokine binding protein | C23L |
|
| TNF-α receptor homolog and PIE domain, chemokine binding protein, CrmB | C22L |
|
| ANK and PRANC domains, binds to the NF-κB p105 subunit and to SCP1 | C19L-C21L |
|
| ANK and PRANC domains | C17L-C18L |
|
| Bcl-2 domain, host range factor | C16L |
|
| Bcl-2 domain | |
|
| PIE domain | |
|
| BTB domain, no kelch domain | |
|
| ANK and PRANC domains | |
|
| C-type lectin domain, putative decoy ligand for NK cell inhibitory receptor | |
|
| Type II transmembrane, TAP inhibitor, inhibits peptide loading on MHC I and antigen presentation | |
|
| BTB and Kelch domains | |
|
| PIE domain, SCP-2 | |
|
| TNF receptor vCD30 homolog, binds CD153, prevents CD30/CD153 interaction | |
|
| ANK and PRANC domains, binds to cullin 2 and inhibits NF-κB activation | |
|
| ANK and PRANC domains | |
|
| MHC class I homolog (OMCP), secreted, blocks NKG2D receptor | |
|
| ANK and PRANC domains | |
|
| Paralog of OPG27, VACV C7 protein | |
|
| Epidermal growth factor homolog, EGFR-dependent MAPK/ERK1/2 activation | C11R |
|
| N-term prolyl hydroxylase fold, C-term IL-1 receptor antagonist, antagonizes DNA-PK by binding Ku | C10L |
|
| KilA-N and RING domains, E3 ubiquitin ligase, blocks apoptosis, virulence factor | |
|
| IL-18 binding protein homolog, secreted, blocks IL-18 receptor binding site | |
|
| ANK and PRANC domains, host range, CP77 | |
|
| Intrinsically disordered | |
|
| ANK and PRANC domains, targets IFITs for degradation, IFN resistance | C9L |
|
| PIE domain, secreted | C8L |
|
| Host range, inhibits type I IFN production and signaling, paralog of OPG18 | C7L |
|
| Transmembrane domain | |
|
| Bcl-2 domain, inhibits type I IFN production and signaling, targets HDAC5 for degradation | C6L |
|
| BTB domain without Kelch domain | C5L |
|
| Inactive prolyl hydroxylase, induction of hypoxic response, inhibits NF-kB signaling, binds Ku | C4L |
|
| Complement control protein-like (CCP), prevents complement activation, secreted | C3L |
|
| BTB and Kelch domains, inhibits inflammation, reduces immunopathology | C2L |
|
| Bcl-2 domain | C1L |
|
| Bcl-2 domain, inhibits NF-kB activation and apoptosis | N1L |
|
| Bcl-2 domain, IRF3 activation inhibitor, α-amanitin target | N2L |
|
| ANK domain (no PRANC domain), inhibits intrinsic apoptosis at the level of the apoptosome | M1L |
|
| PIE domain, blocks SD28-mediated T cell activation, secreted | M2L |
|
| ANK domain, no PRANC domain, NF-kB inhibitor, host range | K1L |
|
| Mimic of eIF2α, pseudo-substrate for PKR , PKR inhibitor, IFN-resistance | K3L |
|
| Phospholipase-D homolog | K4L |
|
| Monoglyceride lipase homolog | K5L, K6L |
|
| Bcl-2 domain, binds DDX3, blocks IFN-β promoter induction | K7R |
|
| Bcl-2 domain, localized to mitochondria, blocks mitochondrial apoptosis | F1L |
|
| BTB and Kelch domains, contributes to virulence, larger lesion size | F3L |
|
| Immunoglobulin domain, type I membrane protein, affects plaque morphology | F5L |
|
| No detectable domains | F6L |
|
| No detectable domains | F7L |
|
| No detectable domains, non-essential for morphogenesis and actin rearrangement in VACV | F8L |
|
| RhoA-mDia signaling inhibitor, promotes cell motility | F11L |
|
| Predicted all-β domain, highly conserved in most chordopoxviruses | F15L |
|
| Non-functional serine recombinase | F16L |
|
| Z-DNA binding domain, dsRNA-binding, PKR inhibitor, IFN resistance | E3L |
|
| 2 DNA-binding BEN domains, cGAS and IFN type 1 inhibitor, virulence factor | E5R |
|
| C-terminal TM, activates of ERK1/2 signaling pathway, increases virulence | O1L |
|
| NLPc/P60 superfamily protein, predicted hydrolase, non-essential in cell culture | G6R |
|
| US22 family, SUKH superfamily, predicted dsRNA binding | A31R |
|
| MHC class II antigen presentation inhibitor, virulence factor | A35R |
|
| Inactive prolyl hydroxylase, induction of hypoxic response?, paralog of C10, C4 proteins | A37R |
|
| No detectable domains, uncharacterized | |
|
| CD47 homolog, integral membrane protein, promotes Ca2+ influx into infected cells | A38L |
|
| Semaphorin homolog, secreted, blocks semaphorin cell surface receptor | A39R |
|
| Lectin homolog, type II TM, cell surface | A40R |
|
| PIE domain, secreted chemokine binding protein, interferes with chemokine-GAG interaction | A41L |
|
| Profilin homolog, interacts with A-type inclusion protein and tropomyosin | A42R |
|
| Type I membrane glycoprotein, localizes to Golgi and plasma membranes | A43R |
|
| Inhibitor of host translation initiation, suppresses innate and adaptive immunity | A43.5R |
|
| Hydroxysteroid dehydrogenase, inflammatory response inhibitor | A44L |
|
| Bcl-2 domain, inhibits TLR- and IL-1β mediated NF-kB activation | A46R |
|
| Gasdermin homolog, pyroptosis inhibitor? Immunoprevalent | A47L |
|
| Bcl-2 domain, disordered 17 amino acid N-terminus mimics IκBα, inhibits NF-kB activation | A49R |
|
| Distant homolog of OPG202, VACV B17 protein | A51R |
|
| Bcl-2 domain, NF-kB pathway inhibitor, blocks TLR signaling | A52R |
|
| TNF-receptor homolog, CrmC, secreted, prevents TNF binding to cell receptor | A53R |
|
| BTB and Kelch domains, strong interaction with Cullin 3, could sequester Cullin 3 | A55R |
|
| Poxin-Schlafen fusion protein, blocks DNA sensing and prevents IFN induction | B2R-B3R |
|
| Poxin, 2’-3’-cGAMP nuclease, blocks DNA sensing and IFN induction | B2 |
|
| ANK and PRANC domains, affects ECTV spread in mouse organs | B4R |
|
| HEAT/ARM repeats, C-terminal transmembrane domain | B6R |
|
| PIE domain, ER resident, SCP-3, influences lesion size in mice | B7R |
|
| IFN-γ receptor mimic, secreted, sequesters IFN-γ | B8R |
|
| Surface glycoprotein type 1 TM domain, absent in CPXV Gri and BRI, present in CPXV GER91-3 | |
|
| PIE, ER localized, retains MHC I in ER, prevents antigen presentation | B9R |
|
| BTB and Kelch domains | B10R |
|
| Disordered | B11R |
|
| Serine/threonine kinase homolog, nonfunctional | B12R |
|
| Serpin (SPI2), CrmA, caspase-1 and 8 inhibitor, apoptosis (pyroptosis) inhibitor | B13R, B14R |
|
| Bcl-2, NF-kB signaling inhibitor, prevents phosphorylation of IkBα, regulates MAPK/AP-1 activation | B15R |
|
| 3 immunoglobulin domains, IL-1 receptor homolog, blocks IL-1β, prevents fever in mice | B16R |
|
| Distant homolog of OPG181, VACV A51 protein | B17L |
|
| ANK and PRANC domains, binds to SCF ubiquitin ligase complex | B18R |
|
| 3 immunoglobulin domains, IFN type-1 decoy receptor, secreted, attaches to GAGs, inhibits IFN | B19R |
|
| ANK and PRANC domains | B20R |
|
| BTB and Kelch domains interacts with C3RL complex | |
|
| No detectable domains, uncharacterized | C11.5R |
|
| Serpin (SPI1), host range factor, apoptosis inhibitor | C12L |
|
| PIE domain, SCP-1 | C13L, C14L |
|
| Surface glycoprotein, type I transmembrane domain, T cell response suppressor, virulence factor | |
|
| ANK and PRANC domains | C15L/B21R |
|
| TNF receptor homolog and PIE domain, CrmD | |
|
| TNF receptor homolog, secreted, CrmE, blocks TNF binding to cell receptors | |
|
| N-methyl D-aspartate receptor homolog, Golgi localized, vGAAP, suppresses apoptosis |
Genes that are present on both ends of the Cop genome in internal terminal repeat.Abbreviations: Cop, Copenhagen; IFN, interferon; TM, transmembrane; EV, enveloped virus.
FIG 2Gain of ORPV genes at different stages of evolution of chordopoxviruses. Each circle associated with a tree branch shows the node number from which the branch emits (see Table S5A) and the number of genes inferred to have been gained at the branch. The callouts (blue boxes) indicate the inferred points of entry of the ancestors of the major families of paralogous accessory genes into the evolving poxvirus genomes. The chordopoxvirus genera are indicated to the right of the tree.
FIG 1Distribution of ORPV accessory genes among chordopoxviruses. (A) Number of ORPV accessory genes (OPG) represented in each chordopoxvirus genus. (B) Distribution of accessory genes among the major lineages of ORPV. (C) Representation of accessory genes in ORPV genomes. Each bar represents one of the 106 accessory OPGs.
FIG 3Phylogenetic tree of ORPV and gene loss in ORPV evolution. The large branches are collapsed and shown with triangles. Each box associated with a tree branch shows the node number from which the branch emits (see Fig. S2) and the number of genes inferred to have been lost at the branch (numbers in bold). The four colored boxes show major clades with many genes lost at the stem. For branches without associated boxes, no or one gain was inferred. The red shape shows the high-loss clade of ORPV.
FIG 4Estimated rates of gene loss in the major clades of ORPV. The plot shows the loss rate in the stem of the respective clade and each of the branches within the clade. The gene loss rates were estimated as the number of genes lost per 1% nucleotide distance from the common ancestor of the respective group. Whiskers show the 90% confidence interval (5th and 95th percentile of the loss rates across 1000 bootstrap samples of genes).
FIG 5Gene content evolution tree of ORPV derived from the patterns of gene presence-absence. The Unweighted Pair Group Method with Arithmetic mean (UPGMA) dendrogram was constructed from the matrix of distances between OPG presence-absence patterns, and therefore, reflects the overlap between the sets of lost genes (Table S3A).
FIG 6Distribution of inferred gene losses along the ORPV genomes. Each bar shows the number of times a given OPG was lost during the evolution of the ORPV. OPG153 (A26L), a gene with an anomalously high loss rate for its position away from the genome end, is indicated.
FIG 7Concurrent gene loss in ORPV. (A) Heatmap of gene losses. The all against all map of OPG is shown. The intensity of the red color is proportional to the frequency of the concurrent loss of the respective OPG. (B) Network of gene losses. Shown is a sub-network of the complete network of concurrent gene loses that includes 53 genes, with edges connecting genes with at least 3 independent concurrent losses. The color goes from blue (3 shared losses) to red (13 shared losses). The network was visualized using Cytoscape v.3.8.2.
Loss of the genes of the four dominant large families of paralogs in the major clades of ORPV
| ORPV branch | Total lost genes | BTB-Kelch (8) | Ank-PRANC (14) | Bcl-2 (12) | PIE (11) |
|---|---|---|---|---|---|
| MPXV | 37 | 6 | 5 | 3 | 1 |
| VARV | 37 | 6 | 7 | 2 | 2 |
| VACV | 18 | 1 | 6 | 1 | 2 |
| ECTV | 35 | 1 | 7 | 6 | 3 |
The total number of members in each paralogous family is indicated in parentheses.
FIG 8Distribution of different gene disruption events along the ORPV genomes. The distributions of relative frequencies of 4 types of gene disruption events along the ORPV genomes are shown. The events are color-coded as shown beneath the plot: D, decaying (deletion); F, frameshift; N, in-frame stop nonsense mutation; X, complete absence (large deletion). Relative frequencies of disruption events in each gene were averaged in an 11-gene sliding windows.
FIG 9Routes of gene disruption in ORPV. The gene disruption events for 6 OPGs are mapped onto the phylogenetic tree of the ORPV (Fig. S2). Magenta, decaying (within gene deletion); blue, frameshift; red, missing (large deletion).
FIG 10Unique evolutionary scenarios for disrupted genes in ORPV. The inferred evolutionary events are shown for three OPGs with unusual histories. OPG188: fusion of Poxin with Schlafen followed by disruption of Schlafen, leaving the intact Poxin, or disruption of the entire fused gene. OPG177, Gasdermin homolog, duplication in branch leading to the common ancestor of ORPV and Centapoxviruses followed by the loss of one paralog in Old World ORPV. Poxvirus X denotes the common ancestor of Centapoxviruses and ORPV. Shapes with pale gray coloring and red cross denotes gene disruption. OPG010, OPG010a: exaptation of a disrupted remnant of an endoplsmic reticulum protein containing a lectin domain for a function as a TAP inhibitor (OPG10a); OPG10a evolved convergently in several groups of ORPV (see the text).