| Literature DB >> 23701946 |
Natalya Yutin1, Didier Raoult, Eugene V Koonin.
Abstract
BACKGROUND: Recent advances of genomics and metagenomics reveal remarkable diversity of viruses and other selfish genetic elements. In particular, giant viruses have been shown to possess their own mobilomes that include virophages, small viruses that parasitize on giant viruses of the Mimiviridae family, and transpovirons, distinct linear plasmids. One of the virophages known as the Mavirus, a parasite of the giant Cafeteria roenbergensis virus, shares several genes with large eukaryotic self-replicating transposon of the Polinton (Maverick) family, and it has been proposed that the polintons evolved from a Mavirus-like ancestor.Entities:
Mesh:
Year: 2013 PMID: 23701946 PMCID: PMC3671162 DOI: 10.1186/1743-422X-10-158
Source DB: PubMed Journal: Virol J ISSN: 1743-422X Impact factor: 4.099
Figure 1Comparison of genome architectures of the virophages, polintons, some viruses, and transpovirons. Homologous genes are marked by same colors. Different hatching patterns are used to mark non-orthologous primase-helicase, integrase, and lipase. Homologous regions are shaded. Reference sequences were extracted from GenBank (D. fasciculatum, GI:328871053; P. palladium, GI: 281202948; T. castaneum, GI:58197573; A. pisum, GI:156713484; Mimivirus lentille transpoviron Lentille, GI: 374110342; C. congregate bracovirus, GI: 326937614 Minute virus of mice, GI:9626993; Bovine adenovirus A, GI:52801677; Bacteriophage Bam35, GI:38640293; and Bacteriophage PRD1, GI:159192286) and Repbase (Polinton-1_CB and Polinton-1_TV [15,20]). The genome organization of Tlr1was adopted from ref. [18]. T. castaneum and A. pisum genome fragments are shown as in [21]. PLA2 stands for the phospholipase A2 domain of parvovirus capsid protein. Other color key abbreviations are the same as throughout the text.
Evolutionary provenance of the genes of the three well-characterized virophages
| V9, OLV7, MV16 | C5-family cysteine protease | Protease, probably involved in capsid protein maturation | Only distantly related to other proteases from NCLDV, adenoviruses, eukaryotes and some bacteria | No obvious homologs |
| V3, OLV4, MV15 | P-loop ATPase, FtsK-like family | Packaging ATPase | Only distantly related to other ATPases of the FtsK-like family: NCLDV, adenoviruses, diverse phages, bacteria and archaea (DNA pumping during cell division and conjugation) | Abundant moderately conserved homologs |
| V20, OLV9, MV18 | Predicted distorted jelly-roll domain | Major capsid protein | No homologs beyond virophages | None |
| V18-19, OLV8, MV17 | No detectable domains | Minor capsid protein | No homologs beyond virophages | None |
| V14, V4, MV06 (C-terminal), OLV1 (C-terminal), | C2H2 Zn-ribbon; N-terminal GIY-YIG endonuclease domain in MV06 and OLV1 | Unknown | Homologs in transpovirons (closest to V14, Zn-ribbon only), | Moderately conserved homologs, mostly containing GIY-YIG nuclease domain |
| V13 (C-terminal), OLV25 (C-terminal), MV01 | S3H helicase; N-terminal TVpol in V13 and OLV25 | primase-helicase | Sputnik helicase is most similar to bacterial and bacteriophage homologs; the MV01 helicase is most similar to the NCLDV homolog; the OLV helicase is most similar to homologs from bacteriophages and polintons | Numerous conserved homologs including proteins with both TVpol and helicase domains |
| V13 (N-terminal), OLV25 (N-terminal) | TVpol | Primase and DNA polymerase ([ | Related to | Numerous conserved homologs including proteins with both TVpol and helicase domains |
| V21, OLV5 | No detectable domains | Unknown | No other homologs | None |
| V6(part), V7(part) OLV13 (part), OLV19 (part), OLV20 (part) | Collagen-like repeats | Adsorbtion on host virus? | V6 is highly similar to mimiviruses, OLV13 - to bacteria; OLV19 has regions similar to OLPV, | Abundant homologs mostly containing collagen domain |
| OLV1 (N-terminal), OLV24 and MV06 (N- terminal | GIY-YIG endonuclease, fused to C2H2 Zn-ribbon in OLV1 and MV06 | Unknown | Close homologs in | Moderately conserved homologs |
| OLV12 (C- terminal), MV13 (C- terminal) | Lipase 3 domain | Unknown | Homologs in all cellular organisms; Mavirus closest homolog is a | Few moderately conserved homologs for each of the proteins |
| V10, MV02 | Integrase | | Mavirus interase is related to Polintons, Sputnik - to archaeal and bacterial proviruses | Very few homologs |
| V17 | Transposase, DNA-binding domain | DNA-binding protein | Closest homologs in transpovirons | Numerous moderately conserved homologs |
| V16 | No detectable domains | Unknown | Homologs in moumouvirus: mv_L1152 | none |
| V12 | No detectable domains | Unknown | Highly conserved homologs in Mimiviridae | none |
| V10 | XerD family integrase | Integrase | Closest homologs in archaeal proviruses | Only distant integrases |
| OLV23 | N6 A-specific methylase | DNA methylase | Numerous bacterial homolog | Numerous homologs |
| OLV16, OLV21 | Proline-rich, mucien –like repeats | Unknown (adsorption on virus host?) | Similar repeats in bacteria and eukaryotes | Numerous similar repeats |
| OLV18, OLV19 | Phage Tail Collar Domain | Unknown (adsorption on virus host?) | Closely related to a family of OLPV proteins | Numerous close homologs |
| OLV2 | Uncharacterized domain | Unknown | Homologs in many phycodnaviruses and in Tlr1 element (6Fp) | Abundant homologs with wide range of similarity including very close ones |
| OLV22 | Uncharacterized domain | Unknown | Highly similar to OLPV2, GI:322510937 | A few close homologs |
| OLV12(N-terminal) | Uncharacterized domain fused to Lipase 3 | Unknown | Highly similar to Chloroviruses | Numerous close homologs |
| MV20 | FNIP repeats | Unknown | Closely related homologs in mimiviruses | Numerous moderately similar homologs |
| MV04 | C2H2 Zn finger | Unknown | No close homologs | None |
| MV02 | RVE family integrase | Integration of Mavirus genome into the virus host genome? | Numerous homologs, closest in Polintons | Numerous moderately similar homologs |
| MV19, M09 | S74 family peptidase (C-terminal), N-terminal glycosylase (?); MV09 has only the N-terminal domain | Unknown | Numerous homologs in phages and bacteria (prophages?); homologs in Marseillevirus, Lausannevirus, | Numerous moderately similar homologs |
| MV13 | Lipase (a/b hydrolase superfamily) | Unknown | Homologs in all cellular organisms, closest in plants | Several moderately similar homologs |
| MV03 | B family DNA polymerase | Genome replication | Homologs in all cellular organisms and numerous viruses, the closest homologs in Polintons | No close homologs |
Figure 2Phylogenetic trees of conserved virophage proteins. A, maturation protease. B, packaging ATPase. Branches with bootstrap support (expected-likelihood weights) less than 0.5 were collapsed. Sequences marked with an asterisks (*) were taken from Repbase [20]. For other sequences, the species name abbreviation and the GenBank identification numbers are indicated; env stands for “marine metagenome.” Species abbreviations: Acaca, Acanthamoeba castellanii mamavirus; Acapo, Acanthamoeba polyphaga mimivirus; Afrsw, African swine fever virus; Amsmo, Amsacta moorei entomopoxvirus 'L'; Caebr, Caenorhabditis brenneri; Caere, Caenorhabditis remanei; Crovi, Crocodilepox virus; Dicfa, Dictyostelium fasciculatum; Fowvi, Fowlpox virus isolate HP-438/Munich; Megch, Megavirus chiliensis; Melsa, Melanoplus sanguinipes entomopoxvirus; MouMo, Moumouvirus Monve; Orfvi, Orf virus; Popla, Polysphondylium pallidum PN500; Tanvi, Tanapox virus; uncvi, uncultured virus; Vacvi, Vaccinia virus Tian Tan. Taxa abbreviations: Ea, Amoebozoa; El, Opisthokonta; n2, mimiviruses; u1, Chordopoxvirinae; u2, Entomopoxvirinae. Color code: Red, virophages; blue, (predicted) polintons and related elements; light brown, NCLDV; gray, unassigned environmental sequences.
Figure 3Phylogenetic trees of virophage S3H helicase: A, helicase domain; B, TVpol domain. Branches with bootstrap support (expected-likelihood weights) less than 0.5 were collapsed. TVpol domain genome contexts are shown by same color scheme as in Figure 1. TVpol domain reference sequences (marked with two asterisks) were taken from [22]. For other sequences, the species name abbreviation and the GenBank identification numbers are indicated; env stands for “marine metagenome.” Species abbreviations: Afrsw, African swine fever virus; Dicfa, Dictyostelium fasciculatum; Ectsi, Ectocarpus siliculosus virus 1; Marse, Marseillevirus; Micpu, Micromonas pusilla virus PL1; Monbr, Monosiga brevicollis MX1; Mycph, Mycobacterium phage; Parbu, Paramecium bursaria Chlorella virus NY2A; Physo, Phytophthora sojae; Popla, Polysphondylium pallidum PN500; Steph, Stenotrophomonas phage S1. Taxa abbreviations: E8, stramenopiles; Ea, Amoebozoa; El, Opisthokonta; f3, Siphoviridae; q1, Chlorovirus; q3, Phaeovirus; q4, Prasinovirus. The color code is as in Figure 2.
Figure 4Phylogenetic tree of the GIY-YIG endonuclease. Branches with bootstrap support (expected-likelihood weights) less than 0.5 were collapsed. The species name abbreviation and the GenBank identification numbers are indicated; env stands for “marine metagenome.” Species abbreviations: Aciph, Acinetobacter phage 133; Bacam, Bacillus amyloliquefaciens subsp. plantarum YAU B9601-Y2; Invir, Invertebrate iridescent virus 6; Phagl, Phaeocystis globosa virus 12T; Phyin, Phytophthora infestans T30-4; Physo, Phytophthora sojae; Psyto, Psychroflexus torquis ATCC 700755; Taxa abbreviations: Bb, Bacteriodetes/Chlorobi group; Bf, Firmicutes; l2, Iridovirus. The color code is as in Figure 2.
Figure 5Two alternative evolutionary scenarios for the virophages. A, Mavirus/polinton-like ancestor. B, Sputnik/OLV-like ancestor. The genes are denoted as in Figure 1.
Figure 6Phylogenetic trees of Mavirus genes shared with Polintons but not with other virophages. A, B family DNA polymerase. B, Catalytic domain of RVE integrase. Branches with the bootstrap support (expected-likelihood weights) less than 0.5 were collapsed. Sequences marked with an astericks (*) were taken from Repbase [20]. For other sequences, the species name abbreviation and the GenBank identification numbers are indicated; env stands for “marine metagenome.” Species abbreviations: Acypi, Acyrthosiphon pisum; Caeel, Caenorhabditis elegans; Caere, Caenorhabditis remanei; Capow, Capsaspora owczarzaki ATCC 30864; Danre, Danio rerio; Debha, Debaryomyces hansenii; Dicfa, Dictyostelium fasciculatum; Klula, Kluyveromyces lactis; Lackl, Lachancea kluyveri; Nasvi, Nasonia vitripennis; Polpa, Polysphondylium pallidum PN500; Sacko, Saccoglossus kowalevskii; Strpu, Strongylocentrotus purpuratus; Trica, Tribolium castaneum; Xentr, Xenopus (Silurana) tropicalis. Taxa abbreviations: Ea, Amoebozoa; El, Opisthokonta. The color code is as in Figure 2.
Figure 7Phylogenetic trees of Tlr1 proteins shared with transpovirons or virophages. A, Superfamily 1 helicase. B, Tlr1 6F protein. Branches with the bootstrap support (expected-likelihood weights) less than 0.5 were collapsed. Genome contexts are shown by same color scheme as on Figure 1. Sequences marked with an astericks (*) were taken from Repbase [20]. For other sequences, the species name abbreviation and the GenBank identification numbers are indicated; tr stands for transpoviron; env stands for environmental (metagenomic) sequence. Species abbreviations: Acaca, Acanthamoeba castellanii mamavirus; Acatu, Acanthocystis turfacea Chlorella virus 1; Batsp, Bathycoccus sp. RCC1105 virus BpV2; Dicdi, Dictyostelium discoideum AX4; Ectsi, Ectocarpus siliculosus; Emihu, Emiliania huxleyi virus 84; Entdi, Entamoeba dispar SAW760; Guith, Guillardia theta; Homsa, Homo sapiens; Hydma, Hydra magnipapillata; Invir, Invertebrate iridescent virus 6; Lacph, Lactococcus phage 949; Megch, Megavirus chiliensis; Megco, Megavirus courdo7; Micpu, Micromonas pusilla virus PL1; Micsp, Micromonas sp. RCC1109 virus MpV1; Mimle, Mimivirus lentille; MouMo, Moumouvirus Monve; OrgLa, Organic Lake phycodnavirus 1; Ostlu, Ostreococcus lucimarinus virus OlV1; Ostta, Ostreococcus tauri virus RT-2011; Ostvi, Ostreococcus virus OsV5; Parbu, Paramecium bursaria Chlorella virus NY2A; Phagl, Phaeocystis globosa virus 12T; Polpa, Polysphondylium pallidum PN500; Wisir, Wiseana iridescent virus. Taxa abbreviations: E8, stramenopiles; Ea, Amoebozoa; El, Opisthokonta; f3, Siphoviridae; l2, Iridovirus; n2, mimiviruses; q0, unassigned Phycodnaviridae; q1, Chlorovirus; q2, Coccolithovirus; q4, Prasinovirus. The color code is as in Figure 2.
Figure 8The virophage-polinton evolutionary network. Specific groups of bacteriophages that are involved in the network connections: Tectiviridae (PolB); Caudovirales (tailed bacteriophages: S3H and GIY-YIG); cyanophages (MV19 peptidase). Specific groups of NCLDV that are involved in the network connections: Irido-, Mimi-, Pox-, Marseilleviruses (Mavirus S3H helicase); Marseillevirus (OLV S3H helicase and MV19 peptidase); Phaeocystis globosa virus and Invertebrate iridescent virus 6 (GIY-YIG); Phycodnaviridae (Tlr 6F); Pox- and Asfarviridae (ATPase), and Mimiviridae (MV20 FNIP repeats).