Literature DB >> 16105173

Mimivirus relatives in the Sargasso sea.

Elodie Ghedin1, Jean-Michel Claverie.   

Abstract

The discovery and genome analysis of Acanthamoeba polyphaga Mimivirus, the largest known DNA virus, challenged much of the accepted dogma regarding viruses. Its particle size (>400 nm), genome length (1.2 million bp) and huge gene repertoire (911 protein coding genes) all contribute to blur the established boundaries between viruses and the smallest parasitic cellular organisms. Phylogenetic analyses also suggested that the Mimivirus lineage could have emerged prior to the individualization of cellular organisms from the three established domains, triggering a debate that can only be resolved by generating and analyzing more data. The next step is then to seek some evidence that Mimivirus is not the only representative of its kind and determine where to look for new Mimiviridae. An exhaustive similarity search of all Mimivirus predicted proteins against all publicly available sequences identified many of their closest homologues among the Sargasso Sea environmental sequences. Subsequent phylogenetic analyses suggested that unknown large viruses evolutionarily closer to Mimivirus than to any presently characterized species exist in abundance in the Sargasso Sea. Their isolation and genome sequencing could prove invaluable in understanding the origin and diversity of large DNA viruses, and shed some light on the role they eventually played in the emergence of eukaryotes.

Entities:  

Mesh:

Substances:

Year:  2005        PMID: 16105173      PMCID: PMC1215527          DOI: 10.1186/1743-422X-2-62

Source DB:  PubMed          Journal:  Virol J        ISSN: 1743-422X            Impact factor:   4.099


Introduction

The discovery and genome sequence analysis of Mimivirus [1,2], the largest of the Nucleo-cytoplasmic Large DNA Viruses (NCLDV), challenged much of the accepted dogma regarding viruses. Its particle size (>400 nm), genome length (1.2 million bp) and extensive gene repertoire (911 protein coding genes) all contribute to blur the established boundaries between viruses and the smallest parasitic cellular organisms such as Mycoplasma or Nanoarchea [2]. In the universal tree of life, the Mimivirus lineage appears to define a new branch, predating the emergence of all established eukaryotic kingdoms [2]. Although this result is compatible with various hypotheses implicating ancestral DNA viruses in the emergence of eukaryotes [3-5], it requires confirmation from additional data. An urgent task is thus to convince ourselves that Mimivirus is not the sole representative of its kind (i.e. a viral counterpart to the platypus) and to provide some rational guidance as to where to begin the search for eventual new Mimiviridae. Mimivirus was serendipitously discovered within Acanthamoeba polyphaga, a free-living ubiquitous amoeba, prevalent in aquatic environments. Phylogenetic analysis of the most conserved genes common to all nucleo-cytoplasmic large double-stranded DNA viruses (NCLDV) [6] positions Mimivirus as an independent lineage, roughly equidistant from the Phycodnaviridae (algal viruses) and Iridoviridae (predominantly fish viruses). Given the ecological affinity of these virus families for the marine environment, we have examined the sequence data set gathered through environmental microbial DNA sampling in the Sargasso Sea [7] to look for possible Mimivirus relatives.

Results

By comparing Mimivirus ORFs to the Sargasso Sea sequence data set and to all other publicly available sequences, 138 (15%) of the 911 Mimivirus ORFs were found to exhibit their closest match (Blastp E-values ranging from 10-74 to 10-4 [8]) to environmental sequences (see Additional file 1). Even before the discovery of Mimivirus, increasingly complex large double-stranded DNA viruses have been isolated, in particular from unicellular algae. The genome analysis of these Phycodnaviruses revealed a variety of genes encoding enzymes from totally unexpected metabolic pathways [9]. Mimivirus added more unexpected genes (such as translation system components [2]) to this list. As the gene repertoire of these large viruses and the gene content of cellular organisms become increasingly comparable, we have to be cautious in the interpretation of environmental/metagenomics sequence data. To focus our study on environmental organisms most likely to be viruses, we limited further analyses to Mimivirus homologues member of the NCLDV core gene sets [2,6]. These core genes are subdivided into four classes from the most (class I) to least (class IV) evolutionarily conserved [6]. Seven of 10 Mimivirus Class I core genes (L206 to R400) have their closest homologues in the Sargasso Sea data. This is also the case for 3 of 7 class II (R450-R313)core genes, 3 of the 13 class III core genes (R429-L364) and 7 of the 16 Class IV core genes (L4-R301) (Table 1). Overall, 43% of Mimivirus core genes have their closest homologues in the Sargasso Sea data set. To further assess the viral nature of these unknown microbes, we studied the phylogenetic relationships between the corresponding Mimivirus proteins, their Sargasso Sea homologues, and the closest homologues in other NCLDVs (see Materials and Methods). Figure 1a–c exhibits three independent phylogenic trees computed using the MEGA3 software [10] for Mimivirus ORFs R449 (unknown function), R429 (unknown function) and L437 (putative virion packaging ATPase). Figure 1a shows that the closest environmental R449 homologues cluster with Mimivirus separately from the known phycodnaviruses, while other Sargasso Sea homologues cluster in a way suggesting the presence of a new clade distinct from Phycodnaviridae. The tree based on R429 and L437 (Fig. 1b,c) similarly suggests the presence of close Mimivirus relatives not belonging to the Phycodnaviridae or Iridoviridae clades.
Table 1

Matching Status of Mimivirus core genes (type 1 to 4).

ORF#DefinitionBest score in nrBest score in DNA virusesBest score in Sargasso SeaStatusReciprocal Best match
L206Helicase III / VV D5167-virus167214Best ENVYES
R322DNA pol (B family) extein207167238Best ENVYES
L437A32 virion packaging ATPase169-virus169191Best ENVYES
L396VV A18 helicase200-virus200187-
L425Capsid protein119-virus117142Best ENVcomplex
R439Capsid protein164-virus159173Best ENVcomplex
R441Capsid protein137-virus147209Best ENVcomplex
R596E10R-Thiol oxidoreductase104-virus105119Best ENVYES
R350VV D6R – helicase170-virus170102-
R400F10L – prot. Kinase86-virus8658-
R450A1L-transcr factor52-virus4765Best ENV
R339TFII-transcr. factor624266Best ENV
L524MuT-like NTP PP-hydrolase403839-
L323Myristoylated virion prot. A434240-
R493PCNA9287154Best ENVYES
L312Small Ribonucl. reduct341338310-
R313Large Ribonucl. reduct766741740-
R429PBCV1-A494R-like152-virus152216Best ENVYES
L37BroA, KilA-N123-virus12465-
R382mRNA-capping enz.8678166Best ENVYES
L244RNA pol. sub 2 (Rbp2)727416508-
R501RNA pol. sub.1 (Rpb1)805415520-
R195ESV128-Glutaredoxin503949-
R622S/Y phosphatase757365-
R311CIV193R BIR domain684451-
L65Virion memb. prot4444--
R480Topoisomerase II902717367-
L221Topoisomerase I bacterial52835516-
R194Topoisomerase I pox-like188100145-
L364SW1/SNF2 helicase70-virus7072Best ENVYES
L4N1R/P28 DNA binding prot123-virus12472-
L540Pre-mRNA helicase – splicing256136214-
L235RNA pol subunit5693850-
R354Lambda-type exonuclease69-virus69154Best ENVYES
R343RNAse III129112131Best ENVYES
R141GDP mannose 4,6-dehydratase29468252-
L258Thymidine kinase151140124-
L271Ankyrin repeats (66 paralogs)179152192Best ENVcomplex
R325Metal-dependent hydrolase69-virus69105Best ENVYES
L477Cathepsin B2264347-
R497Thymidylate synthase278242217-
R449Uncharacterized prot.69-virus69129Best ENVYES
R303NAD-dependent DNA ligase270-virus270228-
L805MACRO domain3633--
R571Patatin-like phospholipase10580122Best ENVYES
R301Uncharacterized prot.48-virus4865Best ENVYES
Figure 1

Phylogenetic evidence of uncharacterized Mimivirus relatives. (a) Neighbor-joining (NJ) clustering (see Materials and Methods) of Mimivirus R449 ORF with its best matching (≈35% identical residues) environmental homologues (noted Sargasso1 to Sargasso6 according to their decreasing similarity) and closest viral orthologues (28% identical). (b) NJ clustering of Mimivirus R429 ORF with its best matching (≈50% identical) environmental homologues (noted Sargasso1 to Sargasso5) and closest viral orthologues (35% identical). (c) NJ clustering of Mimivirus putative virion packaging ATPase L437 with its best matching (≈45% identity) environmental homologues (Sargasso1 and Sargasso2) and closest viral orthologues (34% identical). Abbreviations: Phyco: Phycodnavirus; PBCV: Paramecium bursaria chlorella virus 1; EsV: Ectocarpus siliculosus virus; FsV: Feldmannia sp. virus; HaV: Heterosigma akashiwo virus; Irido: Iridovirus; LCDV: Lymphocystis disease virus 1; Frog: Frog virus 3; Amby: Ambystoma tigrinum stebbensi virus; Rana: Rana tigrina ranavirus; Chilo: Chilo iridescent virus. Bootstrap values larger than 50% are shown. Branches with lower values were condensed.

Matching Status of Mimivirus core genes (type 1 to 4). Phylogenetic evidence of uncharacterized Mimivirus relatives. (a) Neighbor-joining (NJ) clustering (see Materials and Methods) of Mimivirus R449 ORF with its best matching (≈35% identical residues) environmental homologues (noted Sargasso1 to Sargasso6 according to their decreasing similarity) and closest viral orthologues (28% identical). (b) NJ clustering of Mimivirus R429 ORF with its best matching (≈50% identical) environmental homologues (noted Sargasso1 to Sargasso5) and closest viral orthologues (35% identical). (c) NJ clustering of Mimivirus putative virion packaging ATPase L437 with its best matching (≈45% identity) environmental homologues (Sargasso1 and Sargasso2) and closest viral orthologues (34% identical). Abbreviations: Phyco: Phycodnavirus; PBCV: Paramecium bursaria chlorella virus 1; EsV: Ectocarpus siliculosus virus; FsV: Feldmannia sp. virus; HaV: Heterosigma akashiwo virus; Irido: Iridovirus; LCDV: Lymphocystis disease virus 1; Frog: Frog virus 3; Amby: Ambystoma tigrinum stebbensi virus; Rana: Rana tigrina ranavirus; Chilo: Chilo iridescent virus. Bootstrap values larger than 50% are shown. Branches with lower values were condensed. Another piece of evidence substantiating the existence of an unknown Mimivirus relative in the Sargasso Sea is the discovery of contigs built from the data that contain multiple genes with a high degree of similarity to Mimivirus genes. A spectacular case is illustrated in Figure 2. Here, a 4.5 kb scaffold (See Materials and Method) exhibits 4 putative ORFs. When compared to the whole nr database, each of them has as a best match 4 distinct Mimivirus ORFs: thiol oxidoreductase R368 (29% identical, E-value < 10-9), NTPase-like L377 (25% identical, E-value < 10-20), unknown function L375 (34% identical, E-value < 10-30), and DNA repair enzyme L687 (40% identical, E-value < 10-62). Moreover, the gene order is conserved for three of them (R368, L375, L377). Such colinearity is rarely observed between viral genomes except for members of the same family. Unfortunately, the sequences of these genes are not conserved enough to allow the construction of informative phylogenic trees that would include other NCLDV orthologues.
Figure 2

Organization of four Mimivirus ORF best matching homologues in a 4.5 kb environmental sequence scaffold (approximately to scale). The three colinear Mimivirus homologues are in green. Unmatched ORF extremities are indicated by dots. The two diagonal lines indicate where the two contigs are joined on the scaffold.

Organization of four Mimivirus ORF best matching homologues in a 4.5 kb environmental sequence scaffold (approximately to scale). The three colinear Mimivirus homologues are in green. Unmatched ORF extremities are indicated by dots. The two diagonal lines indicate where the two contigs are joined on the scaffold. As of today, genes encoding capsid proteins are among the most unequivocal genes of viral origin. Except for cases of integrated proviral genomes, no cellular homologues of viral capsid proteins have ever been found. During our study, the closest homologues of Mimivirus capsid proteins were found to be capsid protein genes of environmental origin. For example, Mimivirus capsid protein (R441) was found to be 48.5% identical to an unknown environmental sequence, when it is only 36.2% identical to the major capsid protein Vp49 of Chlorella virus CVG-1, its best match among known viruses (Figure 3). As the environmental capsid protein sequence also shares 44.5% identical residues with the CVG-1 Vp49, the corresponding uncharacterized virus appears to lie at an equal evolutionary distance from the Mimiviridae and the Phycodnaviridae.
Figure 3

Partial 3-way alignment (N-terminus region) of Mimivirus capsid protein (R441) with it best matching homologues in the NR and Environmental sequence databases. The Mimivirus R441 protein shares 83/229 (36.2%) identical residues (colored in red or blue) with the major capsid protein Vp49 of Chlorella virus CVG-1 and 111/229 (48.5%) identical residues (indicated in red or green) with the N-terminus of a capsid protein from an unknown large virus sampled from the Sargasso Sea (Accession: EAD00518). On the other hand, the CVG-1 Vp49 and the Sargasso Sea sequence share 44.5% identical residues. By comparison, the CVG-1 Vp49 protein share 72% of identical residue with PBCV-1 Vp54, its best matching homologue among known phycodnaviruses.

Partial 3-way alignment (N-terminus region) of Mimivirus capsid protein (R441) with it best matching homologues in the NR and Environmental sequence databases. The Mimivirus R441 protein shares 83/229 (36.2%) identical residues (colored in red or blue) with the major capsid protein Vp49 of Chlorella virus CVG-1 and 111/229 (48.5%) identical residues (indicated in red or green) with the N-terminus of a capsid protein from an unknown large virus sampled from the Sargasso Sea (Accession: EAD00518). On the other hand, the CVG-1 Vp49 and the Sargasso Sea sequence share 44.5% identical residues. By comparison, the CVG-1 Vp49 protein share 72% of identical residue with PBCV-1 Vp54, its best matching homologue among known phycodnaviruses.

Discussion

Our results predict that DNA viruses of 0.1 to 0.8 microns in size exist in the Sargasso Sea that are evolutionarily closer to Mimivirus than to any presently characterized species. These viruses are abundant enough to have been collected by environmental sampling. It must be noticed that a similar approach attempting to find relatives to two other unique NCLDVs, the African swine fever virus (the unique member of Asfarviridae) and the White spot syndrome virus, a major shrimp pathogen (the sole Nimaviridae), failed to provide convincing results (Claverie, data not shown). The identification of numerous Mimivirus-like sequences in the Sargasso Sea data is thus not simply the result of a large number of sequences been compared, but truly suggests that viruses from this clade are specifically abundant in the sampled marine environment. It is actually expected that many novel viruses will be encountered in natural waters in which they constitute the most abundant microrganisms [11,12]. There might be as many as 10 billion virus particles per litre of ocean surface waters [13]. Interestingly, the specialized literature abounds of descriptions of large virus-like particle associated with algae [e.g. [14-16]], or various marine protists [17,18]. With the exception of Phycodnaviruses [19-21], the genomic characterization of these viruses has not been attempted. Guided by the results presented here, their isolation and genome sequencing could prove invaluable in understanding the diversity of DNA viruses and the role they eventually played in the evolution of eukaryotes.

Materials and methods

The protocols used to collect Sargasso Sea environmental micro-organisms and generate DNA sequences from these samples has been described elsewhere [7]). The data analyzed here correspond to "bacteria-sized" organisms that have passed through 3 μm filters and been retained by 0.8 μm to 0.1 μm filters. Mimivirus-like particles (0.8–0.4 μm) belong in this range. Database similarity searches were performed using the Blast suite of programs [8] (default options) as implemented on the web server and as implemented at The Institute for Genomic Research. Final similarity searches were performed on the non-redundant peptide sequence databases (nr) and environmental data (env-nr) downloaded from the National Institute for Biotechnology Information ftp server on March 14, 2005. To avoid missing potential better matches with annotated virus ORFs, all Mimivirus ORFs exhibiting a best match (blosum62 scoring scheme) in env-nr were also searched against all DNA virus genomes using TblastN (peptide query against translated nucleotide sequence). The comprehensive list of Mimivirus ORFs exhibiting a best match in the env-nr database is given in Additional file: 1. Phylogenetic analyses were conducted using MEGA version 3.0 [10] (option: Neighbor joining, 250 pseudo-replicates, and gaps handled by pairwise deletion). Tree branches were condensed for bootstrap values <50%. Only Mimivirus ORFs with best matching homologues in DNA viruses and belonging to the nucleo-cytoplasmic large DNA virus core gene set (2, 6) were analyzed in detail. These ORFs (and matching status) are listed in Table 1. Phylogenetic analyses were limited to viral homologues and environmental sequences exhibiting a reciprocal best match relationship with the corresponding Mimivirus ORF (putative orthologues) (YES in the rightmost column). The three cases (red lines in Table 1) exhibiting the best bootstrap values are shown in Figure 1. Cases of complex relationships, for instance due to the presence of many paralogues (e.g. capsid proteins), are also indicated. These cases of non-reciprocal best matches are frequent (i.e. the closest homologue of a Mimivirus ORFs being an environmental sequence, but the latter sequence exhibiting a better match with a different ORF in the nr database). Two environmental sampling contigs – contig IBEA_CTG_1979672 (AACY01022731, GI:44566181) and contig IBEA_CTG_1979673 (AACY01022732, GI:44566179) – are linked in a 4,465 bp scaffold (scaffold IBEA_SCF = 2208413) found to contain four ORFs with strong matches to Mimivirus peptides (R368, L377, L375, and L687). The three colinear ORFs (R368, L377, L375) are found on one contig while the orthologue to Mimivirus ORF L687 is found in the second contig. It is conceivable that the lack of colinearity for this fourth ORF is due to an assembly error.

Additional file 1

List of Mimivirus ORFs exhibiting a best match in the env-nr database Click here for file
  19 in total

1.  A hypothesis for DNA viruses as the origin of eukaryotic replication proteins.

Authors:  L P Villarreal; V R DeFilippis
Journal:  J Virol       Date:  2000-08       Impact factor: 5.103

2.  Common origin of four diverse families of large eukaryotic DNA viruses.

Authors:  L M Iyer; L Aravind; E V Koonin
Journal:  J Virol       Date:  2001-12       Impact factor: 5.103

3.  Poxviruses and the origin of the eukaryotic nucleus.

Authors:  M Takemura
Journal:  J Mol Evol       Date:  2001-05       Impact factor: 2.395

4.  Viral eukaryogenesis: was the ancestor of the nucleus a complex DNA virus?

Authors:  P J Bell
Journal:  J Mol Evol       Date:  2001-09       Impact factor: 2.395

5.  A giant virus in amoebae.

Authors:  Bernard La Scola; Stéphane Audic; Catherine Robert; Liang Jungang; Xavier de Lamballerie; Michel Drancourt; Richard Birtles; Jean-Michel Claverie; Didier Raoult
Journal:  Science       Date:  2003-03-28       Impact factor: 47.728

6.  The 1.2-megabase genome sequence of Mimivirus.

Authors:  Didier Raoult; Stéphane Audic; Catherine Robert; Chantal Abergel; Patricia Renesto; Hiroyuki Ogata; Bernard La Scola; Marie Suzan; Jean-Michel Claverie
Journal:  Science       Date:  2004-10-14       Impact factor: 47.728

Review 7.  Viruses and viruslike particles of eukaryotic algae.

Authors:  J L Van Etten; L C Lane; R H Meints
Journal:  Microbiol Rev       Date:  1991-12

8.  Complete genome sequence and lytic phase transcription profile of a Coccolithovirus.

Authors:  William H Wilson; Declan C Schroeder; Michael J Allen; Matthew T G Holden; Julian Parkhill; Bart G Barrell; Carol Churcher; Nancy Hamlin; Karen Mungall; Halina Norbertczak; Michael A Quail; Claire Price; Ester Rabbinowitsch; Danielle Walker; Marie Craigon; Douglas Roy; Peter Ghazal
Journal:  Science       Date:  2005-08-12       Impact factor: 47.728

Review 9.  Phycodnaviridae--large DNA algal viruses.

Authors:  J L Van Etten; M V Graves; D G Müller; W Boland; N Delaroque
Journal:  Arch Virol       Date:  2002-08       Impact factor: 2.574

Review 10.  Unusual life style of giant chlorella viruses.

Authors:  James L Van Etten
Journal:  Annu Rev Genet       Date:  2003       Impact factor: 16.830

View more
  43 in total

1.  Gene and genome duplication in Acanthamoeba polyphaga Mimivirus.

Authors:  Karsten Suhre
Journal:  J Virol       Date:  2005-11       Impact factor: 5.103

2.  Mimivirus giant particles incorporate a large fraction of anonymous and unique gene products.

Authors:  Patricia Renesto; Chantal Abergel; Philippe Decloquement; Danielle Moinier; Saïd Azza; Hiroyuki Ogata; Patrick Fourquet; Jean-Pierre Gorvel; Jean-Michel Claverie
Journal:  J Virol       Date:  2006-09-13       Impact factor: 5.103

3.  Quantifying environmental adaptation of metabolic pathways in metagenomics.

Authors:  Tara A Gianoulis; Jeroen Raes; Prianka V Patel; Robert Bjornson; Jan O Korbel; Ivica Letunic; Takuji Yamada; Alberto Paccanaro; Lars J Jensen; Michael Snyder; Peer Bork; Mark B Gerstein
Journal:  Proc Natl Acad Sci U S A       Date:  2009-01-22       Impact factor: 11.205

4.  Marine viruses, a genetic reservoir revealed by targeted viromics.

Authors:  Joaquín Martínez Martínez; Brandon K Swan; William H Wilson
Journal:  ISME J       Date:  2013-12-05       Impact factor: 10.302

5.  Causes for the intriguing presence of tRNAs in phages.

Authors:  Marc Bailly-Bechet; Massimo Vergassola; Eduardo Rocha
Journal:  Genome Res       Date:  2007-09-04       Impact factor: 9.043

6.  Genomic exploration of individual giant ocean viruses.

Authors:  William H Wilson; Ilana C Gilg; Mohammad Moniruzzaman; Erin K Field; Sergey Koren; Gary R LeCleir; Joaquín Martínez Martínez; Nicole J Poulton; Brandon K Swan; Ramunas Stepanauskas; Steven W Wilhelm
Journal:  ISME J       Date:  2017-05-12       Impact factor: 10.302

7.  New dimensions of the virus world discovered through metagenomics.

Authors:  David M Kristensen; Arcady R Mushegian; Valerian V Dolja; Eugene V Koonin
Journal:  Trends Microbiol       Date:  2009-11-26       Impact factor: 17.079

8.  Structural characterization of a viral NEIL1 ortholog unliganded and bound to abasic site-containing DNA.

Authors:  Kayo Imamura; Susan S Wallace; Sylvie Doublié
Journal:  J Biol Chem       Date:  2009-07-22       Impact factor: 5.157

9.  Origin and evolution of the Notch signalling pathway: an overview from eukaryotic genomes.

Authors:  Eve Gazave; Pascal Lapébie; Gemma S Richards; Frédéric Brunet; Alexander V Ereskovsky; Bernard M Degnan; Carole Borchiellini; Michel Vervoort; Emmanuelle Renard
Journal:  BMC Evol Biol       Date:  2009-10-13       Impact factor: 3.260

10.  Eukaryotic large nucleo-cytoplasmic DNA viruses: clusters of orthologous genes and reconstruction of viral genome evolution.

Authors:  Natalya Yutin; Yuri I Wolf; Didier Raoult; Eugene V Koonin
Journal:  Virol J       Date:  2009-12-17       Impact factor: 4.099

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.