| Literature DB >> 16105173 |
Elodie Ghedin1, Jean-Michel Claverie.
Abstract
The discovery and genome analysis of Acanthamoeba polyphaga Mimivirus, the largest known DNA virus, challenged much of the accepted dogma regarding viruses. Its particle size (>400 nm), genome length (1.2 million bp) and huge gene repertoire (911 protein coding genes) all contribute to blur the established boundaries between viruses and the smallest parasitic cellular organisms. Phylogenetic analyses also suggested that the Mimivirus lineage could have emerged prior to the individualization of cellular organisms from the three established domains, triggering a debate that can only be resolved by generating and analyzing more data. The next step is then to seek some evidence that Mimivirus is not the only representative of its kind and determine where to look for new Mimiviridae. An exhaustive similarity search of all Mimivirus predicted proteins against all publicly available sequences identified many of their closest homologues among the Sargasso Sea environmental sequences. Subsequent phylogenetic analyses suggested that unknown large viruses evolutionarily closer to Mimivirus than to any presently characterized species exist in abundance in the Sargasso Sea. Their isolation and genome sequencing could prove invaluable in understanding the origin and diversity of large DNA viruses, and shed some light on the role they eventually played in the emergence of eukaryotes.Entities:
Mesh:
Substances:
Year: 2005 PMID: 16105173 PMCID: PMC1215527 DOI: 10.1186/1743-422X-2-62
Source DB: PubMed Journal: Virol J ISSN: 1743-422X Impact factor: 4.099
Matching Status of Mimivirus core genes (type 1 to 4).
| ORF# | Definition | Best score in nr | Best score in DNA viruses | Best score in Sargasso Sea | Status | Reciprocal Best match |
| L206 | Helicase III / VV D5 | 167-virus | 167 | 214 | Best ENV | YES |
| R322 | DNA pol (B family) extein | 207 | 167 | 238 | Best ENV | YES |
| L396 | VV A18 helicase | 200-virus | 200 | 187 | - | |
| L425 | Capsid protein | 119-virus | 117 | 142 | Best ENV | complex |
| R439 | Capsid protein | 164-virus | 159 | 173 | Best ENV | complex |
| R441 | Capsid protein | 137-virus | 147 | 209 | Best ENV | complex |
| R596 | E10R-Thiol oxidoreductase | 104-virus | 105 | 119 | Best ENV | YES |
| R350 | VV D6R – helicase | 170-virus | 170 | 102 | - | |
| R400 | F10L – prot. Kinase | 86-virus | 86 | 58 | - | |
| R450 | A1L-transcr factor | 52-virus | 47 | 65 | Best ENV | |
| R339 | TFII-transcr. factor | 62 | 42 | 66 | Best ENV | |
| L524 | MuT-like NTP PP-hydrolase | 40 | 38 | 39 | - | |
| L323 | Myristoylated virion prot. A | 43 | 42 | 40 | - | |
| R493 | PCNA | 92 | 87 | 154 | Best ENV | YES |
| L312 | Small Ribonucl. reduct | 341 | 338 | 310 | - | |
| R313 | Large Ribonucl. reduct | 766 | 741 | 740 | - | |
| L37 | BroA, KilA-N | 123-virus | 124 | 65 | - | |
| R382 | mRNA-capping enz. | 86 | 78 | 166 | Best ENV | YES |
| L244 | RNA pol. sub 2 (Rbp2) | 727 | 416 | 508 | - | |
| R501 | RNA pol. sub.1 (Rpb1) | 805 | 415 | 520 | - | |
| R195 | ESV128-Glutaredoxin | 50 | 39 | 49 | - | |
| R622 | S/Y phosphatase | 75 | 73 | 65 | - | |
| R311 | CIV193R BIR domain | 68 | 44 | 51 | - | |
| L65 | Virion memb. prot | 44 | 44 | - | - | |
| R480 | Topoisomerase II | 902 | 717 | 367 | - | |
| L221 | Topoisomerase I bacterial | 528 | 35 | 516 | - | |
| R194 | Topoisomerase I pox-like | 188 | 100 | 145 | - | |
| L364 | SW1/SNF2 helicase | 70-virus | 70 | 72 | Best ENV | YES |
| L4 | N1R/P28 DNA binding prot | 123-virus | 124 | 72 | - | |
| L540 | Pre-mRNA helicase – splicing | 256 | 136 | 214 | - | |
| L235 | RNA pol subunit5 | 69 | 38 | 50 | - | |
| R354 | Lambda-type exonuclease | 69-virus | 69 | 154 | Best ENV | YES |
| R343 | RNAse III | 129 | 112 | 131 | Best ENV | YES |
| R141 | GDP mannose 4,6-dehydratase | 294 | 68 | 252 | - | |
| L258 | Thymidine kinase | 151 | 140 | 124 | - | |
| L271 | Ankyrin repeats (66 paralogs) | 179 | 152 | 192 | Best ENV | complex |
| R325 | Metal-dependent hydrolase | 69-virus | 69 | 105 | Best ENV | YES |
| L477 | Cathepsin B | 226 | 43 | 47 | - | |
| R497 | Thymidylate synthase | 278 | 242 | 217 | - | |
| R303 | NAD-dependent DNA ligase | 270-virus | 270 | 228 | - | |
| L805 | MACRO domain | 36 | 33 | - | - | |
| R571 | Patatin-like phospholipase | 105 | 80 | 122 | Best ENV | YES |
| R301 | Uncharacterized prot. | 48-virus | 48 | 65 | Best ENV | YES |
Figure 1Phylogenetic evidence of uncharacterized Mimivirus relatives. (a) Neighbor-joining (NJ) clustering (see Materials and Methods) of Mimivirus R449 ORF with its best matching (≈35% identical residues) environmental homologues (noted Sargasso1 to Sargasso6 according to their decreasing similarity) and closest viral orthologues (28% identical). (b) NJ clustering of Mimivirus R429 ORF with its best matching (≈50% identical) environmental homologues (noted Sargasso1 to Sargasso5) and closest viral orthologues (35% identical). (c) NJ clustering of Mimivirus putative virion packaging ATPase L437 with its best matching (≈45% identity) environmental homologues (Sargasso1 and Sargasso2) and closest viral orthologues (34% identical). Abbreviations: Phyco: Phycodnavirus; PBCV: Paramecium bursaria chlorella virus 1; EsV: Ectocarpus siliculosus virus; FsV: Feldmannia sp. virus; HaV: Heterosigma akashiwo virus; Irido: Iridovirus; LCDV: Lymphocystis disease virus 1; Frog: Frog virus 3; Amby: Ambystoma tigrinum stebbensi virus; Rana: Rana tigrina ranavirus; Chilo: Chilo iridescent virus. Bootstrap values larger than 50% are shown. Branches with lower values were condensed.
Figure 2Organization of four Mimivirus ORF best matching homologues in a 4.5 kb environmental sequence scaffold (approximately to scale). The three colinear Mimivirus homologues are in green. Unmatched ORF extremities are indicated by dots. The two diagonal lines indicate where the two contigs are joined on the scaffold.
Figure 3Partial 3-way alignment (N-terminus region) of Mimivirus capsid protein (R441) with it best matching homologues in the NR and Environmental sequence databases. The Mimivirus R441 protein shares 83/229 (36.2%) identical residues (colored in red or blue) with the major capsid protein Vp49 of Chlorella virus CVG-1 and 111/229 (48.5%) identical residues (indicated in red or green) with the N-terminus of a capsid protein from an unknown large virus sampled from the Sargasso Sea (Accession: EAD00518). On the other hand, the CVG-1 Vp49 and the Sargasso Sea sequence share 44.5% identical residues. By comparison, the CVG-1 Vp49 protein share 72% of identical residue with PBCV-1 Vp54, its best matching homologue among known phycodnaviruses.