| Literature DB >> 21375749 |
Matthieu Legendre1, Sébastien Santini, Alain Rico, Chantal Abergel, Jean-Michel Claverie.
Abstract
BACKGROUND: Mimivirus, a giant dsDNA virus infecting Acanthamoeba, is the prototype of the mimiviridae family, the latest addition to the family of the nucleocytoplasmic large DNA viruses (NCLDVs). Its 1.2 Mb-genome was initially predicted to encode 917 genes. A subsequent RNA-Seq analysis precisely mapped many transcript boundaries and identified 75 new genes.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21375749 PMCID: PMC3058096 DOI: 10.1186/1743-422X-8-99
Source DB: PubMed Journal: Virol J ISSN: 1743-422X Impact factor: 4.099
Figure 1Flow chart of the Mimivirus genome correction pipeline. The upper panel illustrates the correction procedure and the lower panel the annotation method. Colors are used for clarity: datasets are in purple, genomes are in green, sequence manipulations (mapping, duplicate removal, or modifications) are in yellow, computation steps are in blue and genes in red. The upper left graph represents the decrease in substitutions (in red) and indels (in black) identified during the iterative genome correction process, together with the increase in the total number of reads (in green) mapped to genome.
| Genomic position | Gene | Gene annotation | Codon (SNP position in bold) | Reference allele | Reference allele coverage (%) | Second allele | Second allele coverage (%) | Reference encoded AA | Second allele encoded AA |
|---|---|---|---|---|---|---|---|---|---|
| 2746 | L1c | Uncharacterized probable non-coding RNA gene | - | C | 86.6 | T | 13.4 | - | - |
| 5402 | L3 | Uncharacterized protein | G | 78.0 | A | 22.0 | E | K | |
| 9911 | L6 | Uncharacterized protein | GT | A | 74.2 | G | 25.8 | V | V |
| 22248 | R13 | Uncharacterized protein | TA | T | 83.7 | G | 16.3 | Y | * |
| 28580 | L18 | Putative sel1-like repeat-containing protein | A | 76.9 | T | 23.1 | I | F | |
| 47300 | L37 | Putative KilA-N domain-containing protein | A | 86.3 | G | 13.7 | I | V | |
| 54207 | L42 | Putative ankyrin repeat protein | A | 63.8 | G | 36.2 | L | V | |
| 97232 | L77b | Uncharacterized protein | - | C | 88.3 | T | 11.7 | A | V |
| 166952 | R135 | Putative GMC-type oxidoreductase | GA | T | 87.0 | C | 13.0 | D | D |
| 322426 | L254 | Heat shock protein 70 homolog | AT | T | 88.9 | A | 11.1 | I | I |
| 328586 | R260 | DnaJ-like protein | T | 81.3 | G | 18.8 | F | V | |
| 329434 | R261 | Uncharacterized protein | CA | A | 85.7 | C | 14.3 | Q | H |
| 399891 | R313 | Ribonucleoside-diphosphate reductase large subunit | A | 83.9 | C | 16.1 | I | L | |
| 440978 | R343 | Probable ribonuclease 3 | T | 89.9 | A | 10.1 | W | R | |
| 483113 | R367 | Uncharacterized protein | A | A | 86.2 | T | 13.8 | K | I |
| 504876 | - | - | - | T | 88.0 | G | 12.0 | - | - |
| 601715 | L454 | Uncharacterized protein | A | T | 87.3 | C | 12.7 | I | T |
| 649432 | L485 | Uncharacterized protein | GA | A | 88.9 | C | 11.1 | E | D |
| 655506 | L490 | Uncharacterized protein | A | G | 85.1 | T | 14.9 | T | I |
| 734179 | R547 | Uncharacterized protein | A | 84.7 | C | 15.3 | N | H | |
| 736530 | R549b | Uncharacterized probable non-coding RNA gene | - | T | 88.0 | C | 12.0 | - | - |
| 787617 | L594 | Uncharacterized protein | A | A | 73.5 | C | 26.5 | K | T |
| 918583 | R699 | Uncharacterized protein | AA | A | 87.5 | C | 12.5 | K | N |
| 939044 | R714 | Uncharacterized protein | T | T | 69.0 | G | 31.0 | F | C |
| 962204 | R735 | Uncharacterized protein | CA | A | 82.3 | C | 17.7 | Q | H |
| 1069573 | R822 | Uncharacterized protein | A | 89.2 | G | 10.8 | I | V | |
| 1170156 | R903 | Putative ankyrin repeat protein | T | T | 89.1 | G | 10.9 | F | C |
Figure 2Discovery of a component of the Mimivirus transcription apparatus. A) Mimivirus genome browser (URL: http://www.igs.cnrs-mrs.fr/mimivirus/) screenshot showing the newly discovered component of the transcription apparatus (R357b) in its genomic context. Three informative tracks are displayed: the protein coding genes, the late gene expression signals, and the gene expression data from the SOLiD™ RNA-seq experiment. Transcriptome data is shown at each genomic position (for each of the 9 samples) going from white (not expressed) to red (highly expressed) in the forward strand, and white to blue (highly expressed) in the reverse strand. B) Protein sequence alignment of the Mimivirus R357b gene and the most similar homologous sequences from the giant virus CroV and the two archea Methanocella paludicola and Ferroplasma acidarmanus.