| Literature DB >> 25197475 |
Benoît Piégu1, Sébastien Guizard1, Tan Yeping2, Corinne Cruaud, Arnault Couloux, Dennis K Bideshi3, Brian A Federici2, Yves Bigot1.
Abstract
Members of the family Iridoviridae are animal viruses that infect only invertebrates and poikilothermic vertebrates. The invertebrate iridoviruses 22 (IIV22) and 25 (IIV25) were originally isolated from a single sample of blackfly larva (Simulium spp., order Diptera) collected from the Ystwyth river near Aberystwyth, Wales. Recently, the genomes of IIV22 (197.7 kbp) and IIV25 (204.8 kbp) were sequenced and reported. Here, we describe the complete genome sequence of IIV22A, a variant that was isolated from the same pool of virions collected from the blackfly larva from which the IIV22 virion genome originated. The IIV22A genome, 196.5 kbp, is smaller than IIV22. Nevertheless, it contains 7 supplementary putative ORFs. Its analysis enables evaluation of the degree of genomic polymorphisms within an IIV isolate. Despite the occurrence of this IIV variant with IIV22 and IIV25 in a single blackfly larva and the features of their DNA polymerase, we found no evidence of lateral genetic transfers between the genomes of these two IIV species.Entities:
Keywords: MITE; dsDNA virus; insect; intein; invertebrate; iridoviridae; iridovirus
Year: 2014 PMID: 25197475 PMCID: PMC4149023 DOI: 10.4056/sigs.5059132
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Figure 1Transmission electron micrograph of IIV22A virions. Bar = 100 nanometers.
Genome sequencing project information
| | | |
|---|---|---|
| MIGS-31 | Finishing quality | Finished (>99%) |
| Number of contigs | 1 | |
| Assembly size | 196455-bp | |
| Assembly coverage | 61 × | |
| Total number of read used | 31811 | |
| MIGS-29 | Sequencing plate-form | 454 |
| MIGS-30 | Assemblers | Newbler version 2.3 Post-release-11.19.2009 |
| Gene calling method | Annotation protocol [ | |
| EMBL ID | HF920634 |
Genome statistics of IIV22A
| | | |
|---|---|---|
| Genome size (bp) | 196,455 | 100.00 |
| DNA G+C content (bp) | 55,204 | 28.01 |
| DNA coding region (bp) | 171,852 | 87.5 |
| Fossil genes | 4 | 100 |
| Total genes (putatively functional) | 174 | 100 |
| Protein coding genes with function prediction | 66 | 37.9 |
| Protein coding genes with orthologs in databases | 171 | 98.3 |
| Family of gene paralogs | 2 | - |
| Genes in families of paralogous genes | 24 | 13.7 |
| Non coding regions over 200 bp in length | 6,455 (11 segments) | 5.4 |
aThe total is based on either the size of the genome in base pairs or the total number of protein coding genes in the annotated genome, where applicable.
Figure 2Graphical circular map of the 196,455 bp IIV22A genome. The outer scale is numbered clockwise in bp. Circles 1 and 2 (from outside to inside) denote the coding DNA sequences (CDSs). Forward strands are in red and reverse strands in blue. The grey box in circle 3 represents the two regions that are found in inverse orientation in IIV9. Green boxes in circle 4 represents ORF-free region with a size over 200 bp. The box in orange in circle 5 represents a fossil gene. Circle 6 represents the local variations of G+C content along the genome sequence.
Figure 3Nucleotide sequence of IIV22-MITE halves found in the IIV22A genome between nucleotides 86230 to 88050 (a) and 144843 to 144662 (b). IIV22-MITE ITR at both ends are highlighted in black and typed in white. In a, the CDS082 is in italics, its start and stop codons are boxed. In b, TTAA at the insertion site is in bold type.