| Literature DB >> 18070355 |
Adam Monier1, Jean-Michel Claverie, Hiroyuki Ogata.
Abstract
BACKGROUND: DNA viruses have a wide range of genome sizes (5 kb up to 1.2 Mb, compared to 0.16 Mb to 1.5 Mb for obligate parasitic bacteria) that do not correlate with their virulence or the taxonomic distribution of their hosts. The reasons for such large variation are unclear. According to the traditional view of viruses as gifted "gene pickpockets", large viral genome sizes could originate from numerous gene acquisitions from their hosts. We investigated this hypothesis by studying 67 large DNA viruses with genome sizes larger than 150 kb, including the recently characterized giant mimivirus. Given that horizontally transferred DNA often have anomalous nucleotide compositions differing from the rest of the genome, we conducted a detailed analysis of the inter- and intra-genome compositional properties of these viruses. We then interpreted their compositional heterogeneity in terms of possible causes, including strand asymmetry, gene function/expression, and horizontal transfer.Entities:
Mesh:
Year: 2007 PMID: 18070355 PMCID: PMC2211322 DOI: 10.1186/1471-2164-8-456
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Proportions of cA genes in the 67 large DNA viruses.
| Caudovirales | Enterobacteria phage RB43 | 180500 | 43.2 | 177 | 15 | 8.47 | - |
| Enterobacteria phage T4 | 168903 | 35.3 | 175 | 8 | 4.57 | - | |
| 211215 | 49.33 | 178 | 18 | 10.11 | - | ||
| 280334 | 36.83 | 278 | 27 | 9.71 | - | ||
| 156102 | 64.77 | 147 | 27 | 18.37 | - | ||
| Enterobacteria phage RB69 | 167560 | 37.66 | 160 | 11 | 6.88 | * | |
| Enterobacteria phage RB49 | 164018 | 40.44 | 160 | 12 | 7.5 | - | |
| 244835 | 42.6 | 256 | 18 | 7.03 | - | ||
| 173591 | 43.88 | 173 | 7 | 4.05 | - | ||
| 233234 | 42.78 | 228 | 13 | 5.7 | - | ||
| Bacteriophage S-PM2 | 196280 | 37.82 | 135 | 28 | 20.74 | * | |
| Cyanophage P-SSM2 | 252401 | 35.51 | 212 | 33 | 15.57 | * | |
| Cyanophage P-SSM4 | 178249 | 36.74 | 121 | 20 | 16.53 | ** | |
| 172963 | 43.91 | 173 | 10 | 5.78 | - | ||
| Bacteriophage c-st | 185683 | 26.28 | 151 | 8 | 5.3 | - | |
| Alphaherpesvirinae | Cercopithecine herpesvirus 16 | 156487 | 76.09 | 71 | 18 | 25.35 | - |
| Equid herpesvirus 1 | 150224 | 56.67 | 79 | 12 | 15.19 | - | |
| Cercopithecine herpesvirus 1 | 156789 | 74.46 | 71 | 23 | 32.39 | - | |
| Human herpesvirus 2 | 154746 | 70.39 | 72 | 21 | 29.17 | * | |
| Human herpesvirus 1 | 152261 | 68.28 | 72 | 24 | 33.33 | - | |
| Psittacid herpesvirus 1 | 163025 | 60.95 | 71 | 8 | 11.27 | - | |
| Gallid herpesvirus 2 | 174077 | 43.89 | 106 | 19 | 17.92 | * | |
| Gallid herpesvirus 3 | 164270 | 53.61 | 92 | 19 | 20.65 | *** | |
| Cercopithecine herpesvirus 2 | 150715 | 75.97 | 71 | 22 | 30.99 | * | |
| Meleagrid herpesvirus 1 | 159160 | 47.56 | 85 | 8 | 9.41 | *** | |
| Betaherpesvirinae | Pongine herpesvirus 4 | 241087 | 61.7 | 155 | 67 | 43.23 | *** |
| Human herpesvirus 6B | 162114 | 42.77 | 91 | 6 | 6.59 | - | |
| Murid herpesvirus 1 | 230278 | 58.73 | 156 | 52 | 33.33 | *** | |
| Human herpesvirus 5 strain AD169 | 230287 | 57.19 | 145 | 53 | 36.55 | *** | |
| Human herpesvirus 6 | 159321 | 42.44 | 107 | 11 | 10.28 | - | |
| Human herpesvirus 7 | 153080 | 36.22 | 78 | 8 | 10.26 | - | |
| Cercopithecine herpesvirus 8 | 221454 | 49.14 | 223 | 80 | 35.87 | - | |
| Murid herpesvirus 2 | 230138 | 61.01 | 165 | 87 | 52.73 | *** | |
| Human herpesvirus 5 strain Merlin | 235645 | 57.48 | 153 | 56 | 36.6 | *** | |
| Tupaiid herpesvirus 1 | 195859 | 66.61 | 144 | 59 | 40.97 | *** | |
| Gammaherpesvirinae | Human herpesvirus 4 | 171823 | 59.5 | 90 | 37 | 41.11 | * |
| Equid herpesvirus 2 | 184427 | 57.5 | 74 | 28 | 37.84 | - | |
| Cercopithecine herpesvirus 15 | 171096 | 61.94 | 77 | 22 | 28.57 | * | |
| Herpesvirinae Uncl. | Ostreid herpesvirus 1 | 207439 | 38.73 | 121 | 7 | 5.79 | - |
| Chordopoxvirinae | Monkeypox virus | 196858 | 33.09 | 162 | 3 | 1.85 | - |
| Camelpox virus | 205719 | 33.17 | 171 | 6 | 3.51 | ** | |
| Cowpox virus | 224499 | 33.4 | 189 | 2 | 1.06 | - | |
| Myxoma virus | 161773 | 43.56 | 147 | 5 | 3.4 | - | |
| Rabbit fibroma virus | 159857 | 39.53 | 138 | 4 | 2.9 | - | |
| Ectromelia virus | 209771 | 33.18 | 154 | 6 | 3.9 | - | |
| Variola virus | 185578 | 32.73 | 152 | 6 | 3.95 | - | |
| Molluscum contagiosum virus | 190289 | 63.36 | 139 | 43 | 30.94 | *** | |
| Canarypox virus | 359853 | 30.37 | 291 | 14 | 4.81 | - | |
| Fowlpox virus | 288539 | 30.89 | 230 | 12 | 5.22 | - | |
| Lumpy skin disease virus | 150773 | 25.91 | 136 | 4 | 2.94 | - | |
| Vaccinia virus | 194711 | 33.34 | 179 | 3 | 1.68 | - | |
| Entomopoxvirinae | 236120 | 18.27 | 208 | 15 | 7.21 | - | |
| 232392 | 17.78 | 218 | 16 | 7.34 | - | ||
| Poxvirinae Uncl. | Mule deer poxvirus | 166259 | 26.16 | 148 | 2 | 1.35 | - |
| Asfarviridae | African swine fever virus | 170101 | 38.95 | 129 | 22 | 17.05 | ** |
| Baculovirviridae | 155060 | 41.68 | 146 | 7 | 4.79 | - | |
| 158482 | 40.04 | 146 | 7 | 4.79 | - | ||
| 161046 | 57.47 | 133 | 34 | 25.56 | - | ||
| 178733 | 40.68 | 145 | 8 | 5.52 | - | ||
| Iridoviridae | Lymphocystis disease virus | 186250 | 27.23 | 127 | 7 | 5.51 | - |
| Invertebrate iridescent virus 6 | 212482 | 28.63 | 178 | 15 | 8.43 | - | |
| Mimiviridae | 1181404 | 27.96 | 910 | 83 | 9.12 | - | |
| Phycodnaviridae | 407339 | 40.18 | 416 | 44 | 10.58 | - | |
| 330743 | 39.97 | 426 | 117 | 27.46 | - | ||
| 335593 | 51.73 | 209 | 25 | 11.96 | - | ||
| Nimaviridae | Shrimp white spot syndrome virus | 305107 | 41.01 | 245 | 96 | 39.18 | - |
| Unclassified Virus | 228089 | 41.86 | 104 | 25 | 24.04 | * |
a. '-' no significance; * p ≤ 0.05; ** p ≤ 0.01; *** p ≤ 0.001.
Figure 1Difference in genomic signature distance between real and randomized data for all possible pairs of 67 LDVs. The inset figure shows the genomic signature distances between all possible pairs of 67 LDVs for both real and randomized data.
Figure 2Comparison of G+C content between LDVs and their hosts.
Figure 3Difference in genomic signature distance between real and randomized data for LDV-host pairs. The inset figure shows the genomic signature distances between the LDV-host pairs for both real and randomized data.
Figure 4Proportions of the cA genes detected in the 67 LDV genomes, compared with the genome sizes.
Figure 5Gene organization and the locations of the cA genes for herpesviruses. The phylogenetic reconstruction (on the left) is based on the DNA polymerase catalytic subunit sequences. Red and pink dots (on the right) correspond to the cA genes and the remaining genes, respectively. Green lines show orthologous gene relationships defined by BLAST reciprocal best hit.
Figure 6Functional categories of the cA genes detected in the 67 LDV genomes.