| Literature DB >> 25912716 |
Eneida L Hatcher1, Chunlin Wang2, Elliot J Lefkowitz3.
Abstract
To investigate gene loss in poxviruses belonging to the Chordopoxvirinae subfamily, we assessed the gene content of representative members of the subfamily, and determined whether individual genes present in each genome were intact, truncated, or fragmented. When nonintact genes were identified, the early stop mutations (ESMs) leading to gene truncation or fragmentation were analyzed. Of all the ESMs present in these poxvirus genomes, over 65% co-localized with microsatellites-simple sequence nucleotide repeats. On average, microsatellites comprise 24% of the nucleotide sequence of these poxvirus genomes. These simple repeats have been shown to exhibit high rates of variation, and represent a target for poxvirus protein variation, gene truncation, and reductive evolution.Entities:
Mesh:
Substances:
Year: 2015 PMID: 25912716 PMCID: PMC4411693 DOI: 10.3390/v7042126
Source DB: PubMed Journal: Viruses ISSN: 1999-4915 Impact factor: 5.048
Selected chordopoxvirus genomes used in these analyses.
| Genus | Abbreviation | NCBI strain | Accession no. | Intact genes | Truncated genes | Fragmented genes | Length (bp) |
|---|---|---|---|---|---|---|---|
| Avipoxvirus | |||||||
| FWPV | Fowlpox virus strain Iowa | NC_002188 | 231 | 5 | 13 | 288,539 | |
| CNPV | Canarypox virus strain ATCC VR111 | NC_005309 | 316 | 2 | 4 | 359,853 | |
| Capripoxvirus | |||||||
| SPPV | Sheeppox virus strain TU-V02127 | NC_004002 | 150 | 1 | 8 | 149,955 | |
| GTPV | Goatpox virus strain Pellor | NC_004003 | 150 | 2 | 7 | 149,599 | |
| LSDV | Lumpy skin disease virus strain Neethling 2490 | NC_003027 | 159 | 0 | 0 | 150,773 | |
| Cervidpoxvirus | |||||||
| DPV | Deerpox virus W-848-83 | NC_006966 | 172 | 0 | 1 | 166,259 | |
| Crocodylidpoxvirus | |||||||
| CRV | Crocodilepox virus strain Zimbabwe | NC_008030 | 175 | 0 | 0 | 190,054 | |
| Leporipoxvirus | |||||||
| MYXV | Myxoma virus strain Lausanne | NC_001132 | 166 | 0 | 1 | 161,773 | |
| RFV | Rabbit fibroma virus strain Kasza | NC_001266 | 161 | 0 | 4 | 159,857 | |
| Molluscipoxvirus | |||||||
| MOCV | Molluscum contagiosum virus strain subtype 1 | NC_001731 | 159 | 0 | 0 | 190,289 | |
| Orthopoxvirus | |||||||
| CPXV-Ger | Cowpox virus strain Germany 91-3 | DQ437593 | 208 | 3 | 3 | 228,250 | |
| CPXV-BR | Cowpox virus strain Brighton Red | AF482758 | 206 | 3 | 1 | 224,499 | |
| CPXV-Gri | Cowpox virus strain GRI-90 | X94355 | 209 | 3 | 0 | 223,666 | |
| HSPV | Horsepox virus strain MNR-76 | DQ792504 | 181 | 22 | 4 | 212,633 | |
| RPXV | Rabbitpox virus strain Utrecht | AY484669 | 179 | 13 | 4 | 197,731 | |
| VACV-WR | Vaccinia virus strain WR (Western Reserve) | AY243312 | 178 | 12 | 3 | 194,711 | |
| MPXV-WR | Monkeypox virus strain MPXV-WRAIR7-61; Walter Reed 267 | AY603973 | 175 | 7 | 14 | 199,195 | |
| MPXV-ZAI | Monkeypox virus strain Zaire-96-I-16 | AF380138 | 176 | 7 | 14 | 196,858 | |
| ECTV | Ectromelia virus strain Moscow | AF012825 | 172 | 21 | 7 | 209,771 | |
| TATV | Taterapox virus strain Dahomey 1968 | DQ437594 | 163 | 26 | 7 | 198,050 | |
| CMLV | Camelpox virus strain M-96 from Kazakhstan | AF438165 | 174 | 14 | 8 | 205,719 | |
| VARV-BRZ | Variola virus strain Brazil 1966 (v66-39 Sao Paulo) | DQ441419 | 162 | 18 | 12 | 188,062 | |
| VARV-SLE | Variola virus strain Sierra Leone 1969 (V68-258) | DQ441437 | 162 | 17 | 11 | 187,014 | |
| VARV-SAF | Variola virus strain South Africa 1965 (103 T'vaal, Nelspruit) | DQ441436 | 162 | 17 | 11 | 185,881 | |
| VARV-KWT | Variola virus strain Kuwait 1967 (K1629) | DQ441433 | 162 | 17 | 11 | 185,853 | |
| Parapoxvirus | |||||||
| ORFV | Orf virus strain OV-SA00 | NC_005336 | 134 | 0 | 0 | 139,962 | |
| BPSV | Bovine papular stomatitis virus strain BV-AR02 | NC_005337 | 136 | 0 | 0 | 134,431 | |
| Suipoxvirus | |||||||
| SWPV | Swinepox virus strain Nebraska 17077-99 | NC_003389 | 151 | 0 | 0 | 146,454 | |
| Yatapoxvirus | |||||||
| TANV | Tanapox virus strain Kenya | NC_009888 | 151 | 2 | 0 | 144,565 | |
| YMTV | Yaba monkey tumor virus strain Amano | NC_005179 | 139 | 2 | 0 | 134,721 | |
| Unassigned | |||||||
| Yoka poxvirus (not classified) | YKPV | Yoka poxvirus | NC_015960.1 | 164 | 12 | 5 | 175,699 |
| SQPV | Squirrel poxvirus strain Red squirrel UK | NC_022563.1 | 142 | 0 | 0 | 148,803 |
Figure 1Pairwise nucleotide % identities for the conserved central region of the poxvirus isolates used in this study. Values below the diagonal included gaps when calculating the pairwise identity, whereas values above the diagonal had all gapped columns removed before analysis. The conserved central region spans the genomic sequences from Cop-F9L (CPXV-Ger position 52,398) to Cop-A34R (CPXV-Ger position 160,094). Darker shading reflects higher levels of identity. *FWPV and CNPV, contain a genome inversion in this region, and therefore their sequences were rearranged in order to align them with the other viruses.
Comparative genomic content of early stop mutationss and microsatellites.
| Virus Genome | ESM Count | Non-intact genes | Non-intact genes/1000 bp | ESMs/non-intact gene | ESMs overlapping microsatellites | % of ESMs overlapping microsatellites | Genome microsatellite content (%) | GC% of genome | GC% of microsatellites |
|---|---|---|---|---|---|---|---|---|---|
| FWPV | 53 | 18 | 0.062 | 2.9 | 15 | 28.3 | 24.0 | 31 | 15 |
| CNPV | 22 | 6 | 0.017 | 3.7 | 9 | 40.9 | 23.5 | 30 | 22 |
| SPPV | 107 | 9 | 0.060 | 11.8 | 66 | 61.7 | 31.7 | 25 | 9 |
| GTPV | 82 | 9 | 0.060 | 9.1 | 56 | 68.3 | 31.7 | 25 | 10 |
| LSDV | 0 | 0 | 0 | - | 0 | - | 31.1 | 26 | 10 |
| DPV | 2 | 1 | 0.006 | 0.0 | 1 | 50.0 | 29.7 | 26 | 10 |
| CRV | 0 | 0 | 0 | - | 0 | - | 22.2 | 62 | 66 |
| MYXV | 2 | 1 | 0.006 | 1.0 | 1 | 50.0 | 20.8 | 44 | 35 |
| RFV | 44 | 4 | 0.025 | 11.0 | 11 | 25.0 | 21.1 | 40 | 28 |
| MOCV | 0 | 0 | 0 | - | 0 | - | 22.5 | 63 | 70 |
| CPXV-Ger | 11 | 6 | 0.026 | 1.8 | 7 | 63.6 | 22.5 | 34 | 20 |
| CPXV-BR | 3 | 4 | 0.018 | 0.8 | 1 | 33.3 | 22.9 | 33 | 20 |
| CPXV-Gri | 2 | 3 | 0.013 | 0.7 | 1 | 50.0 | 22.5 | 34 | 20 |
| HSPV | 49 | 26 | 0.122 | 1.9 | 29 | 59.2 | 22.6 | 33 | 19 |
| RPXV | 44 | 17 | 0.086 | 2.6 | 36 | 81.8 | 22.5 | 34 | 19 |
| VACV-WR | 41 | 15 | 0.077 | 2.7 | 31 | 75.6 | 22.8 | 33 | 19 |
| MPXV-WR | 104 | 21 | 0.105 | 5.0 | 66 | 63.5 | 22.7 | 33 | 19 |
| MPXV-ZAI | 95 | 21 | 0.107 | 4.5 | 56 | 58.9 | 22.6 | 33 | 19 |
| ECTV | 73 | 28 | 0.133 | 2.6 | 37 | 50.7 | 22.5 | 33 | 19 |
| TATV | 87 | 33 | 0.167 | 2.6 | 55 | 63.2 | 22.7 | 33 | 19 |
| CMLV | 65 | 22 | 0.107 | 3.0 | 37 | 56.9 | 22.7 | 33 | 19 |
| VARV-BRZ | 92 | 30 | 0.160 | 3.1 | 60 | 65.2 | 22.6 | 33 | 18 |
| VARV-SLE | 100 | 28 | 0.150 | 3.6 | 67 | 67.0 | 22.6 | 33 | 18 |
| VARV-SAF | 93 | 28 | 0.151 | 3.3 | 58 | 62.4 | 22.6 | 33 | 18 |
| VARV-KWT | 93 | 28 | 0.151 | 3.3 | 59 | 63.4 | 22.7 | 33 | 18 |
| ORFV | 0 | 0 | 0 | - | 0 | - | 22.8 | 64 | 68 |
| BPSV | 0 | 0 | 0 | - | 0 | - | 12.2 | 65 | 76 |
| SWPV | 0 | 0 | 0 | - | 0 | - | 27.1 | 28 | 12 |
| TANV | 4 | 2 | 0.014 | 1.0 | 1 | 25.0 | 30.2 | 27 | 10 |
| YMTV | 4 | 2 | 0.015 | 1.5 | 3 | 75.0 | 28.0 | 30 | 12 |
| YKPV | 47 | 17 | 0.097 | 2.8 | 21 | 44.7 | 28.8 * | 26 | 11 |
| SQPV | 0 | 0 | 0 | - | 0 | 0.0 | 23.5 | 67 | 73 |
| Total | 1140 | 379 | 0.060* | 3.4 * | 784 | 68.8 * | 24.1 | 37 * | 26 * |
* These figures are averages, not totals.
Figure 2Example of an early stop mutation (ESM) overlapping a microsatellite. For each isolate, a portion of the coding sequence for the thymidylate kinase gene (Cop-A48R) is shown. The top row is the nucleotide sequence, the amino acid translation is the middle row, and a bar is shown to indicate the gene ORF. The green bar represents the intact gene in CPXV-Ger which continues off the right of the figure, and the grey bar represents the homologous truncated gene in TATV. The pink bars highlight microsatellites consisting of a monomeric repeat. An ESM in TATV in the form of a 2 bp deletion is shown by the red bar.
Figure 3Presence of early stop mutations at microsatellites is independent of repeat content of the genome. (a) The number of ESMs which overlap or do not overlap microsatellites. Only isolates with non-intact genes and ESMs are shown; (b) The percentage of ESMs which overlap microsatellites is shown, along with the percentage of each genome which consists of microsatellites; (c) No relationship could be identified between the percentages of ESMs which overlap microsatellites and the percentage of each genome which consists of microsatellites.
Description of the microsatellite sequences identified in Poxviridae genomes. Comparative abundance of repeats is shown. The high GC viruses are shown at the bottom of the table.
| Isolate | Total number | Total number of 1, 2, 3, 4, 5, 6 bp | RA | RA of 1, 2, 3, 4, 5, 6 bp | Total length of microsatellites | RD | RD of 1, 2, 3, 4, 5, 6 bp | RA in noncoding regions | RA in coding regions |
|---|---|---|---|---|---|---|---|---|---|
| FWPV | 12063 | 5896, 894, 4110, 838, 217, 108 | 42 | 20, 3, 14, 3, 1, 0a | 69359 | 240 | 93, 20, 89, 26, 8, 5 | 49 | 40 |
| CNPV | 14465 | 6687, 1100, 5214, 999, 310, 147 | 40 | 19, 3, 14, 3, 1, 0a | 84508 | 235 | 85, 20, 91, 25, 9, 5 | 47 | 39 |
| SPPV | 8283 | 4964, 435, 2207, 461, 143, 73 | 55 | 33, 3, 15, 3, 1, 0a | 47574 | 317 | 162, 19, 93, 28, 10, 6 | 85 | 53 |
| GTPV | 8214 | 4863, 431, 2209, 481, 154, 76 | 55 | 33, 3, 15, 3, 1, 1 | 47406 | 317 | 159, 19, 92, 29, 11, 7 | 91 | 52 |
| LSDV | 8181 | 4888, 430, 2173, 463, 149, 78 | 54 | 32, 3, 14, 3, 1, 1 | 46886 | 311 | 157, 19, 91, 28, 10, 7 | 129 | 51 |
| DPV | 8620 | 4780, 583, 2490, 510, 154, 103 | 52 | 29, 4, 15, 3, 1, 1 | 49369 | 297 | 134, 23, 94, 28, 10, 8 | 81 | 49 |
| MYXV | 5780 | 2599, 373, 2174, 464, 123, 47 | 36 | 16, 2, 13, 3, 1, 0a | 33652 | 208 | 73, 15, 83, 26, 8, 4 | 88 | 33 |
| RFV | 5781 | 2683, 423, 2018, 469, 130, 58 | 36 | 17, 3, 13, 3, 1, 0a | 33800 | 211 | 76, 17, 78, 25, 9, 6 | 62 | 34 |
| CPXV-Ger | 8648 | 3650, 704, 3307, 698, 173, 116 | 38 | 16, 3, 14, 3, 1, 1 | 51415 | 225 | 73, 20, 91, 27, 8, 7 | 62 | 35 |
| CPXV-BR | 8656 | 3758, 716, 3178, 685, 172, 147 | 39 | 17, 3, 14, 3, 1, 1 | 51443 | 229 | 76, 20, 88, 27, 8, 9 | 57 | 36 |
| CPXV-Gri | 8466 | 3613, 691, 3202, 668, 178, 114 | 38 | 16, 3, 14, 3, 1, 1 | 50251 | 225 | 73, 20, 89, 27, 8, 7 | 53 | 36 |
| HSPV | 8205 | 3662, 632, 3012, 643, 158, 98 | 39 | 17, 3, 14, 3, 1, 0a | 48109 | 226 | 79, 19, 88, 27, 8, 6 | 45 | 37 |
| RPXV | 7570 | 3370, 584, 2766, 596, 154, 100 | 38 | 17, 3, 14, 3, 1, 1 | 44447 | 225 | 78, 19, 87, 27, 8, 7 | 44 | 37 |
| VACV-WR | 7586 | 3506, 588, 2664, 574, 162, 92 | 39 | 18, 3, 14, 3, 1, 0a | 44323 | 228 | 83, 19, 85, 26, 9, 6 | 45 | 37 |
| MPXV-WR | 7645 | 3281, 643, 2854, 626, 155, 86 | 38 | 16, 3, 14, 3, 1, 0a | 45187 | 227 | 75, 20, 89, 28, 8, 6 | 44 | 37 |
| MPXV-ZAI | 7540 | 3262, 617, 2830, 589, 157, 85 | 38 | 17, 3, 14, 3, 1, 0a | 44559 | 226 | 75, 20, 90, 26, 9, 6 | 46 | 37 |
| ECTV | 8018 | 3519, 667, 2942, 651, 153, 86 | 38 | 17, 3, 14, 3, 1, 0a | 47220 | 225 | 76, 20, 87, 28, 8, 6 | 45 | 36 |
| TATV | 7559 | 3273, 582, 2880, 575, 157, 92 | 38 | 17, 3, 15, 3, 1, 0a | 44943 | 227 | 75, 19, 91, 25, 9, 8 | 42 | 37 |
| CMLV | 7900 | 3446, 644, 2945, 607, 159, 99 | 38 | 17, 3, 14, 3, 1, 0a | 46688 | 227 | 77, 20, 89, 26, 8, 8 | 44 | 37 |
| VARV-BRZ | 7305 | 3294, 558, 2665, 564, 146, 78 | 39 | 18, 3, 14, 3, 1, 0a | 42513 | 226 | 80, 19, 88, 26, 8, 5 | 43 | 38 |
| VARV-SLE | 7241 | 3237, 554, 2667, 561, 145, 77 | 39 | 17, 3, 14, 3, 1, 0a | 42251 | 226 | 79, 19, 89, 26, 8, 5 | 43 | 37 |
| VARV-SAF | 7214 | 3234, 553, 2650, 556, 143, 78 | 39 | 17, 3, 14, 3, 1, 0a | 42078 | 226 | 79, 19, 89, 26, 8, 5 | 43 | 37 |
| VARV-KWT | 7228 | 3238, 553, 2658, 557, 144, 78 | 39 | 17, 3, 14, 3, 1, 0a | 42140 | 227 | 79, 19, 89, 26, 8, 5 | 43 | 38 |
| SWPV | 6642 | 3001, 672, 2296, 467, 205, 74 | 45 | 20, 5, 16, 3, 1, 1 | 39737 | 271 | 96, 31, 98, 29, 16, 7 | 68 | 44 |
| TANV | 7746 | 4835, 371, 1929, 415, 117, 79 | 54 | 33, 3, 13, 3, 1, 1 | 43730 | 302 | 162, 17, 83, 26, 8, 7 | 67 | 52 |
| YMTV | 6703 | 4184, 322, 1668, 379, 93, 57 | 50 | 31, 2, 12, 3, 1, 0a | 37689 | 280 | 150, 15, 77, 25, 7, 5 | 63 | 48 |
| YKPV | 8517 | 3902, 821, 2973, 554, 180, 87 | 48 | 22, 5, 17, 3, 1, 0a | 50540 | 288 | 103, 31, 107, 29, 11, 7 | 60 | 47 |
| CRV | 6626 | 1711, 640, 3533, 460, 139, 145 | 35 | 9, 3, 19, 2, 1, 1 | 42209 | 222 | 40, 24, 119, 22, 8, 10 | 62 | 34 |
| MOCV | 6456 | 1475, 1274, 2960, 454, 130, 163 | 34 | 8, 7, 16, 2, 1, 1 | 42852 | 225 | 34, 52, 98, 21, 7, 12 | 41 | 32 |
| ORFV | 4959 | 1123, 748, 2540, 335, 94, 119 | 35 | 8, 5, 18, 2, 1, 1 | 31940 | 228 | 35, 38, 116, 22, 7, 11 | 30 | 36 |
| BSPV | 2404 | 1123, 522, 240, 318, 80, 121 | 18 | 8, 4, 2, 2, 1, 1 | 16437 | 122 | 36, 27, 20, 22, 7, 11 | 34 | 17 |
| SQPV | 5565 | 1423, 393, 3159, 337, 85, 168 | 37 | 10, 3, 21, 2, 1, 1 | 34911 | 235 | 41, 18, 135, 20, 6, 14 | 56 | 36 |
The RA of 6bp microsatelites is closer to 0 than to 1, but is not equal to 0.
Figure 4Relative abundance and density of microsatellites. RA for (a) all microsatellites and for (b) each motif length measured (1 bp to 6 bp) for each isolate. RD for (c) all microsatellites and for (d) each motif length measured for each isolate. The high GC viruses are shown at the right end of each figure.
Figure 5Distribution of microsatellites across genomes. The number of microsatellites is shown for bins of 2000 bp. The number of bins reflects the length of each genome. The isolates shown are (a) CNPV; (b) CPXV-Ger; (c) GTPV; (d) ORFV; and (e) VARV-BRZ.
Figure 6Genome length is not correlated with microsatellite abundance or density. (a) RA and (b) RD are shown for each isolate, plotted versus genomic length.
Figure 7Correlation of microsatellite GC content and genome GC content.