| Literature DB >> 18367473 |
Karin Hoelzer1, Laura A Shackelton, Colin R Parrish.
Abstract
Nucleotide composition varies greatly among DNA viruses of animals, yet the evolutionary pressures and biological mechanisms driving these patterns are unclear. One of the most striking discrepancies lies in the frequency of CpG (the dinucleotide CG, linked by a phosphate group), which is underrepresented in most small DNA viruses (those with genomes below 10 kb) but not in larger DNA viruses. Cytosine methylation might be partially responsible, but research on this topic has focused on a few virus groups. For several viruses that integrate their genome into the host genome, the methylation status during this stage has been studied extensively, and the relationship between methylation and viral-induced tumor formation has been examined carefully. However, for actively replicating viruses--particularly small DNA viruses--the methylation status of CpG motifs is rarely known and the effects on the viral life cycle are obscure. In vertebrate host genomes, most cytosines at CpG sites are methylated, which in vertebrates acts to regulate gene expression and facilitates the recognition of unmethylated, potentially pathogen-associated DNA. Here we briefly introduce cytosine methylation before reviewing what is currently known about CpG methylation in DNA viruses.Entities:
Mesh:
Substances:
Year: 2008 PMID: 18367473 PMCID: PMC2396429 DOI: 10.1093/nar/gkn121
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Overview of vertebrate DNA methyltransferases and their functions
| DNMT | Functional role | Reference |
|---|---|---|
| De-novo DNMTs | ||
| DNMT 3a | Embryonic development, methylation of CpG sites, meiosis induction in sperm(?) | ( |
| DNMT 3b | Embryonic development, spermatogenesis(?) | ( |
| DNMT 3L | Maternal genomic imprinting, silencing of retrotransposons in spermatogonial stem cells | ( |
| Maintenance DNMTs | ||
| DNMT 1 | Cellular maintenance methylation, maintenance of imprinting, silencing of mobile elements during genomic demethylation, contribution to histone deacetylases | ( |
| DNMT 2 | Unclear. Methylation in | ( |
The enzymatic functions attributed to known DNMTs are summarized here, and some key references are provided. Some simplifications were made for the purpose of clarity, and the reader is referred to specific reviews of DNMTs, indicated in the text, for more detail. Cases where the functional role has been proposed, but not yet established conclusively, are indicated by question marks.
Frequency of TLR9 stimulatory and non-stimulatory/inhibitory sequences
GpC and CpG contents, as well as individual nucleotide and motif frequencies, were determined from the publicly available reference sequences (RefSeq) in Genbank (MatLab script available from the authors upon request). Sequence length, frequencies and percentages of individual nucleotides, GpC and CpG content, as well as the frequencies of potentially stimulating and non-stimulating motifs, are provided. Potentially stimulatory sequences are those described by Rankin et al. (120); values represent the total number of the respective motif present in the genome. The number in parentheses indicates the number of motifs expected in the sequence, given the individual nucleotide composition. This was calculated using the following formula: E(UVWXYZ) = 10−12* sequence length*(%U*%V*%W*%X*%Y*%Z), where E(UVWXYZ) = expected value of the motif UVWXYZ; U,V,W,X,Y,Z = nucleotides of which the motif consists, and %U = % of viral sequence consisting of base U. Expected values were rounded to the nearest integer. Likely non-stimulatory and potentially inhibitory sequences are those described by Krieg; values again represent the total number of the respective motif present in the genome, and expected values were calculated as describe above. ‘Total stimulatory’ or ‘total non-stimulatory’ values represents the sum of all putatively stimulatory or non-stimulatory motifs in the sequence. ‘Ratio of stimulatory/non-stimulatory sequences’ represent the fraction of the total number of stimulatory divided by the total number of non-stimulatory sequences. ‘CpG’ or ‘GpC’ represents the total number of the respective dinucleotide CpG or GpC in the viral DNA sequence. ‘CpG/GpC’ represents fraction of the number of CpG motifs divided by the number of GpC motifs in the viral DNA. ‘RefSeq’ number represent GeneBank accession number. ‘#’ no RefSeq FPV sequence available.
Figure 1.Comparison of DNA virus infection pathways. Major intracellular trafficking routes and characteristics of replication are shown for the DNA virus families discussed. The importance of methylation and immune recognition is indicated. Simplifications and generalizations were made for the purpose of clarity.
Overview of GC content, CpG frequency and methylation status of small and large DNA viruses
| Virus | Genome size | GC frequency | CpG content | Methylation status during active replication | Methylation status during latency | Host species | Effect on host methylation | |||
|---|---|---|---|---|---|---|---|---|---|---|
| Replicating | Reference | Integrated | Episomal | Reference | ||||||
| Large dsDNA viruses | ||||||||||
| Adenoviridae | 28–45 | 0.3–0.65 | 0.5–1.13 | Un/hypomethylated | ( | Methylated | – | ( | Mammal, bird | DNMT upregulation |
| Alpha-herpesvirinae | 130–150 | 0.4–0.71 | 0.9–1.17 | Un/hypomethylated | ( | – | Un/hypo- mehtylated | ( | Mammal, bird | DNMT upregulation |
| Beta-herpesvirinae | 140–240 | 0.4–0.67 | 1.0–1.25 | Unknown | N/A | – | Unknown | N/A | Mammal | Unknown |
| Gamma-herpesvirinae | 110–185 | 0.3–0.61 | 0.3–0.66 | Un/hypomethylated | ( | – | Methylated | ( | Mammal | DNMT upregulation |
| ( | ||||||||||
| Ranid herpesvirus | 220–230 | 0.5–0.55 | 0.8–0.95 | Methylated | ( | – | Unknown | N/A | Amphibian | Viral 5-cytosine methyltransferases? |
| Iridoviridae | 140–383 | 0.2–0.56 | 0.5–0.84 | Methylated | ( | – | – | – | Amphibian, fish | Viral 5-cytosine methyltransferases? |
| Poxviridae | 130–375 | 0.2–0.64 | 0.8–1.23 | Unknown | N/A | – | – | – | Mammal bird invertebrate | Unknown |
| Small dsDNA viruses | ||||||||||
| Papilloma-viridae | 7–8 | 0.4–0.54 | 0.1–0.57 | Partially methylated | ( | Methylated | – | ( | Mammal | DNMT upregulation |
| Polyoma-viridae | 5 | 0.4–0.48 | 0.05–0.78 | Un/hypomethylated | ( | Methylated | – | ( | Mammal, bird | DNMT upregulation |
| Small ssDNA viruses | ||||||||||
| Autonomous Parvoviridae | 4–6 | 0.3–0.5 | 0.3–0.71 | Unknown | N/A | – | – | – | Mammal | Unknown |
| Dependo-virinae | 4–6 | 0.4–0.58 | 0.6–1.03 | Unknown | N/A | – | – | – | Mammal | Unknown |
| Circoviridae | 2 | 0.5–0.57 | 0.4–0.87 | Unknown | N/A | – | – | – | Mammal, bird | Unknown |
| Anellovirus | 4 | 0.5 | 0.67 | Unknown | N/A | – | – | – | Human | Unknown |
The GC content, CpG content and, where known, methylation status is shown for viral families/subfamilies. Where applicable, a distinction is made between active replication and latency. If latent, the state of the genome (i.e. integrated or episomal) is specified. Preferred host species are indicated along with any known effect the virus has on host cell methylation. Where relevant, representative references are provided. Dash indicates this form is not known to occur for this virus.
aGC and CpG contents as determined by authors [here or in (83)] according to methods described therein); GC content represents the relative frequency of G and C in the sequence. CpG content represents the observed divided by the expected frequency of the dinucleotide CpG in the sequence.
bReference refers to a representative study, which focuses on one member of the virus family.
cUnpublished results, analysis done by authors as described above (83).