| Literature DB >> 16092962 |
Patric Jern1, Göran O Sperber, Jonas Blomberg.
Abstract
BACKGROUND: Endogenous retroviral sequences (ERVs) are integral parts of most eukaryotic genomes and vastly outnumber exogenous retroviruses (XRVs). ERVs with a relatively complete structure were retrieved from the genetic archives of humans and chickens, diametrically opposite representatives of vertebrate retroviruses (over 3300 proviruses), and analyzed, using a bioinformatic program, RetroTector, developed by us. This rich source of proviral information, accumulated in a local database, and a collection of XRV sequences from the literature, allowed the reconstruction of a Pol based phylogenetic tree, more extensive than previously possible. The aim was to find traits useful for classification and evolutionary studies of retroviruses. Some of these traits have been used by others, but they are here tested in a wider context than before.Entities:
Mesh:
Substances:
Year: 2005 PMID: 16092962 PMCID: PMC1224870 DOI: 10.1186/1742-4690-2-50
Source DB: PubMed Journal: Retrovirology ISSN: 1742-4690 Impact factor: 4.602
Detected ERV structural traits in the human and chicken genomes
| - | - | |||||||||||
| gg01 | alpha | 34 | 1 | 1 | 2 | 1 | 0 | 0 | 0 | |||
| gg01 | alpha-beta | 67 | 2 | 0 | 3 | 1 | 21 | 0 | 0 | |||
| gg01 and hg16 | beta | II | 582 | 18 | 14 | 22 | 27 | 363 | 68 | 0 | ||
| gg01 and hg16 | gamma | I | 2069 | 14 | 13 | 32 | 43 | 0 | 0 | 264 | ||
| gg01 and hg16 | delta | (-)6 | (-) | (-) | (-) | (-) | (-) | (-) | (-) | (-) | (-) | |
| gg01 and hg16 | epsilon | n.d.7 | n.d. | n.d. | n.d. | n.d. | n.d. | n.d. | n.d. | n.d. | n.d. | |
| gg01 and hg16 | lenti | (-) | (-) | (-) | (-) | (-) | (-) | (-) | (-) | (-) | (-) | |
| gg01 and hg16 | spuma-like | III | 193 | n.d. | n.d. | n.d. | n.d. | n.d. | n.d. | n.d. | n.d. | n.d. |
1 Human genome version 16 (hg16) and Chicken genome (gg01). Alpha and alpha-beta retroviruses were only detected in the chicken genome
2 Detected ERVs from the databank collection, with RetroTector© score >300 and recognized Pol puteins.
3 Predicted frameshift (f.s.) translation strategy between the respective putein ORFs in elements with score >1000, and no f.s. in the near 3'-end.
4 Detected dUTPase N-terminal of protease domain (dUTPasePro) in elements with score >300.
5 Detected C-terminal Pro (G-patch) and Pol (GPY/F) motifs elements with score >300.
6 (-): no delta or lentiviral ERVs were detected in the genomes.
7 (n.d.): not determined. The scarcity of epsilon like elements [19] and the highly mutated nature of both epsilon and spuma-like elements, in the human genome did not provide sufficient information to conduct a thorough analysis.
Figure 1Representative unrooted Pol neighbor joining (NJ) dendrogram. Unrooted Pol neighbor joining (NJ) dendrogram (500 bootstraps consensus) of the seven retroviral genera: alpha-, beta-, gamma-, delta-, epsilon-, lenti- and spuma-like retroviruses. The somewhat more loosely defined (endogenous) retroviral classes are indicated in the periphery. The various host species are indicated with symbols next to each taxonomic unit. The novel sequences are named according to their chromosomal positions within respective genomes. (hg15 and 16: Human genome; gg01: Chicken genome and pt01: chimpanzee genome). The two pt01 sequences were unique to chimpanzee and not found in humans [Jern et al. submitted].
Figure 2Structural traits projected onto the Pol dendrogram. The pol dendrograms in panels A to D are all derived from figure 1. A. The number of recognized Gag NC zinc finger motifs within respective genera. Detections of two NC zinc fingers are marked in light grey for all genera to the right from deltaretroviruses to betaretroviruses, and also for the gammaretroviral HERV-H. The remaining gammaretroviral elements (dark grey) had one NC zinc finger B. Presence of dUTPase is highlighted in grey. The non-primate lentiviral dUTPasePolA (dark grey) is found within Pol and the dUTPasePro (light grey) are found N-terminal of Pro. The dUTPasePro appears to occasionally have been lost, indicated by the two uncolored intermediate chicken and python ERVs. dUTPasePolB. of foamy viruses is not indicated. C. Nucleotide biases may be useful in demarcating retroviral groups locally and the most obvious found are here highlighted. For more detail see refs [31, 40]. D. Genera with detected Pol C-terminal GPY/F motifs are marked light grey and Pro C-terminal G-patch marked in dark grey (exclusively in betaretroviruses). Some betaretroviruses missed a G-Patch and are therefore unmarked.
Figure 3Structural traits summary. Simplified view of the different genotypic traits suggested for retroviral phylogeny inference. The branch for Gypsy and Copia represent an imagined midpoint reference in the tree. The number of NC zinc fingers, presence of dUTPase (dUTPasePolB is not indicated), known accessory genes, C-terminal Pro (G-patch) and Pol (GPY/F) motifs are shown. Nucleotide bias was defined to 25 ± 5 %. (↑) shifted upwards; (↓) shifted downwards; (≈) uncertain bias. Exploration of the LTR lengths of the different groups as detected by RetroTector© are shown as boxplots. In addition, the translational strategy may be used in the phylogeny to separate the gammaretroviruses (including class I ERVs) from spuma-like elements (class III ERVs), deltaretroviruses, lentiviruses, alpharetroviruses and the betaretroviruses (class II ERVs) with respective intermediate groups. The Gypsy and Copia are not included in the translational strategy analysis.