| Literature DB >> 25499090 |
Amanda Y Chong1, Kenji K Kojima2, Jerzy Jurka3, David A Ray4,5,6, Arian F A Smit7, Sally R Isberg8,9, Jaime Gongora10.
Abstract
BACKGROUND: Crocodilians are thought to be hosts to a diverse and divergent complement of endogenous retroviruses (ERVs) but a comprehensive investigation is yet to be performed. The recent sequencing of three crocodilian genomes provides an opportunity for a more detailed and accurate representation of the ERV diversity that is present in these species. Here we investigate the diversity, distribution and evolution of ERVs from the genomes of three key crocodilian species, and outline the key processes driving crocodilian ERV proliferation and evolution.Entities:
Mesh:
Year: 2014 PMID: 25499090 PMCID: PMC4299795 DOI: 10.1186/s12977-014-0071-2
Source DB: PubMed Journal: Retrovirology ISSN: 1742-4690 Impact factor: 4.602
Figure 1Classification and likely relationships between the CrocERV groups and ERVs from other species. Maximum likelihood phylogenies were created from the RT domain of the crocodilian consensus sequences and a selection of sequences deposited in Repbase and published sequences. Part (a) is the entire RT tree, while (b) and (c) are expanded versions of ERV3 and 4, and ERV1 respectively. The complete version of (a) including sequence IDs is presented as Additional file 6: Figure S2. Symbols represent the taxa from which the sequences were derived. The numbers of CrocERV groups are shown outside of corresponding consensus sequences. Major ERV and retroviral groups, and the Gypsy elements, are indicated by brackets. Numbers within the phylogeny indicate aLRT values greater than 90%. The scale bar indicates branch length.
Figure 2Presence and absence of dUTPase and is variable between lineages within ERV3. Maximum likelihood phylogenies were created from the RT domain of Crocodilian ERV3 and ERV3 sequences deposited in Repbase. The ERV3 lineage encoding dUTPase is shaded in grey. Elements encoding a recognisable env are indicated by boxes. Symbols represent the taxa from which the sequences were derived. The numbers of CrocERV groups are shown outside of corresponding consensus sequences. Major retroviral groups are indicated brackets. Numbers within the phylogeny indicate aLRT values greater than 90%. The scale bar indicates branch length.
Typical characteristics of ERV classes
|
|
|
|
| |
|---|---|---|---|---|
| TSD length | 4 bp | 6 bp | 5 bp | 5 bp |
| Zinc-finger motifs | 1-2 | 2 | Absent | Absent |
| GPF/Y motif or equivalent | Present | Present | Absent | Absent |
| dUTPase | Absent | Proab | Polb | Absent |
aNon-primate lentiviral ERVs encode dUTPase within pol.
bSome lineages may have lost dUTPase [9].
Figure 3Reconstructed KIT-ligand proteins from CrocERV29 and a simplified diagram of the provirus. Maximum Likelihood phylogenies and sequence alignments were created from the reconstructed KIT-ligand proteins encoded by one lineage within CrocERV29. ERV IDs are provided within the tree and the alignment (see also Additional file 7). Four letter sequence names indicate the crocodilian species (‘Amis’ , A. mississippiensis; ‘Asin’ , A. sinensis; ‘Cpor’ , C. porosus; ‘Ggan’ , G. gangeticus). Shaded columns indicate the positions of indels within the alignment. Numbers within the tree indicate statistical support for the branches and the scale bars indicate branch length. Proviral structure is not to scale.
Figure 4Reconstructed nectin3 proteins from CrocERV31 and a simplified diagram of the provirus. Maximum Likelihood phylogenies and sequence alignments were created from the reconstructed nectin3 proteins encoded by one lineage within CrocERV31. ERV IDs are provided within the tree and the alignment (see also Additional file 8). ‘Amis’ stands for A. mississippiensis and ‘Asin’ for A. sinensis. Shaded columns indicate the positions of indels within the alignment. Numbers within the tree indicate statistical support for the branches and the scale bars indicate branch length. Proviral structure is not to scale.
Estimated ERV content based on retroviral chains, and a comparison with previous estimates and other species
|
|
|
|
|
|
|---|---|---|---|---|
|
| American alligator | 0.25% | 1.88% | [ |
|
| Saltwater crocodile | 0.26% | 1.63% | [ |
|
| Gharial | 0.14% | 1.22% | [ |
|
| Green anole | 3.00% | [ | |
|
| European cattle | 0.36% | 4.29% | [ |
|
| Dog | 0.15% | [ | |
|
| Zebrafish | 0.80% | [ | |
|
| Chicken | 0.20% | 2.90% | [ |
|
| Human | 0.80% | 8.29% | [ |
|
| Opossum | 2.00% | 10.64% | [ |
|
| Mouse | 2.00% | 9.22% | [ |
|
| Western clawed frog | 0.12% | [ |