| Literature DB >> 29484202 |
Yusuke Inoue1, Masahiko Kumagai1, Xianbo Zhang2, Tomonori Saga1, Deshou Wang2, Akihiko Koga3, Hiroyuki Takeda1.
Abstract
BACKGROUND: Endogenous viral elements play important roles in eukaryotic evolution by giving rise to genetic novelties. Herpesviruses are a large family of DNA viruses, most of which do not have the ability to endogenize into host genomes. Recently, we identified a novel type of endogenous herpesvirus, which we named "Teratorn", from the medaka (Oryzias latipes) genome, in which the herpesvirus is fused with a piggyBac-like DNA transposon, forming a novel mobile element. Teratorn is a unique herpesvirus that retains its viral genes intact and has acquired the endogenized lifestyle by hijacking the transposon system. However, it is unclear how this novel element evolved in the teleost lineage and whether fusion of two mobile elements is a general phenomenon in vertebrates.Entities:
Keywords: Endogenous viral elements; Herpesvirus; Teleosts; Transposon; piggyBac
Year: 2018 PMID: 29484202 PMCID: PMC5822658 DOI: 10.1186/s40851-018-0089-8
Source DB: PubMed Journal: Zoological Lett ISSN: 2056-306X Impact factor: 2.836
Fig. 1Teratorn-like viruses are widely distributed in teleost fish genomes (a) Result of a tblastn search for 13 herpesvirus core genes of medaka Teratorn against publicly available genome data of teleost fish species. Species that appear to contain Teratorn-like virus (> 8 of the 13 herpesvirus core genes; E-value < 10− 3) are highlighted in blue. The phylogenetic tree was constructed by Bayesian inference, based on the concatenated nucleotide sequence of 17 host genes (Betancur-R. R. et al., 2013 [39]). Species in which Teratorn-like viruses are adjacent to a piggyBac-like transposase gene are marked by magenta squares (solid, fusion for all copies; outlined, co-localization for only some of the copies). b Maximum-likelihood tree based on the concatenated amino acid sequences of five herpesvirus genes (major capsid protein, capsid triplex protein, DNA helicase, DNA polymerase and DNA packaging terminase) from Teratorn-like viruses in teleosts and exogenous alloherpesvirus species. Teratorn-like viruses are depicted in blue. Elements that harbor a piggyBac-like transposase gene are marked by magenta squares. For the designated numbers on the right (1, 2, 3), see Fig. 4c and main text. The scale bars represent the number of substitutions per site
tblastn of alloherpesvirus core genes of medaka Teratorn against teleost genomes
| Species | pol | hel | pri | mcp | tri | mem | ter | pro | 37 | 54 | 56 | 60 | 64 | No. of genes |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
| ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | 13 |
|
| ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | 13 |
|
| ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | 13 |
|
| ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | 13 |
|
| ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | 13 |
|
| ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | 13 |
|
| ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | 13 |
|
| ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | 13 |
|
| ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | 13 |
|
| ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | 13 |
|
| ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | 13 |
|
| ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | 13 |
|
| ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | 13 |
|
| ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | 13 |
|
| ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | 13 |
|
| ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | 13 |
|
| ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | 12 | |
|
| ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | 11 | ||
|
| ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | 11 | ||
|
| ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | 10 | |||
|
| ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | 9 | ||||
|
| ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | 8 | |||||
|
| ◎ | ◎ | ◎ | ◎ | ◎ | ◎ | 6 | |||||||
|
| ◎ | ◎ | ◎ | ◎ | ◎ | 5 | ||||||||
|
| ◎ | ◎ | ◎ | ◎ | ◎ | 5 | ||||||||
|
| ◎ | ◎ | ◎ | ◎ | ◎ | 5 | ||||||||
|
| ◎ | ◎ | ◎ | ◎ | 4 | |||||||||
|
| ◎ | ◎ | ◎ | ◎ | 4 |
Abbreviations: pol DNA polymerase, hel DNA helicase, pri primase mcp major capsid protein, tri capsid triplex protein, mem membrane protein, ter DNA packaging terminse, pro capsid maturation protease; 37, ORF37 of Ictalurid herpesvirus 1; 54, ORF54; 56, ORF56; 60, ORF60; 64, ORF64
Characteristics of Teratorn-like viruses within teleost genomes
| Species | Intactness of ORFs | Copy No. /haploid | Validity of Genomic integration | Terminal sequences | Subtypes | |
|---|---|---|---|---|---|---|
|
| intact | ~ 8 | integrated (contig-mediated link between | Some | TIRs of | |
|
| degraded | – | integrated (ORF degradation) | – | Unidentified (degradation of | |
|
| intact | 1–5 | integrated (contig-mediated link between | Some | Unidentified (small number of blast hits) | |
|
| partially degraded | ~ 8 | integrated (contig-mediated link between | – | Unidentified (small number of blast hits) | |
|
| intact | – | unkonwn | – | Unidentified (small number of blast hits) | |
|
| unknown (contigs too short) | – | unknown | – | Unidentified (contigs too short) | |
|
| intact | ~ 25, ~ 5 | integrated (BAC sequencing) | Fused | TIRs of | 2 subtypes |
|
| intact | 1–2, ~ 4 | likely integrated (scaffold-mediated link between | some | Unidentified (gaps between | 2 subtypes |
|
| intact | 1–2 | unknown | Consistent link between | Unidentified (small number of blast hits) | 2 subtypes? |
|
| intact | – | unknown | – | Unidentified (contigs too short) | 2 subtypes |
|
| intact | – | likely integrated (scaffold-mediated link between | – | Unidentified (small number of blast hits) | |
|
| partially degraded | ~ 12, ~ 2 | integrated (fosmid sequencing, scaffold-mediated link between | Consistent link between | Subtype 1: TIRs of | 2 subtypes |
|
| intact | ~ 2 | likely integrated (scaffold-mediated link between | – | Unidentified (small number of blast hits) | |
|
| intact | ~ 1 | unknown | – | Unidentified (small number of blast hits) | |
|
| unknown (contigs too short) | – | unknown | – | Unidentified (contigs too short) | at least 2 subtypes? |
|
| unknown (contigs too short) | – | unknown | – | Unidentified (contigs too short) | |
|
| intact | – | likely integrated(scaffold-mediated link between | – | Unidentified (small number of blast hits) | |
|
| degraded | – | integrated (ORF degradation) | some | Unidentified (degradation of | |
|
| unknown (contigs too short) | – | unknown | – | Unidentified (contigs too short) | |
|
| intact | ~ 18 | likely integrated (scaffold-mediated link between | Consistent link between | Unidentified (gaps between | |
|
| intact (subtype2), degraded (subtype1) | ~ 0.5, ~ 2 | integrated (contig-mediated link between | Consistent link between | Unidentified (small number of blast hits) | 2 subtypes |
|
| degraded | – | integrated (contig-mediated link between | Consistent link between | Unidentified (small number of blast hits) | 2 subtypes |
Fig. 4Phylogenetic comparison between piggyBac-like elements and herpesvirus-like genes (a) A Maximum-likelihood tree based on the nucleotide sequences of piggyBac-like transposase genes inside Teratorn-like viruses is shown. General time reversible model was used as the substitution model, and evolutionary rate differences among sites was modeled by discrete gamma distribution. A total of 1518 positions were used in the final dataset. b A maximum-likelihood tree based on the concatenated nucleotide sequences of five herpesvirus genes (DNA polymerase, DNA helicase, terminase, major capsid protein, membrane glycoprotein) is shown. General time reversible model was used as the substitution model, and evolutionary rate differences among sites was modeled by discrete gamma distribution. A total of 13,683 positions were used in the final dataset. c A maximum-likelihood tree based on the amino acid sequences of all annotated piggyBac-like transposase genes in teleosts is shown. Expansion of the clade surrounded by the blue square is shown on the right. Magenta indicates piggyBac-like transposase genes inside Teratorn-like viruses. JTT model was used as substitution model, and evolutionary rate differences among sites were not modeled. The bar represents the number of substitutions pre site. A total of 324 positions were used in the final dataset. Note the existence of three clusters containing piggyBac-like elements fused with Teratorn-like viruses, the phylogeny of which is highly correlated with that of Teratorn-like viruses (see Fig. 1b). Sequences used for the phylogenetic tree in (a) are marked by magenta circles. The scale bars represent the number of substitutions per site
Fig. 2Teratorn-like viruses encode a series of herpesvirus genes a A homology plot of Teratorn-like virus sequences compared to medaka Teratorn subtype 1, visualized by VISTA. Blue and white regions indicate coding and non-coding regions, respectively. Colored boxes in the two bars above the histograms indicate the positions of forward- and reverse-oriented predicted genes of medaka Teratorn subtype 1 (magenta, piggyBac-like transposase gene; blue, herpesvirus-like genes; orange, cellular homologs; gray, unannotated genes). All sequences displayed in this plot are the same as those in (b). b Structures of Teratorn-like viruses in several teleost fish species. Conserved synteny blocks are depicted by arrows of the same color. Magenta arrows indicate the piggyBac-like transposase gene. Sources of each sequence are described on the right. These sequences were (1) extracted from contigs or scaffolds (M. mola, M. albus, S. salar, O. kisutch, N. brichardi, N. furzeri, A. citrinellus, A. limnaeus and A. polyacanthus), or (2) reconstructed by conjugating multiple contigs (subtype 2 Teratorn of O. latipes, L. crocea and O. niloticus) or BAC sequencing (subtype 1 Teratorn of O. latipes)
Fig. 3Teratorn-like viruses exist in multiple copies in host genomes Estimated copy number of Teratorn-like viruses in the genomes of some teleost fishes. Copy number was estimated by mapping of Illumina short read data against reference genome data, calculating the coverage at each nucleotide position, and then dividing the average coverage value of herpesvirus core genes by the average coverage of the coding region of host genes
Fig. 5Model: piggyBac-herpesvirus fusion resulting from virus-mediated transfer of transposons Fusion of piggyBac-like elements and Teratorn-like viruses could have originated from virus-mediated transfer of transposons across teleost fish species. Since herpesviruses invade the nucleus of host cells to establish latent infection, transfer of transposons between the host genome and virus DNA is theoretically possible. In some cases, rearrangement might occur within the virus genome to form a genuine piggyBac-herpesvirus fusion, making it possible for the virus to integrate and propagate in the host genome