| Literature DB >> 28039384 |
Jamie E Henzy1, Robert J Gifford2, Christopher P Kenaley1, Welkin E Johnson1.
Abstract
We have identified a retroviral envelope gene with a complete, intact open reading frame (ORF) in 20 species of spiny-rayed fishes (Acanthomorpha). The taxonomic distribution of the gene, "percomORF", indicates insertion into the ancestral lineage >110 Ma, making it the oldest known conserved gene of viral origin in a vertebrate genome. Underscoring its ancient provenence, percomORF exists as an isolated ORF within the intron of a widely conserved host gene, with no discernible proviral sequence nearby. Despite its remarkable age, percomORF retains canonical features of a retroviral glycoprotein, and tests for selection strongly suggest cooption for a host function. Retroviral envelope genes have been coopted for a role in placentogenesis by numerous lineages of mammals, including eutherians and marsupials, representing a variety of placental structures. Therefore percomORF's presence within the group Percomorpha-unique among spiny-finned fishes in having evolved placentation and live birth-is especially intriguing.Entities:
Keywords: endogenous retrovirus; evolution.; viral gene cooption
Mesh:
Substances:
Year: 2017 PMID: 28039384 PMCID: PMC5939848 DOI: 10.1093/molbev/msw262
Source DB: PubMed Journal: Mol Biol Evol ISSN: 0737-4038 Impact factor: 16.240
FTop, genomic position of percomORF. The orientation of the coding regions for auxilin exons and percomORF, and the varying lengths of the ORF-flanking intron sequences are shown along the top. Below is a graph representing the average pairwise percent nucleotide identity of each region, generated using GENEIOUS v. 6.0.4, available from http://www.geneious.com. Dark green indicates 100% identity across all sequences at a given site; green, >30%, identity; red, <30% identity. Height of graph at each position conveys the fraction of sequences with an identical nucleotide at that position. Bottom, conserved features of PercomORF. SU and TM subunits are indicated at the bottom of the figure; green vertical lines represent cysteines, with approximate positions of the CXXC motifs in SU, and the CX6CC motif in TM indicated; fusion peptide (fp), immunosuppressive domain (ISD), and transmembrane region (tm) are shown as patterned boxes; SP, signal peptide; Y, predicted N-glycan site, with parentheses at partially conserved site, and asterisk marking the “heptad stutter”; ct, cytoplasmic tail. The blue bar above TM represents the partial sequence from the stickleback that was included in a study on filoviruses in mammalian genomes (Belyi et al. 2010).
FEstimated age of percomORF. Time-calibrated phylogeny scaled to the geological time scale, adapted from Near et al. (2012) and based on 9 nuclear genes and 36 fossil age constraints. Orders in which percomORF or an empty site was found are indicated by red boxes and black boxes, respectively. Lineages that are part of the Ovalentaria clade are indicated. The S, Pg, and N (top) signify the Silurian, Paleogene, and Neogene geological periods.
FRelationship of PercomORF to other gammaretroviral-like Env proteins. A neighbor-joining tree was generated (midpoint-rooted) using GENEIOUS v. 6.0.4, based on an amino acid alignment spanning the TM coding region, excluding the cytoplasmic tail. PercomORF sequences (collapsed into a clade represented by the gray triangle) cluster with the group of gammaretroviral-like Env proteins that carry a “heptad stutter” in the N-terminal heptad repeat. The lower clade consists of gammaretroviral sequences that do not include the stutter. All of the sequences besides PercomORF occur in the context of proviruses.