Literature DB >> 31777663

Identification and evolution of avian endogenous foamy viruses.

Yicong Chen1,2,3, Xiaoman Wei1,2,3, Guojie Zhang4,5,6,7, Edward C Holmes8, Jie Cui1,2.   

Abstract

A history of long-term co-divergence means that foamy viruses (family Retroviridae) provide an ideal framework to understanding virus-host evolution over extended time periods. Endogenous foamy viruses (EndFVs) are rare, and to date have only been described in a limited number of mammals, amphibians, reptiles and fish genomes. By screening 414 avian genomes we identified EndFVs in two bird species: the Maguari Stork (Ciconia maguari) and the Oriental Stork (Ciconia boyciana). Analyses of phylogenetic relationships, genome structures and flanking sequences revealed a single origin of EndFVs in Ciconia species. In addition, the marked incongruence between the virus and host phylogenies suggested that this integration event occurred independently in birds. In sum, by providing evidence that birds can be infected with foamy viruses, we fill the last major gap in the taxonomic distribution of foamy viruses and their animal hosts.
© The Author(s) 2019. Published by Oxford University Press.

Entities:  

Keywords:  birds; cross-species transmission; endogenous foamy viruses; incongruence

Year:  2019        PMID: 31777663      PMCID: PMC6875641          DOI: 10.1093/ve/vez049

Source DB:  PubMed          Journal:  Virus Evol        ISSN: 2057-1577


1. Introduction

Retroviruses (family Retroviridae) are viruses of substantial medical and economic significance as some are associated with severe infectious disease or are oncogenic (Hayward et al. 2015; Aiewsakun and Katzourakis 2017; Xu et al. 2018). Retroviruses are also of evolutionary importance as they have occasionally invaded the host germ line, leading to the generation of endogenous retroviruses (ERVs) and hence genomic ‘fossils’ (Stoye 2012; Johnson 2015, 2019). ERVs are widely distributed in vertebrates (Hayward et al. 2013; Cui et al. 2014; Hayward et al. 2015; Xu et al. 2018) and provide important insights into the origin and long-term evolution of retroviruses. However, some complex retroviruses such as lenti-, delta-, and spuma viruses, only relatively rarely appear as endogenized forms. As well leaving a litany of endogenous copies in host genomes, foamy viruses are of particular importance because they exhibit a history of long-term co-divergence with their vertebrate hosts (Switzer et al. 2005). Endogenous foamy viruses (EndFVs) were first discovered in sloths (Katzourakis et al. 2009), and then found in several primate genomes (Han and Worobey 2012b, 2014; Katzourakis et al. 2014). The subsequent discovery of EndFV and EndFV-like copies in fish genomes indicated that foamy viruses may have a deep evolutionary history within the vertebrates (Han and Worobey 2012a; Ruboyianes and Worobey 2016; Aiewsakun and Katzourakis 2017). Recently, three novel EndFVs were identified in reptile genomes, although there is disagreement over their origins with some suggesting virus-host co-divergence over many millions of years (Wei et al. 2019), and others favoring cross-species transmission events (Aiewsakun et al. 2019; Wei et al. 2019). To date, no EndFVs have been identified in avian genomes.

2. Materials and methods

2.1 Genome screening and viral genome structure identification

All 147 avian genomes available in GenBank as of June 2019 (Supplementary Table S1) and 267 genomes from the ‘Bird 10K’ program were screened for EndFVs using the TBLASTN algorithm (Altschul et al. 1990) and the protein sequences of representative exogenous foamy viruses, EndFVs and endogenous foamy-like viruses (Supplementary Table S2). A 35% sequence identity over a 30% region with an e-value set to 0.0001 was used to filter significant hits (Supplementary Table S3). Viral hits within large scaffolds (>20 kb) were assumed to represent bona fide ERVs. We then extended the flanking sequence of these hits to identify the viral long terminal repeats (LTRs) using BLASTN (Altschul et al. 1990), LTR Finder (Xu and Wang 2007) and LTRharvest (Ellinghaus et al. 2008). In accordance with the nomenclature proposed for ERVs (Gifford et al. 2018), EndFVs were identified in the genomes of the Maguari Stork (Ciconia maguari) and the Oriental stork (Ciconia boyciana). These were termed ‘ERV-Spuma.n-Cma’ and ‘ERV-Spuma.n-Cbo’, respectively (in which n represents the number of the viral sequences extracted from host genome) (Supplementary Table S4). Putative genome structures and conserved EndFV domains were identified using BLASTP, CD-search (Marchler-Bauer and Bryant 2004; Marchler-Bauer et al. 2017) and ORFfinder (https://www.ncbi.nlm.nih.gov/orffinder/) at NCBI.

2.2 Molecular dating

ERV integration time can be approximately estimated using the relation T = (D/R)/2, in which T is the integration time (million years), D is the number of nucleotide differences per site between the pairwise LTRs, and R is the genomic substitution rate (nucleotide substitutions per site, per year). We used the previously estimated neutral nucleotide substitution rate for birds (1. 9 × 10−9 nucleotide substitutions per site, per year; Zhang et al. 2014). Two full-length ERVs-Spuma-Cma containing a pairwise intact LTRs were used to estimate integration time in this manner (Supplementary Table S5). We excluded ERV-Spum.1-Cbo from this dating exercise due to its defective 5’ LTR.

2.3 Phylogenetic analysis

To describe the evolutionary relationship of EndFVs to other representative retroviruses, sequences of the Pol (Supplementary Data Set S1) and concatenated Gag-Pol-Env proteins (Supplementary Data Set S2) were aligned using MAFFT 7.222 (Katoh and Standley 2013) and confirmed manually in MEGA X (Kumar et al. 2018). A phylogenetic tree of these data was then inferred using the maximum likelihood (ML) method in PhyML 3.1 (Guindon et al. 2010), incorporating 100 bootstrap replicates to assess node robustness. The best-fit LG + Γ+I + F of amino acid substitution was selected for both Pol and concatenated Gag-Pol-Env protein data sets using ProtTest (Abascal et al. 2005).

2.4 Recombination analysis

To test for recombination in these data, we: (1) compared target site duplications (TSDs) flanking the ERVs, as it has previously been shown that ERVs not flanked by the same TSDs likely arose by provirus recombination (Hughes and Coffin 2001), and (2) screened for recombination in the pol and gag-pol-env nucleotide sequences of mammalian, tuatara, amphibian, lobe-finned fish and avian FVs/EndFVs using the Recombination Detection Program 4 (Martin et al. 2015).

3. Results and discussion

3.1 Discovery and characterization of endogenous foamy viral elements in avian genomes

To identify potential foamy (-like) viral elements in birds, we collected 147 available bird genomes from GenBank (Supplementary Table S1) and 267 genomes from the ‘Bird 10K’ project (Zhang et al. 2015) and performed in silico TBLASTN, using the amino acid sequences of representative retroviruses as queries (Supplementary Table S2). This genomic mining identified sixteen significant hits in the Maguari Stork and the twelve in Oriental Stork (Supplementary Table S3). We designated these ERV-Spuma.n-Cma and Spuma.n-Cbo, respectively (Gifford et al. 2018) (Supplementary Tables S3 and S4). We considered hits within large scaffolds (>20 kb in length) to represent bona fide ERVs. We then extended the flanking sequences of these EndFVs on both sides to search for LTRs, as these define the boundary of the viral elements. Through this analysis we discovered two full-length EndFVs in the Maguari stork genome and one in the Oriental stork genome. The low copy number of EndFVs found in both two bird species accords with the observation that avian genomes generally harbor small numbers of endogenous viruses (Cui et al. 2014). To further elucidate the relationship between these novel avian EndFVs and other retroviruses, Pol gene sequences (>500 amino acid residue in length) were used in a phylogenetic analysis (Fig. 1). Accordingly, our ML phylogenetic tree revealed that the EndFVs discovered in birds formed a close and well supported monophyletic group within the foamy virus clade compatible with the idea that these avian EndFVs might have a single origin. Notably, however, because they were most closely related to the EndFVs found in mammals rather than to those found in reptiles, the phylogenetic position of the avian EndFVs described here was incongruent with that of the host phylogeny (although the node associated with the tuatara EndFV was relatively poorly supported) (Fig. 2). This, and the overall rarity of EndFVs in birds, suggests that these avian EndFVs have an independent origin in birds and were not acquired through virus-host co-divergence, such that a non-avian retrovirus jumped into birds at some point during evolutionary history. No evidence for recombination was found in these data.
Figure 1.

Phylogenetic tree of retroviruses and endogenous retroviruses, including the EndFVs found in avian genomes. The tree was inferred using amino acid sequences of the Pol gene, and rooted using the remaining retroviral taxa (excluding the spumaviruses). The newly identified viral elements are labeled in red. *Indicates the EndFV found in the Maguari Stork genome, while †denotes the EndFV from the Oriental Stork genome. The scale bar indicates the number of amino acid changes per site.

Figure 2.

Evolutionary history of foamy viruses (left) and their vertebrate hosts (right). Associations between foamy viruses and their hosts are indicated by connecting lines. The avian EndFV and its host are labeled in red. Note that the avian EndFVs are more closely related to mammalian foamy viruses than the reptilian EndFV. Scale bars indicate the number of amino acid changes per site in the viruses and the host divergence times (MYA).

Phylogenetic tree of retroviruses and endogenous retroviruses, including the EndFVs found in avian genomes. The tree was inferred using amino acid sequences of the Pol gene, and rooted using the remaining retroviral taxa (excluding the spumaviruses). The newly identified viral elements are labeled in red. *Indicates the EndFV found in the Maguari Stork genome, while †denotes the EndFV from the Oriental Stork genome. The scale bar indicates the number of amino acid changes per site. Evolutionary history of foamy viruses (left) and their vertebrate hosts (right). Associations between foamy viruses and their hosts are indicated by connecting lines. The avian EndFV and its host are labeled in red. Note that the avian EndFVs are more closely related to mammalian foamy viruses than the reptilian EndFV. Scale bars indicate the number of amino acid changes per site in the viruses and the host divergence times (MYA).

3.2 Genomic structure characterization

By searching for conserved domains against the Conserved Domains Database (www.ncbi.nlm.nih.gov/Structure/cdd), we identified three typical foamy conserved domains in the three full-length avian EndFVs: (1) the Spuma virus Gag domain (pfam03276; Winkler et al. 1997), (2) the Spuma aspartic protease (A9) domain (pfam03539) that is present in all mammalian foamy virus Pol proteins (Aiewsakun and Katzourakis 2017), and the (3) foamy virus envelope protein domain (pfam03408) (Han and Worobey 2012a; Wei et al. 2019; Supplementary Fig. S2). Furthermore, we identified an open reading frame 1 (ORF1) as an accessary gene in all three full-length avian EndFV genomes. These ORF1 sequences exhibited high nucleotide similarity (>88%) with each other, although not to any known accessory genes from foamy viruses (and the ORF1 in ERV-Spuma.1-Cma was split in two, due to two insertions and a non-sense mutation; Supplementary Fig. S1). Interestingly, the internal promoter of this ORF was located between 3’ end of the env gene and the 5’ start of the ORF gene. This is inconsistent with exogenous foamy viruses whose internal promoters are located toward the 3’ end of the env gene (Campbell et al. 1994; Lochelt et al. 1995). In summary, the genomes of the new EndFVs documented here contained a pair of LTRs, although with no sequence similarity to other EndFV LTRs, and exhibited a typical spuma virus structure, with three main ORF—gag, pol, and env—and one putative additional accessory gene, ORF 1 (Fig. 3; Supplementary Fig. S1).
Figure 3.

Genomic organization of exogenous foamy viruses (acc: NC_001364) and avian EndFVs. LTR, long terminal repeat; Pro, protease; RT, reverse transcriptase; RH, ribonuclease H; IN, integrase.

Genomic organization of exogenous foamy viruses (acc: NC_001364) and avian EndFVs. LTR, long terminal repeat; Pro, protease; RT, reverse transcriptase; RH, ribonuclease H; IN, integrase.

3.3 Vertical transmission of bird EndFVs

Surprisingly, ERV-Spuma.2-Cma and ERV-Spuma.1-Cbo shared 98% nucleotide sequence identity and contained the same deletion in the pol gene (Supplementary Fig. S3). By comparing the flanking sequences of these two EndFVs using BLASTN, we discovered that two scaffolds containing EndFV (BDFF02011124.1 in the Oriental stork and scaffold3222 in the Maguari Stork) shared 99% coverage with 98.66% sequence identity (e-value = 0.0). Furthermore, upon the ERV insertion, the target DNA fragment is also duplicated, resulting in TSDs that differs among ERV insertions (Hughes and Coffin 2001). We were able to identify the same TSD flanking these two avian EndFVs (Supplementary Fig. S3), confirming that they have been vertically inherited. As such, we can infer that the most recent common ancestor of Maguari Stork and Oriental Stork, estimated to have existed between 3.2 and 13.2 million years ago (Jetz et al. 2012), carried these EndFVs. However, neither EndFVs nor any solo LTRs were present in other bird species from same order (Pelecaniformes), including the little egret (Egretta garzetta), crested ibis (Nipponia nippon), and Yellow-throated sandgrouse (Pterocles gutturalis). Clearly, the study of additional genomes sampled across the avian phylogeny is merited.

3.4 Estimation of ERV insertion times

To approximately estimate the insertion time of EndFVs in two birds, we utilized the LTR-divergence based on the degree of sequence divergence between the 5’ and 3’ LTRs with a known host nucleotide substitution rate (Dangel et al. 1995; Johnson and Coffin 1999). Accordingly, the two intact pairwise LTRs of ERVs-Spuma-Cma were selected for time estimation (Supplementary Table S5). This analysis revealed insertion times of 3.15 and 13.95 MYA (million years ago), close to the estimated time of common ancestor of Maguari Stork and Oriental Stork (Jetz et al. 2012). However, the presence of multiple premature stop codon suggests the invasion was ancient, and all estimates of integration times should be treated with caution (Kijima and Innan 2010). Clearly, the discovery of additional EndFVs will shed more light on the time-scale of these retroviral integration events. In sum, we describe the presence and evolution of two novel avian EndFVs. This discovery fills the last major gap in our understanding of the taxonomic distribution of the foamy viruses and helps reveal the evolutionary interactions between retroviruses and their hosts over extended time periods.

Supplementary data

Supplementary data are available at Virus Evolution online. Click here for additional data file.
  38 in total

1.  Constructing primate phylogenies from ancient retrovirus sequences.

Authors:  W E Johnson; J M Coffin
Journal:  Proc Natl Acad Sci U S A       Date:  1999-08-31       Impact factor: 11.205

2.  CD-Search: protein domain annotations on the fly.

Authors:  Aron Marchler-Bauer; Stephen H Bryant
Journal:  Nucleic Acids Res       Date:  2004-07-01       Impact factor: 16.971

Review 3.  Origins and evolutionary consequences of ancient endogenous retroviruses.

Authors:  Welkin E Johnson
Journal:  Nat Rev Microbiol       Date:  2019-06       Impact factor: 60.633

4.  MEGA X: Molecular Evolutionary Genetics Analysis across Computing Platforms.

Authors:  Sudhir Kumar; Glen Stecher; Michael Li; Christina Knyaz; Koichiro Tamura
Journal:  Mol Biol Evol       Date:  2018-06-01       Impact factor: 16.240

5.  MAFFT multiple sequence alignment software version 7: improvements in performance and usability.

Authors:  Kazutaka Katoh; Daron M Standley
Journal:  Mol Biol Evol       Date:  2013-01-16       Impact factor: 16.240

6.  On the estimation of the insertion time of LTR retrotransposable elements.

Authors:  T E Kijima; Hideki Innan
Journal:  Mol Biol Evol       Date:  2009-12-02       Impact factor: 16.240

7.  An endogenous foamy virus in the aye-aye (Daubentonia madagascariensis).

Authors:  Guan-Zhu Han; Michael Worobey
Journal:  J Virol       Date:  2012-05-09       Impact factor: 5.103

8.  Marine origin of retroviruses in the early Palaeozoic Era.

Authors:  Pakorn Aiewsakun; Aris Katzourakis
Journal:  Nat Commun       Date:  2017-01-10       Impact factor: 14.919

Review 9.  Nomenclature for endogenous retrovirus (ERV) loci.

Authors:  Robert J Gifford; Jonas Blomberg; John M Coffin; Hung Fan; Thierry Heidmann; Jens Mayer; Jonathan Stoye; Michael Tristem; Welkin E Johnson
Journal:  Retrovirology       Date:  2018-08-28       Impact factor: 4.602

10.  Discovery of prosimian and afrotherian foamy viruses and potential cross species transmissions amidst stable and ancient mammalian co-evolution.

Authors:  Aris Katzourakis; Pakorn Aiewsakun; Hongwei Jia; Nathan D Wolfe; Matthew LeBreton; Anne D Yoder; William M Switzer
Journal:  Retrovirology       Date:  2014-08-04       Impact factor: 4.602

View more
  7 in total

1.  Unexpected Discovery and Expression of Amphibian Class II Endogenous Retroviruses.

Authors:  Mingyue Chen; Xiaoxia Guo; Lei Zhang
Journal:  J Virol       Date:  2021-01-13       Impact factor: 5.103

2.  Multiple Infiltration and Cross-Species Transmission of Foamy Viruses across the Paleozoic to the Cenozoic Era.

Authors:  Yicong Chen; Yu-Yi Zhang; Xiaoman Wei; Jie Cui
Journal:  J Virol       Date:  2021-06-24       Impact factor: 5.103

3.  Erratum: Avian and serpentine endogenous foamy viruses, and new insights into the macroevolutionary history of foamy viruses.

Authors:  Pakorn Aiewsakun
Journal:  Virus Evol       Date:  2020-02-20

Review 4.  Bovine Foamy Virus: Shared and Unique Molecular Features In Vitro and In Vivo.

Authors:  Magdalena Materniak-Kornas; Juan Tan; Anke Heit-Mondrzyk; Agnes Hotz-Wagenblatt; Martin Löchelt
Journal:  Viruses       Date:  2019-11-21       Impact factor: 5.048

Review 5.  The Unique, the Known, and the Unknown of Spumaretrovirus Assembly.

Authors:  Dirk Lindemann; Sylvia Hütter; Guochao Wei; Martin Löchelt
Journal:  Viruses       Date:  2021-01-13       Impact factor: 5.048

Review 6.  The diversity and evolution of retroviruses: Perspectives from viral "fossils".

Authors:  Jialu Zheng; Yutong Wei; Guan-Zhu Han
Journal:  Virol Sin       Date:  2022-01-19       Impact factor: 4.327

Review 7.  Foamy Viruses, Bet, and APOBEC3 Restriction.

Authors:  Ananda Ayyappan Jaguva Vasudevan; Daniel Becker; Tom Luedde; Holger Gohlke; Carsten Münk
Journal:  Viruses       Date:  2021-03-18       Impact factor: 5.048

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.