| Literature DB >> 31191479 |
Peter J Flynn1,2, Corrie S Moreau2,3.
Abstract
Endogenous viral elements (EVEs) can play a significant role in the evolution of their hosts and have been identified in animals, plants, and fungi. Additionally, EVEs potentially provide an important snapshot of the evolutionary frequency of viral infection. The purpose of this study is to take a comparative host-centered approach to EVE discovery in ant genomes to better understand the relationship of EVEs to their ant hosts. Using a comprehensive bioinformatic pipeline, we screened all nineteen published ant genomes for EVEs. Once the EVEs were identified, we assessed their phylogenetic relationships to other closely related exogenous viruses. A diverse group of EVEs were discovered in all screened ant host genomes and in many cases are similar to previously identified exogenous viruses. EVEs similar to ssRNA viral proteins are the most common viral lineage throughout the ant hosts, which is potentially due to more chronic infection or more effective endogenization of certain ssRNA viruses in ants. In addition, both EVEs similar to viral glycoproteins and retrovirus-derived proteins are also abundant throughout ant genomes, suggesting their tendency to endogenize. Several of these newly discovered EVEs are found to be potentially functional within the genome. The discovery and analysis of EVEs is essential in beginning to understand viral-ant interactions over evolutionary time.Entities:
Keywords: Formicidae; comparative genome biology; endogenous viral elements; microbes; viral diversity
Year: 2019 PMID: 31191479 PMCID: PMC6540820 DOI: 10.3389/fmicb.2019.01139
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 5.640
Summary table of ant genomes which includes information on the species, subfamily, accession number from www.NCBI.nlm.nih.gov, total genome length in megabases, nesting habitat (arboreal/ground), diet (fungus, generalist, predatory, and herbivore), and number of EVE hits recovered.
| Ant species | Ant subfamily | Accession number | Total | Nesting habitat | Diet | No. putative |
|---|---|---|---|---|---|---|
| length | EVEs | |||||
| (Mb) | Recovered | |||||
| GCF_000204515.1 | 295.945 | Ground/arboreal | Fungus | 23 | ||
| GCF_000143395.1 | 317.672 | Ground | Fungus | 11 | ||
| GCF_001594045.1 | 291.258 | Ground | Fungus | 10 | ||
| GCF_000147175.1 | 232.685 | Arboreal | Generalist | 11 | ||
| GCF_001594065.1 | 300.317 | Ground | Fungus | 68 | ||
| GCF_001313825.1 | 259.666 | Ground | Predatory | 13 | ||
| GCF_000147195.1 | 294.466 | Ground | Predatory | 17 | ||
| GCA_001045655.1 | 236.236 | Ground | Generalist | 2 | ||
| GCF_000217595.1 | 219.501 | Ground/arboreal | Generalist | 19 | ||
| GCF_000980195.1 | 257.977 | Ground | Generalist | 13 | ||
| GCF_000611835.1 | 212.826 | Ground | Predatory | 11 | ||
| GCF_000187915.1 | 235.646 | Ground | Herbivore | 42 | ||
| GCF_002006095.1 | 282.776 | Arboreal | Generalist | 52 | ||
| GCF_000188075.2 | 396.025 | Ground | Generalist | 18 | ||
| GCF_001594075.1 | 369.438 | Ground | Fungus | 39 | ||
| GCF_001594115.1 | 291.747 | Ground | Fungus | 26 | ||
| GCF_001594055.1 | 267.973 | Ground | Fungus | 15 | ||
| GCF_000949405.1 | 287.901 | Ground | Predatory | 31 | ||
| GCF_000956235.1 | 324.12 | Ground/arboreal | Generalist | 13 | ||
FIGURE 1Phylogeny of ants with sequenced genomes used in this study colored by subfamily (left), ant genome size in mb (middle), and number of EVE hits per genome (right). Ant EVE hit numbers are colored by viral protein to which they were found to be most similar in the blastx analysis.
Genome quality assessment of factors from the clipped and pre-clipped genomes as correlated with the number of EVEs per genome.
| Genome type | Factor | ||
|---|---|---|---|
| Pre-clipped | Genome length | +0.211 | 0.386 |
| No. contigs | -0.105 | 0.669 | |
| Contig N50 | +0.415 | 0.078 | |
| No. scaffolds | -0.160 | 0.515 | |
| Scaffold N50 | -0.177 | 0.467 | |
| Clipped | Genome length | +0.358 | 0.132 |
| No. contigs | +0.125 | 0.611 | |
| Contig N50 | -0.101 | 0.680 | |
| No. scaffolds | -0.124 | 0.613 | |
| Scaffold N50 | +0.091 | 0.712 | |
Summary table of viral phylogeny information.
| Virus | Virus clade | Protein | No. of | No. of | Supplementary Figure |
|---|---|---|---|---|---|
| genome | EVEs | exogenous | |||
| structure | proteins used | ||||
| ssRNA | Bunya-Arena | Nucleoprotein | 18 | 18 | |
| RdRP | 3 | 12 | |||
| Hepe-Virga | RdRP | 3 | 15 | ||
| Mono-Chu | Glycoprotein | 227 | 17 | ||
| Nucleoprotein | 2 | 12 | |||
| RdRP | 24 | 30 | |||
| Narna-Levi | RdRP | 2 | 15 | ||
| Partiti-Picobirna | Capsid | 5 | 7 | ||
| RdRP | 4 | 16 | |||
| Qinvirus | RdRP | 3 | 10 | ||
| Toti-Chryso | Coat | 12 | 12 | ||
| RdRP | 1 | 15 | |||
| ssDNA | Circoviridae | rep-associated protein | 2 | 10 | |
| Parvoviridae | VP1 | 3 | 10 | ||
| Non-structural protein 1 | 8 | 18 | |||
| Non-structural protein 2 | 1 | 12 | |||
| dsDNA | Baculoviridae | Bro-a | 1 | 12 | |
| PIF-1 | 8 | 18 | |||
| PIF-2 | 7 | 14 | |||
| PIF-3 | 1 | 12 | |||
| Poxviridae | Tryptophan | 2 | 6 | ||
| RNA-polymerase RP0147 | 1 | 7 | |||
| Polydnaviridae | Pox A32 | 3 | 6 | ||
| ssRNA(RT) | Metaviridae | ORF B | 93 | NA | NA |
FIGURE 2(A) Glycoprotein Mono-Chu Phylogeny. The phylogeny was obtained from maximum likelihood analysis of the glycoprotein multiple amino acid alignment, including both ant EVE sequences and closely related exogenous viruses. The best fit protein substitution model was JTT + G + F. ML non-parametric bootstrap values >70 (1000 replicates) are indicated at each node. Scale bar indicates evolutionary distance in substitutions per site. The 224 ant EVEs which comprise their own clade have been collapsed and are represented by the red triangle. The tips are colored by associated host. The gray bars on the right are based on distinct clades of EVE hits. (B) Expanded phylogeny representing the clade of 224 ant EVEs which was collapsed in panel (A). ML non-parametric bootstrap values >70 (1000 replicates) are indicated at each node. Scale bar indicates evolutionary distance in substitutions per site. The gray bars on the right are based on distinct clades of EVE hits.
Potentially functional stop codons.
| EVE | Similar virus | Similar protein | EVE | Protein |
|---|---|---|---|---|
| length (AA) | length (AA) | |||
| Wuchang cockroach virus 3 | Glycoprotein | 586 | 659 | |
| Hubei narna-like virus 19 | RdRP | 750 | 737 | |
| Densovirus SC1065 | VP1 | 215 | 288 | |
| Wuhan mosquito virus 8 | Glycoprotein | 582 | 653 | |
| Cyclovirus PK5034 | Rep-associated | 236 | 277 | |
| Densovirus SC1065 | VP1 | 281 | 288 | |
| Wuchang cockroach virus 3 | Glycoprotein | 641 | 659 | |
| Wuhan insect virus 16 | Nucleoprotein | 294 | 327 | |
| Wuhan mosquito virus 8 | Glycoprotein | 624 | 653 | |
| Wuhan mosquito virus 8 | Glycoprotein | 582 | 653 | |
| Wuchang cockroach virus 3 | Glycoprotein | 605 | 659 | |
| Wuhan mosquito virus 8 | Glycoprotein | 612 | 653 | |
| Hubei partiti-like virus 11 | Capsid | 412 | 475 | |
| Wuhan insect virus 16 | Nucleoprotein | 327 | 327 | |
| Wuhan insect virus 16 | Nucleoprotein | 326 | 327 | |
| Wuhan mosquito virus 8 | Glycoprotein | 595 | 653 | |
FIGURE 3Tanglegram with ant phylogeny on left and glycoprotein Mono-Chu viral phylogeny on right. The exogenous viruses included in the viral phylogeny were left out because their hosts are not included in the ant phylogeny. Colors of the association are randomly prescribed for each of the ant species.