| Literature DB >> 33086653 |
Justine Charon1, Vanessa Rossetto Marcelino1,2, Richard Wetherbee3, Heroen Verbruggen3, Edward C Holmes1.
Abstract
Our knowledge of the diversity and evolution of the virosphere will likely increase dramatically with the study of microbial eukaryotes, including the microalgae within which few RNA viruses have been documented. By combining total RNA sequencing with sequence and structural-based homology detection, we identified 18 novel RNA viruses in cultured samples from two major groups of microbial algae: the chlorophytes and the chlorarachniophytes. Most of the RNA viruses identified in the green algae class Ulvophyceae were related to the Tombusviridae and Amalgaviridae viral families commonly associated with land plants. This suggests that the evolutionary history of these viruses extends to divergence events between algae and land plants. Seven Ostreobium sp-associated viruses exhibited sequence similarity to the mitoviruses most commonly found in fungi, compatible with horizontal virus transfer between algae and fungi. We also document, for the first time, RNA viruses associated with chlorarachniophytes, including the first negative-sense (bunya-like) RNA virus in microalgae, as well as a distant homolog of the plant virus Virgaviridae, potentially signifying viral inheritance from the secondary chloroplast endosymbiosis that marked the origin of the chlorarachniophytes. More broadly, these data suggest that the scarcity of RNA viruses in algae results from limited investigation rather than their absence.Entities:
Keywords: RNA virus metatranscriptomics; RNA-dependent RNA polymerase; algae viruses; evolution; phylogeny; protist viruses
Mesh:
Substances:
Year: 2020 PMID: 33086653 PMCID: PMC7594059 DOI: 10.3390/v12101180
Source DB: PubMed Journal: Viruses ISSN: 1999-4915 Impact factor: 5.048
Sample and library description.
| Library | Species | Class/Family |
|---|---|---|
| ALG_1 |
| Ulvophyceae/Kraftionemaceae |
|
| Mamiellophyceae/Dolichomastigaceae | |
|
| Mamiellophyceae/Dolichomastigaceae | |
| ALG_2 |
| Ulvophyceae/Bryopsidales |
| ALG_3 |
| Chlorarachniophyceae/Chlorarachnion |
|
| Chlorarachniophyceae/Lotharella |
Figure 1The enormous diversity of algae contrasts with their poorly characterized viromes. (A) Representation of algae supergroups among the diversity of eukaryotes (latest eukaryotic classification retrieved from [29]). The phylogenetic tree was adapted from [56]. Pictures illustrate some of the samples used in this study and corresponding clades are marked with “*”. (B) Pictures of algae cultures used in this study. (C) The current extent of the microalgae virosphere. The viral sequence counts for each virus class (DNA or RNA, single-stranded or double-stranded) were retrieved from VirusHostdb [16] according to 11 major eukaryotic algae lineages. Microalgal lineages investigated in this study are highlighted in bold.
BLASTx results of the newly described virus-like sequences against nr database.
| New Virus | Length nt | PE Read Count | BLASTx Hit | %ID | e-Value | BLASTx Hit | BLASTx Hit Taxonomy |
|---|---|---|---|---|---|---|---|
| Amalga-like boulavirus ( | 1440 | 4313 | BAA25883 | 31 | 4.6 × 1023 | BDRM | |
| Amalga-like chassivirus | 3399 | 1503 | BAA25883 | 28 | 1.8 × 1038 | BDRM | |
| Amalga-like chaucrivirus | 4036 | 16,934 | BAA25883 | 33 | 3.3 × 10103 | BDRM | |
| Amalga-like dominovirus | 4011 | 2996 | BAA25883 | 33 | 5.1 × 1088 | BDRM | |
| Amalga-like lacheneauvirus ( | 3254 | 934 | BAA25883 | 27 | 3.5 × 1039 | BDRM | |
| Partiti-like alassinovirus | 3658 | 5135 | BAB63954 | 29 | 1.6 × 1048 | BDRC | |
| Partiti-like lacotivirus | 3273 | 92,840 | BAB63954 | 29 | 2.5 × 1045 | BDRC | |
| Partiti-like adriusvirus | 4252 | 4833 | BAB63954 | 23 | 5.7 × 1018 | BDRC | |
| Mito-like babylonusvirus | 2942 | 9029 | APG77166 | 38 | 9 × 1042 | Shahe narna-like virus 6 | Unclassified RNA virus (ssRNA) |
| Mito-like albercanusvirus | 2791 | 5294 | APG77166 | 39 | 7.2 × 1041 | Shahe narna-like virus 6 | Unclassified RNA virus (ssRNA) |
| Mito-like spartanusvirus | 2684 | 15,388 | ASM94099 | 38 | 2 × 1032 | Barns Ness serrated wrack narna-like virus 3 | |
| Mito-like laruketanusvirus | 2928 | 14,185 | APG77166.1 | 36 | 5.8 × 1033 | Shahe narna-like virus 6 | Unclassified RNA virus (ssRNA) |
| Mito-like bobnusvirus | 2773 | 2621 | APG77166 | 34 | 6.1 × 1040 | Shahe narna-like virus 6 | Unclassified RNA virus |
| Mito-like picolinusvirus | 2714 | 8792 | YP 00228433 | 34 | 4.1 × 1033 | Botrytis cinerea mitovirus 1 | |
| Mito-like bionusvirus | 3260 | 7529 | AXY40442 | 27 | 5.7 × 1013 | Rhizophagus diaphanum mitovirus 1 | |
| Tombus-like chagrupourvirus ( | 3835 | 5418 | YP 009336735 | 36 | 5.1 × 1045 | Hubei tombus-like virus 12 | Unclassified RNA virus |
| Virga-like bellevillovirus | 2313 | 229 | AMO03254 | 29 | 4.4 × 1044 | Boutonnet virus | Unclassified ssRNA virus (ssRNA) |
| Bunya-like bridouvirus | 2818 | 208 | APG79310 | 30 | 9.1 × 1068 | Shahe bunya-like virus 1 | Unclassified RNA virus |
BDRM: Bryopsis mitochondria-associated dsRNA; BDRC: Bryopsis cinicola chloroplast-associated dsRNAs; * likely viral RdRp mis-annotated as a host protein.
Figure 2Abundance of unknown and RNA virus-like contigs detected in the algal libraries. (A) Percentage of nonassigned contigs. For clarity, numbers are normalized as the percentage of total contigs (actual contig numbers are indicated in bold). Blue: number of contigs showing strong sequence similarity to the nr database (e-value < 10−05); light grey: contigs showing weak sequence similarity to the nr database (e-values 10−05 to 10−03); middle-dark grey: contigs with no sequence similarity detected by BLASTx/BLASTp but predicted to encode one or more ORFs longer than 200 amino acids (600 nt); dark grey: genomic ‘dark matter’ - contigs without any signal detected or any long ORFs encoded. (B) Total number of RNA virus reads per total number of non-rRNA reads in each library. (C) Distribution of RNA virus diversity in the three libraries and percentage of RNA virus reads associated with each viral super-clade. The host range is represented for each viral clade.
Results of the VOGdb and PFAM HMM analysis. Light pink: phage-like sequences. Grey: nonviral sequences; Light blue: DNA virus-like sequences; Orange: RNA virus-like sequences. Abund: expected read counts estimated using the RSEM program.
|
|
|
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|---|---|
| ALG_2_DN19089_c0_g1_i1_len4252 | ORF_1 | 4717 | VOG03062 | 1.00 × 1012 | REFSEQ hypothetical protein | Bacteriophage | - | - | - |
| ALG_2_DN19089_c0_g1_i1_len4252 | ORF_1 | 4717 | PF00680.20 | 2.70 × 105 | RNA dependent RNA polymerase | RdRP-1 | - | - | - |
| ALG_2_DN19250_c2_g3_i5_len1869 | ORF_1 | 743.91 | VOG23558 | 1.00 × 104 | REFSEQ hypothetical protein |
| - | - | - |
| ALG_2_DN18568_c0_g1_i1_len2977 | ORF_1 | 511 | VOG10478 | 1.30 × 106 | sp|Q05224|VG18 BPML5 Gene 18 protein | Bacteriophage | - | - | - |
| ALG_2_DN19013_c0_g1_i2_len1689 | ORF_2 | 224.31 | VOG22975 | 1.40 × 104 | REFSEQ carboxylesterase |
| - | - | - |
| ALG_2_DN19410_c0_g2_i6_len1950 | ORF_1 | 183.94 | VOG12013 | 5.20 × 104 | sp|P03778|Y06 BPT7 Protein 0.6B | Viruses | PF16752.5 | 1.50 × 104 | Tubulin-specific chaperone C |
| ALG_2_DN18226_c0_g1_i1_len1259 | ORF_1 | 157.61 | VOG09820 | 2.90 × 104 | REFSEQ hypothetical protein | - | - | - | |
| ALG_2_DN18993_c2_g2_i2_len2532 | ORF_1 | 151.17 | VOG08344 | 8.50 × 108 | REFSEQ hypothetical protein | Bacteriophage | PF13424.6 | 3.80 × 10132 | Tetratricopeptide repeat |
| ALG_3_DN34624_c0_g1_i1_len2077 | ORF_2 | 146 | PF02123.16 | 8.50 × 105 | Viral RNA-directed RNA-polymerase | RdRP-4 | - | - | - |
| ALG_2_DN18744_c0_g1_i3_len2080 | ORF_1 | 134.99 | VOG10472 | 4.10 × 104 | REFSEQ hypothetical protein |
| - | - | - |
| ALG_2_DN19214_c2_g1_i7_len1432 | ORF_2 | 118 | VOG06927 | 6.90 × 104 | REFSEQ hypothetical protein | Bacteriophage | - | - | - |
| ALG_3_DN25592_c0_g1_i1_len1043 | ORF_1 | 115 | VOG01256 | 4.40 × 104 | sp|Q9QU29|ORF3 TTVB1 Uncharacterized ORF3 protein | dsDNA viruses | - | - | - |
| ALG_2_DN18451_c0_g1_i4_len2061 | ORF_4 | 109.48 | VOG17696 | 7.10 × 104 | REFSEQ hypothetical protein | Bacteriophage | PF16058.5 | 1.80 × 1017 | Mucin-like |
| ALG_2_DN18451_c0_g1_i4_len2061 | ORF_4 | 109.48 | VOG17696 | 7.10 × 104 | REFSEQ_hypothetical_protein | Bacteriophage | PF16058.5 | 1.10 × 107 | Mucin-like |
| ALG_2_DN18732_c0_g2_i2_len2454 | ORF_2 | 99.65 | VOG09815 | 1.70 × 1015 | REFSEQ hypothetical protein | - | - | - | |
| ALG_2_DN18957_c0_g1_i1_len1408 | ORF_2 | 97.98 | VOG02199 | 8.40 × 104 | sp|Q5UR09|YR648 MIMIV Uncharacterized protein R648 |
| PF06156.13 | 4.70 × 104 | Initiation control protein YabA |
| ALG_2_DN18957_c0_g1_i2_len1657 | ORF_2 | 89.04 | VOG02199 | 3.50 × 104 | sp|Q5UR09|YR648 MIMIV Uncharacterized protein R648 |
| PF06156.13 | 2.20 × 104 | Initiation control protein YabA |
| ALG_2_DN19250_c2_g3_i3_len2203 | ORF_1 | 71.48 | VOG23558 | 7.80 × 105 | REFSEQ hypothetical protein | - | - | - | |
| ALG_2_DN19410_c0_g2_i7_len1634 | ORF_1 | 49.51 | VOG12013 | 9.90 × 104 | sp|P03778|Y06 BPT7 Protein 0.6B | Bacteriophage | - | - | - |
| ALG_2_DN11543_c0_g1_i1_len842 | ORF_1 | 42 | VOG20356 | 3.80 × 104 | REFSEQ hypothetical protein | - | - | - | |
| ALG_2_DN19463_c5_g2_i1_len765 | ORF_1 | 37.7 | VOG24589 | 2.60 × 105 | REFSEQ hypothetical protein | - | - | - | |
| ALG_2_DN19174_c0_g2_i18_len938 | ORF_1 | 36.39 | VOG10625 | 9.70 × 104 | sp|Q05293|VG78 BPML5 Gene 78 protein | Bacteriophage | - | - | - |
| ALG_2_DN41289_c0_g1_i1_len750 | ORF_1 | 36 | VOG06662 | 9.50 × 105 | REFSEQ Cupin | Bacteriophage | - | - | - |
| ALG_2_DN22182_c0_g1_i1_len820 | ORF_1 | 35 | VOG02199 | 1.50 × 104 | sp|Q5UR09|YR648 MIMIV Uncharacterized protein R648 |
| PF06156.13 | 7.20 × 104 | Initiation control protein YabA |
| ALG_2_DN44027_c0_g1_i1_len815 | ORF_1 | 33.98 | VOG21678 | 3.70 × 104 | REFSEQ hypothetical protein | PF08614.11 | 6.00 × 105 | Autophagy protein 1(ATG16) | |
| ALG_2_DN594_c0_g2_i1_len711 | ORF_1 | 28 | PF17501.2 | 2.80 × 104 | Viral RNA-directed RNA polymerase | Viral_RdRp_C | - | - | - |
| ALG_2_DN14271_c0_g1_i1_len772 | ORF_1 | 25.12 | VOG18617 | 2.20 × 104 | REFSEQ hypothetical protein | PF13855.6 | 1.60 × 1021 | Leucine rich repeat | |
| ALG_2_DN18993_c2_g2_i1_len2178 | ORF_1 | 21.18 | VOG08344 | 4.70 × 107 | REFSEQ hypothetical protein | Bacteriophage | PF13374.6 | 1.40 × 10126 | 1Tetratricopeptide repeat |
| ALG_2_DN44027_c0_g1_i2_len783 | ORF_1 | 19.02 | VOG21678 | 3.60 × 104 | REFSEQ hypothetical protein | PF08614.11 | 7.70 × 105 | Autophagy protein 1(ATG16) | |
| ALG_2_DN19463_c5_g2_i6_len928 | ORF_1 | 14.34 | VOG24589 | 6.40 × 105 | REFSEQ hypothetical protein | - | - | - | |
| ALG_2_DN19463_c5_g2_i4_len863 | ORF_1 | 4.36 | VOG24589 | 5.30 × 105 | REFSEQ hypothetical protein | - | - | - |
Figure 3RdRp phylogeny of the newly identified chlorophyte viruses among the Amalgaviridae, Partitiviridae, Picobirnaviridae and Hypoviridae. Sequences identified in this study are labeled in red. Unclassified sequences from [65] are highlighted in grey. For clarity, some families and genera have been collapsed. Left, phylogenetic tree estimated using IQ-Tree with bootstrap replicates and SH-aLTR set to 1000 (values in parenthesis). Right, genomic organization of both viral genomes identified in this study (red) and representative species of each major family and genus used in the phylogeny (Cryphonectria hypovirus 2—Hypoviridae; Chicken picobirnavirus—Picobirnaviridae; Southern tomato virus—Amalgaviridae; Cryptosporidium parvum virus 1—Cryspovirus; Discula destructiva virus 1—Gammapartitivirus; Figure cryptic virus—Deltapartitivirus; Ceratocystis resinifera partitivirus—Betapartitivirus; White clover cryptic virus 1—Alphapartitivirus. The tree is mid-pointed rooted and branch lengths are scaled according to the number of amino acid substitutions per site.
Figure 4Genome pairwise identity distributions of the new algal viral sequences. The level of pairwise identity between the newly identified viruses and existing members of each viral family are represented in red. (A) Intergenus (grey) and intra-genus (purple) identity levels within the Amalgaviridae. (B) Intergenus (grey) and intra-genus (yellow) identity levels within the Narnaviridae. (C) Intergenus (grey) and intra-genus (green) identity levels within the Tombusviridae.
Figure 5Genomic organization of the Partitiviridae, Totiviridae and Amalgaviridae. Possible evolutionary scenarios for the BDRC-like contigs observed in Ostreobium sp.
Figure 6Phylogeny of the Narnaviridae-Botourmiaviridae group based on the RdRp. Newly discovered viruses from Ostreobium sp. are highlighted in red. RdRp sequences from unassigned RNA virus retrieved from [65] are marked in grey. Left, phylogenetic tree estimated using IQ-Tree with bootstrap replicates and SH-aLTR set to 1000 (values in parenthesis). Right, genomic organization of both viral genomes identified in this study (red) and representative species of each major family and genus used in the phylogeny (Cassava virus C—Botourmiaviridae; Saccharomyces 23S RNA narnavirus—Narnavirus genus; Chenopodium quinoa mitovirus 1 Mitovirus genus). Annotations of Cassava virus C coding sequences: RdRp (Segment I); Putative movement protein (Segment II); Coat protein (Segment III). Branch lengths are scaled according to the number of amino acid substitutions per site.
Figure 7Phylogeny of the Tombusviridae RdRp. The tombus-like sequence identified in this study is labeled in red. Unclassified sequences from [65] are highlighted in grey. For clarity, some families and genera have been collapsed. Left, phylogenetic tree estimated using IQ-Tree with bootstrap replicates and SH-aLTR set to 1000 (values in parenthesis). Right, genomic organizations of the new viruses as well as their closest homologs and representative species from each family/genus as follows: Black beetle virus (Nodaviridae); Carrot mottle virus (Dianthovirus); Carnation ringspot virus (Dianthovirus); Beet black scorch virus (Betanecrovirus); Cucumber leaf spot virus (Aureusvirus); Maize necrotic streak virus (Tombusvirus); Carnation mottle virus (Alphacarmovirus). The tree is mid-pointed rooted and branch lengths are scaled according to the number of amino acid substitutions per site.
Figure 8Phylogeny of the Hepe-Virga group RdRp. The hepe-virga-like sequence identified in this study is labeled in red. Unclassified sequences from [65] are highlighted in grey. For clarity, some families or genera have been collapsed. Left, RdRp-based phylogenetic tree obtained using IQ-tree with bootstrap replicates and SH-aLTR set to 1000 (values in parenthesis). Right, genomic organizations of the new viruses as well as closest homologs and representative species from each family/genus as follows: Orthohepevirus A (Hepeviridae); Poinsettia mosaic virus (order Tymovirales); Wheat stripe mosaic virus (Benyviridae); Diatom colony associated dsRNA virus 15 (Endornaviridae); Cabassou virus (Togaviridae); Apple mosaic virus (Bromoviridae); Mint virus 1 (Closteroviridae); Cucumber mottle virus (Virgaviridae). The tree is mid-pointed rooted and branch lengths are scaled according to the number of amino acid substitutions per site.
Figure 9Phylogeny of the Bunyavirales RdRp. The bunya-like sequence identified in this study is labeled in red. Unclassified sequences from [65] are highlighted in grey. For clarity, some families and genera have been collapsed. Left, phylogenetic tree estimated using IQ-Tree with bootstrap replicates and SH-aLTR set to 1000 (values in parenthesis). Right, genomic organizations of the new viruses as well as their closest homologs and representative species from each family/genus as follows: Melon chlorotic spot virus (Phenuiviridae); Yogue virus (Nairoviridae); Latino mammarenavirus (Arenaviridae); Seattle orthophasmavirus (Phasmaviridae); Melon yellow spot virus (Tospoviridae); Fig mosaic emaravirus (Fimoviridae); Tataguine orthobunyavirus (Peribunyaviridae). The tree is mid-pointed rooted and branch lengths are scaled according to the number of amino acid substitutions per site.