| Literature DB >> 32075325 |
Magda Lewandowska1, Yael Hazan1, Yehu Moran1.
Abstract
The role of viruses in forming a stable holobiont has been the subject of extensive research in recent years. However, many emerging model organisms still lack any data on the composition of the associated viral communities. Here, we re-analyzed seven publicly available transcriptome datasets of the starlet sea anemone Nematostella vectensis, the most commonly used anthozoan lab model, and searched for viral sequences. We applied a straightforward, yet powerful approach of de novo assembly followed by homology-based virus identification and a multi-step, thorough taxonomic validation. The comparison of different lab populations of N. vectensis revealed the existence of the core virome composed of 21 viral sequences, present in all adult datasets. Unexpectedly, we observed an almost complete lack of viruses in the samples from the early developmental stages, which together with the identification of the viruses shared with the major source of the food in the lab, the brine shrimp Artemia salina, shed new light on the course of viral species acquisition in N. vectensis. Our study provides an initial, yet comprehensive insight into N. vectensis virome and sets the first foundation for the functional studies of viruses and antiviral systems in this lab model cnidarian.Entities:
Keywords: Cnidaria; Nematostella vectensis; RNA-seq; viruses
Mesh:
Year: 2020 PMID: 32075325 PMCID: PMC7077227 DOI: 10.3390/v12020218
Source DB: PubMed Journal: Viruses ISSN: 1999-4915 Impact factor: 5.048
Summary of sequence data used for virus identification in RNA-seq studies. Retained reads refer to read counts after quality filtering, trimming, and removal of the Nematostella genome, transfer, mitochondrial and cytoplasmic rRNA, as well as ERCC spike-ins. Identified final viral reads were filtered from the total number of de novo assembled contigs and cleared from duplicates.
| Reference | No. of Samples | Raw Read Pairs | Retained Reads | Total Number of Contigs | Identified Viral Sequences |
|---|---|---|---|---|---|
| Babonis et al. 2016 † [ | 9 | 364,726,242 | 18,134,716 | 99,241 | 76 |
| Tulin et al. 2013 a [ | 6 | 112,159,243 | 5,251,525 | 7631 | 6 |
| Oren et al. 2015 a [ | 13 | 105,403,849 | 6,805,643 | 10,968 | 12 |
| Shwaiger et al. 2014 a
| 8 | 532,867,635 | 75,383,380 | 40,225 | 50 |
| Shwaiger et al. 2014 b
| 2 | 155,212,236 | 23,904,647 | 6470 | 62 |
| Fidler et al. 2014 a [ | 1 | 95,331,053 | 6,988,326 | 9902 | 25 |
| Warner et al. 2018 a [ | 15 | 542,474,332 | 35,925,820 | 24,934 | 73 |
| All datasets merged | 54 | 1,908,174,590 | 172,394,057 | 155,322 | 94 |
a polyA selected RNA-seq library; b rRNA depleted RNA-seq library; † No information available on mRNA enrichment strategy.
Figure 1Taxonomic classification of N. vectensis viral sequences. Distribution of viral families (a) and groups (b) in the sequence assembly from all merged datasets and (c) relative abundance of viral families within each studied dataset. Sequences representing fragments of the same viral species were collapsed into one entry.
Figure 2Relative abundance of viral sequences in each analyzed dataset presented as log2 of transcript per thousand (TPM/1000). Sequences representing fragments of the same viral species were collapsed into one entry.
Figure 3Venn diagram of core viral sequences for all non-embryonic datasets [31,33,34,35,36] revealed by remapping filtered reads to the viral contigs assembled from the merged dataset. For simplicity, only numbers of contigs common to all datasets and specific to each dataset are shown. Values in brackets indicate the total number of different viral contigs to which reads from each dataset were successfully remapped.
Figure 4The number of reads mapped to Nematostella viral sequences assembled from all merged datasets presented as Reads Per Million (RPM) (a) and the fraction of total reads which mapped to A. salina viruses (b).
Summary of remapping results including the total number of reads mapped to assembled viral contigs from the merged dataset and number of reads mapped to putative Artemia salina viruses; SDs—number of standard deviations from mean calculated on the number of reads normalized to sequencing depth.
| Babonis et al. [ | Tulin et al. [ | Oren et al. [ | Schwaiger et al. | Schwaiger et al. | Fidler et al. [ | Warner et al. [ | |
|---|---|---|---|---|---|---|---|
| All reads aligned to viruses | 9544 | 388 | 19,676 | 74,721 | 140,782 | 2429 | 28,732 |
| SDs from mean | −0.5133 | −0.5837 | −0.0156 | −0.1597 | 2.218 | −0.5154 | −0.4302 |
| Reads aligned to | 1362 | 63 | 721 | 1783 | 127,633 | 1116 | 537 |
| SDs from mean | −0.3805 | −0.3908 | −0.3705 | −0.3818 | 2.2676 | −0.3547 | −0.3894 |