| Literature DB >> 27259971 |
François Olivier Hébert1, Stephan Grambauer2, Iain Barber2, Christian R Landry3, Nadia Aubin-Horth3.
Abstract
BACKGROUND: Schistocephalus solidus is a well-established model organism for studying the complex life cycle of cestodes and the mechanisms underlying host-parasite interactions. However, very few large-scale genetic resources for this species are available. We have sequenced and de novo-assembled the transcriptome of S. solidus using tissues from whole worms at three key developmental states - non-infective plerocercoid, infective plerocercoid and adult plerocercoid - to provide a resource for studying the evolution of complex life cycles and, more specifically, how parasites modulate their interactions with their hosts during development.Entities:
Keywords: Cestode; Flatworm; Gasterosteus aculeatus; Parasite; RNA-seq; Schistocephalus solidus; Threespine stickleback; Transcriptome; de novo assembly
Mesh:
Year: 2016 PMID: 27259971 PMCID: PMC4891850 DOI: 10.1186/s13742-016-0128-3
Source DB: PubMed Journal: Gigascience ISSN: 2047-217X Impact factor: 6.524
Schistocephalus solidus specimens from three different developmental stages used to generate the de novo reference transcriptomeb
| Sample ID | Mass (mg) | Stagea | Library size (no. raw reads) | Platform | Ave. read quality (PHRED score) |
|---|---|---|---|---|---|
| ssol.cltw.NI.05 | 1.7 | NI | 95.7 M | HiSeq 2000 | 37 |
| ssol.cltw. NI.08 | 1.5 | NI | 57.6 M | HiSeq 2000 | 37 |
| ssol.cltw. NI.12 | 5.2 | NI | 29.5 M | HiSeq 2000 | 37 |
| ssol.cltw. NI.13 | 1.8 | NI | 69.6 M | HiSeq 2000 | 37 |
| ssol. |
|
|
| HiSeq 2000 |
|
| ssol.cltw. NI.22 | 7.7 | NI | 49.1 M | HiSeq 2000 | 37 |
| ssol.cltw. NI.26 | 13.1 | NI | 53.7 M | HiSeq 2000 | 37 |
| ssol.cltw. NI.63 | 102 | I | 37.5 M | HiSeq 2000 | 37 |
| ssol.cltw. NI.67 | 90.1 | I | 42.3 M | HiSeq 2000 | 37 |
| ssol.cltw. I.98–1 | 101.2 | I | 36.8 M | HiSeq 2000 | 37 |
|
|
|
|
| HiSeq 2000 |
|
|
|
|
|
|
|
|
| ssol.cltw.A.03 | 321 | A | 61.2 M | HiSeq 2000 | 37 |
|
|
|
|
| HiSeq 2000 |
|
| ssol.cltw.A.12 | 356 | A | 51.7 M | HiSeq 2000 | 37 |
a NI non-infective, I infective, A adult
bWorms in bold were selected as the three representative samples to be used to perform the initial raw de novo assembly. All 14 HiSeq libraries were then used to eliminate redundancy in the final dataset and increase assembly quality, while sample cltw.a-01 (in italic) was used to perform preliminary optimization tests on assembly parameters
Fig. 1De novo assembly method used in the construction of a reference transcriptome for Schistocephalus solidus. Sequencing libraries from three developmental stages of S. solidus, non-infective plerocercoid, infective plerocercoid and adult, were trimmed (PHRED > 2, read length > 60 nucl.), concatenated and assembled de novo (1 library per stage). Next, the three libraries initially used to produce the de novo assembly, in addition to 11 libraries spanning the same three developmental stages (non-infective plerocercoid = 6 libraries, infective plerocercoid = 3 libraries, adult = 2 libraries) were aligned back to the de novo assembly. corset was used on the resulting alignment to eliminate redundancy by creating clusters of similar sequences, called ‘unigenes’. Transcripts were finally annotated through the Trinotate pipeline and transcripts poorly supported by protein-coding evidence were discarded, along with transcripts showing a low average coverage, i.e. CPM < 10 in 50 % of the samples in one group. The final transcriptome contains 24,765 transcripts accounting for 10,285 unigenes, of which 77 % could be annotated
De novo assembly and annotations metrics for the transcriptome of the cestode Schistocephalus solidus
|
| |
| Assembled bases | 195 089 904 |
| Assembled transcripts | 293 731 |
| Unigenes – Unfiltered | 115 318 |
| Unigenes – Expression filter | 12 291 |
| Unigenes – Expression & ORF filters (transcripts) | 10 285 (24 765) |
| Average transcripts per filtered unigene | 2.41 |
| Sum of filtered transcripts (Mbp) | 367.83 |
| Average length (bp) – Filtered transcripts (min - max : median) | 2 684 (174–25 376 : 2168) |
| Annotation | |
| Unigenes with protein name | 7 920 (77 %) |
| Unigenes with Gene Ontology (% of unigenes) | 7 323 (71 %) |
| Proteins with complete ORF (% of unigenes) | 20 335 (82 %) |
| Unigenes/transcripts with KEGG ID (% of unigenes) | 4 270/7 798 (35 %) |
Fig. 2Phylogenetic relationships between Schistocephalus solidus (Schistocephalidae) and seven other parasitic worm species. These other species include five cestodes from the Cyclophyllidea and Pseudophyllidea orders and two trematodes (outgroups). More specifically, species from the cestode phylum include Hymenolepis microstoma (rodent tapeworm), Taenia solium (pork tapeworm), Echinococcus multilocularis (fox tapeworm), Echinococcus granulosus (dog tapeworm) and Spirometra erinaceieuropaei (responsible for the sparganosis infection), while the trematode outgroups are represented by Schistosoma mansoni (responsible for diseases such as schistosomiatis) and Clonorchis sinensis (Chinese liver fluke). Bootstrap values were all 100. Total number of single-copy orthologs used to produce the phylogenetic tree = 4 520 (distributed in 565 core orthologous groups). The number of orthologs shared with S. solidus is defined as the number of amino acid sequences in a given species that are part of an orthologous group identified by orthoMCL that also contains sequences from S. solidus