| Literature DB >> 35794144 |
Ilenia Urso1, Alberto Biscontin1, Davide Corso1, Cristiano Bertolucci2,3, Chiara Romualdi1, Cristiano De Pittà1, Bettina Meyer4,5,6, Gabriele Sales7.
Abstract
The krill species Euphausia superba plays a critical role in the food chain of the Antarctic ecosystem. Significant changes in climate conditions observed in the Antarctic Peninsula region in the last decades have already altered the distribution of krill and its reproductive dynamics. A deeper understanding of the adaptation capabilities of this species is urgently needed. The availability of a large body of RNA-seq assays allowed us to extend the current knowledge of the krill transcriptome. Our study covered the entire developmental process providing information of central relevance for ecological studies. Here we identified a series of genes involved in different steps of the krill moulting cycle, in the reproductive process and in sexual maturation in accordance with what was already described in previous works. Furthermore, the new transcriptome highlighted the presence of differentially expressed genes previously unknown, playing important roles in cuticle development as well as in energy storage during the krill life cycle. The discovery of new opsin sequences, specifically rhabdomeric opsins, one onychopsin, and one non-visual arthropsin, expands our knowledge of the krill opsin repertoire. We have collected all these results into the KrillDB2 database, a resource combining the latest annotation of the krill transcriptome with a series of analyses targeting genes relevant to krill physiology. KrillDB2 provides in a single resource a comprehensive catalog of krill genes; an atlas of their expression profiles over all RNA-seq datasets publicly available; a study of differential expression across multiple conditions. Finally, it provides initial indications about the expression of microRNA precursors, whose contribution to krill physiology has never been reported before.Entities:
Mesh:
Substances:
Year: 2022 PMID: 35794144 PMCID: PMC9259678 DOI: 10.1038/s41598-022-15320-5
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.996
Figure 1Transcriptome quality assessment results. Results of the first assembly filtering in terms of total number of transcripts.
Quality measures computed at each assembly step, from the independent de novo assembly algorithms (a), after the first filtering process (b) and finally comparing the quality of the EvidentialGene meta-assembly and the final krill transcriptome after the redundancy filter (c).
| Trinity | BinPacker | rnaSPAdes | IDBA-tran | TransABySS | |
|---|---|---|---|---|---|
| # Transcripts | 671.837 | 288.476 | 503.293 | 400.75 | 389.351 |
| %GC | 35.56 | 34.66 | 35.39 | 34.77 | 35.07 |
| Median contig length | 368 | 938 | 347 | 336 | 332 |
| N50 | 1.14 | 2.213 | 1.533 | 553 | 730 |
| # Bases | 470,615,830 | 413,787,317 | 392,082,747 | 198,618,480 | 218,288,366 |
| # Transcripts | 353.34 | 203.274 | 228.038 | 125.886 | 195.764 |
| %GC | 35.77 | 34.80 | 35.61 | 35.18 | 35.17 |
| Median contig length | 452 | 1074 | 762 | 352 | 426 |
| N50 | 1.455 | 2.317 | 1.958 | 670 | 1.1 |
| # Bases | 301,600,820 | 321,478,538 | 280,807,245 | 69,975,046 | 144,053,299 |
BUSCO assessment results on independent de novo assemblies from RNA-seq stranded library.
| Complete (%) | Fragmented (%) | Missing (%) | |
|---|---|---|---|
| Trinity | 94.4 | 1.3 | 4.3 |
| BinPacker | 94.1 | 0.9 | 5 |
| IDBA-tran | 77.9 | 11.5 | 10.6 |
| Trans-AbySS | 93.4 | 1.6 | 5 |
| rnaSPAdes | 94.6 | 0.8 | 4.6 |
| Trinity | 93.3 | 1.7 | 5 |
| BinPacker | 92.9 | 1.7 | 5.4 |
| IDBA-tran | 80.9 | 9.0 | 10.1 |
| Trans-AbySS | 92.1 | 1.7 | 6.2 |
| rnaSPAdes | 93.1 | 1.5 | 5.4 |
| EvidentialGene | 92.3 | 0.6 | 4.1 |
| KrillDB2 | 93.2 | 0.6 | 6.2 |
(a), RNA-seq unstranded library (b) and on EvidentialGene transcriptome compared to krill transcriptome after last filter (c): the EvidentialGene transcriptome was characterized by 95.3% Complete sequences, 0.6% Fragmented and 4.1% Missing sequences. The same analysis on the final krill transcriptome reconstruction produced 93.5% Complete transcripts, 0.7% Fragmented and 5.8% Missing sequences.
Quality statistics of the previously released krill transcriptomes compared to the newly assembled KrillDB2. GenBank accession GFCS00000000.1 refers to the SuperbaSe krill transcriptome reference[19].
| GFCS00000000.1 | KrillDB | KrillDB2 | |
|---|---|---|---|
| #Total Transcript | 484.08 | 133.965 | 151.464 |
| Median contig length | 439 | 683 | 1.155 |
| N50 | 1.071 | 1.294 | 2.759 |
| BUSCO—complete | 827 (81.6%) | 536 (52.9%) | 947 (93.5%) |
Figure 2Phylogenetic relationships of Euphausia superba opsins shown as circular cladogram. Colored dots indicate krill opsins: red, previously cloned opsins; green, novel identified opsins. The spectral sensitivities of rhabdomeric opsin clades were inferred from the curated invertebrate-only opsin dataset proposed by DeLeo & Bracken‐Grissom, 2020. Represented opsin classes: LWS, long-wavelenght-sensitive; LSM, long/middle-wavelenght-sensitive; MWS, middle-wavelenght-sensitive; SWS/UV, short/UV-wavelenght-sensitive; ONY, onychopsins; MEL, melanopsins; PER, peropsin; ART, arthropsin. Rectangular phylogram is reported in Fig. S1 (Supplementary Material).
List of contrasts computed with total number of differentially expressed genes and numbers of up- and downregulated genes.
| Reference condition | Alternative condition | Sample group | # Total | # Upregulated | # Downregulated |
|---|---|---|---|---|---|
| Summer | Winter | Group 2 | 1195 | 1078 | 117 |
| Male | Femae | Group 2 | 14 | 7 | 7 |
| Male/summer | Female/Winter | Group 2 | 12 | 6 | 6 |
| South Georgia | Lazarev Sea | Group 2 | 79 | 26 | 53 |
| South Georgia | Bransfield Strait-South Orkney | Group 2 | 28 | 6 | 22 |
| Lazarev sea | Bransfield Strait-South Orkney | Group 2 | 17 | 13 | 4 |
| South Georgia/male | Bransfield Strait-South Orkney/Female | Group 2 | 10 | 6 | 4 |
| South Georgia/male | Lazarev Sea/Male | Group 2 | 19 | 8 | 11 |
| South Georgia/summer | Bransfield Strait-South Orkney/Winter | Group 2 | 75 | 66 | 9 |
| Lazarev Sea/summer | Bransfield Strait-South Orkney/Winter | Group 2 | 359 | 173 | 186 |
| South Georgia/summer | Lazarev Sea/Summer | Group 2 | 188 | 150 | 38 |
| Lazarev Sea/male | Bransfield Strait-South Orkney/Female | Group 2 | 20 | 10 | 10 |
List of biologically relevant DEGs identified, starting from those already described by Höring et al.[35].
| Process | Gene | KrillDB2 Gene |
|---|---|---|
| Development of cuticle (moult cycle) | Peritrophin | ESG063925 |
| Chitooligosaccharidolytic beta-N- acetylglucosaminidase | ESG040750 | |
| Carbohydrate sulfotransferase 11 | ESG043538 | |
| Immune response | Laccase | ESG048485 |
| Leucine rich repeat only protein 2 | ESG048485 | |
| Embryogenesis | Blastula protease 10 | ESG045350 |
| Development and reproduction | Aldehyde dehydrogenase family 8 | ESG043319 |
| Retinoid-inducible serine carboxypeptidase | ESG040940 | |
| Dehydrogenase/reductase SDR family member 11 | ESG048936 | |
| Reproduction | Vitellogenin | ESG035720 |
| Hematopoietic prostaglandin D synthase | ESG056241 | |
| Carboxylic ester hydrolase | ESG040590 | |
| Adiponectin receptor protein | ESG049090 | |
| Type I iodothyronine deiodinase | ESG061750 | |
| Metalloendopeptidase activity | Neprilysin 1 | ESG037511 |
| Steroid metabolism | Inactive hydroxysteroid dehydrogenase-like protein 1 | ESG050201 |
| Short-chain dehydrogenase/ reductase family 42E member 1 | ESG041089 | |
| Lipid metabolism | Epoxide hydrolase | ESG048309 |
| Enoyl-CoA isomerase | ESG051749 | |
| Long-chain-fatty-acid–CoA ligase | ESG040433 | |
| Glucose metabolic process | ||
| Cell cycle | ||
| Circadian clock | ||
| Photoreception |
Genes that were already found to be differentially expressed in the work by Höring are reported in black, while newly DEGs identified by our analysis are reported in Bold.
Figure 3Blast search section. The new search box for sequence searches (a) with an example of a BLAST search (highlighted in yellow) and the corresponding results (b). By clicking on each target identifier, the user will be redirected to that specific transcript page, where new sections have been added, as shown in Fig. 6.
Figure 6Additional sections in gene and transcript pages. The new sections in the gene-centric page show a table listing the orthologous sequences with their belonging species and the identity score (a), a visualization of the gene structure as estimated by Lace software (d) and a boxplot coming from Expression Atlas analyses (c). Both Orthology and Expression sections are integrated also in the transcript-centric page. When a transcript is annotated as a putative microRNA, a “Predicted Hairpin” section displays a visualization of the hairpin predicted secondary structure and tables showing the alignment length, the HHMMiR score and the list of mature microRNAs matching (b).
Figure 4Differential Expression section. The new section collecting all differentially expressed genes tables (a) with an example of the corresponding result for a selected contrast (b).
Figure 5New search engine of KrillDB2. Example of the results of a full-text search on KrillDB2.
Figure 7Workflow of the assembly process, annotation, database re-design and downstream analyses.