| Literature DB >> 33266516 |
Chiara Lauritano1, Vittoria Roncalli2, Luca Ambrosino3, Matthew C Cieslak4, Adrianna Ianora1.
Abstract
Antarctic waters are the largest almost untapped diversified resource of our planet. Molecular resources for Antarctic organisms are very limited and mostly represented by sequences used for species genotyping. In this study, we present the first transcriptome for the copepod Rhincalanus gigas, one of the predominant zooplankton species of Antarctic waters. This transcriptome represents also the first molecular resource for an eucalanoid copepod. The transcriptome is of high quality and completeness. The presence of three predicted genes encoding antifreeze proteins and gene duplication within the glutathione metabolism pathway are suggested as possible adaptations to cope with this harsh environment. The R. gigas transcriptome represents a powerful new resource for investigating the molecular basis associated with polar biological processes and ecology.Entities:
Keywords: Antarctic waters; South Shetland Trench; copepod; transcriptome
Year: 2020 PMID: 33266516 PMCID: PMC7700397 DOI: 10.3390/biology9110410
Source DB: PubMed Journal: Biology (Basel) ISSN: 2079-7737
Summary of assembly statistics for Rhincalanus gigas reference transcriptome. De novo assembly was generated by combining two RNA-Seq library using Trinity software. Mapping percentage refers to the average of the two individuals’ libraries mapped back to the reference transcriptome. BUSCO analysis was performed using the Arthropoda dataset consisting of 2675 single-copy orthologs.
| Trinity Transcripts ( | 78,285 |
| Trinity predicted genes ( | 31,851 |
| Minimum length (bp) | 301 |
| Maximum length (bp) | 10,033 |
| Average contig length (bp) | 877 |
| GC content (%) | 40.52 |
| N50 (bp) | 1143 |
| N25 (bp) | 2089 |
| N75 (bp) | 612 |
|
| |
| Overall mapping (%) | 81.8 |
| Mapping >1 time (%) | 44 |
|
| |
| Complete (%) | 70 |
| Fragmented (%) | 12 |
| Missing (%) | 22 |
Figure 1Rhincalanus gigas reference transcriptome annotation against Gene Ontology (GO) database. Pie chart showing the percentage of annotated transcripts within each GO term belonging to the Gene Ontology biological process (BP) category.
Comparison of Rhincalanus gigas transcriptome with other copepod de novo assemblies. R. gigas (current study) assembly and annotation statistics were compared with those from Calanus finmarchicus [25], Neocalanus flemingeri [26] and Labidocera madurae [27]. For each assembly, information on sequencing, assembly, and functional annotation steps are provided.
|
|
|
|
| |
|---|---|---|---|---|
| Sequencing | Illumina NextSeq | Illumina HiSeq | Illumina NextSeq | Illumina NextSeq |
| NCBI BioProject | PRJNA639356 | PRJNA236528 | PRJNA324453 | PRJNA324849 |
|
| ||||
| Transcripts (n) | 78,285 | 206,041 | 140,841 | 211,002 |
| Minimum length (bp) | 301 | 301 | 301 | 301 |
| Maximum length (bp) | 10,033 | 23,068 | 24,981 | 23,836 |
| N50 | 1143 | 1418 | 1452 | 1184 |
| Overall self-mapping (%) | 82 | 89 | 92 | 90.8 |
|
| ||||
| Transcripts with coding region (n) | 61,983 | np | 108,092 | 72,391 |
| Transcripts with BLAST hits (n) | 34,238 | 28,616 | 62,126 | 62,980 |
| Transcripts with GO terms (n) | 24,426 | 10,334 | 59,544 | 60,097 |
|
| ||||
| Complete (%) | 70 | 79 | 79 | 76 |
| Fragmented (%) | 12 | 8 | 6 | 11 |
| Missing (%) | 22 | 12 | 15 | 12 |
NCBI, National Center for Biotechnology Information; np, not provided in the study.
Top 10 KEGG pathways in the Rhincalanus gigas transcriptome. For each pathway, name, entry, and class are reported according to the BRITE hierarchy in the KEGG database (https://www.genome.jp/kegg/brite.html). For each pathway, the number of transcripts annotated as enzymes is listed.
| Pathway Name | Entry | Class | Transcripts |
|---|---|---|---|
| Purine metabolism | 00230 | Nucleotide metabolism (metabolism) | 210 |
| Cysteine and methionine metabolism | 00270 | Amino-acid metabolism (metabolism) | 128 |
| Pyrimidine metabolism | 00240 | Nucleotide metabolism (metabolism) | 121 |
| Glutathione metabolism | 00480 | Other amino-acid metabolism (metabolism) | 116 |
| Starch and sucrose metabolism | 00500 | Carbohydrate metabolism (metabolism) | 115 |
| mTOR signaling pathway | 04150 | Signal transduction (environmental information processing) | 115 |
| Glycolysis/Gluconeogenesis | 00010 | Carbohydrate metabolism (metabolism) | 106 |
| Aminoacyl transfer RNA (tRNA) biosynthesis | 00970 | Translation (genetic information processing) | 100 |
| Amino sugar and nucleotide sugar metabolism | 00520 | Carbohydrate metabolism (metabolism) | 99 |
| PI3K/Akt signaling pathway | 04151 | Signal transduction (environmental information processing) | 97 |
Figure 2Alignment of three antifreeze protein (AFP) sequences predicted from the R. gigas de novo transcriptome. Three genes were predicted (I–III) on the basis of automated annotation. (A) Alignment of the first two sequences (R. gigas AFP I and II) with their top BLAST hit, AFP type II from Hemitripterus americanus (P05140). (B) Alignment of the third R. gigas predicted gene, AFP III, with its top BLAST hit AFP type II from Osmerus mordax (Q01758). The R. gigas AFP III consisted of multiple isoforms (i1, i3, i5, and i13). In yellow, the C-type lectin carbohydrate-recognition domain typical of AFP type II is highlighted.