| Literature DB >> 33235200 |
Laura Payton1,2, Céline Noirot3, Claire Hoede3, Lukas Hüppe4,5,6, Kim Last7, David Wilcockson8, Elizaveta A Ershova9,10, Sophie Valière11, Bettina Meyer4,5,6.
Abstract
The zooplankter Calanus finmarchicus is a member of the so-called "Calanus Complex", a group of copepods that constitutes a key element of the Arctic polar marine ecosystem, providing a crucial link between primary production and higher trophic levels. Climate change induces the shift of C. finmarchicus to higher latitudes with currently unknown impacts on its endogenous timing. Here we generated a daily transcriptome of C. finmarchicus at two high Arctic stations, during the more extreme time of Midnight Sun, the summer solstice. While the southern station (74.5 °N) was sea ice-free, the northern one (82.5 °N) was sea ice-covered. The mRNAs of the 42 samples have been sequenced with an average of 126 ± 5 million reads (mean ± SE) per sample, and aligned to the reference transcriptome. We detail the quality assessment of the datasets and the complete annotation procedure, providing the possibility to investigate daily gene expression of this ecologically important species at high Arctic latitudes, and to compare gene expression according to latitude and sea ice-coverage.Entities:
Mesh:
Substances:
Year: 2020 PMID: 33235200 PMCID: PMC7686379 DOI: 10.1038/s41597-020-00751-4
Source DB: PubMed Journal: Sci Data ISSN: 2052-4463 Impact factor: 6.444
Fig. 1Overview of the experimental workflow used to generate the transcriptomic data output (all times noted in local time (UTC + 2)). Sample details are available in Supplementary Table 1.
Summary of sampling and sequencing strategy. Details are available in Supplementary Table 1.
| Station | Number of timepoint | Sampling frequency | Total duration of sampling | Number of replicate per timepoint | Total number of samples | Sequencing strategy | Reads | Platform |
|---|---|---|---|---|---|---|---|---|
| “North” | 7 | 4 h | 24 h | 3 | 21 | RNA-seq | paired-end 2 × 150 pb | Illumina NovaSeq |
| “South” | 7 | 4 h | 24 h | 3 | 21 | RNA-seq | paired-end 2 × 150 pb | Illumina NovaSeq |
List of available datasets related to the study (NCBI Bioproject PRJNA628886[40] and figshare collection 5127704[41]).
| DOI | Availability | File Name | Description | File format |
|---|---|---|---|---|
| — | SRR11748365.fastq.gz to SRR11748406.fastq.gz | Raw RNA sequencing data | fastq | |
| 10.6084/m9.figshare.c.5127704 | 10.6084/m9.figshare.c.5127704 | PRJNA628886_raw_quantification_206K.tsv.gz | Raw count matrix of the 206,012 contigs | tsv |
| 76k_ids_list.txt | List of identifiers corresponding to the 76,550 contigs after filtering | txt | ||
| PRJNA628886_quantification_downsampled_76k.tsv.gz | Down-sampling normalized quantification matrix | tsv | ||
| PRJNA628886_quantification_RLE_76k.tsv.gz | RLE normalized quantification matrix | tsv | ||
| Interprot_annot_206K.gff3 | Gff3 InterProScan annotation of the 206,012 contigs | gff3 | ||
| Interprot_annot_206K.tsv | Tabulated InterProScan annotation of the 206,012 contigs | tsv | ||
| diamond_annotation_206k.tsv | Tabulated DIAMOND annotation matrix of the 206,012 contigs (NR,swissprot,trembl) | tsv | ||
| diamond_annotation_76k.tsv | Tabulated DIAMOND annotation matrix of the 76,550 contigs (NR,swissprot,trembl) | tsv |
Details are available in Supplementary Table 1.
Fig. 2Mapping statistics of the 42 samples against the set of 206,012 contigs. Number of million (M) reads: paired, single mate, and unmapped. Details are available in Supplementary Table 3.
Fig. 3Principal component analysis of the 42 samples based on expression levels of the whole transcriptome (206,012 contigs). “Groups” grouping the 3 replicates per time and per station.
Average number (million, mean ± SE) of alignments per station and percentage of alignments discarded by filtering contigs with very low expression.
| Station | Average number of aligned reads on full dataset (206,012 contigs) | Average number of aligned reads after filtering (76,550 contigs) | Percentage of aligned reads discarded by filtering |
|---|---|---|---|
| “North” | 117.9 ± 5.4 | 116.3 ± 5.3 | 1.32 ± 0.04% |
| “South” | 129.9 ± 9.3 | 128.3 ± 9.2 | 1.27 ± 0.05% |
Details are available in Supplementary Table 4.
Number of contigs annotated with DIAMOND against NR, TREMBL and Swissprot and number of unique hits in the target database.
| Dataset | Database | Number of contigs | Number of unique hit in target database |
|---|---|---|---|
| 206,012 contigs | NR | 32,637 | 16,165 |
| Trembl | 28,845 | 15,518 | |
| Swissprot | 11,668 | 5,486 | |
| With an annotation in at least one database | 36,274 | — | |
| 76,550 contigs | NR | 21,171 | 11,045 |
| Trembl | 17,988 | 10,051 | |
| Swissprot | 8,456 | 4,343 | |
| With an annotation in at least one database | 22,527 | — |
Number of contigs with an InterProScan annotation and details on main features.
| Dataset | Main programs | Extracted GO | ||
|---|---|---|---|---|
| PFAM | TMHMM | getorf | ||
| 206,012 contigs | 63,661 | 123,542 | 192,688 | 65,924 |
| 76,550 contigs | 34,558 | 48,682 | 73,272 | 33,057 |
Details are available in Supplementary Table 5.
| Measurement(s) | transcriptome • sequence feature annotation |
| Technology Type(s) | RNA sequencing • sequence annotation |
| Factor Type(s) | location • time |
| Sample Characteristic - Organism | Calanus finmarchicus |
| Sample Characteristic - Environment | polar |
| Sample Characteristic - Location | Arctic Ocean |