| Literature DB >> 35228860 |
Laura Payton1,2, Céline Noirot3, Kim S Last4, Jordan Grigor4, Lukas Hüppe1,2,5,6, David V P Conway7, Mona Dannemeyer2, Amandine Suin8, Bettina Meyer1,2,6.
Abstract
The copepod Calanus finmarchicus (Crustacea, Copepoda) is a key zooplanktonic species with a crucial position in the North Atlantic food web and significant contributor to ocean carbon flux. Like many other high latitude animals, it has evolved a programmed arrested development called diapause to cope with long periods of limited food supply, while growth and reproduction are timed to take advantage of seasonal peaks in primary production. However, anthropogenic warming is inducing changes in the expected timing of phytoplankton blooms, suggesting phenological mismatches with negative consequences for the N. Atlantic ecosystem. While diapause mechanisms are mainly studied in terrestrial arthropods, specifically on laboratory model species, such as the fruit fly Drosophila, the molecular investigations of annual rhythms in wild marine species remain fragmentary. Here we performed a rigorous year-long monthly sampling campaign of C. finmarchicus in a Scottish Loch (UK; 56.45°N, 5.18°W) to generate an annual transcriptome. The mRNA of 36 samples (monthly triplicate of 25 individuals) have been deeply sequenced with an average depth of 137 ± 4 million reads (mean ± SE) per sample, aligned to the reference transcriptome, and filtered. We detail the quality assessment of the datasets and provide a high-quality resource for the investigation of wild annual transcriptomic rhythms (35,357 components) in a key diapausing zooplanktonic species.Entities:
Keywords: annual rhythms; copepods; diapause; transcriptome; wildlife; zooplankton
Year: 2022 PMID: 35228860 PMCID: PMC8861585 DOI: 10.1002/ece3.8605
Source DB: PubMed Journal: Ecol Evol ISSN: 2045-7758 Impact factor: 2.912
FIGURE 1Overview of the experimental workflow used to generate the transcriptomic data output. Sample details are available in Tables S1 and S2
Summary of sampling and sequencing strategy. Details are available in Tables S1 and S2
| Number of time points | Sampling frequency | Total duration of sampling | Number of replicates per time point | Total number of samples | Sequencing strategy | Reads | Platform | Volume |
|---|---|---|---|---|---|---|---|---|
| 12 | 1 month | 1 year | 3 | 36 | RNA‐seq | Paired‐end 2 × 150 pb | Illumina NovaSeq | 1 lane S4 |
Mapping results on the reference transcriptome (96,090 comps) before and after filtering. Details are available in Table S3
| Average number of aligned reads (million, mean ± SE) | Number of comps | |
|---|---|---|
| Mapping results before filtering | 102 ± 3 | 96,090 |
| Mapping results after filtering | 99 ± 3 | 35,357 |
Number (and percentage) of annotated comps in the working transcriptome (35,357 comps), using annotation works from Bioproject PRJNA628886 (Payton et al., 2020) and Bioproject PRJNA236528 (Lenz et al., 2014)
| Dataset | Annotation source | Database | Annotated comps | Comps with at least one annotation |
|---|---|---|---|---|
| Working transcriptome (35,357 comps) | Bioproject PRJNA628886 | NR | 9779 (27.7%) | 22,643 (64.0%) |
| Trembl | 7868 (22.3%) | |||
| SwissProt | 3803 (10.8%) | |||
| Bioproject PRJNA236528 | Non‐redundant database | 21,938 (62.0%) | ||
| Bioproject PRJNA628886 | InterProScan (GO) | 15,726 (44.5%) | 15,726 (44.5%) |
FIGURE 2Counts distribution. Counts (expressed as pseudo‐counts, i.e. (counts +1) at the log2 scale) distribution: (a) before filtering the outlier component (96,090 “comps”), (b) after filtering the outlier component (96,089 “comps”), (c) after filtering components with less than 1 CPM in all samples (35,357 “comps”), and (d) after down‐sampling normalization (35,357 “comps”)
FIGURE 3Correlation between replicates before and after filtering the outlier component (comp92_c0_seq1; GAXK01170082.1). Data are available in Table S4