| Literature DB >> 30094004 |
Benjamin J Hunt1, Özge Özkaya1, Nathaniel J Davies1, Edward Gaten1, Paul Seear2, Charalambos P Kyriacou1, Geraint Tarling2, Ezio Rosato1.
Abstract
Antarctic krill (Euphausia superba) is a crucial component of the Southern Ocean ecosystem, acting as the major link between primary production and higher trophic levels with an annual predator demand of up to 470 million tonnes. It also acts as an ecosystem engineer, affecting carbon sequestration and recycling iron and nitrogen, and has increasing importance as a commercial product in the aquaculture and health industries. Here we describe the creation of a de novo assembled head transcriptome for E. superba. As an example of its potential as a molecular resource, we relate its exploitation in identifying and characterizing numerous genes related to the circadian clock in E. superba, including the major components of the central feedback loop. We have made the transcriptome openly accessible for a wider audience of ecologists, molecular biologists, evolutionary geneticists, and others in a user-friendly format at SuperbaSE, hosted at http://www.krill.le.ac.uk.Entities:
Keywords: Antarctic; circadian; crustacean; database; krill; transcriptome
Year: 2017 PMID: 30094004 PMCID: PMC6077532 DOI: 10.1002/ece3.3168
Source DB: PubMed Journal: Ecol Evol ISSN: 2045-7758 Impact factor: 2.912
Core circadian genes cloned and Sanger sequenced for Euphausia superba. Es‐cryptochrome 2 has been previously reported as Escry (Mazzotta et al., 2010)
| Gene name | Accession | Length (nt/aa) | Top NCBI nr hit | ||||||
|---|---|---|---|---|---|---|---|---|---|
| mRNA | CDS | Peptide | Accession | Name | Species |
| Identity (%) | ||
|
|
| 2,017 | 1,992 | 664 |
|
|
| 0 | 73 |
|
|
| 4,044 | 4,032 | 1,344 |
|
|
| 0 | 79 |
|
|
| 1,644 | 1,599 | 533 |
|
|
| 0 | 53 |
|
|
| 1,720 | 1,635 | 545 |
|
|
| 0 | 100 |
|
|
| 4,061 | 3,783 | 1,261 |
|
|
| 0 | 67 |
|
|
| 4,222 | 3,933 | 1,311 |
|
|
| 0 | 75 |
Comparison of de novo assembly quality using TransRate contig and read‐mapping metrics. Bridger, SOAPdenovo‐Trans, and Trans‐ABySS assemblies are multi‐k‐mer assemblies, see Methods for details. See Smith‐Unna et al. (2016) and http://hibberdlab.com/transrate/metrics.html for further details regarding TransRate metrics. Read‐mapping metrics were not considered appropriate for the coding assembly, which is a selective dataset
| Metric | Bridger | SOAPdenovo‐Trans | Trans‐ABySS | Trinity | Total assembly | Coding assembly |
|---|---|---|---|---|---|---|
| No. of contigs | 513,499 | 282,833 | 318,436 | 157,274 | 484,080 | 147,450 |
| Shortest | 189 | 200 | 200 | 201 | 200 | 297 |
| Longest | 27,543 | 17,265 | 37,689 | 11,148 | 34,468 | 34,468 |
| Mean length | 780 | 445 | 573 | 503 | 726 | 1,121 |
| No. with ORF | 138,937 | 51,336 | 72,597 | 31,566 | 135,969 | 100,823 |
| ORFs % | 27% | 18% | 23% | 20% | 28% | 81% |
| N50 | 1,218 | 480 | 738 | 602 | 1,071 | 1,518 |
| % of reads mapped | 97% | 93% | 92% | 82% | 98% | – |
| Good mappings | 30,880,238 | 28,292,720 | 28,688,483 | 22,012,681 | 31,833,986 | – |
| % good mappings | 88% | 81% | 82% | 63% | 91% | – |
| Potential bridges | 43,048 | 46,708 | 38,398 | 38,011 | 34,120 | – |
| % uncovered bases | 38% | 11% | 15% | 4% | 23% | – |
| % contigs with uncovered bases | 86% | 71% | 77% | 44% | 80% | – |
| % contigs uncovered | 45% | 16% | 18% | 4% | 29% | – |
| % contigs low‐covered | 93% | 75% | 80% | 76% | 89% | – |
| Transrate score | 0.13 | 0.26 | 0.25 | 0.21 | 0.42 | – |
Figure 1Functional characterization of SuperbaSE coding assembly, in comparison with KrillDB (also E. superba) and D. pulex mRNA transcripts. Chart depicts percentage of contigs/genes mapping to high‐level GO terms retrieved through BLAST annotation with SwissProt
Figure 2Es‐BMAL1 (accession KX238952) aligned with fragments mined from the total assembly. Yellow highlights represent sequence data not found in the total assembly
Figure 3Es‐CLOCK (accession KX238953) aligned with fragments mined from the total assembly. Yellow highlights represent sequence data not found in the total assembly. Red text highlights residues inconsistent between the confirmed PCR sequence and transcriptome sequences
Figure 4Es‐CRY1 (accession KX238951) aligned with fragments mined from the total assembly. Yellow highlights represent sequence data not found in the total assembly. Red text highlights residues inconsistent between the confirmed PCR sequence and transcriptome sequences
Figure 5Es‐CRY2 (accession CAQ86665, Mazzotta et al. (2010)) aligned with contig mined from the total assembly
Figure 6Es‐PERIOD (accession KX238955) aligned with contig mined from the total assembly. Red text highlights residues inconsistent between the confirmed PCR sequence and transcriptome sequence. Black arrows denote the point at which sequence data from 3′ RACE extension ends—beyond this the transcriptome was necessary to complete the sequence
Figure 7Es‐TIMELESS (accession KX238954) aligned with contig mined from the total assembly. Yellow highlights represent sequence data not found in the total assembly. Red text highlights residues inconsistent between the confirmed PCR sequence and transcriptome sequence. Black arrows denote the point at which sequence data from 3′ RACE extension ends—beyond this the transcriptome was necessary to complete the sequence
Transcriptome mining: clock‐related query proteins and E. superba output contigs
| Query Protein | Accession |
| Length | Protein name | Length | |
|---|---|---|---|---|---|---|
| Protein name | Contig ID |
| ||||
| CASEIN KINASE II α |
| ES.21_comp3643_seq1 | 0 | 2556 | Es‐CKIIα | 350 |
| CASEIN KINASE II β |
| ES.194593 | 4E‐119 | 1276 | Es‐CKIIβ | 219 |
| CLOCKWORK ORANGE |
| ES.21_comp15386_seq1 | 2E‐26 | 2301 | Es‐CWO | 707 |
| CTRIP |
| ES.31_comp9752_seq0 | 1E‐167 | 8148 | Es‐CTRIP | 2152 |
| DOUBLETIME |
| ES.25_comp4349_seq1 | 0 | 2480 | Es‐DBT | 345 |
| JETLAG |
| ES.21_comp16370_seq0 | 1E‐15 | 1901 | – | – |
| LARK |
| ES.k31.S5232406 | 6E‐59 | 1199 | Es‐LARK | 326 |
| NEJIRE |
| ES.31_comp5882_seq0 | 0 | 6924 | Es‐NEJ | 2072 |
| NEMO |
| k21.S7956922 | 0 | 2038 | Es‐NEMO | 456 |
| PAR DOMAIN PROTEIN 1ε |
| ES.k41.J3781389 | 8E‐05 | 1835 | Es‐PDP1 | 489 |
| PIGMENT DISPERSING FACTOR |
| ES.k51.S2746663 | 8E‐06 | 387 | Es‐PDH1 | 74 |
| ES.c68106_g1_i1 | 3E‐05 | 531 | Es‐PDH2 | 79 | ||
| PIGMENT DISPERSING FACTOR RECEPTOR |
| ES.21_comp12084_seq0 | 3E‐91 | 1588 | Es‐PDHR | 295 |
| PROTEIN PHOSPHATASE 1 |
| ES.k51.J2636217 | 0 | 1870 | Es‐PP1 | 329 |
| PROTEIN PHOSPHATASE 2A ‐ MICROTUBULE STAR |
| ES.23_comp1445_seq0 | 0 | 2263 | Es‐MTS | 309 |
| PROTEIN PHOSPHATASE 2A ‐ TWINS |
| ES.21_comp997_seq2 | 0 | 3375 | Es‐TWS | 455 |
| PROTEIN PHOSPHATASE 2A ‐ WIDERBORST |
| ES.104836 | 0 | 4190 | Es‐WBT | 461 |
| REV‐ERBα |
| ES.25_comp2884_seq0 | 1E‐45 | 3602 | Es‐E75 | 800 |
| SHAGGY |
| ES.27_comp3209_seq0 | 0 | 2222 | Es‐SGG | 415 |
| SUPERNUMERARY LIMBS |
| ES.25_comp34876_seq0 | 0 | 1843 | Es‐SLIMB | 612 |
| TAKEOUT |
| ES.21_comp3588_seq0 | 1E‐11 | 1090 | Es‐TAKEOUT | 247 |
| TIMELESS (TIMEOUT) | Q9R1X4 | ES.23_comp37221_seq0 | 2E‐82 | 934 | Es‐TIMEOUT | 1363 |
| ES.29_comp20079_seq0 | 4E‐94 | 3067 | ||||
| VRILLE |
| ES.25_comp4277_seq0 | 3E‐43 | 2021 | Es‐VRI | 474 |
E. superba PDH also reported by Toullec et al. (2013).
Query protein used is a Mus musculus orthlog. All others from Drosophila melanogaster.
Es‐NEJIRE, Es‐NEMO and Es‐SLIMB are 3′ partial fragments.
Es‐TIMEOUT full protein sequence determined through PCR using fragments listed.
Results of blastp analyses of putative E. superba clock‐associated proteins against D. melanogaster protein database (Flybase) and NCBI nonredundant (nr) protein database
| Query | Top Flybase hit | Top NCBI nr hit | |||||
|---|---|---|---|---|---|---|---|
| Flybase No. | Associated gene name |
| Accession | Name | Species |
| |
| Es‐CKIIα | FBpp0070041 |
| 1E‐162 |
| Casein kinase II subunit alpha |
| 0 |
| Es‐CKIIβ | FBpp0300427 |
| 6E‐114 |
| Casein kinase II subunit beta |
| 5E‐146 |
| Es‐CTRIP | FBpp0310477 |
| 1E‐173 |
| ULF (TRIP12) |
| 0 |
| Es‐CWO | FBpp0081723 |
| 8E‐27 |
| Hairy/enhancer‐of‐split related with YRPW motif protein 2 |
| 2E‐61 |
| Es‐DBT | FBpp0306615 |
| 7E‐157 |
| Casein kinase 1 epsilon |
| 0 |
| Es‐E75 | FBpp0074915 |
| 2E‐143 |
| Ecdysteroid receptor E75 |
| 0 |
| Es‐LARK | FBpp0076555 |
| 2E‐55 |
| lark, isoform A |
| 2E‐62 |
| Es‐NEJ | FBpp0305701 |
| 0 |
| CREB‐binding protein |
| 0 |
| Es‐NEMO | FBpp0076474 |
| 0 |
| Nemo, isoform E |
| 0 |
| Es‐PDP1 | FBpp0076495 |
| 2E‐35 |
| Hepatic leukemia factor |
| 2E‐39 |
| Es‐PDH1 | FBpp0084396 |
| 3E‐05 |
| PDH related peptide precursor 79 |
| 9E‐27 |
| Es‐PDHR | FBpp0309084 |
| 9E‐97 |
| Neuropeptide GPCR B2 |
| 1E‐140 |
| Es‐PP1 | FBpp0306442 |
| 4E‐178 |
| PP1‐alpha catalytic subunit |
| 0 |
| Es‐MTS | FBpp0310063 |
| 2E‐176 |
| PP2A catalytic subunit alpha |
| 0 |
| Es‐TWS | FBpp0081671 |
| 0 |
| PP2A regulatory subunit B |
| 0 |
| Es‐WBT | FBpp0084575 |
| 0 |
| PP2A regulatory subunit epsilon |
| 0 |
| Es‐TAKEOUT | FBpp0297106 |
| 1E‐32 |
| Circadian clock‐controlled protein precursor |
| 1.00E‐47 |
| Es‐TIMEOUT | FBpp0082180 |
| 0 |
| Timeout |
| 0 |
| Es‐SGG | FBpp0070450 |
| 0 |
| Shaggy |
| 0 |
| Es‐SLIMB | FBpp0306059 |
| 0 |
| F‐box/WD repeat‐containing protein 1A |
| 0 |
| Es‐VRI | FBpp0309715 |
| 4E‐42 |
| Vrille |
| 3E‐68 |
Fragment. Es‐PDH2 is not shown as it is a minor variant of Es‐PDH1.
Preprohormone query proteins with E. superba output contigs and subsequent blastp analysis against NCBI nonredundant protein database
| Peptide family (subfamily) | Output from total assembly using Accession as query | Output from NCBI NR using total assembly contig as query | |||||
|---|---|---|---|---|---|---|---|
| Accession | Contig | Size (nt) |
| Accession | Description |
| |
| Allatostatin A |
| ES.27_comp22909_seq0 | 1239 | 4E‐19 |
| allatostatin precursor protein | 1E‐60 |
| Allatostatin B |
| ES.27_comp14162_seq0 | 710 | 2E‐09 |
| B‐type preproallatostatin I | 3E‐15 |
| Allatostatin C |
| ES.23_comp57987_seq0 | 757 | 4E‐05 |
| Allatostatin‐cc | 5E‐13 |
| Allatotropin |
| – | – | – | – | – | – |
| Bursicon alpha |
| ES.21_comp33157_seq0 | 552 | 2E‐60 |
| Bursicon alpha subunit | 4E‐72 |
| Bursicon beta |
| ES.21_comp28222_seq0 | 721 | 7E‐37 |
| Bursicon beta subunit | 9E‐63 |
| CHHamide |
| ES.25_comp21895_seq1 | 829 | 6E‐04 |
| CCHamide‐like precursor | 3E‐08 |
| Corazonin |
| ES.29_comp12038_seq1 | 365 | 2E‐04 |
| Corazonin precursor‐related peptide | 2E‐06 |
| Crustacean cardioactive peptide |
| ES.27_comp31900_seq0 | 685 | 2E‐02 |
| Crustacean cardioactive peptide | 4E‐33 |
| Crustacean hyperglycemic hormone |
| ES.k41.R3771434 | 691 | 5E‐24 |
| Crustacean hyperglycemic hormone precursor | 8E‐27 |
| Calcitonin‐like diuretic hormone |
| ES.c66980_g1_i1 | 608 | 1E‐47 |
| Prepro‐calcitonin‐like diuretic hormone | 1E‐53 |
| Corticotropin‐releasing factor‐like diuretic hormone |
| – | – | – | – | – | – |
| Ecdysis‐triggering hormone |
| – | – | – | – | – | – |
| Eclosion hormone |
| ES.21_comp1319_seq1 | 536 | 3E‐10 |
| Eclosion hormone 2 | 2E‐16 |
| FMRFamide‐like peptide (myosuppressin) |
| ES.265178 | 343 | 2E‐34 |
| Myosuppressin‐like neuropeptide precursor | 9E‐36 |
| FMRFamide‐like peptide (neuropeptide F) |
| ES.23_comp4135_seq0 | 700 | 3E‐26 |
| Preproneuropeptide F I | 5E‐27 |
| FMRFamide‐like peptide (short neuropeptide F) |
| ES.k61.S126078 | 378 | 4E‐08 |
| Short neuropeptide F precursor | 2E‐13 |
| FMRFamide‐like peptide (sulfakinin) |
| – | – | – | – | – | – |
| FMRFamide‐like peptide (other) |
| ES.63534 | 521 | 6E‐06 |
| FMRFamide‐like peptide precursor | 1E‐08 |
| Insulin‐like peptide |
| – | – | – | – | – | – |
| Inotocin |
| ES.c73008_g1_i1 | 538 | 3E‐17 |
| Arginine vasotocin precursor | 2E‐24 |
| Leucokinin |
| – | – | – | – | – | – |
| Neuroparsin |
| ES.31_comp27103_seq0 | 522 | 1E‐15 |
| Neuroparsin‐A precursor | 4E‐16 |
| Orcokinin |
| ES.k51.J2644345 | 765 | 2E‐59 |
| Orcokinin‐like peptide 4 Precursor | 1E‐77 |
| Periviscerokinin/pyrokinin |
| – | – | – | – | – | – |
| Proctolin |
| – | – | – | – | – | – |
| Red pigment concentrating hormone |
| ES.c39108_g1_i1 | 620 | 9E‐18 |
| Red pigment concentrating hormone | 4E‐17 |
| Adipokinetic hormone |
| – | – | – | – | – | – |
| RYamide |
| – | – | – | – | – | – |
| SIFamide |
| ES.23_comp6805_seq0 | 935 | 7E‐25 |
| SIFamide precursor | 8E‐24 |
| Tachykinin‐related peptide |
| ES.c73859_g1_i1 | 1289 | 6E‐37 |
| Preprotachykinin B | 1E‐37 |
Candidates for these genes also reported by Toullec et al. (2013).
Comparison of circadian‐related contigs from the total assembly of SuperbaSE and equivalents found in KrillDB. Longest accepted candidate contig shown for each assembly. “Agreement” shows percentage of identical residues in the largest HSP in each search result alignment
| Gene | Total assembly contig | Length | KrillDB contig | Length | Agreement (%) |
|---|---|---|---|---|---|
|
| ES.21_comp39465_seq0 | 1,150 | ESS133965 | 303 | 100 |
|
| ES.21_comp24303_seq0 | 1,903 | ESS034516 | 1,795 | 100 |
|
| ES.23_comp19428_seq0 | 4,476 | ESS133963 | 2,339 | 99 |
|
| ES.23_comp16712_seq0 | 3,415 | ESS040526 | 4,047 | 99 |
|
| ES.23_comp21133_seq0 | 1,100 | ESS023688 | 1,819 | 99 |
|
| ES.154212 | 2,146 | ESS118473 | 1,855 | 100 |
|
| ES.21_comp3643_seq1 | 2,556 | ESS071624 | 1,825 | 91 |
|
| ES.194593 | 1,276 | ESS051147 | 805 | 97 |
|
| ES.21_comp15386_seq1 | 2,301 | ESS049809 | 2,111 | 95 |
|
| ES.31_comp9752_seq0 | 8,148 | ESS002511 | 2,454 | 100 |
|
| ES.25_comp4349_seq1 | 2,480 | ESS096454 | 1,925 | 91 |
|
| ES.k31.S5232406 | 1,199 | ESS098132 | 1,058 | 70 |
|
| ES.31_comp5882_seq0 | 6,924 | ESS048098 | 2,180 | 98 |
|
| k21.S7956922 | 2,038 | ESS021045 | 2,238 | 100 |
|
| ES.k41.J3781389 | 1,835 | ESS017186 | 465 | 75 |
|
| ES.k51.S2746663 | 387 | ESS069399 | 362 | 93 |
|
| ES.21_comp12084_seq0 | 1,588 | ESS048516 | 1,557 | 100 |
|
| ES.k51.J2636217 | 1,870 | ESS073678 | 2,246 | 99 |
|
| ES.23_comp1445_seq0 | 2,263 | ESS030735 | 1,334 | 58 |
|
| ES.21_comp997_seq2 | 3,375 | ESS126945 | 2,979 | 100 |
|
| ES.104836 | 4,190 | ESS023456 | 3,762 | 72 |
|
| ES.25_comp2884_seq0 | 3,602 | ESS094384 | 3,620 | 99 |
|
| ES.27_comp3209_seq0 | 2,222 | ESS074789 | 1,350 | 96 |
|
| ES.25_comp34876_seq0 | 1,843 | ESS133964 | 1,350 | 100 |
|
| ES.21_comp3588_seq0 | 1,090 | – | – | – |
|
| ES.23_comp37221_seq0 | 3,067 | ESS130300 | 2,078 | 99 |
|
| ES.25_comp4277_seq0 | 2,021 | ESS123361 | 1,522 | 100 |
These contigs were rejected as equivalent to the query contigs on visual inspection.
Figure 8Sequence and annotation data found in SuperbaSE for circadian gene Es‐period
Comparison of published E. superba de novo assemblies. “No. of reads” shows raw read count of new sequencing based on the samples described in this table, either as reported or obtained from NCBI SRA
| Clark et al. ( | De Pittà et al. ( | Martins et al. ( | Meyer et al. ( | Sales et al. ( | SuperbaSE (coding assembly) | |
|---|---|---|---|---|---|---|
| Sampling vicinity | South Orkney Islands | Ross Sea | Antarctic Peninsula | East Antarctica/Lazarev Sea | Indian Ocean | NW of South Georgia |
| Tissue type | Adult, whole | Adult, head, abdomen, thoracopods | Adult, head and eye stalks | Adult, whole/head, aquarium/wild | Larval, whole, aquarium‐bred | Adult, head |
| Experimental conditions/library type | Pooled time series | Pooled time series | Control/food deprivation/UV‐B stress | Pooled seasonal samples | Control/1000/2000 μatm pCO2 | Pooled time series |
| Sequencing technology | GS FLX Titanium (454) | GS FLX Titanium (454) | GS FLX Titanium (454) | GS FLX Titanium (454) | Illumina Genome Analyzer IIx | Illumina HiSeq 2000 |
| No. of reads | 943,817 | 96,803 | 377,442 | 1,771,572 | 368,287,158 | 69,837,314 |
| No. of reads used in assembly | 699,248 | 711,148 | 306,182 | 2,608,911 | 33,175,931 | 69,837,314 |
| No. of assembled contigs | 22,177 | 32,217 | 26,415 | 58,581 | 133,962 | 147,450 |
| Smallest contig | 137 | 300 | – | 300 | 200 | 200 |
| Largest contig | 8,515 | 8,558 | – | 11,127 | 14,396 | 34,468 |
| Average contig size | 492 | 890 | 593 | 691 | 964 | 1,121 |
| N50 | – | – | 666 | 716 | 1,294 | 1,518 |
| No. of reported BLAST annotations | 5,563 | 11,230 | 10,501 | 15,347 | 90,121 | 92,201 |
| BLAST annotation source | GenBank nonredundant | NCBI nonredundant, UniProtKB | NCBI nonredundant (arthropoda) | NCBI nucleotide, UniProtKB | NCBI nucleotide, UniProtKB/TREMBL | SwissProt, UniProt arthropoda |
| Notes | – | Final assembly includes publicly available ESTs and reads from Clark et al. ( | – | Final assembly includes reads from Clark et al. ( | Reads were subject to digital normalization before assembly. Final assembly includes fragments from Meyer et al. ( | – |