| Literature DB >> 26076827 |
Bas Verbruggen1, Lisa K Bickley2, Eduarda M Santos3, Charles R Tyler4, Grant D Stentiford5, Kelly S Bateman6, Ronny van Aerle7.
Abstract
BACKGROUND: The European shore crab, Carcinus maenas, is used widely in biomonitoring, ecotoxicology and for studies into host-pathogen interactions. It is also an important invasive species in numerous global locations. However, the genomic resources for this organism are still sparse, limiting research progress in these fields. To address this resource shortfall we produced a C. maenas transcriptome, enabled by the progress in next-generation sequencing technologies, and applied this to assemble information on the innate immune system in this species.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26076827 PMCID: PMC4469326 DOI: 10.1186/s12864-015-1667-1
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Number of read pairs obtained for each crab tissue before and after removal of adapter sequences and quality filtering
| Tissue sample | Number of read pairs | Number of clean read pairs |
|---|---|---|
| Eggs | 9,337,648 | 6,614,044 |
| Epidermis | 11,929,821 | 8,302,718 |
| Eye | 13,463,765 | 9,430,381 |
| Gill | 10,110,102 | 7,234,304 |
| Haemolymph | 10,611,241 | 7,233,253 |
| Heart | 9,657,081 | 6,717,788 |
| Hepatopancreas | 9,216,408 | 6,471,110 |
| Intestine | 8,685,232 | 5,765,077 |
| Muscle | 17,251,355 | 11,749,555 |
| Nerve | 14,278,257 | 9,670,912 |
| Ovary | 11,125,170 | 7,869,190 |
| Testis | 13,197,599 | 9,189,430 |
| Total | 138,863,679 | 96,247,762 |
Transcriptome statistics
| Description | Value |
|---|---|
| Number of loci | 153,669 |
| Number of transcripts | 212,427 |
| Maximum transcript length (bp) | 24,848 |
| Minimal transcript length (bp) | 201 |
| Mean transcript length (bp) | 992 |
| Standard deviation (bp) | 1363 |
| Median transcript length (bp) | 380 |
| Total length (bp) | 196,966,469 |
| N50 (bp) | 2,102 |
Number of annotated transcripts and open reading frames (identified by TransDecoder) using different annotation methods and sequence databases
| Input | Annotation method | Number of annotated transcripts |
|---|---|---|
| All transcripts | BLASTx – NCBI nr protein | 62,804 (29.6 %) |
| All transcripts | BLASTn – | 19,891 (9.4 %) |
| All transcripts | BLAST2GO | 8,091 (3.8 %) |
| All transcripts | TransDecoder ORF finder | 58,383 (27.5 %) |
| All transcripts | KEGG | 30,352 (14.3 %) |
| Open reading frames | BLASTp – UniProt/SwissProt | 41,108 (70.4 %) |
| Open reading frames | Pfam | 37,776 (67.4 %) |
| Open reading frames | SignalP | 4,132 (1.9 %) |
| Open reading frames | TmHMM | 0 (0.0 %) |
Identification, sequence similarity and Gene Ontology annotation statistics of peptide sequences in the transcriptome
| Description | Number of sequences | Percentage of sequences (%) | |
|---|---|---|---|
| Transcripts | 212,427 | ||
| TransDecoder peptides | 58,383 | ||
| Peptides with Swissprot/Uniprot annotation | 41,108 | 70.41 | |
| GO annotated transcripts | 53,766 | 25.31 | |
| GO annotated peptides | 19,423 | 47.23 | |
| GO tree | GO | Count | % |
| Cellular Component | cytoplasm | 2,122 | 10.9 |
| nucleus | 1,955 | 10.1 | |
| plasma membrane | 1,272 | 6.6 | |
| membrane | 1,213 | 6.3 | |
| cytosol | 1,195 | 6.2 | |
| Molecular Function | protein binding | 2,577 | 13.3 |
| binding | 1,071 | 5.5 | |
| ATP binding | 755 | 3.9 | |
| metal ion binding | 569 | 2.9 | |
| protein homodimerization activity | 485 | 2.5 | |
| Biological Process | cellular process | 597 | 3.1 |
| regulation of cellular process | 539 | 2.8 | |
| primary metabolic process | 446 | 2.3 | |
| response to stimulus | 419 | 2.2 | |
| transport | 416 | 2.1 |
Fig. 1Taxonomic classifications of Carcinus maenas transcripts. Partially collapsed phylogenetic tree produced by MEGAN4. Numbers illustrate the number of transcripts representing each taxa. Within the metazoan taxon, the pancrustacea represented the largest taxonomic group
Differentially expressed transcripts in specific tissues
| Tissue | Differentially expressed transcripts |
|---|---|
| eggs | 1,605 |
| epidermis | 1,339 |
| eye | 1,312 |
| gill | 1,223 |
| Haemolymph | 2,008 |
| heart | 1,226 |
| hepatopancreas | 2,741 |
| intestine | 1,519 |
| muscle | 2,200 |
| nerve | 1,989 |
| ovary | 1,751 |
| testis | 1,391 |
Top 5 most enriched Gene Ontology terms in specific tissues
| Tissue | GO-ID | Term |
| FDR |
|---|---|---|---|---|
| Eggs | GO:0042302 | structural constituent of cuticle | 9.86e-11 | 1.07e-6 |
| Eggs | GO:0003677 | DNA binding | 5.25e-7 | 2.86e-3 |
| Eggs | GO:0006260 | DNA replication | 2.78e-6 | 1.01e-2 |
| Eggs | GO:0006261 | DNA-dependent DNA replication | 5.76e-6 | 1.57e-2 |
| Eggs | GO:0001708 | cell fate specification | 1.06e-5 | 2.30e-2 |
| Epidermis | GO:0018298 | protein-chromophore linkage | 3.85e-7 | 2.56e-3 |
| Epidermis | GO:0015772 | oligosaccharide transport | 7.05e-7 | 2.56e-3 |
| Epidermis | GO:0015766 | disaccharide transport | 7.05e-7 | 2.56e-3 |
| Eye | GO:0003008 | system process | 7.00e-12 | 7.61e-8 |
| Eye | GO:0050877 | neurological system process | 2.36e-11 | 1.28e-7 |
| Eye | GO:0022834 | ligand-gated channel activity | 9.64e-9 | 2.62e-5 |
| Eye | GO:0015276 | ligand-gated ion channel activity | 9.64e-9 | 2.62e-5 |
| Eye | GO:0070011 | peptidase activity, acting on L-amino acid peptides | 1.57e-8 | 3.42e-5 |
| Gill | GO:0070160 | occluding junction | 1.71e-6 | 4.20e-3 |
| Gill | GO:0005344 | oxygen transporter activity | 1.84e-6 | 4.20e-3 |
| Gill | GO:0015671 | oxygen transport | 1.84e-6 | 4.20e-3 |
| Gill | GO:0015669 | gas transport | 1.84e-6 | 4.20e-3 |
| Gill | GO:0005923 | tight junction | 2.65e-6 | 4.20e-3 |
| Haemolymph | GO:0001525 | angiogenesis | 5.76e-11 | 6.27e-7 |
| Haemolymph | GO:0048514 | blood vessel morphogenesis | 1.80e-9 | 9.79e-6 |
| Haemolymph | GO:0001568 | blood vessel development | 1.25e-8 | 4.54e-5 |
| Haemolymph | GO:0001944 | vasculature development | 3.97e-8 | 1.08e-4 |
| Haemolymph | GO:0009653 | anatomical structure morphogenesis | 1.46e-7 | 1.65e-4 |
| Heart | GO:0016328 | lateral plasma membrane | 1.78e-7 | 1.94e-3 |
| Heart | GO:0006768 | biotin metabolic process | 1.28e-6 | 3.04e-3 |
| Heart | GO:0004736 | pyruvate carboxylase activity | 1.28e-6 | 3.04e-3 |
| Heart | GO:0005344 | oxygen transporter activity | 1.67e-6 | 3.04e-3 |
| Heart | GO:0015671 | oxygen transport | 1.67e-6 | 3.04e-3 |
| Hepatopancreas | GO:0016491 | oxidoreductase activity | 6.35e-16 | 6.91e-12 |
| Hepatopancreas | GO:0003824 | catalytic activity | 2.87e-11 | 1.56e-7 |
| Hepatopancreas | GO:0044710 | single-organism metabolic process | 2.06e-10 | 7.47e-7 |
| Hepatopancreas | GO:0005576 | extracellular region | 5.68e-10 | 1.20e-6 |
| Hepatopancreas | GO:0005764 | lysosome | 6.61e-10 | 1.20e-6 |
| Intestine | GO:0016337 | cell-cell adhesion | 5.72e-9 | 6.22e-5 |
| Intestine | GO:0005548 | phospholipid transporter activity | 7.29e-8 | 3.97e-4 |
| Intestine | GO:0006022 | aminoglycan metabolic process | 1.56e-7 | 4.06e-4 |
| Intestine | GO:0015917 | aminophospholipid transport | 2.09e-7 | 4.06e-4 |
| Intestine | GO:0004012 | phospholipid-translocating ATPase activity | 2.09e-7 | 4.06e-4 |
| Muscle | GO:0042383 | sarcolemma | 1.93e-11 | 2.10e-7 |
| Muscle | GO:0031674 | I band | 7.82e-11 | 4.25e-7 |
| Muscle | GO:0006811 | ion transport | 2.87e-10 | 1.04e-6 |
| Muscle | GO:0030018 | Z disc | 1.94e-9 | 5.29e-6 |
| Muscle | GO:0044449 | contractile fiber part | 2.54e-9 | 5.52e-6 |
| Nerve | GO:0015277 | kainate selective glutamate receptor activity | 1.39e-14 | 1.51e-10 |
| Nerve | GO:0004872 | receptor activity | 2.54e-12 | 1.38e-8 |
| Nerve | GO:0048172 | regulation of short-term neuronal synaptic plasticity | 5.16e-12 | 1.87e-8 |
| Nerve | GO:0004970 | ionotropic glutamate receptor activity | 1.02e-11 | 2.77e-8 |
| Nerve | GO:0048168 | regulation of neuronal synaptic plasticity | 4.92e-11 | 1.07e-7 |
| Ovary | GO:0016459 | myosin complex | 1.43e-7 | 1.56e-3 |
| Ovary | GO:0018298 | protein-chromophore linkage | 1.67e-6 | 9.10e-3 |
| Ovary | GO:0036002 | pre-mRNA binding | 1.37e-5 | 4.96e-2 |
| Testis | GO:0008499 | UDP-galactose:beta-N-acetylglucosamine beta-1,3-galactosyltransferase activity | 5.07e-17 | 5.52e-13 |
| Testis | GO:0035250 | UDP-galactosyltransferase activity | 1.45e-16 | 7.86e-13 |
| Testis | GO:0005797 | Golgi medial cisterna | 1.38e-15 | 5.00e-12 |
| Testis | GO:0048531 | beta-1,3-galactosyltransferase activity | 6.12e-15 | 1.66e-11 |
| Testis | GO:0008378 | galactosyltransferase activity | 1.26e-14 | 2.75e-11 |
Carcinus maenas pathogen associated molecular pattern recognition genes
| PRP group | Transcript | Identity (%) | Length | E-value | Query | Ancestor |
|---|---|---|---|---|---|---|
| GNBP | comp44152_c0_seq1 | 42.06 | 340 | 1.00e-65 | gi|300507044 : gram-negative binding protein [ | Crustacea |
| comp44453_c0_seq (1–2) | 58.06 | 341 | 3.00e-123 | gi|62122584 : GNBP [ | Bilateria | |
| comp74133_c0_seq1 | 44.8 | 346 | 8.00e-88 | gi|62122584 : GNBP [ | Bilateria | |
| comp83740_c0_seq (1–5) | 46.02 | 339 | 5.00e-87 | gi|62122584 : GNBP [ | Bilateria | |
| comp19734_c0_seq1 | 41.55 | 142 | 8.00e-32 | gi|62122584 : GNBP [ | Bilateria | |
| comp136078_c0_seq1 | 62.96 | 81 | 3.00e-26 | gi|62122584 : GNBP [ | Bilateria | |
| comp75261_c0_seq1 | 27.57 | 243 | 6.00e-22 | gi|62122584 : GNBP [ | Bilateria | |
| TECP | comp85313_c2_seq1 | 39.94 | 318 | 6.00e-63 | gi|385049105 : thioester containing protein 3, partial [ | Crustacea |
| comp65627_c0_seq1 | 46.34 | 246 | 3.00e-58 | gi|54644242 : Thioester-containing protein 6 [ | Pancrustacea | |
| comp87629_c0_seq4 | 74.36 | 234 | 6.00e-101 | gi|331031264 : TEP isoform 2 [ | Pleocyemata | |
| comp74624_c1_seq1 | 40.65 | 310 | 8.00e-56 | gi|385049099 : thioester containing protein 3, partial [ | Crustacea | |
| comp65627_c1_seq1 | 36.78 | 590 | 6.00e-118 | gi|54644242 : Thioester-containing protein 6 [ | Pancrustacea | |
| comp74624_c2_seq1 | 37.83 | 534 | 1.00e-105 | gi|54644242 : Thioester-containing protein 6 [ | Pancrustacea | |
| comp85313_c0_seq1 | 38.14 | 430 | 1.00e-80 | gi|568250870 : thioester-containing protein [ | Pancrustacea | |
| comp103781_c0_seq1 | 43.36 | 113 | 4.00e-22 | gi|54644242 : Thioester-containing protein 6 [ | Pancrustacea | |
| C-Type Lectin | comp69837_c0_seq1 | 43.15 | 146 | 1.00e-25 | gi|558633447 : C-type lectin [ | Decapoda |
| comp86095_c0_seq (1–2) | 43.92 | 148 | 2.00e-25 | gi|558633447 : C-type lectin [ | Decapoda | |
| comp68699_c0_seq1 | 38.89 | 144 | 1.00e-24 | gi|558633447 : C-type lectin [ | Decapoda | |
| comp87731_c3_seq (2–3) | 33.78 | 225 | 8.00e-25 | gi|657397985 : C-type lectin receptor-like tyrosine-kinase plant [ | Eukaryota | |
| comp88573_c0_seq (1–2) | 57.5 | 80 | 4.00e-22 | gi|676264911 : C-type lectin domain family 3 member A [ | Bilateria | |
| comp90611_c0_seq1 | 64.56 | 158 | 5.00e-60 | gi|575878533 : C-type lectin [ | Portunoidea |
Fig. 2Toll-like receptor signalling pathway coverage. The Toll-like receptor signalling pathway in the KEGG database. Proteins in the pathway are depicted by boxes while arrows depict signalling routes. Pathway components with homologues in the Carcinus maenas transcriptome are highlighted in pink
Fig. 3TNF signalling pathway. Overview of the KEGG TNF signalling pathway, components depicted as in Fig. 2. Components with homologues in the Carcinus maenas transcriptome are highlighted in pink
Fig. 4JAK-STAT signalling pathway. Overview of the KEGG JAK-STAT signalling pathway. Components with homologues in the Carcinus maenas transcriptome are highlighted in pink
Carcinus maenas Immune system response proteins
| Response protein | Transcript | Identity (%) | Length | E-value | Query | Ancestor |
|---|---|---|---|---|---|---|
| ALF | comp79835_c0_seq2 | 65.98 | 97 | 2.00e-34 | gi|302138013 : anti-lipopolysacharide factor [ | Decapoda |
| Crustin | comp88229_c1_seq1 | 56.36 | 110 | 8.00e-31 | gi|162945361 : crustin antimicrobial peptide [ | Portunoidea |
| comp91133_c0_seq1 | 65.38 | 78 | 7.00e-24 | gi|255653868 : crustin 1 [ | Pleocyemata | |
| Carcinin | comp88229_c1_seq1 | 86.36 | 110 | 1.00e-49 | gi|18157188 : carcinin [ | Carcinus maenas |
| Lysozyme | comp83352_c1_seq4 | 41.13 | 124 | 4.00e-23 | gi|675374133 : Lysozyme 1, partial [ | Arthropoda |
| comp83352_c1_seq2 | 41.13 | 124 | 4.00e-23 | gi|675374133 : Lysozyme 1, partial [ | Arthropoda | |
| comp83352_c1_seq1 | 41.13 | 124 | 4.00e-23 | gi|675374133 : Lysozyme 1, partial [ | Arthropoda | |
| iNOS | comp89503_c2_seq (1–26) | 52.6 | 308 | 1.00e-96 | gi|13359094 : nitric oxide synthase 2 [ | Pancrustacea |
Fig. 5Melanization pathway expression. Expression of melanization pathway components in twelve Carcinus maenas tissues. The expression values are presented in FPKM, values of 0 are coloured white and values over 10000 FPKM are binned together
Carcinus maenas RNAi pathway components
| RNAi | Transcript | Identity (%) | Length | E-value | Query | Ancestor |
|---|---|---|---|---|---|---|
| TRBP | comp79785_c0_seq (1–2) | 83.97 | 343 | 2.00e-167 | gi|332271591 : TAR RNA-binding protein isoform 1 [ | Decapoda |
| comp79200_c0_seq (1–2) | 36.74 | 460 | 4.00e-77 | gi|110825988 : probable methyltransferase TARBP1 [ | Bilateria | |
| comp49673_c0_seq1 | 46.34 | 205 | 2.00e-41 | gi|444174849 : TAR RNA-binding protein 1 [ | Decapoda | |
| R2D2 | comp79785_c0_seq (1–2) | 48.86 | 350 | 8.00e-81 | gi|619831236 : R2D2 [ | Pancrustacea |
| comp49673_c0_seq1 | 38.32 | 167 | 6.00e-24 | gi|619831236 : R2D2 [ | Pancrustacea | |
| drosha | comp87202_c0_seq1 | 93.37 | 829 | 0 | gi|396941645 : drosha [ | Decapoda |
| Dicer2 | comp90354_c0_seq (1–11) | 47.73 | 1253 | 0 | gi|402534262 : Dicer-2 [ | Decapoda |
| Dicer1 | comp85246_c1_seq1 | 77.95 | 1578 | 0 | gi|195424855 : dicer-1 [ | Decapoda |
| comp90354_c0_seq (5–6) | 31 | 658 | 1.00e-83 | gi|195424855 : dicer-1 [ | Decapoda | |
| comp55144_c0_seq1 | 61.06 | 113 | 3.00e-37 | gi|283827860 : dicer-1 [ | Decapoda | |
| comp77864_c(1–2)_seq (1–2) | 83.67 | 98 | 2.00e-40 | gi|195424855 : dicer-1 [ | Decapoda | |
| ago2 | comp81967_c(1–2)_seq1 | 37.16 | 802 | 5.00e-139 | gi|563729913 : argonaute2 [ | Decapoda |
| comp41784_c0_seq1 | 52.74 | 876 | 0 | gi|563729913 : argonaute2 [ | Decapoda | |
| comp76466_c0_seq1 | 42.51 | 821 | 0 | gi|563729913 : argonaute2 [ | Decapoda | |
| ago1 | comp81967_c1_seq1 | 89.45 | 758 | 0 | gi|321468117 : putative Argonaute protein [ | Crustacea |
| comp41784_c0_seq1 | 43.65 | 811 | 0 | gi|321468117 : putative Argonaute protein [ | Crustacea | |
| comp76466_c0_seq1 | 41.58 | 671 | 4.00e-148 | gi|321468117 : putative Argonaute protein [ | Crustacea |
Fig. 6Endocytosis pathway. Overview of the KEGG endocytosis pathway. Components with homologues in the Carcinus maenas transcriptome are highlighted in pink
Fig. 7Endocytosis pathway expression. Expression of endocytosis pathway components in twelve Carcinus maenas tissues. The expression values are in FPKM, values of 0 are coloured white and values over 1000 FPKM are binned together