| Literature DB >> 27809760 |
Adam J Northcutt1, Kawasi M Lett1, Virginia B Garcia1, Clare M Diester1, Brian J Lane1, Eve Marder2, David J Schulz3.
Abstract
BACKGROUND: Crustaceans have been studied extensively as model systems for nervous system function from single neuron properties to behavior. However, lack of molecular sequence information and tools have slowed the adoption of these physiological systems as molecular model systems. In this study, we sequenced and performed de novo assembly for the nervous system transcriptomes of two decapod crustaceans: the Jonah crab (Cancer borealis) and the American lobster (Homarus americanus).Entities:
Keywords: Crustacean; Ion channel; Neurotransmitters; Stomatogastric; Transcriptome
Mesh:
Substances:
Year: 2016 PMID: 27809760 PMCID: PMC5096308 DOI: 10.1186/s12864-016-3215-z
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Overview of transcriptome assembly statistics for C. borealis and H. americanus
|
|
| |||
| Raw reads | 414,978,768 | 452,237,240 | ||
| Clean reads | 391,060,790 | 426,712,238 | ||
| % Q Scores ≥ 30 | 92.96 | 92.72 | ||
| % GC | 43.4 | 39.4 | ||
| Average clean read length (bp) | 97.05 | 97.16 | ||
| Assemblers | CLC Genomics | SeqMan NGen | CLC Genomics | SeqMan NGen |
| Number of Contigs | 42,766 | 67,380 | 60,273 | 45,043 |
| N50 (bp) | 2,178 | 1,239 | 2,357 | 2,258 |
| N75 (bp) | 1,058 | 763 | 1,169 | 1,085 |
| Mean contig length (bp) | 1,544 | 1,076 | 1,657 | 1,799 |
| Longest contig (bp) | 21,761 | 14,125 | 25,723 | 17,700 |
| Shortest contig (bp) | 454 | 451 | 453 | 450 |
| Total assembled bases | 66,058,464 | 72,508,321 | 99,847,148 | 81,065,797 |
Accession numbers for ion channels identified from transcriptome assemblies of C. borealis and H. americanus
| Channel Family | Gene Name | Current/Channel Type |
|
|
|---|---|---|---|---|
| Voltage-dependent K+ Channels |
| Voltage-gated A-type potassium (IA or Kv1) | FJ263946 | KU702655 |
|
| Voltage-gated delayed rectifier (IKd or Kv2) | DQ103255 | KU702656 | |
|
| Voltage-gated delayed rectifier (IKd or Kv3) | KU681456 | KU681443 | |
|
| Voltage-gated delayed rectifier (IKd or Kv3) | KU681455 | KU681444 | |
|
| Voltage-gated A-type potassium (IA or Kv4) | DQ103254 | KU702654 | |
|
| Voltage-gated slow delayed rectifier (M-type or Kv7) | KU681453 | KU681441 | |
|
| Voltage-gated slow delayed rectifier (M-type or Kv7) | KU681452 | KU681440 | |
|
| Ether-a-go-go type potassium (Kv10) | KU681458 | KU681446 | |
|
| Ether-a-go-go-related potassium (elk or Kv12) | KU681459 | KU681447 | |
|
| Ether-a-go-go-related potassium (erg of Kv11) | KU681460 | KU681448 | |
| Other K+ channels |
| Large conductance (BK) voltage/Ca2 + −activated potassium | DQ103256 | KU712072 |
|
| Small conductance (SK) Ca2 + −activated potassium | KU710383 | KU712071 | |
|
| Sodium-activated potassium | KU681454 | KU681442 | |
|
| Inward-rectifier potassium (IRK) | KU681451 | KU681439 | |
|
| Two-pore domain leak potassium (K2p) | KU681438 | KU681450 | |
|
| Two-pore domain leak potassium (K2p) | KU681437 | KU681449 | |
| Ca2+ Channels |
| L-type high-voltage-activated (HVA) calcium | N809809 | KU702651 |
|
| P/Q-N high-voltage-activated (HVA) calcium | JN809808 | KU702650 | |
|
| T-type low-voltage-activated (LVA) calcium | JN809810 | KU702652 | |
| Na + Channels |
| Voltage-gated fast sodium | EF089568 | KU702653 |
|
| non-selective sodium leak | KU681457 | KU681445 | |
| Hyperpolarization-Activated/Cyclic Nucleotide Gated Channels |
| Hyperpolarization-activated cyclic nucleotide-gated | DQ103257 | KU712077 |
|
| Cyclic nucleotide-gated channel alpha 1 | KU716097 | KU712074 | |
|
| Cyclic nucleotide-gated channel alpha 2 | KU716098 | KU712075 | |
|
| Cyclic nucleotide-gated channel alpha 3 | KU716099 | KU712076 | |
|
| Cyclic nucleotide-gated channel beta 1 | KU716096 | KU712073 | |
| Transient Receptor Potential (TRP) Channels |
| Transient receptor potential cation channel, subfamily A, member 1 | KX037435 | KX037441 |
|
| Transient receptor potential cation channel, subfamily A, member | KX037434 | KX037440 | |
|
| Transient receptor potential cation channel, subfamily M, member 1 | KX037436 | - | |
|
| Transient receptor potential cation channel, subfamily M, member 3 | KX037433 | KX037439 | |
|
| Transient receptor potential cation channel, subfamily M, member | KX037437 | KX037444 | |
|
| Transient receptor potential cation channel, subfamily V, member 5 | KX037438 | KX037445 | |
|
| Transient receptor potential cation channel, subfamily V, member 6 | - | KX037443 | |
|
| Pyrexia transient receptor potential channel | - | KX037442 |
Accession numbers for biogenic amine and GABA receptor subtypes from transcriptome assemblies of C. borealis and H. americanus
| Receptor Family | Gene Name |
|
|
|---|---|---|---|
| Octopamine/Tyramine Receptors |
| KU710373 | KU712061 |
|
| KU710375 | KU712062 | |
|
| KU710372 | KU712063 | |
|
| KU710374 | KU712064 | |
|
| KU710370 | KU712065 | |
|
| KU710371 | KU712066 | |
| Dopamine Receptors |
| KU710377 | KU712059 |
|
| KU710376 | KU712060 | |
|
| KU710378 | KU712058 | |
| Serotonin Receptors |
| KU710381 | KU712070 |
|
| KU710382 | KU712069 | |
|
| KU710380 | KU712067 | |
|
| KU710379 | KU712068 | |
| Histamine Receptors |
| KU716100 | KU716104 |
|
| KU716101 | KU716106 | |
|
| KU716102 | KU716103 | |
|
| - | KU716105 | |
| GABA Receptors |
| KU986868 | KU986874 |
|
| KU986869 | KU986875 | |
|
| KU986871 | KU986878 | |
|
| KU986872 | KU986876 | |
|
| KU986873 | KU986877 |
Accession numbers for glutamate and acetylcholine receptor subtypes from transcriptome assemblies of C. borealis and H. americanus
| Receptor Family | Gene Name |
|
|
|---|---|---|---|
| Metabotropic Glutamate Receptors |
| KU986879 | KU986885 |
|
| KU986880 | KU986887 | |
|
| KU986881 | KU986888 | |
|
| KU986882 | KU986890 | |
|
| KU986883 | KU986886 | |
|
| KU986884 | KU986889 | |
| Kainate-Like Receptors |
| KX016772 | KX016777 |
|
| KX016773 | KX016778 | |
|
| KX016774 | KX016779 | |
|
| KX016775 | KX016780 | |
|
| KX016776 | KX016781 | |
| NMDA-like Receptors |
| KX016782 | KX016787 |
|
| KX016783 | KX016788 | |
|
| KX016785 | KX016789 | |
|
| KX016786 | KX016791 | |
|
| KX016784 | KX016790 | |
| Glutamate-Gated Chloride Channel |
| KX059698 | KX059699 |
| Acetylcholine Receptors |
| KX021822 | KX021833 |
|
| KX021821 | KX021832 | |
|
| KX021828 | KX021840 | |
|
| KX021827 | KX021839 | |
|
| KX021829 | KX021841 | |
|
| KX021830 | KX021842 | |
|
| KX021824 | KX021836 | |
|
| KX021825 | KX021837 | |
|
| KX021831 | - | |
|
| - | KX021835 | |
|
| KX021826 | KX021838 | |
|
| KX021823 | KX021834 |
Accession numbers for Innexin subtypes from transcriptome assemblies of C. borealis and H. americanus
| Gene Name |
|
| |
|---|---|---|---|
| Innexins |
| JQ994479 | KM984498 |
|
| JQ994480 | KM984499 | |
|
| JQ994481 | KM984500 | |
|
| KJ642222 | KM984501 | |
|
| KJ817410 | - | |
|
| KJ817411 | KM984502 | |
|
| - | KM984503 |
Fig. 1Length distribution of the de novo assemblies and annotation coverage of the C. borealis and H. americanus transcriptomes. a Size distribution of contigs shown for two different de novo assemblies of the C. borealis and H. americanus nervous system transcriptomes. Each assembly is shown individually, and overlaid contig lengths are shown in the right panels. Assembly statistics are shown in Table 1. b Horizontal stacked bar plots showing proportions of gene sets in BUSCO quality categories for the 4 different assemblies shown in panel A (CLC and SeqMan NGen, noted in bold). A previously published nervous system transcriptome from H. americanus (denoted by *) is also provided for comparison [5]. Quality categories are as follows: i) Complete BUSCOs: genes that match a single gene in the BUSCO reference group; ii) Fragmented BUSCOs: genes only partially recovered with gene length exceeding alignment length cut-off; iii) Missing BUSCOs: non-recovered genes
Fig. 2Annotation-score distribution of C. borealis and H. americanus transcripts. a Distribution of annotation scores for all Gene Ontology (GO) terms assigned during the Blast2GO annotation process of the CLC assembled contigs. b Distribution of GO terms for C. borealis and H. americanus. Absolute values of GO annotation for the ontology categories of Cellular Component, Molecular Function, and Biological Process. Order was based on top GO counts for H. americanus, except for one case (RNA-dependent DNA Replication) due to high incidence in C. borealis biological process
Fig. 3Gene ontology comparison between C. borealis and H. americanus neural transcriptomes. GO annotation categories Cellular Component, Molecular Function, and Biological Process were plotted as a percentage of their total annotation counts for each category
Fig. 4Species distribution of blast hits of C. borealis and H. americanus neural transcriptomes. Total hits and top-hit numbers for a given species from C. borealis and H. americanus transcriptomes
Fig. 5Comparison of overlap of C. borealis and H. americanus neural transcriptomes. a VennBLAST comparison of C. borealis and H. americanus neural transcriptomes. Alignment of top hit sequence comparison was performed with a tblastx of both C. borealis and H. americanus against a common top hit species, D. pulex, allowing for a highly annotated crustacean database for reference. Filtering added another further stringency on top of that from the tblastx by requiring an amino acid identity percent of 70 % and E-value threshold of 1.0e-5. b Percent amino acid sequence identity (blue points) and similarity (red points) for selected neural function related gene products. For the specifically curated gene products described in the remainder of the study, we found very high (>90 %) amino acid sequence identity and similarity between C. borealis Innexins (gap junction proteins), ion channels, and ionotropic receptors and the corresponding sequence in H. americanus. We saw a significant drop (one-way ANOVA with post-hoc t-tests) in similarity in sequences for metabotropic receptor subtypes. This indicates that channel proteins (including gap junction, voltage-gated, and ligand gated) show more highly conserved amino acid sequence than receptors that work via intracellular signal transduction cascades. *** indicates significant difference (P < 0.001, t-test) between metabotropic receptors and each of the other three groups. None of the other groups were significantly different from one another
Fig. 6Ion channel subtypes and families identified in C. borealis and H. americanus transcriptome assemblies. Amino acid sequence alignment was carried out using ClustalW, and subsequent trees were generated using ClustalW2_Phylogeny. We were not attempting to generate true phylogenetic relationships, but rather simply used tree-based analysis to identify putative gene families. Hence no bootstrap values are calculated. There was a 100 % overlap in identified genes from both species, with the exception of TRP channels. A comprehensive list of channel types identified, their putative membrane currents, and accession numbers are provided in Table 2. Transcripts with the prefix “Cb” were identified from C. borealis, while those with “Ha” were identified from H. americanus
Fig. 7Biogenic amine receptor subtypes identified in C. borealis and H. americanus transcriptome assemblies. Trees were generated as described in Fig. 6. Once again a 100 % overlap in transcript types was found between the two species, with one exception – a histamine receptor (Ha-HisR4) was identified in lobsters that had no corresponding sequence from the crab transcriptome. In the case of serotonin (5HT) and dopamine receptor types, existing crustacean sequence from two different decapod species (Panulirus interruptus and Procambarus clarkii) were used to help identify orthologs from C. borealis and H. americanus. These are also included in their respective trees as points of reference. Pan- represents P. interruptus while Pro- represent P. clarkii. A comprehensive list of amine receptor subtypes, including accession numbers, is provided in Table 3
Fig. 8Glutamate receptor subtypes identified in C. borealis and H. americanus transcriptome assemblies. Trees were generated as described in Fig. 6. We separated glutamate receptor subtypes into metabotropic (G-protein coupled), and the ionotropic Kainate- and NMDA-like. A single glutamate-gated chloride channel (GluCl) sequence was identified in both C. borealis and H. americanus, and is not included as a member of a given receptor subclass in the figure. A comprehensive list of glutamate receptor subtypes, including accession numbers, is provided in Table 4
Fig. 9GABA and acetylcholine receptor subtypes identified in C. borealis and H. americanus transcriptome assemblies. Trees were generated as described in Fig. 6. GABA and acetylcholine are both small molecule transmitters in crabs and lobsters. Both transmitters act through ionotropic and metabotropic receptor subtypes. Metabotropic GABA receptors (GABAB-type) and ionotropic GABA subunits (GABAA-type) were identified in both species. A comprehensive list of GABA receptor subtypes, including accession numbers, is provided in Table 3. Both nicotinic (ionotropic) and muscarinic (metabotropic) acetylcholine receptors were identified from both species, including one nicotinic beta-subunit and 8 alpha-subunit types. A comprehensive list of acetylcholine receptor subtypes, including accession numbers, is provided in Table 4
Fig. 10Innexin subtypes identified in C. borealis and H. americanus transcriptome assemblies. Trees were generated as described in Fig. 6. Innexins are proteins responsible for gap junctions in invertebrates. Six distinct Innexin subtypes were identified in both C. borealis and H. americanus (see also [103]). Of these six, one from each species did not contain enough sequence homology to classify as the same type across species (Cb-INX5 and Ha-INX7) hence are named as distinct subtypes. A comprehensive list of Innexin subtypes, including accession numbers, is provided in Table 5