Literature DB >> 30060147

Conotoxin Diversity in Chelyconus ermineus (Born, 1778) and the Convergent Origin of Piscivory in the Atlantic and Indo-Pacific Cones.

Samuel Abalde1, Manuel J Tenorio2, Carlos M L Afonso3, Rafael Zardoya1.   

Abstract

The transcriptome of the venom duct of the Atlantic piscivorous cone species Chelyconus ermineus (Born, 1778) was determined. The venom repertoire of this species includes at least 378 conotoxin precursors, which could be ascribed to 33 known and 22 new (unassigned) protein superfamilies, respectively. Most abundant superfamilies were T, W, O1, M, O2, and Z, accounting for 57% of all detected diversity. A total of three individuals were sequenced showing considerable intraspecific variation: each individual had many exclusive conotoxin precursors, and only 20% of all inferred mature peptides were common to all individuals. Three different regions (distal, medium, and proximal with respect to the venom bulb) of the venom duct were analyzed independently. Diversity (in terms of number of distinct members) of conotoxin precursor superfamilies increased toward the distal region whereas transcripts detected toward the proximal region showed higher expression levels. Only the superfamilies A and I3 showed statistically significant differential expression across regions of the venom duct. Sequences belonging to the alpha (motor cabal) and kappa (lightning-strike cabal) subfamilies of the superfamily A were mainly detected in the proximal region of the venom duct. The mature peptides of the alpha subfamily had the α4/4 cysteine spacing pattern, which has been shown to selectively target muscle nicotinic-acetylcholine receptors, ultimately producing paralysis. This function is performed by mature peptides having a α3/5 cysteine spacing pattern in piscivorous cone species from the Indo-Pacific region, thereby supporting a convergent evolution of piscivory in cones.

Entities:  

Mesh:

Substances:

Year:  2018        PMID: 30060147      PMCID: PMC6178336          DOI: 10.1093/gbe/evy150

Source DB:  PubMed          Journal:  Genome Biol Evol        ISSN: 1759-6653            Impact factor:   3.416


Introduction

The family of Conidae (Fleming, 1822 sensu lato) that includes cone snails is well known for their astonishing species diversity (> 800 species; Tucker and Tenorio 2013) as well as for their sophisticated feeding behavior, which includes the production and injection of venom in preys through a specialized harpoon-like radular tooth (Salisbury et al. 2010; Dutertre et al. 2014; Olivera et al. 2015). Although all cone snails were traditionally classified into the single genus Conus, recent phylogenetic studies based on morphological (Tucker and Tenorio 2009) and molecular (Puillandre, Bouchet, et al. 2014; Uribe et al. 2017) data supported the split of Conus into several lineages, which are ranked either at the family or genus levels, respectively. According to Puillandre, Duda, et al. (2014) and Uribe et al. (2017), the following six genera are recognized: Profundiconus, Californiconus, Lilliconus, Pygmaeconus, Conasprella, and Conus. The latter genus holds most of the species diversity with up to 60 monophyletic groups, either recognized as subgenera (Puillandre, Duda, et al. 2014) or genera (Tucker and Tenorio 2009) depending on the author (herein we will use the taxonomy of Tucker and Tenorio 2009). The last common ancestor of Conidae likely fed on worms, as most of all living species have been suggested to do (Duda et al. 2001; Puillandre, Bouchet, et al. 2014). During the evolution and diversification of the group, there was suggested to be one diet shift to prey on other snails in the last common ancestor of genera Calibanus, Cylinder, Conus, Darioconus, Eugeniconus, and Leptoconus (Puillandre, Bouchet, et al. 2014). Instead, phylogenetic analyses suggested at least two diet shifts to prey on fishes as the Atlantic/Eastern Pacific genus Chelyconus did not share a most recent common ancestor with Indo-Pacific piscivorous genera: Phasmoconus, Gastridium, Pionoconus, Textilia, Afonsoconus, Embrikena, and Asprella (for the latter three there is no direct observation of prey capture; Duda et al. 2001; Duda and Palumbi 2004; Puillandre, Bouchet, et al. 2014; Olivera et al. 2015). Here, we reconstructed a simplified maximum likelihood (ML) phylogeny of cone snails based on complete mitochondrial (mt) genomes showing the same evolutionary trends in feeding behavior (fig. 1). A remarkable singularity within the group is Californiconus californicus, which has a diverse diet including fish, snails, worms, and shrimps (Biggs et al. 2010). The shape and number of barbs of the hollow radular tooth as well as the feeding behavior of cone snails appear to be, at least in some cases, adapted to capturing most efficiently the different types of prey. For instance, some molluscivorous cones make successive injections of radular teeth into the prey (Prator et al. 2014) whereas piscivorous cones show up to three different hunting modes including electrical stunning and tethering of single preys using the proboscis, engulfing of several prey fish at once by the rostrum, and flailing the proboscis around the fish without tethering (Olivera et al. 2015).
. 1.

—Simplified ML phylogeny of cone snails based on complete mitochondrial genomes (concatenated 13 protein-coding genes plus two rRNA genes analyzed at the nucleotide level). The evolution of diet is mapped onto the phylogeny. Bootstrap values are indicated above each node. Scale bar indicates substitutions/site. GenBank accession numbers are indicated for each species mt genome.

—Simplified ML phylogeny of cone snails based on complete mitochondrial genomes (concatenated 13 protein-coding genes plus two rRNA genes analyzed at the nucleotide level). The evolution of diet is mapped onto the phylogeny. Bootstrap values are indicated above each node. Scale bar indicates substitutions/site. GenBank accession numbers are indicated for each species mt genome. The success of a strike relies on the high efficacy of the injected venoms, which readily elicit sedation, paralysis or sensory overload in the prey (Robinson, Li, Bandyopadhyay, et al. 2017). Cone venoms are for the most part complex mixtures of short bioactive peptides termed conotoxins or conopeptides (Lavergne et al. 2013; Robinson and Norton 2014). It is possible to distinguish at least two different components based on their ultimate target: 1) specific peptides, which target voltage-gated and ligand-gated ion channels, neurotransmitter transporters, and receptors in the central and peripheral nervous system of the preys (Olivera 2002; Terlau and Olivera 2004; Lewis et al. 2012; Lavergne et al. 2013) and 2) hormone/neuropeptide-like components, which target neuroendocrine processes in the prey (Safavi-Hemami et al. 2015; Robinson, Li, Bandyopadhyay, et al. 2017). In addition, the venom duct also produces several proteins that are involved in the processing of conotoxins or in enhancing venom activity (Safavi-Hemami et al. 2010, 2014; Hu et al. 2011; Terrat et al. 2012; Barghi et al. 2015b). Each cone species biosynthesizes in the apical secretory cells of the venom duct (Endean and Duchemin 1967) its own venom profile, which shows remarkable intraspecific variability (Davis et al. 2009; Rivera-Ortiz et al. 2011; Rodriguez et al. 2015; Chang and Duda 2016; Peng et al. 2016) as well as striking changes in composition over time within single individuals (Dutertre et al. 2010; Prator et al. 2014). Moreover, it has been shown that conotoxin expression is regionalized along the venom duct (Garrett et al. 2005; Tayo et al. 2010; Hu et al. 2012; Jin et al. 2015; Prashanth et al. 2016) and that the proximal and distal regions (with respect to the bulb) of the venom duct of several cones produce distinct defense- and predation-evoked conotoxin cocktails, respectively (Dutertre et al. 2014). Conotoxins are generally synthesized as precursors with a typical three domain structure including: 1) a highly conserved hydrophobic N-terminal signal region, which guides the conotoxin precursor to the endoplasmic reticulum and the cellular secretory pathway; 2) an intervening, moderately conserved propeptide region, which for some conotoxins participates in secretion, post-translational modification and folding (Bandyopadhyay et al. 1998; Conticello et al. 2003; Buczek et al. 2004); and 3) a C-terminal region, which constitutes the mature functional peptide (Woodward et al. 1990; Kaas et al. 2010). The conserved sequence profiles of the signal region have been used to classify conotoxin precursors into 40–50 protein superfamilies, originally named with alphabet letters (Terlau and Olivera 2004; Corpuz et al. 2005; Kaas et al. 2010; Puillandre et al. 2012; Lavergne et al. 2013; Robinson and Norton 2014; Li et al. 2017). Furthermore, sequence comparison of mature peptides revealed up to 26 conserved cysteine frameworks named with roman numbers, which generally correlate with conotoxin precursor superfamilies (Lavergne et al. 2015), although most of these protein superfamilies have been shown to have more than one cysteine framework (Terlau and Olivera 2004; Corpuz et al. 2005; Luo et al. 2006; Kaas et al. 2010; Puillandre et al. 2010, 2012; Robinson and Norton 2014; Lavergne et al. 2015). Notably, some mature peptides are cysteine-poor or completely lack cysteines (Puillandre et al. 2012). The renowned conotoxin hyperdiversity, which is typical for gene families that mediate interactions between organisms (Conticello et al. 2001; Barghi et al. 2015a), can be explained by a combination of several (evolutionary) processes including: 1) extensive gene duplication (Duda and Palumbi 1999; Espiritu et al. 2001; Puillandre et al. 2010; Chang and Duda 2012); 2) high mutation rates and diversifying selection of the mature domain (Conticello et al. 2001); 3) recombination events (Espiritu et al. 2001; Wu et al. 2013); and 4) variable peptide processing (Dutertre et al. 2013) and post-translational modifications (Lu et al. 2014; Peng et al. 2016). The advent of high-throughput sequencing techniques, and in particular of RNA sequencing (first using the 454 GS FLX Titanium and currently the Illumina HiSeq platforms) has produced a quantum leap in the characterization of whole conotoxin precursor repertoires (Prashanth et al. 2012; Barghi et al. 2015b) compared with the more traditional sequencing of individual cDNA clones (Pi, Liu, Peng, Jiang, et al. 2006; Pi, Liu, Peng, Liu, et al. 2006; Liu et al. 2012; Lu et al. 2014). RNA sequencing is highly sensitive and even rare transcripts with low expression levels can be identified (Barghi et al. 2015b). Therefore, several recent studies have determined the complete conotoxin diversity of 1) piscivorous species such as Textilia bullatus (Hu et al. 2011), Pionoconus consors (Terrat et al. 2012; Violette et al. 2012), Gastridium geographus (Hu et al. 2012; Dutertre et al. 2014; Safavi-Hemami et al. 2014), and Pionoconus catus (Himaya et al. 2015); 2) molluscivorous species such as Conus marmoreus (Dutertre et al. 2013; Lavergne et al. 2013), Darioconus episcopatus (Lavergne et al. 2015), and Cylinder gloriamaris (Robinson, Li, Lu, et al. 2017); and 3) vermivorous species such as Puncticulis pulicarius (Lluisma et al. 2012), Rhizoconus miles (Jin et al. 2013), Kioconus tribblei (Barghi et al. 2015b), Dendroconus betulinus (Peng et al. 2016), and the congeneric species Turriconus andremenezi and Turriconus praecellens (Li et al. 2017), among others. Altogether, these transcriptome studies show that each cone species produces at least 100–400 different conotoxin precursors, and have been very successful in discovering new superfamilies and cysteine frameworks (Barghi et al. 2015b; Lavergne et al. 2015; Peng et al. 2016; Li et al. 2017). The emerging general pattern is that several gene superfamilies are widespread among cone species, although with different degrees of expansion, whereas others are restricted to a few lineages (Duda and Remigio 2008; Puillandre et al. 2012). To further contribute to the cataloguing of conotoxin diversity in the main lineages of cone snails, we characterized the full transcriptome of Chelyconus ermineus (Born 1778). This is a cone species, which can be found on both shores of the Atlantic Ocean and feeds on fishes. Together with Chelyconus purpurascens from the Eastern Pacific region, it forms a clade, which according to the phylogeny of Conidae (Duda and Palumbi 2004; Puillandre, Bouchet, et al. 2014; fig. 1) may underwent a shift to piscivory independent to the one occurred in the ancestor of Indo-Pacific piscivorous genera. Thus far, the study of conotoxins in C. ermineus has been limited to the identification of few mature peptides (Martinez et al. 1995; Jacobsen et al. 1997; Barbier et al. 2004; Rivera-Ortiz et al. 2011; Echterbille et al. 2017) and some conotoxin precursors belonging to the A, B1, and O1 superfamilies (Duda and Palumbi 2004; Gowd et al. 2008; Puillandre et al. 2010; 18 entries in ConoServer, Kaas et al. 2012). Notably, however, some of the mature conotoxins of C. ermineus such as EVIA are among the few peptides whose functional activity (Barbier et al. 2004) and tertiary structure (Volpon et al. 2004) have been determined experimentally. There are a few more conotoxin precursors identified in the closely related C. purpurascens belonging to the A, B1, M, O1, and T superfamilies (e.g., Shon et al. 1995, 1998; Duda and Palumbi 2004; 45 entries in ConoServer, Kaas et al. 2012). By sequencing the transcriptome of C. ermineus, the first one of an Atlantic cone species, we aimed: 1) to catalogue the diversity of conotoxin precursors in this species and classify them into superfamilies; 2) to identify other proteins that are transcribed in the venom duct and are potentially involved in the processing of conotoxins or in enhancing venom activity; 3) to estimate intraspecific variation of conotoxin precursors; 4) to determine differences in the spatial distribution of conotoxin precursors along the distal, medium, and proximal regions of the venom duct; 5) to quantify the expression levels of conotoxin genes in the different individuals and along the venom duct; and 6) to compare the venom composition of Atlantic and Indo-Pacific piscivorous cones, and to identify putative differences with the venoms of cones preying on snails and worms.

Materials and Methods

Sampling and RNA Extraction

Three adult specimens of C. ermineus were captured, respectively, in Boa Vista (CVERM3; hereafter ERM1), Sal (CVERM13; hereafter ERM2), and Santa Luzia (CV1446; hereafter ERM3) islands in Cabo Verde with corresponding permits (table 1). Each individual, in a resting stage, was extracted from the shell and dissected to remove the venom duct, which was excised into three equal parts: proximal, medium, and distal with respect to the venom bulb (following Tayo et al. 2010). These fragments were stored in 1 ml RNAlater (Invitrogen, Life Technologies), first at 4°C and for the long term at −20°C.
Table 1

Specimens of Chelyconus ermineus Analyzed in This Study and Main Statistics of Illumina Sequencing and Assembly

SpecimenVoucher ID MNCNIslandSegmentSRA Accesion No.# Raw Reads# Clean Reads# Contigs% Mappinga# Conotoxins% Mappingb
ERM115.05/80980Boa VistaProximalSRR698316813,023,11412,882,97064,233925961
ERM115.05/80980Boa VistaMediumSRR698316925,823,48125,541,08769,836837570
ERM115.05/80980Boa VistaDistalSRR698316627,702,51327,160,103119,3848811717
ERM215.05/80013SalProximalSRR698316726,754,50926,754,50952,506697569
ERM215.05/80013SalMediumSRR698316426,986,67826,986,22057,887768963
ERM215.05/80013SalDistalSRR698316526,107,66626,107,19573,8099110940
ERM315.05/78606Santa LuziaProximalSRR698316227,163,84927,163,36849,195767178
ERM315.05/78606Santa LuziaMediumSRR698316331,223,31231,222,73368,103929058
ERM315.05/78606Santa LuziaDistalSRR698316131,717,50531,716,94871,785568358

Percentage of clean reads that map onto assembled contigs.

Percentage of clean reads that map onto assembled conotoxin precursors.

Specimens of Chelyconus ermineus Analyzed in This Study and Main Statistics of Illumina Sequencing and Assembly Percentage of clean reads that map onto assembled contigs. Percentage of clean reads that map onto assembled conotoxin precursors. For RNA extraction, each venom duct portion was incubated independently in a 2 ml eppendorf with 500 µl of TRIzol LS Reagent (Invitrogen, Life Technologies) and grinded with ceramic beads in a Precellys Evolution tissue homogenizer. The solution was mixed with 100 µl of chloroform. After centrifugation (12,000 x g for 15 minutes at 4°C), the aqueous phase was recovered and RNA precipitated with 250 µl of isopropanol and stored overnight at −80°C. The Direct-zol RNA miniprep kit (Zymo Research, Irvine) was used to purify total RNA (5–15 µg) following manufacturer’s instructions.

Library Preparation and Sequencing

Dual-indexed cDNA libraries (307–345 bp insert average size) for each sample were constructed using the TruSeq RNA Library Prep Kit v2 (Illumina, San Diego) and following manufacturer’s instructions at Sistemas Genómicos (Valencia, Spain). Briefly, the poly(A)+ mRNA fraction was isolated using oligo-(dT)25 magnetic beads. Subsequently purified mRNA was chemically fragmented prior to reverse transcription and the construction of the cDNA library. The quality of the libraries was analyzed with the TapeStation 4200, High Sensitivity assay; the quantity of the libraries was determined by real-time PCR in LightCycler 480 (Roche). The pool of libraries (including other cone species for different projects) was split into different lanes and sequenced by paired-end sequencing (100×2) in an Illumina HiSeq2500 (two flowcells) following standard procedures at Sistemas Genómicos (Valencia, Spain).

Assembly

The reads corresponding to the different regions of the venom duct and individuals were sorted using the corresponding library indices. Adapter sequences were removed using SeqPrep (St John 2011). Assembly was performed using the TRUFA webserver (Kornobis et al. 2015). Briefly, the quality of the sequencing was checked using FastQC v.0.10.1 (Andrews 2010). Ends of reads were trimmed (PHRED < 30) and resulting trimmed reads were filtered out according to their mean quality scores (PHRED < 20) using PRINSEQ v.0.20.3 (Schmieder and Edwards 2011). This step also ensured minimizing cross-contamination resulting from potential index misassignment, as this tends to be associated to low quality scores (Wright and Vetsigian 2016). Filtered reads were used for de novo assembly of trancriptomes with Trinity r2012-06-08 (Grabherr et al. 2011) with default settings (minimum contig length: 200; sequence identity threshold: 0.95). The transcriptome raw reads produced in this project have been deposited at the NCBI SRA database under accession SRP139515 (see also table 1).

Prediction and Annotation of Conotoxin Precursors and Associated Proteins

The sequences of all conotoxin precursors and associated proteins of cone venoms available in GenBank release 217 (Benson et al. 2005), Uniprot release 2016_11 (Uniprot Consortium 2017), and ConoServer release 12-26-2016 (Kaas et al. 2012) were downloaded in December 26, 2016 to construct a local reference database. Redundant entries from the three databases were removed. Subsequently, BLASTX was used to identify those sequences encoding putative conotoxin precursors and associated proteins (with an E-value of 1e-5) among the assembled contigs by similarity searches against the reference database. These sequences were translated into amino acids using the universal genetic code and manually inspected, in order to discard false positives (hits not corresponding to canonical conotoxins) or assembly artifacts (due to indels that interrupt open reading frames). Duplicate and highly truncated (>55% of the estimated total length of a precursor) sequences were removed to produce the final working list of conotoxin precursors and associated proteins of C. ermineus (provided in supplementary table 1, Supplementary Material online). The three domains of the predicted conotoxin precursors and the cysteine frameworks of the mature peptides were identified using the Conoprec tool (Kaas et al. 2012). Assignment of amino acid sequences to different superfamilies was based on the two highest scoring full-length conotoxin precursor hits in the BLAST results and taking into account the percentage of sequence identity (>90%) to the highly conserved signal region (Robinson and Norton 2014; Barghi et al. 2015b). Further refinement of the superfamily assignment was achieved by aligning conotoxin precursor amino acid sequences of C. ermineus to selected canonical representatives of each superfamily using Mafft v7 (Katoh and Standley 2013) with default parameters (see supplementary file 1, Supplementary Material online). This step revealed important diversity (i.e., presence of potential paralogs) at the propeptide domain within several superfamilies, which was further analyzed. All C. ermineus conotoxin precursor amino acid sequences are deposited (as nucleotide sequences) in GenBank under accession numbers MH360289–MH360712.

Phylogenetic Analyses of the M and T Superfamilies

In order to infer the evolutionary origin of cysteine-poor conotoxins, we performed phylogenetic analyses of the M and T conotoxin precursor superfamilies, which have both cysteine-rich and cysteine poor members. Concatenated amino acid alignments of the signal and propeptide domains of the M and T superfamilies, respectively, were constructed using Mafft v7 (Katoh and Standley 2013) with default parameters. Phylogenetic relationships were inferred using ML (Felsenstein 1981) with PhyML v3.0 (Guindon et al. 2010) with default settings in the ATGC platform (http://www.atgc-montpellier.fr/phyml/; last accessed September 05, 2018) and using the smart model selection option. Statistical support was assessed with 1,000 bootstrap pseudoreplicates (BP).

Expression Analyses

Approximate relative expression levels were estimated by mapping only clean reads, back to all assembled contigs of C. ermineus. TPM (transcripts per kilobase million), which normalize for gene length and sequencing depth, were estimated with the RSEM package (which uses the mapper Bowtie 2; Langmead and Salzberg 2012) included in Trinity r2012-06-08 (Grabherr et al. 2011). In addition, we run the EBSeq software (Leng et al. 2013) as implemented in Trinity to estimate the posterior probability of being differentially expressed (PPDE), setting the False Discovery Rate (FDR) at 0.95, of conotoxins as a whole and of each of the different superfamilies along the different regions of the venom duct using the three individuals as biological replicates.

Reconstruction of Cone Snail Phylogeny

In order to determine diet shifts during the evolutionary history of cone snails, a simplified phylogeny was reconstructed using ML based on complete mt genomes (13 protein-coding and two rRNA genes) available in GenBank. Protein-coding genes were individually aligned using TranslatorX (Abascal et al. 2010), which generates a nucleotide alignment based on corresponding deduced amino acid alignments. The rRNA genes were aligned using Mafft v7 (Katoh and Standley 2013). All ambiguously aligned positions were removed using GBlocks v.0.9.1b (Castresana 2000) with the following settings: minimum sequence for flanking positions: 85%; maximum contiguous nonconserved positions: 8; minimum block length: 10; gaps in final blocks: no. Finally, the different single alignments were concatenated using Geneious 8.1.8. The best-fit partition scheme and models of substitution for the data set were identified using PartitionFinder (Lanfear et al. 2012) with the Bayesian Information Criterion (Schwarz 1978). The following partitions were tested: all genes together, all genes arranged in subunits (atp, cob, cox, nad, and rrn), and all genes separated (except atp6-atp8 and nad4-nad4L). In addition, we also tested separately the three codon positions in the protein-coding genes. The best partition scheme was the one considering each codon position separately, all protein-coding genes concatenated, and rRNA genes concatenated. For each partition, the selected best-fit model was GTR +I + G. In order to reconstruct the ML tree, we used RAxML-HPC2 on XSEDE 8.2.10 (Stamatakis 2014) as implemented in the CIPRES Science Gateway v 3.3 (http://www.phylo.org/; last accessed September 05, 2018) with the rapid hill-climbing algorithm and 1,000 BP. The outgroups were Rhizoconus capitaneus and K. tribblei based on Puillandre et al. (2014).

Results

Sequencing and Assembly

A total of nine samples were sequenced corresponding to three regions (distal, medium, and proximal) of the venom duct of three individuals (ERM1-3) of C. ermineus. The main statistics associated to the sequencing and assembly procedures are summarized in table 1. Sequencing generated between 13 and 32 million raw reads per sample. Most (99–100%) of the reads were kept as clean after adapter and quality trimming. The number of assembled contigs varied between 49,195 and 119,384 with a mean of 69,637.6 per sample. Mapping of clean reads onto assembled contigs indicated that on an average 80% of the reads were used for further analyses (table 1). After BLASTX searches against a local reference database, the number of distinct (with at least one amino acid difference) putative conotoxin precursor sequences per sample (i.e., venom duct portion of an individual) varied between 59 and 117 (table 1). The majority of these sequences were full-length but a few were slightly truncated at the N- or C-terminus (see supplementary table 1, Supplementary Material online). Mapping of clean reads onto assembled transcripts indicated that conotoxin production made up on an average 57% of the transcriptome in the venom duct, although there was important variability among venom duct regions within the same individual (table 1).

Intraspecific Variation in Venom Composition

The sequences of the conotoxin precursor transcripts expressed in the venom duct of individuals ERM1-3 of C. ermineus inhabiting three different islands (Boa Vista, Sal, and Santa Luzia) of the archipelago of Cabo Verde were determined (fig. 2). The total numbers of inferred conotoxin precursors in each specimen were 161, 176, and 144, respectively. Of these, 25 were found in all specimens, nine were common to ERM1 and ERM2 (but not found in ERM3), 18 were shared by ERM1 and ERM3 (not present in ERM2), and 26 were common to ERM2 and ERM3 (and not found in ERM1). The number of peptides common to all three specimens rose up to 33 when only differences in the functional mature peptide were taken into consideration (fig. 2). In such analyses, the numbers of sequences exclusive to each specimen were 96, 104, and 56, respectively (fig. 2). We also estimated intraspecific diversity taking into account putative allele variation by clustering together sequences, which diverged in one or less, two or less, and three or less amino acids, respectively. The numbers of conotoxin precursor sequences common to all three specimens rose up to 44, 48, and 53, respectively (supplementary fig. 1, Supplementary Material online).
. 2.

—Distinct conotoxin precursors (left) and mature peptides (right) identified in the three analyzed individuals of Chelyconus ermineus.

—Distinct conotoxin precursors (left) and mature peptides (right) identified in the three analyzed individuals of Chelyconus ermineus.

Diversity of Conotoxin Precursor Sequences in C. ermineus

Of the 422 distinct transcripts related to venom activity identified as produced in the venom duct of the three individuals of C. ermineus, a total of 296 could be assigned (based on the signal region sequence) to 33 known conotoxin precursor superfamilies already described in other cone venom ducts (fig. 3). In addition, 82 conotoxin precursor sequences were grouped, using reciprocal BLASTs and taking into account a 90% identity threshold per superfamily, into 22 unassigned conotoxin superfamilies, not formally described in other cone species but also present in some of them (see supplementary file 1, Supplementary Material online). Finally, 44 peptides corresponded to six associated protein families (see supplementary table 1 and file 1, Supplementary Material online). All but one (alpha conotoxin EI; P50982) previously reported conotoxin precursors and mature peptides from C. ermineus (Martinez et al. 1995; Jacobsen et al. 1997; Barbier et al. 2004; Duda and Palumbi 2004; Gowd et al. 2008; Puillandre et al. 2010; Rivera-Ortiz et al. 2011) were detected (see supplementary file 2, Supplementary Material online). Homologs to conotoxin precursors and mature peptides from C. purpurascens were identified as well (Shon et al. 1995, 1998; Duda and Palumbi 2004). The six most diverse conotoxin precursor superfamilies (in terms of the number of distinct members) were T, W, O1, M, O2, and Z, accounting for 57% of all observed diversity (fig. 3). Conversely, several of the conotoxin precursor superfamilies were restricted to only one or two representatives (e.g., A2, E, J, K, P, R, and several unassigned superfamilies).
. 3.

—Distribution in superfamilies of the 296 identified conotoxin precursors. Numbers in parentheses indicate the number of mature peptides without cysteine framework.

—Distribution in superfamilies of the 296 identified conotoxin precursors. Numbers in parentheses indicate the number of mature peptides without cysteine framework. Some of the inferred mature domains in the conotoxin precursors showed no cysteine framework. In some cases, these mature peptides belonged to superfamilies exclusively formed by members without cysteines, such as the W and Z superfamilies. In other cases, mature conotoxins with and without framework were grouped together within the same superfamily (fig. 3). Notably, the M and T superfamilies had both types of mature conotoxins. Phylogenetic trees of both superfamilies were reconstructed based on the amino acid sequences of the signal and propeptide regions, and allowed distinguishing several paralog groups within each conotoxin precursor superfamily (fig. 4). While mature conotoxins without cysteine framework form a distinct paralog group within the M superfamily, they seem to have originated independently and recurrently in the different paralogs within the T superfamily (fig. 4).
. 4.

—Reconstructed ML phylogenies of the M and T superfamilies, recovering several clades (paralogs; in different colors) and indicating the differential evolutionary origin of cysteine-poor (in brown; WF meaning without framework) mature peptides. Bootstrap values of main clades are indicated. Scale bar indicates substitutions/site. GenBank accession numbers are indicated after each species except for C. ermineus.

—Reconstructed ML phylogenies of the M and T superfamilies, recovering several clades (paralogs; in different colors) and indicating the differential evolutionary origin of cysteine-poor (in brown; WF meaning without framework) mature peptides. Bootstrap values of main clades are indicated. Scale bar indicates substitutions/site. GenBank accession numbers are indicated after each species except for C. ermineus. Within unassigned superfamilies, most precursors have the canonical three-domain structure (see signal sequences in table 2). However, an interesting case was that of several precursors (unassigned superfamilies 16–22), which could not be assigned to any known superfamily because they lacked a signal region, but had a mature peptide, which could be confidently aligned to the mature peptides normally associated to the O3 and T superfamilies in other cone species (see supplementary file 1, Supplementary Material online). In trying to assign these sequences to known superfamilies, we used hidden Markov model searches as implemented in Conodictor (Koua et al. 2012) but retrieved no significant hit.
Table 2

New Signal Sequences in Conotoxin Precursors of Chelyconus ermineus

Unassigned SuperfamilySignalCysteine FrameworkAlso Found In:Best-Hit Known Superfamily
% Coverage% IdentitySuperfamily
1MRFYMLLAVALLLTSVMSVI/VII6675O2 (Q9NDA7)
2MRFLLFLCIAVLLTSFRETEAVI/VIIbetulinus8535T (BAS25421)
3MKLSMMFILSLVLTLSMTDGXIVpraecellens9067L (ABC74975)
4MKLSVMVIVLVLAMAFTPGLLXIVbetulinus8067L (ABC74975)
5MNFSVMFILALVLTLSMTDAXIVbetulinus, praecellens9061L (ABC74975)
6MKVVVVLLAVLVAASAXIVbetulinus10056Hyaluronidase (C0HKM3)
7MCLSTMPSVILMMVLMFAFDNVDGIXbetulinus, imperialis5857P (ATF27727)
8MKLFMFTAIIFTMASTTVTVIIIandremenezi, caracteristicus7853O1 (Q5K0B8)
9MSKTGLVLVVLYLLSSPVNLXIIImiles, praecellens, andremenezi8560M (ACV87169)
10MKFTTFVMVLMAAVLLTSILETEAVI/VIIbetulinus, praecellens5462Con-ikot-ikot (BAO65537)
11MEFRRLVTVGLLLTLVMSTDSIXbetulinus4788Insulin (AOF40168)
13MLSMLAWTLMTAMVVMNAKS(C)12praecellens, gloriamaris5573O1 (BAS22670)
14MNMRMTIIVFVVVATAATVVGSTCC-C-C-C-C-C-CC-C-C-C-Clenavati, tribblei10061Con-ikot-ikot (P0CB20)
15MSVVYCKPSVPVDSVSSNFCVRGPDNGHQAVI/VII4086T ( Q9BPD9)
New Signal Sequences in Conotoxin Precursors of Chelyconus ermineus

Differential Spatial Distribution of Conotoxin Precursor Transcripts along the Venom Duct

Changes in the diversity of conotoxin precursors were analyzed along the three regions (distal, medium, and proximal) of the venom duct (fig. 5). The numbers of distinct conotoxin precursor sequences found in the distal, medium, and proximal regions were 190, 162, and 130, respectively. Up to 30–33 (depending on the individual) precursor transcripts were expressed throughout the venom duct (fig. 5). Of these, seven were common to the three individuals (not shown). A total of 33–64, 24–34, and 15–25 conotoxin precursors were found exclusively in the distal, medium, and proximal portions, respectively (fig. 5). Most conotoxin precursor superfamilies were identified in the three regions of the venom duct with the exception of E, K, and Thyrostimulin β, which were only detected in the distal portion; L and Q, which were missing in the proximal portion; A2, which was absent in the medium portion; and J, which was missing in the distal portion (fig. 5). In general, the diversity of members of the different conotoxin precursor superfamilies was relatively uniform across venom duct segments. However, the H, I2, I3, L, M, O1, Q, T, and W superfamilies, and the hormone conophysin showed more diversity toward the distal portion; A2 and B2 superfamilies had more diversity toward the proximal portion; and most of the members of Z superfamily were detected in the medium portion (fig. 5).
. 5.

—Distribution of conotoxin precursor superfamily diversity (number of members) along the distal (D), medium (M) and proximal (P) portion of the venom duct (with respect to the venom bulb) is shown in panel (A). Number of common members in the three analyzed individuals per venom duct region is depicted in panel (B). Cerm code indicates unique peptide sequences after considering the three analyzed individuals (see supplementary table 1, Supplementary Material online).

—Distribution of conotoxin precursor superfamily diversity (number of members) along the distal (D), medium (M) and proximal (P) portion of the venom duct (with respect to the venom bulb) is shown in panel (A). Number of common members in the three analyzed individuals per venom duct region is depicted in panel (B). Cerm code indicates unique peptide sequences after considering the three analyzed individuals (see supplementary table 1, Supplementary Material online).

Expression of Conotoxin Precursor Transcripts along the Venom Duct

The relative expression levels of the different conotoxin precursor superfamilies along the distal, medium and proximal regions of the venom duct in the three individuals were estimated as TPMs, thus normalizing for gene length and sequencing depth (fig. 6). Expression levels varied extensively among individuals hindering the inference of expression patterns. TPM values were declared reliable when they were of similar level in at least two out of the three individuals. Taking this into consideration, the most expressed conotoxin precursor superfamilies were: A, which accounted for much of the conotoxin precursor expression in the medium and proximal fractions, O2, which was expressed abundantly in the distal region; and O1, which showed expression throughout the venom duct but particularly in the distal and medium portions (fig. 6). A second batch of midexpressed superfamilies included: T, which showed higher values in the distal region; M, which was mostly expressed in the proximal region; and S with expression levels higher in the medium and distal regions (fig. 6). The remaining conotoxin precursor superfamilies had much lower expression levels, mostly concentrated in the medium and distal regions of the venom duct with the exception of O3 members, which were higher expressed in the medium and proximal regions (fig. 6). We tested, within a Bayesian framework, whether any of the conotoxin precursor superfamilies had differential expression among regions of the venom duct using the individuals as biological replicates (table 3). Only the A and I3 superfamilies showed significant posterior probabilities. Within the A superfamily, paralogs A-1 (named alpha 4/4; see below) and A-2 (name kappa; see below) had PPDE values of 0.94 and 1, respectively (table 3). Within each paralog, only conotoxin precursors Cerm_405 (A-1) and Cerm_342 (A-2) both detected in ERM3 showed significant differential expression in the proximal and distal portions, respectively (table 3).
. 6.

—Distribution of conotoxin precursor superfamily transcript relative expression (TPMs) along the distal (D), medium (M) and proximal (P) portion of the venom duct (with respect to the venom bulb). Highly-medium expressed transcripts are shown in panel (A). Low expressed components are depicted in panel (B). Overall conotoxin expression with respect to other protein expression in the venom duct is shown in panel (C).

Table 3

Differential Expression of Conotoxins in the Different Regions of the Venom Gland of Chelyconus ermineus

 RegionPPDE
ConotoxinsPM1
Superfamilies
APM0.96
I3D0.99
Superfamily A
A-1 (alpha 4/4)PM0.94
A-2 (Kappa)P1
Superfamily A-1
Cerm_138M0.71
Cerm_255P0.70
Cerm_405P1
Superfamily A-2
Cerm_008PD0.89
Cerm_145PM0.82
Cerm_268P0.76
Cerm_342D1
Differential Expression of Conotoxins in the Different Regions of the Venom Gland of Chelyconus ermineus —Distribution of conotoxin precursor superfamily transcript relative expression (TPMs) along the distal (D), medium (M) and proximal (P) portion of the venom duct (with respect to the venom bulb). Highly-medium expressed transcripts are shown in panel (A). Low expressed components are depicted in panel (B). Overall conotoxin expression with respect to other protein expression in the venom duct is shown in panel (C). Altogether, conotoxin precursor transcripts showed differential expression along the venom duct (table 3), and accounted for ∼60% and 70% of the overall expression in the medium and proximal regions, respectively (fig. 6). In contrast, conotoxin precursor expression in the distal region was restricted to 30% whereas other (house-keeping) transcripts dominated (fig. 6). Ferritin showed important levels of expression, being preferentially expressed in the distal region (fig. 6).

Member Diversity of the a Superfamily across Cone Species

Given the importance of the A superfamily in the overall expression of the venom duct of C. ermineus, we performed a more detailed analysis of the diversity of its members across species (table 4; see also classification of the 288 A superfamily conotoxin precursors available in ConoServer in supplementary file 3, Supplementary Material online). Within the A superfamily (Santos et al. 2004), there are two main groups of conotoxins with very distinct structure and function (Azam and McIntosh 2009; Puillandre et al. 2012; Robinson and Norton 2014). One group (alpha, α) has cysteine framework I and selectively target nicotinic-acetylcholine receptors (nAChRs), ultimately inhibiting neuromuscular transmission (Azam and McIntosh 2009). The other group (Kappa, κ) has cysteine framework IV and their members target preferentially K+ channels, producing an excitatory effect (Robinson and Norton 2014). The Kappa subfamily is, thus far, only found in piscivorous cones (table 4; Santos et al. 2004). Within framework IV, the most frequent cysteine spacing pattern is cc7c2c1c3c (Puillandre et al. 2012), which is shared by C. ermineus, several Pionoconus species, and Embrikena (table 4). In contrast, specific cysteine spacing variations have been reported for C. purpurascens, Gastridium, and Textilia (table 4). Within the alpha subfamily, the most frequent cysteine spacing pattern is cc4c7c (named α4/7; Puillandre et al. 2012). Vermivorous (including Rhombiconus imperialis, which has a strict diet on amphinomids) and molluscivorous species show important diversity of α4/7 conotoxins whereas this subfamily is represented by only 1–2 members in most piscivorous species but G. geographus (table 4; a striking exception is Asprella, which has been proposed to be piscivorous, although not based on direct evidence, and has seven α4/7 conotoxin precursors, a pattern typical of vermivorous or molluscivorous cones). A second frequent cysteine spacing pattern is cc3c5c (named α3/5; Puillandre et al. 2012), which is almost exclusive of piscivorous species, and particularly diverse in G. geographus, P. consors, and P. striatus but not found in Chelyconus (table 4). Finally, a third cysteine spacing pattern, which is also relatively frequent, is cc4c4c (named α4/4; Puillandre et al. 2012). It is particularly diverse in the piscivorous genera Chelyconus and Textilia, and in the vermivorous genera Virgiconus emaciatus and Calamiconus quercinus (table 4). Interestingly, it has been also reported with lower diversity in the piscivorous genus Pionoconus and in the putative piscivorous genus Afonsoconus (Puillandre et al. 2012).
Table 4

Superfamily A Diversity in Several Cone Snails with Different Diets Based on Conoserver Entries

Framework IV
Framework I
Framework IX
Genusspeciescc7c2c1c3ccc7c2c1c6ccc6c2c1c3ccc6c2c1c4ccc4c7ccc4c3ccc4c4ccc4c5ccc4c6ccc4c8ccc3c5ccc3c6ccc5c10cc2c4c6cc10c1c3c
Chelyconusermineus4 (3)2 (2)5 (3)
Chelyconuspurpurascens1 (1)3 (2)(4)a
Gastridiumgeographus3 (2)5 (5)1 (1)6 (3)1 (1)
Pionoconusachatinus4 (4)1 (1)5 (2)1 (1)
Pionoconuscatus3 (3)1 (1)
Pionoconusconsors5 (5)1 (1)7 (4)2 (1)
Pionoconusmagus3 (3)1 (1)1 (1)
Pionoconusmonachus1 (1)1 (1)
Pionoconusstriatus4 (4)1 (1)8 (6)
Textiliabullatus3 (3)1 (1)5 (4)
Embrikenapergrandis(2)1 (1)
Afonsoconuskinoshitai2 (2)
Asprellasulcata7 (5)
Conusmarmoreus7 (5)
Cylindertextile7 (5)
Rhombiconusimperialis1 (1)4 (4)
Calamiconusquercinus12 (11)3 (2)
Dendroconusbetulinus8 (8)1 (1)
Fraterconusdistans2 (2)1 (1)
Lithoconusleopardus11 (9)3 (2)
Lividoconuslividus6 (6)3 (2)
Puncticulispulicarius5 (4)1 (1)1 (1)2 (2)1 (1)
Virgiconusemaciatus1 (1)4 (1)1 (1)2 (2)
Virgiconusflavidus5 (4)
Vituliconusvitulinus1 (1)2 (2)

Note.—Number of precursors and mature proteins (in parentheses) are shown.

Red, piscivorous; green, molluscivorous; brown, vermivorous (fire worms); blue, vermivorous.

Puillandre et al. (2012), Hoggard et al. (2017).

Superfamily A Diversity in Several Cone Snails with Different Diets Based on Conoserver Entries Note.—Number of precursors and mature proteins (in parentheses) are shown. Red, piscivorous; green, molluscivorous; brown, vermivorous (fire worms); blue, vermivorous. Puillandre et al. (2012), Hoggard et al. (2017).

Discussion

The cocktail of bioactive peptides produced in the venom duct of a cone snail is a complex mixture aimed at paralyzing specific preys and deterring predators (Dutertre et al. 2014). The composition of this cocktail greatly varies among species and currently, we are just starting to catalogue the repertoire of conotoxins and associated proteins produced by the more than 800 species of cones (e.g., Peng et al. 2016; Phuong et al. 2016; Li et al. 2017; Robinson, Li, Lu, et al. 2017). The recent advent of next generation RNA sequencing allows a robust approach to cataloguing all the transcripts expressed in the venom duct of a cone snail: not only it is possible to determine the bulk of mRNAs that are synthesized but also the variability of expression at distinct portions of the venom duct (Dutertre et al. 2014), among different individuals (Li et al. 2017), and under variable external conditions (Dutertre et al. 2010). Beyond identifying and describing all the key components of the venom, the ultimate, more ambitious goal of cataloguing studies is to tackle long-standing evolutionary and ecological questions, for example, how the diversity of conotoxins evolved (Duda and Palumbi 1999), how the venom mixtures were adapted to highly distinct diets (Duda et al. 2001) and predation strategies (Olivera et al. 2015), how a cone individual is able to modulate the composition of the venom both to prey and to deter predators (Dutertre et al. 2014), or what is the role of conotoxin diversification in speciation (Li et al. 2017). According to reconstructed phylogenies of the family Conidae, the piscivorous diet evolved independently in some Atlantic and Indo-Pacific cone genera (Duda et al. 2001; Duda and Palumbi 2004; Puillandre, Bouchet, et al. 2014; this work). The transcriptomes of several piscivorous genera (Gastridium, Pionoconus, and Textilia) from the Indo-Pacific have been reported but none from Atlantic piscivorous genera was available yet. Hence, the importance of sequencing the transcriptome of C. ermineus: its comparison with those of piscivorous cone snails from the Indo-Pacific, and against those of cones eating snails and worms could provide important clues on which conotoxins are needed specifically for fish hunting and how they evolved (Duda and Palumbi 2004). The obtained number of assembled contigs per C. ermineus venom duct transcriptome is comparable to those typically reported in equivalent studies also based on the Illumina platform (Peng et al. 2016; Li et al. 2017; Robinson, Li, Lu, et al. 2017). Moreover, the total number of clean reads, which mapped onto conotoxin transcripts constituted 57%. This number is somewhat higher than those reported by studies based on direct sequencing of individual cDNA clones such as, for example, the 39–50% of transcripts being conotoxins reported for Virgiconus virgo, Tesselliconus eburneus, R. imperialis, and C. marmoreus (Liu et al. 2012) or based on 454 sequencing, for example, the 42.7% of transcripts being conotoxins reported for P. consors (Terrat et al. 2012) but lower to the 88% reported for G. geographus (Hu et al. 2012). A total of 378 transcripts encoding conotoxin precursors were identified in the venom duct of C. ermineus. This number is similar to those reported for the venom duct transcriptomes of Virroconus coronatus (331; Phuong et al. 2016), Puncticulis arenatus (326; Phuong et al. 2016), or Harmoniconus sponsalis (401; Phuong et al. 2016) and larger than others such as those reported in D. betulinus (215; Peng et al. 2016), C. marmoreus (158; Lavergne et al. 2013), T. praecellens (149–155; Li et al. 2017), T. andremenezi (107–128; Li et al. 2017), or C. gloriamaris (108; Robinson, Li, Lu, et al. 2017). Until recently, most studies were based on single individuals whereas the current trend is to sequence several specimens as here (Barghi et al. 2015a; Peng et al. 2016; Li et al. 2017), which affects the comparison of numbers. In fact, the C. ermineus individuals produced each 145–176 conotoxin precursors, which is consistent with most reported single individual transcriptomes (see above). In C. ermineus, ∼20% of the inferred mature conotoxins were common to the three analyzed individuals and strikingly, each of them showed an important number of sequences not found in the other two. Therefore, it is very likely that even the number here reported underestimates the whole diversity of conotoxins produced by this species. Such interindividual differences are congruent with results reported in the closely related C. purpurascens (Rodriguez et al. 2015) and for three specimens of D. betulinus with different body sizes (Peng et al. 2016). In this regard, it has been suggested that differences in age/size could be a factor fostering intraspecific variation (Barghi et al. 2015a; Peng et al. 2016). The three specimens of C. ermineus here analyzed were presumably adults and they differed in their size: ERM1-3 had shells of 73.1, 55 and 46 mm in length, respectively. In contrast, other studies found little intraspecific variation within species of the genus Turriconus (Li et al. 2017) and of the genus Kioconus (Barghi et al. 2015a). Here, it is important to note that despite our specimens were from different islands (Boa Vista, Sal, and Santa Luzia), they have not accumulated larger cox1 sequence divergences than individuals within species of Turriconus or Kioconus (see additional supplementary fig. 2, Supplementary Material online). Altogether, our results suggest that, in the near future, many more individuals than those currently analyzed would be required to describe the whole richness of the venom of any cone species (Dutertre et al. 2010). The Illumina-based venom repertoires of C. ermineus and of other recently investigated cone species with various diets were compared in table 5. Most conotoxin precursor superfamilies reported in other cone species (Peng et al. 2016; Phuong et al. 2016; Li et al. 2017; Robinson, Li, Lu, et al. 2017) were also identified in C. ermineus. The most diverse (in number of members) superfamilies in C. ermineus were O1, O2, M, and T, a pattern that is conserved in other cone species (table 5). Interestingly, the W and Z superfamilies, which have a cysteine-poor mature peptide and were originally reported in C. marmoreus (Lavergne et al. 2013), were particularly abundant in C. ermineus. Different cone species seem to have undergone diversification bursts of particular superfamilies, which are otherwise poorly represented in other species (Duda and Remigio 2008; Puillandre et al. 2012; Barghi et al. 2015a). A paradigmatic case is the A superfamily, which is highly diverse in the Indo-Pacific piscivorous G. geographus (Safavi-Hemami et al. 2014), P. consors (Terrat et al. 2012), Pionoconus catus (Himaya et al. 2015), and T. bullatus (Hu et al. 2011) but underrepresented in C. ermineus. Other examples are: the superfamilies P and O1d, which are particularly rich in Turriconus (Li et al. 2017), the B1 superfamily in G. geographus, the I2 superfamily in V. virgo, the A, I3, and N superfamilies in Rolaniconus varius, the D superfamily in Rhizoconus vexillum (Prashanth et al. 2016), and the con-ikot-ikot and B2 superfamilies in K. tribblei and K. lenavati (Barghi et al. 2015a).
Table 5

Comparison of the Diversity of Main Conotoxin Precursor Superfamilies in Cone Snails with Different Diets

Species
Chelyconus ermineus
Gastridium geographus
Pionoconus consors
Cylinder gloriamaris
Conus marmoreus
Conus marmoreus
Dendroconus betulinus
Turriconus andremenezi
Turriconus praecellens
Rolaniconus varius
Virgiconus virgo
Rhombiconus imperialis













Reference
This work
Hu et al. 2012
Terrat et al. 2012
Robinson et al. 2017
Phuong et al. 2016
Lavergne et al. 2013
Peng et al. 2016
Li et al. 2017
Li et al. 2017
Phuong et al. 2016
Phuong et al. 2016
Phuong et al. 2016
NGS platform
HiSeq2000
FLX Titanium
454
HiSeq2000
HiSeq2000
454
HiSeq2000
HiSeq2000
HiSeq2000
HiSeq2000
HiSeq2000
HiSeq2000
Dietfishfishfishsnailsnailsnailwormwormwormwormwormfire worms
A812143109001653
A22a42
B1 (conantokin)721100800210
B2800110123201
B4020800
C000000100000
Con-ikot-ikot560200425810
Conkunitzin1317010800210
D300000002001
E200140200211
F400140300210
H700330313000
I1410220532202
I29003008453142
I34000002651600
I45442046000
J240400309100
K100000000205
L300000003300
M2728720553028332378
N0001206011161
O131181512136133223622269
O1d512
O2 (contryphan)241018341617175133
O3600100400210
P20130013263214012
Q3001000020
R101000
S853110123501
T36652120717413919138
U0101002010
V400000033030
W322
X2
Y000000100020
Y31
Z201
conoCAP01
Conomap00
Conophysin (Conopressin)611100012701
Conorfarmide01023
Contulakin014000000000
Insulin712
Prohormone-403021
Thyrostimulin α2
Thyrostimulin β1

Hyphen indicates that the study apparently did not search for the corresponding peptide.

Comparison of the Diversity of Main Conotoxin Precursor Superfamilies in Cone Snails with Different Diets Hyphen indicates that the study apparently did not search for the corresponding peptide. About 20% of the newly identified conotoxin precursors could not be assigned to known superfamilies based on their signal domain, and new unassigned conotoxin precursor superfamilies had to be proposed temporarily as in other studies (e.g., Barghi et al. 2015b; Peng et al. 2016). However, most of these unassigned superfamilies had homologs in other cone species, and thus, it is foreseen that as more cone transcriptomes become available, formal (phylogeny-based) classification of these unassigned superfamilies will be possible (Puillandre et al. 2012; Lavergne et al. 2013). Remarkably, we found a combination of mature peptides normally associated to the signal and propeptide domains of the O3 and T superfamilies, which in C. ermineus were associated to amino terminal sequences lacking signal and/or propeptide domains (unassigned families 16–22). This observation may evoke the possibility of domain shuffling as one of the underlying mechanisms for generating precursor diversity (Pi, Liu, Peng, Liu, et al. 2006). Despite the coverage is high and homogeneous throughout these assembled transcripts (including the boundary of the mature peptide and the rest of the putative precursor), the possibility of an assembly artifact cannot be fully excluded given that most of these sequences do not show a canonical three-domain structure (and should be validated experimentally). One hot debate in the past has been the relevance of classifying conotoxins into cysteine-rich and cysteine-poor categories (Puillandre et al. 2012; Robinson, Li, Bandyopadhyay, et al. 2017). Our results support that this dichotomy is irrelevant from an evolutionary perspective, although it may have some functional meaning. According to our phylogenetic and sequence comparison analyses, there are at least three different evolutionary origins for cysteine-poor mature peptides: 1) whole conotoxin precursor superfamilies carrying mature peptides with no cysteine framework, such as, for example, R, W, and Z; 2) conotoxin precursor superfamilies in which cysteine-poor mature peptides are associated to a distinct signal and propeptide combination, and thus have a single evolutionary origin, such as, for example, M; and 3) conotoxin precursor superfamilies in which cysteine-poor mature peptides are associated to signal and propeptide combinations that also can be linked to mature peptides with a known cysteine framework, indicating multiple evolutionary origins, such as, for example, T. This third evolutionary pattern supports the importance of modularity as evolutionary mechanism for generating conotoxin diversity (Pi, Liu, Peng, Liu, et al. 2006). Importantly, the use of the amino acid sequences of the propeptide domain in addition to those of the signal domain for phylogenetic analyses and sequence similarity comparisons proved to be very informative (Lavergne et al. 2013) and led to discrimination of potential distinct paralogs within the different conotoxin precursor superfamilies. For instance, at least four paralogs were identified within the M superfamily (named M1-3, M-WF), three within the T superfamily (T1-3; see Liu et al. 2012), which detected up to four clades in their phylogenetic analysis), six within the O1 superfamily (O1-1-6; see Li et al. 2017), which already distinguish O1d from the remaining members of O1) and six within the O2 superfamily (O2-1-6). This, thus far, mostly overlooked diversity within each superfamily will need to be taken into account in future studies when updating classifications of conotoxin precursor superfamilies (Lavergne et al. 2013) and when summarizing the composition of venoms in the different species as the different paralogs may well have different functions (Altenhoff et al. 2012). The current classification system of conotoxin superfamilies originally based on the alphabet can barely integrate the many novel signal sequences (and corresponding superfamilies) found in every new study and would have even more serious problems in dealing with paralog diversity within each superfamily, if this information is also incorporated. Therefore, a radically new evolutionary-based classification (beyond the goals of the present study) is urgently needed, and should come out from the consensus among experts in the field. The distribution of conotoxin precursor diversity along the venom duct of C. ermineus showed some degree of regionalization. The A2 and B2 superfamilies had more diversity toward the proximal portion; Z was more diverse in the medium portion; and O1, M, and T superfamilies had more diversity toward the distal portion. The corresponding pattern in G. geographus showed the A and M superfamilies to be more abundant in the proximal portion, and O1 and T more diverse in the distal portion (Hu et al. 2012). The distribution pattern found in the molluscivorous Cylinder textile, showed the M and T superfamilies as more abundant in the proximal region whereas the O superfamily was more diverse in the distal region (Garrett et al. 2005). Overall, these comparisons show two discriminant patterns: 1) the T superfamily is more diverse in the distal portion of piscivorous cones and in the proximal region of the molluscivorous cone; and 2) the M superfamily is more diverse in the distal region of C. ermineus and in the proximal portion of G. geographus. The venom duct of a cone snail is a specialized convoluted duct mostly devoted to the biosynthesis of conotoxins (Safavi-Hemami et al. 2014), as further demonstrated here by the elevated proportion of conotoxin transcripts detected in the transcriptome of the C. ermineus venom duct. Moreover, our results support that conotoxin expression is localized preferentially in the medium and proximal regions of the venom duct of C. ermineus whereas the distal region is mostly devoted to the expression of house-keeping genes. The inferred expression patterns of conotoxin precursor superfamilies in C. ermineus showed drastic variations among individuals. This may reflect natural conditions or potential methodological biases (despite the use of common sample handling, laboratory, sequencing, and analytical procedures), although we cannot discern between both possibilities. In any case, these results highlight the need of sequencing a fair amount of specimens to generate statistically robust quantitative comparisons and conclusions, as it is becoming the rule for model system species (Schurch et al. 2016). Being cautious in the interpretation of the results of our expression analyses (i.e., considering reliable only those TPM values, which are similar in at least two individuals), we observed that those conotoxin precursor superfamilies showing higher levels of expression are also those having more member diversity (O1, O2, M, and T). A striking exception to this pattern is the A superfamily, which has few distinct members in C. ermineus, but the highest levels of expression (see below). Most superfamilies showed low expression levels, suggesting a subtle contribution of them to the final venom composition. The expression of most superfamilies appears to reflect some degree of compartmentalization. In particular, the A superfamily is preferentially expressed in the medium and proximal regions of the venom duct whereas most other superfamilies tend to be expressed toward the distal region (as occurs in Indo-Pacific cones such as G. geographus; Hu et al. 2012). Moreover, we found that this compartmentalization of the expression of the A superfamily is n statistically significant, further indicating its functional importance in the venom of C. ermineus. The different strategies of prey capture among piscivorous cone species determine the exact mixture (termed “cabal”) of venom components, which will act coordinately to produce a specific physiological response (Olivera et al. 2016). While the Indo-Pacific species G. geographus has a “net engulfment” strategy, the Indo-Pacific species of the genus Pionoconus and Textilia as well as the Atlantic and Eastern Pacific species of the genus Chelyconus have a “taser and tether” or “hook and line” strategy (Olivera et al. 2015, 2016). The “net engulfment” strategy requires disorienting the fish by the release of the “nirvana cabal” into the water. This is a mixture, among others, of B1 superfamily (Hu et al. 2012) and insulin-like peptides (Safavi-Hemami et al. 2015; Robinson, Li, Bandyopadhyay, et al. 2017). Once fishes are disoriented and engulfed, the cone injects into each captured fish, a group of paralytic conotoxins, the “motor cabal,” which includes the αA conotoxins, and the M superfamily µ- and ψ-conotoxins. In the case of the “taser and tether” strategy, the capture of the prey occurs through direct injection of two different mixtures, the “lightning-strike” and the motor cabals (Olivera 2002). The lightning-strike cabal induces an excitatory response, which ultimately causes tetanic paralysis. This cabal includes δ-conotoxins and κ-conotoxins from the O1 superfamily, conkunitzins, and κA conotoxins (Himaya et al. 2018). Therefore, while the nirvana cabal is exclusive of some of species of the genus Gastridium, the lightning-strike cabal is found in genera such as Pionoconus, Textilia, and Chelyconus, and the motor cabal is found in all four above-mentioned genera (Olivera et al. 2016). The κA conotoxins of the lightning-strike cabal have generally the same cysteine spacing pattern in Pionoconus, Textilia, and Chelyconus (except C. purpurascens). However, the main blockers of K+ channels belong to the O superfamily in Chelyconus, whereas this physiological role is accomplished by conkunitzins in Pionoconus (Olivera et al. 2016). With regards to the motor cabal, there are striking instances of differential recruitment of the αA conotoxins. All piscivorous genera have generally at least one member of the α4/7 subfamily, which blocks neuronal nAChRs (Azam and McIntosh 2009). In addition, Gastridium and Pionoconus have members of the α3/5 subfamily, which inhibit muscle nAChRs. However, Chelyconus and Textilia have instead members of the α4/4 subfamily. The α4/4 conotoxins of C. ermineus and C. purpurascens have been shown to selectively bind muscle nAChRs (López-Vera et al. 2007; Quinton et al. 2013) whereas those of Textilia block neuronal nAChRs (Chi et al. 2006). The presence of α4/4 conotoxins has been detected also in Afonsoconus, and three species of Pionoconus, but their functions have not been determined (Puillandre et al. 2012). The α4/4 sequences of Pionoconus and Textilia (and probably Afonsoconus) are more closely related phylogenetically than those of Chelyconus (Puillandre et al. 2012). Altogether, our results suggest independent genetic and biochemical pathways to evolve the same diet adaptation, and thus, favor the hypothesis of a convergent origin of piscivory in cones from the Indo-Pacific and Atlantic oceans (Duda et al. 2001; Puillandre, Bouchet, et al. 2014). Finally, it is interesting to note that recent analysis of conotoxin envenomation in C. purpurascens showed that different individuals could include alternatively either the lightning-strike, the motor or both cabals in the composition of their venoms when preying (Himaya et al. 2018). In our case (see supplementary table 1, Supplementary Material online), the three individuals of C. ermineus produced conotoxins belonging to the motor cabal including α4/4 conotoxins (the α4/7 conotoxin was not detected), and M superfamily µ- and ψ-conotoxins as well as had conotoxins belonging to the lightning-strike cabal including the κA conotoxins, and the O1 superfamily δ- (Aman et al. 2015) and κ-conotoxins. Moreover, the differential high levels of A superfamily transcripts in the proximal region of the venom duct in G. geographus as part of the motor cabal have been associated to defense-evoked responses (Dutertre et al. 2014). In C. ermineus (see supplementary table 1, Supplementary Material online), for the A superfamily, the members involved in the motor cabal (α4/4 conotoxins) and the lightning-strike cabal (κA conotoxins) showing differential expression are located in the proximal and distal regions of the venom duct, respectively, supporting the regionalization of the cabals as reported in Protostrioconus obscurus (Dutertre et al. 2014). Nevertheless, all the above-mentioned inferences need to be interpreted with caution and as tentative until further comparative analyses based on more individuals are carried out.

Conclusions

The venom duct of C. ermineus produces a great diversity of conotoxin precursors, most corresponding to known superfamilies and several showing novel signal domains. Comparison of these data to the venom repertoires reported from different cone species with various diets supports that some superfamilies (O1, O2, T, M) are widespread among cone species, making the basic venom toolkit, whereas others are restricted to fewer lineages. The different superfamilies show various degrees of expansion depending on the species. In the case of C. ermineus, the cysteine-poor superfamilies W and Z are particularly diverse. In this regard, the wide distribution of cysteine-poor mature peptides among superfamilies indicate multiple and diverse origins. Both, diversity and expression of conotoxins are regionalized along the venom duct. Diversity in the number of members of a superfamily increases toward the distal region whereas the less diverse superfamilies in the proximal region show higher expression levels. In particular, the A superfamily, which is highly diverse in piscivorous cones from the Indo-Pacific Ocean, consists of rather few and distinct (α4/4) members in the C. ermineus venom, but these show differentially and significantly high expression levels toward the proximal region. These contrasting patterns support convergent strategies to produce the motor cabal, which targets nicotinic acetylcholine receptors, and seems essential for deterring/preying fishes. Our results show that each newly analyzed cone species uncovers additional conotoxin diversity and thus, that we are still far from covering the whole repertoire of conotoxins, as the venom duct transcriptome of the majority of cone species awaits sequencing and analysis. Moreover, the numerous unassigned superfamilies, which are discovered in every new cone transcriptome together with the emerging evidence of the existence of distinct paralogs within each superfamily prompt for a revision and an update of the nomenclature of conotoxins as the use of the alphabet-based classification seems to be too constrained and obsolete in evolutionary terms. Click here for additional data file.
  97 in total

1.  Mechanisms for evolving hypervariability: the case of conopeptides.

Authors:  S G Conticello; Y Gilad; N Avidan; E Ben-Asher; Z Levy; M Fainzilber
Journal:  Mol Biol Evol       Date:  2001-02       Impact factor: 16.240

2.  Selection of conserved blocks from multiple alignments for their use in phylogenetic analysis.

Authors:  J Castresana
Journal:  Mol Biol Evol       Date:  2000-04       Impact factor: 16.240

3.  The prodomain of a secreted hydrophobic mini-protein facilitates its export from the endoplasmic reticulum by hitchhiking on sorting receptors.

Authors:  Silvestro G Conticello; Noga D Kowalsman; Christian Jacobsen; Guennady Yudkovsky; Kazuki Sato; Zvulun Elazar; Claus Munck Petersen; Ami Aronheim; Mike Fainzilber
Journal:  J Biol Chem       Date:  2003-05-27       Impact factor: 5.157

4.  NMR solution structures of delta-conotoxin EVIA from Conus ermineus that selectively acts on vertebrate neuronal Na+ channels.

Authors:  Laurent Volpon; Hung Lamthanh; Julien Barbier; Nicolas Gilles; Jordi Molgó; André Ménez; Jean-Marc Lancelin
Journal:  J Biol Chem       Date:  2004-02-19       Impact factor: 5.157

5.  Biochemical and gene expression analyses of conotoxins in Conus textile venom ducts.

Authors:  James E Garrett; Olga Buczek; Maren Watkins; Baldomero M Olivera; Grzegorz Bulaj
Journal:  Biochem Biophys Res Commun       Date:  2005-03-04       Impact factor: 3.575

6.  Conantokin-G precursor and its role in gamma-carboxylation by a vitamin K-dependent carboxylase from a Conus snail.

Authors:  P K Bandyopadhyay; C J Colledge; C S Walker; L M Zhou; D R Hillyard; B M Olivera
Journal:  J Biol Chem       Date:  1998-03-06       Impact factor: 5.157

7.  A novel alpha conotoxin (alpha-PIB) isolated from C. purpurascens is selective for skeletal muscle nicotinic acetylcholine receptors.

Authors:  Estuardo López-Vera; Richard B Jacobsen; Michael Ellison; Baldomero M Olivera; Russell W Teichert
Journal:  Toxicon       Date:  2007-02-24       Impact factor: 3.033

8.  alpha-Conotoxin EI, a new nicotinic acetylcholine receptor antagonist with novel selectivity.

Authors:  J S Martinez; B M Olivera; W R Gray; A G Craig; D R Groebe; S N Abramson; J M McIntosh
Journal:  Biochemistry       Date:  1995-11-07       Impact factor: 3.162

9.  Gene expression and feeding ecology: evolution of piscivory in the venomous gastropod genus Conus.

Authors:  Thomas F Duda; Stephen R Palumbi
Journal:  Proc Biol Sci       Date:  2004-06-07       Impact factor: 5.349

10.  A delta-conotoxin from Conus ermineus venom inhibits inactivation in vertebrate neuronal Na+ channels but not in skeletal and cardiac muscles.

Authors:  Julien Barbier; Hung Lamthanh; Frédéric Le Gall; Philippe Favreau; Evelyne Benoit; Haijun Chen; Nicolas Gilles; Nitza Ilan; Stefan H Heinemann; Dalia Gordon; André Ménez; Jordi Molgó
Journal:  J Biol Chem       Date:  2003-11-13       Impact factor: 5.157

View more
  12 in total

1.  Conodipine-P1-3, the First Phospholipases A2 Characterized from Injected Cone Snail Venom.

Authors:  Carolina Möller; W Clay Davis; Evan Clark; Anthony DeCaprio; Frank Marí
Journal:  Mol Cell Proteomics       Date:  2019-02-14       Impact factor: 5.911

Review 2.  Hormone-like conopeptides - new tools for pharmaceutical design.

Authors:  Ashlin Turner; Quentin Kaas; David J Craik
Journal:  RSC Med Chem       Date:  2020-09-24

3.  Comparative transcriptomics of the venoms of continental and insular radiations of West African cones.

Authors:  Samuel Abalde; Manuel J Tenorio; Carlos M L Afonso; Rafael Zardoya
Journal:  Proc Biol Sci       Date:  2020-06-17       Impact factor: 5.349

4.  Conotoxin Diversity in the Venom Gland Transcriptome of the Magician's Cone, Pionoconus magus.

Authors:  José R Pardos-Blas; Iker Irisarri; Samuel Abalde; Manuel J Tenorio; Rafael Zardoya
Journal:  Mar Drugs       Date:  2019-09-27       Impact factor: 5.118

Review 5.  Biomedical Potential of the Neglected Molluscivorous and Vermivorous Conus Species.

Authors:  Yihe Zhao; Agostinho Antunes
Journal:  Mar Drugs       Date:  2022-01-27       Impact factor: 5.118

6.  Identification of Novel Conotoxin Precursors from the Cone Snail Conus spurius by High-Throughput RNA Sequencing.

Authors:  Roberto Zamora-Bustillos; Mario Alberto Martínez-Núñez; Manuel B Aguilar; Reyna Cristina Collí-Dula; Diego Alfredo Brito-Domínguez
Journal:  Mar Drugs       Date:  2021-09-28       Impact factor: 5.118

7.  Venom duct origins of prey capture and defensive conotoxins in piscivorous Conus striatus.

Authors:  Ai-Hua Jin; Brett Hamilton; Subash K Rai; S W A Himaya; Paul Alewood; Richard J Lewis
Journal:  Sci Rep       Date:  2021-06-24       Impact factor: 4.996

8.  A phylogeny-aware approach reveals unexpected venom components in divergent lineages of cone snails.

Authors:  Alexander Fedosov; Paul Zaharias; Nicolas Puillandre
Journal:  Proc Biol Sci       Date:  2021-07-07       Impact factor: 5.530

9.  Venom Diversity and Evolution in the Most Divergent Cone Snail Genus Profundiconus.

Authors:  Giulia Fassio; Maria Vittoria Modica; Lou Mary; Paul Zaharias; Alexander E Fedosov; Juliette Gorson; Yuri I Kantor; Mandё Holford; Nicolas Puillandre
Journal:  Toxins (Basel)       Date:  2019-10-28       Impact factor: 4.546

10.  A Combined Transcriptomics and Proteomics Approach Reveals the Differences in the Predatory and Defensive Venoms of the Molluscivorous Cone Snail Cylinder ammiralis (Caenogastropoda: Conidae).

Authors:  Samuel Abalde; Sébastien Dutertre; Rafael Zardoya
Journal:  Toxins (Basel)       Date:  2021-09-10       Impact factor: 4.546

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.