| Literature DB >> 30917600 |
Ge Yao1, Chao Peng2, Yabing Zhu3, Chongxu Fan4, Hui Jiang5, Jisheng Chen6, Ying Cao7, Qiong Shi8,9.
Abstract
The venom of each Conus species consists of a diverse array of neurophysiologically active peptides, which are mostly unique to the examined species. In this study, we performed high-throughput transcriptome sequencing to extract and analyze putative conotoxin transcripts from the venom ducts of 3 vermivorous cone snails (C. caracteristicus, C. generalis, and C. quercinus), which are resident in offshore waters of the South China Sea. In total, 118, 61, and 48 putative conotoxins (across 22 superfamilies) were identified from the 3 Conus species, respectively; most of them are novel, and some possess new cysteine patterns. Interestingly, a series of 45 unassigned conotoxins presented with a new framework of C-C-C-C-C-C, and their mature regions were sufficiently distinct from any other known conotoxins, most likely representing a new superfamily. O- and M-superfamily conotoxins were the most abundant in transcript number and transcription level, suggesting their critical roles in the venom functions of these vermivorous cone snails. In addition, we identified numerous functional proteins with potential involvement in the biosynthesis, modification, and delivery process of conotoxins, which may shed light on the fundamental mechanisms for the generation of these important conotoxins within the venom duct of cone snails.Entities:
Keywords: Conus; conotoxin; phylogeny; transcriptome sequencing; venom duct
Mesh:
Substances:
Year: 2019 PMID: 30917600 PMCID: PMC6471451 DOI: 10.3390/md17030193
Source DB: PubMed Journal: Mar Drugs ISSN: 1660-3397 Impact factor: 5.118
Statistics of venom duct transcriptome sequencing data for the 3 Conus species.
| Species | Raw Data (Gb) | Clean Data (Gb) | Q20* (%) | Nonsequenced (%) | GC Content (%) |
|---|---|---|---|---|---|
| 5.51 | 4.57 | 95.99 | 0 | 47.84 | |
| 3.47 | 3.21 | 98.31 | 0.01 | 47.3 | |
| 5.32 | 4.37 | 96.00 | 0 | 47.09 |
* A quality score for the percentage of incorrect bases at less than 1%.
Summary of sequences produced by the assembling for the 3 Conus species.
| Species | |||
|---|---|---|---|
| Total reads (n) | 50,788,576 | 48,557,734 | 35,694,024 |
| Base pairs (Mb) | 4,570.97 | 4,370.2 | 3,212.46 |
| Mean length (bp) | 90 | 90 | 90 |
| Total number | 213,155 | 219,692 | 153,249 |
| Base pairs (Mb) | 47.84 | 60.75 | 40.22 |
| Mean length (bp) | 224 | 276 | 262 |
| N50 (bp) | 236 | 307 | 313 |
| Total number | 79,324 | 103,682 | 61,926 |
| Base pairs (Mb) | 47.57 | 65.38 | 34.96 |
| Mean length (bp) | 599 | 630 | 564 |
| N50 (bp) | 794 | 891 | 717 |
| Total number | 72,462 | 95,438 | 61,002 |
| Base pairs (Mb) | 39.61 | 54.87 | 33.67 |
| Mean length (bp) | 546 | 574 | 552 |
| N50 (bp) | 670 | 749 | 688 |
Classification and cysteine patterns of the conotoxins identified from C. caracteristicus.
| Superfamily | Number | Cysteine Pattern (Number of Conotoxins) | |
|---|---|---|---|
| 11 | CC-C-C (8), CC-C (3) | ||
| 2 | Cysteine free | ||
| 1 | Cysteine free | ||
| 2 | C-C-CC-C-C-C-C (1), C-CC-C-CC-C-C-C-C (1) | ||
| 1 | C-C-CC-CC-C-C | ||
| 6 | C-C-CC-CC-C-C (1), C-C-C-C-CC-C-C (4), C-C-CC-C-C (1) | ||
| 4 | C-C-CC-CC-C-C (3), C-C-CC-C-C (1) | ||
| 7 | C-C-C-C | ||
| 4 | C-C-C-C | ||
| 6 | CC-C-C-CC (5), CC-C-C-C-C (1) | ||
| 22 | C-C-CC-C-C | ||
| 11 | C-C-CC-C-C (3), C-C-CC-C-C-C-C (3), C-C (5) | ||
| 6 | C-C-CC-C-C | ||
| 3 | C-C-C-C-C-C-C-C-C-C | ||
| 9 | CC-CC (8), C-C-CC (1) | ||
| 1 | C-C-CC-C-CC-C | ||
| 1 | C-C-C-C-C-C | ||
| 21 | C-C-C-C-C-C (19), C-C-C-C (1), CC-C-C-C-C (1) | ||
Classification and cysteine patterns of the conotoxins identified from C. generalis.
| Superfamily | Number | Cysteine Pattern (Number of Conotoxins) | |
|---|---|---|---|
| 2 | CC-C-C | ||
| 1 | Cysteine free | ||
| 1 | Cysteine free | ||
| 1 | C-CC-C-CC-C-C-C-C | ||
| 1 | C-C-CC-CC-C-C | ||
| 4 | C-C-CC-CC-C-C (2), C-C-C-C-CC-C-C (2) | ||
| 1 | C-C-CC-CC-C-C | ||
| 3 | C-C-C-C | ||
| 4 | CC-C-C-CC (3), C-C-CC (1) | ||
| 12 | C-C-CC-C-C | ||
| 4 | C-C-CC-C-C (3), C-C-CC-C-C-C-C (1) | ||
| 3 | C-C-CC-C-C | ||
| 2 | C-C-C-C-C-C | ||
| 1 | C-C-C-C-C-C-C-C-C-C | ||
| 5 | CC-CC | ||
| 1 | CC-C-C-C-CC-C-C-C | ||
| 1 | CC-C-C | ||
| 1 | C-C-C-CCC-C-C-C-C | ||
| 13 | C-C-C-C-C-C | ||
Classification and cysteine patterns of the conotoxins identified from C. quercinus.
| Superfamily | Number | Cysteine Pattern (Number of Conotoxins) | |
|---|---|---|---|
| 5 | CC-C-C | ||
| 3 | Cysteine free | ||
| 3 | C-C-CC-CC-C-C (2), C-C-C-C-CC-C-C (1) | ||
| 10 | CC-C-C-CC (9), C-CC-C-C-C (1) | ||
| 8 | C-C-CC-C-C | ||
| 3 | C-C-CC-C-C | ||
| 1 | C-C-CC-C-C | ||
| 1 | CC-CC | ||
| 3 | C-C-CC-C-C-C-C | ||
| 1 | C-C-CC-C-CC-C | ||
| 1 | CC-C-C-C-C-CC-C-C-C-C | ||
| 2 | C-C-C-C-C-C | ||
| 7 | C-C-C-C-C-C | ||
Figure 1Summary of the conotoxins identified from the 3 Conus species. Many superfamilies or groups of conotoxins were classified in (A) C. caracteristicus, (B) C. generalis, and (C) C. quercinus.
Figure 2Comparison of the top 10 conotoxins (with the highest FPKM values) from the 3 transcriptome datasets.
Figure 3A Bayesian phylogenetic tree of the O-superfamily conotoxins. Posterior probabilities are labeled at each node.
Figure 4Alignment of the achieved new superfamily conotoxins from venom duct transcriptomes of C. betulinus [32] (betu), C. caracteristicus (cara), C. generalis (gene), and C. quercinus (quer).