| Literature DB >> 29522462 |
S W A Himaya1, Richard J Lewis2.
Abstract
Cone snail venoms are considered a treasure trove of bioactive peptides. Despite over 800 species of cone snails being known, each producing over 1000 venom peptides, only about 150 unique venom peptides are structurally and functionally characterized. To overcome the limitations of the traditional low-throughput bio-discovery approaches, multi-omics systems approaches have been introduced to accelerate venom peptide discovery and characterisation. This "venomic" approach is starting to unravel the full complexity of cone snail venoms and to provide new insights into their biology and evolution. The main challenge for venomics is the effective integration of transcriptomics, proteomics, and pharmacological data and the efficient analysis of big datasets. Novel database search tools and visualisation techniques are now being introduced that facilitate data exploration, with ongoing advances in related omics fields being expected to further enhance venomics studies. Despite these challenges and future opportunities, cone snail venomics has already exponentially expanded the number of novel venom peptide sequences identified from the species investigated, although most novel conotoxins remain to be pharmacologically characterised. Therefore, efficient high-throughput peptide production systems and/or banks of miniaturized discovery assays are required to overcome this bottleneck and thus enhance cone snail venom bioprospecting and accelerate the identification of novel drug leads.Entities:
Keywords: cone snails; proteomics; transcriptomics; venom; venomics; visualisation
Mesh:
Substances:
Year: 2018 PMID: 29522462 PMCID: PMC5877649 DOI: 10.3390/ijms19030788
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Performance of next-generation sequencing platforms.
| Developer | System | Read Length (bp) | Error Rate (%) |
|---|---|---|---|
| Illumina | Illumina Hi-Seq2000 | 2 × 150 bp | 0.1 |
| Illumina Hi-Seq3000/4000 | 2 × 150 bp | 0.1 | |
| Illumina Mi-Seq (Version 3) | 2 × 300 bp | 0.1 | |
| Thermo Fisher Scientific | Ion Torrent PGM 318 | 400 bp | 1 |
| Applied Biosystems | ABI 3730/ABI 3730xl | 500 bp | 0.4 |
| Pacific Biosciences | PacBio RS II | 10–20 kb | 10–15 |
| PacBio Sequel | >20 kb | 10–15 | |
| Oxford | Nanopore PromethION | 900 kb | 4 |
| Roche | 454 (discontinued in 2013) | 700 bp | 1 |
New superfamilies discovered by transcriptomics.
| Species | New Superfamily | Cysteine Framework | Sequencing | Reference |
|---|---|---|---|---|
| MRFYM- | VI/VII | Illumina HiSeq 2000 | [ | |
| MKISL- | VI/VII | Illumina HiSeq 2000 | [ | |
| N, B, H, E, F, H2, I4, M2, N2, O4, Q, R, U, W, X, Y2, Y3, Z | XV, VIIII, VI/VII, N, N, III, C1, C7 *, N, VI/VII, N, N, VI/VII, N, C2, C4 *, C2, C2 | Roche 454 | [ | |
| Cat-NSF1 | VI/VII | Roche 454 | [ | |
| SF-mi 1–8 | XIII, C8-novel, VI/VII, N, N, XV, NA, XIII | Roche 454 | [ | |
| NSVx1-4 | XIV, XXIV, VI/VII, XXVIII (novel) | Roche 454 | [ | |
| NSF-bt01–09 | IX, XV, VI/VII, XIV, VI/VII, IX, VIII, VI/VII, IX | ABI 3730 | [ | |
| SF-Epi 1–16 | variable, III a, V, C7 *, V, V, V, XI, C7 *, N, C5 *, NA, XVIII, VI/VII, IV, C5 * | Illumina MiSeq | [ | |
| SF-01-04 | C12 *, C12 *, IX, XIII, | Illumina HiSeq 2000 | [ | |
| J2 | XIV | Roche GS-FLX | [ | |
| MKAVA-, MSRLF-, MMLFM-, MLSML- | XXII, N, VIII, C12 * | Illumina HiSeq 2000 | [ | |
| SF-05-06 | XIV, C12 * | Illumina HiSeq 2000 | [ | |
| Put.MGGRF, Put.MKAVA | N, C8 * | Illumina HiSeq 2000 | [ | |
| Put.MSGLR, Put.MUSGK | VI/VII, C6 * | Illumina HiSeq 2000 | [ |
N, no cysteines were detected in the mature sequence; * no framework name has been defined, possibly a novel cysteine framework; a probable predicted framework.
Commonly used search algorithms and proteomic analysis tools. PTM: post-translational modification.
| Tool | Function |
|---|---|
| PEAKS DB | Database search engine, run in parallel with de novo sequencing, to automatically validate the search results |
| ProteinPilot Software | Enables peptide identification while considering PTMs, non-tryptic cleavages, and amino acid substitutions. Supports quantification using iTRAQ, mTRAQ, and SILAC |
| Mascot | Peptide mass fingerprinting and MS/MS database searching |
| Protein Prospector | Proteomic analysis tools including “Batch-Tag” for instrument- and fragmentation mode-optimised analysis |
| MassMatrix | Search algorithm for tandem MS data that ranks peptide and protein matches |
| Byonic | MS/MS data search for fragment identifications to produce protein scores and identification probabilities |
| MarkerView Software | Statistical analysis and visualisation of quantitative mass spec data sets from proteomic profiling applications |
| MultiQuant Software | Quantitation and targeted visualisation of TripleTOF or QTRAP data, including MRM and SWATH acquisition |
| MaxQuant | Quantitative proteomics tool for analysis of label-free and SILAC-based proteomics data |
Figure 1A rational venomic approach integrating proteomic data visualization approaches to accelerate the comparisons of complex venoms and the rapid identification of likely functionally relevant novel peptides.
Figure 2Proteomic visualisation of the known peptides and most abundant novel peptides of the injected predatory venom collected from nine individual specimens of C. purpurascens. The heatmap matrix shows peptides that co-cluster with each of the three groups, revealing remarkable venom variability across these specimens. The red, open arrows indicate known excitatory peptides, and the blue, open arrows indicate known neuromuscular blockers. The relative intensity of each peptide is shown in a gradient of brown, with the absence of peptide indicated by the blue color. This figure is adapted from Himaya et al. 2018.