| Literature DB >> 30083468 |
Jason Macrander1,2, Jyothirmayi Panda3, Daniel Janies3,4, Marymegan Daly2, Adam M Reitzel1.
Abstract
The advent of next-generation sequencing has resulted in transcriptome-based approaches to investigate functionally significant biological components in a variety of non-model organism. This has resulted in the area of "venomics": a rapidly growing field using combined transcriptomic and proteomic datasets to characterize toxin diversity in a variety of venomous taxa. Ultimately, the transcriptomic portion of these analyses follows very similar pathways after transcriptome assembly often including candidate toxin identification using BLAST, expression level screening, protein sequence alignment, gene tree reconstruction, and characterization of potential toxin function. Here we describe the Python package Venomix, which streamlines these processes using common bioinformatic tools along with ToxProt, a publicly available annotated database comprised of characterized venom proteins. In this study, we use the Venomix pipeline to characterize candidate venom diversity in four phylogenetically distinct organisms, a cone snail (Conidae; Conus sponsalis), a snake (Viperidae; Echis coloratus), an ant (Formicidae; Tetramorium bicarinatum), and a scorpion (Scorpionidae; Urodacus yaschenkoi). Data on these organisms were sampled from public databases, with each original analysis using different approaches for transcriptome assembly, toxin identification, or gene expression quantification. Venomix recovered numerically more candidate toxin transcripts for three of the four transcriptomes than the original analyses and identified new toxin candidates. In summary, we show that the Venomix package is a useful tool to identify and characterize the diversity of toxin-like transcripts derived from transcriptomic datasets. Venomix is available at: https://bitbucket.org/JasonMacrander/Venomix/.Entities:
Keywords: Bioinformatics; Transcriptome; Venom
Year: 2018 PMID: 30083468 PMCID: PMC6074769 DOI: 10.7717/peerj.5361
Source DB: PubMed Journal: PeerJ ISSN: 2167-8359 Impact factor: 2.984
Figure 1Venomix Pipeline Outline.
Outline showing the stepwise progression of the Venomix pipeline, including the necessary inputs and files included for every Toxin Group directory. Orange boxes indiciate programs used.
Species-specific Venomix outputs following different search strategies
| Original publication | Stringent ( | Less stringent ( | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Groups | Transcripts | Evaluated | Groups | Transcripts | Evaluated | Groups | Transcripts | Evaluated | ||
| 35 | 780 | 393(363) | 22 | 61 | 44(13) | 75 | 293 | 246(45) | ||
| 8 | 82 | 62(35) | 72 | 339 | 147(116) | 130 | 775 | 202(143) | ||
| 32 | 527 | 287/62 | 36 | 289 | 95(14) | 75 | 761 | 201(36) | ||
| 11 | 210 | 71(6) | 50 | 277 | 48(34) | 117 | 689 | 179(38) | ||
Notes.
N, number of candidate toxins identified in original study; Groups, number of toxins types identified based on sequence similarity; , conotoxins only (Phuong, Mahardika & Alfaro, 2016); , scorpion toxins only (Luna-Ramírez et al., 2015); Transcripts, total number of unique transcripts evaluated; †, includes duplicates as cumulative after three iterations in Trinity [see Phuong, Mahardika & Alfaro, 2016]; β, >100 TPM difference upregulated in the venom gland compared the ant carcass; , number of candidates based on different E-values 10/1E−3 thresholds; Evaluated, number of unique transcripts retained after using BLAST screening, parenthesis indicates number of transcripts identified using a Toxclassifier score of 1 or greater.
Figure 2Comparison of Conotoxin Transcripts for C. sponsalis.
Number of candidate toxin transcripts from each toxin gene family from the original study (Phuong, Mahardika & Alfaro, 2016) and Venomix.
Figure 3Number of previously predicted toxin compared to those derived from Venomix.
Number of candidate toxin transcripts from each toxin gene family from the original study (Hargreaves et al., 2014) and Venomix candidates most highly expressed in the venom gland with a TPM >1.0.
Figure 4Candidate toxins from U. yaschenkoi.
(A) Candidate toxins from U. yaschenkoi highlighting alignment difference in the candidate Toxin-like protein 14 sequencing in both analyses. Conserved residues across the alignment are highlighted. (B) Maximum likelihood neprilysin gene tree highlighting the abundance and diversity of candidate neprilysin toxins and non-venomous neprilysin genes. Branches associated with transcripts from U. yaschenkoi are highlighted in orange throughout the tree. Venomous taxa emphasized with bold font.
Figure 5Taxonomic distribution of venom and toxin proteins in the ToxProt dataset.
Deuterostomes are highlighted in green, protostomes in brown, and cnidarians in grey.