| Literature DB >> 27089325 |
Bing Xie1,2, Xiaofeng Li3, Zhilong Lin4, Zhiqiang Ruan5, Min Wang6, Jie Liu7, Ting Tong8, Jia Li9, Yu Huang10, Bo Wen11, Ying Sun12,13, Qiong Shi14,15,16.
Abstract
Fish venom remains a virtually untapped resource. There are so few fish toxin sequences for reference, which increases the difficulty to study toxins from venomous fish and to develop efficient and fast methods to dig out toxin genes or proteins. Here, we utilized Chinese yellow catfish (Pelteobagrus fulvidraco) as our research object, since it is a representative species in Siluriformes with its venom glands embedded in the pectoral and dorsal fins. In this study, we set up an in-house toxin database and a novel toxin-discovering protocol to dig out precise toxin genes by combination of transcriptomic and proteomic sequencing. Finally, we obtained 15 putative toxin proteins distributed in five groups, namely Veficolin, Ink toxin, Adamalysin, Za2G and CRISP toxin. It seems that we have developed a novel bioinformatics method, through which we could identify toxin proteins with high confidence. Meanwhile, these toxins can also be useful for comparative studies in other fish and development of potential drugs.Entities:
Keywords: Chinese yellow catfish; proteome; transcriptome; venom
Mesh:
Substances:
Year: 2016 PMID: 27089325 PMCID: PMC4849012 DOI: 10.3390/ijms17040556
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Figure 1Schematic workflow for the transcriptomics and peptidomics analyses of the venom from the Chinese yellow catfish. ORFs, open reading frames; SignalP, signal peptides.
Figure 2Length distribution of contigs. The horizontal coordinate represents contig length and the vertical coordinates stand for the number of contigs.
Figure 3Length distribution of unigenes. The horizontal coordinate represents unigene length and the vertical coordinate stands for number of unigenes.
Summary of sequence numbers in our achieved Toxin Database.
| Group of Species | Taxonomy Name | Numbers of Sequences |
|---|---|---|
| Snakes | Serpents | 1406 |
| Scorpions | Scorpions | 1510 |
| Spiders | Araneae | 1047 |
| Cone snails | Conus | 3860 |
| Sea anemones | Actiniaria | 308 |
| Insects | Hexapoda | 162 |
| Fish | Teleostei | 31 |
| Mammals | Mammalias | 106 |
| Lizards | Heloderma | 241 |
| Jellyfish | Cubomedusae/Scyphozoa | 175 |
| Sea stars | Asteroidea | 8 |
| Hydra | Hydroida | 14 |
| Worms | Cerebratulus | 5 |
| Forg ,Toad | Amphibia | 64 |
| Sea-urchin | Echinoidea | 2 |
| Sea hare | Aplysia | 22 |
| Annelida | 11 | |
| Scolopendra | Myriapoda | 9 |
| All | Metazoa | 8863 |
Figure 4The statistics for SignalP, HMM search, LC-MS/MS and their mutual overlaps.
Figure 5The putative Veficolin toxin sequences from Chinese yellow catfish. Sequences in red background are signal peptides. Peptides in yellow background are verified by LC-MS/MS analysis. The amino acid Glycine are marked in green color. The G-X-Y repeats are included in the red box. Dashed lines means the absence of corresponding amino acid sequences.
Figure 6The putative Ink toxin sequences from Chinese yellow catfish. Sequences in the red background are signal peptides. Peptides in the yellow background are verified by LC-MS/MS analysis. The red underlined regions are the FAD domains.
Figure 7The putative Adamalysin toxin sequences from Chinese yellow catfish. Sequences in red background are signal peptides. Peptides in yellow background are verified by LC-MS/MS analysis. The red box includes the elongated zinc binding consensus sequence HEXXHXXGXXH. Dashed lines means the absence of corresponding amino acid sequences.
Figure 8The putative Za2G toxin sequences from Chinese yellow catfish. Sequences in red background are signal peptides. Peptides in yellow background are verified by LC-MS/MS analysis. The red underlined domain are highly homologous to the MHC_I superfamily. Dashed lines means the absence of corresponding amino acid sequences.
Figure 9The putative CRISP toxin sequences from Chinese yellow catfish. Sequences in red background are signal peptides. Peptides in yellow background are verified by LC-MS/MS analysis. The amino acid Cysteine is marked in green. Dashed lines means the absence of corresponding amino acid sequences.