| Literature DB >> 27999389 |
Dehong Xu1,2, Xianchun Wang3.
Abstract
Latrodectus tredecimguttatus is a kind of highly venomous black widow spider, with toxicity coming from not only venomous glands but also other parts of its body as well as newborn spiderlings and eggs. Up to date, although L. tredecimguttatus eggs have been demonstrated to be rich in proteinaceous toxins, there is no systematic investigation on such active components at transcriptome level. In this study, we performed a high-throughput transcriptome sequencing of L. tredecimguttatus eggs with Illumina sequencing technology. As a result, 53,284 protein-coding unigenes were identified, of which 14,185 unigenes produced significant hits in the available databases, including 280 unigenes encoding proteins or peptides homologous to known proteinaceous toxins. GO term and KEGG pathway enrichment analyses of the 280 unigenes showed that 375 GO terms and 18 KEGG pathways were significantly enriched. Functional analysis indicated that these unigene-coded toxins have the bioactivities to degrade tissue proteins, inhibit ion channels, block neuromuscular transmission, provoke anaphylaxis, induce apoptosis and hyperalgesia, etc. No known typical proteinaceous toxins in L. tredecimguttatus venomous glands, such as latrotoxins, were identified, suggesting that the eggs have a different toxicity mechanism from that of the venom. Our present transcriptome analysis not only helps to reveal the gene expression profile and toxicity mechanism of the L. tredecimguttatus eggs, but also provides references for the further related researches.Entities:
Keywords: Illumina sequencing; Latrodectus tredecimguttatus; eggs; proteinaceous toxin; transcriptome
Mesh:
Substances:
Year: 2016 PMID: 27999389 PMCID: PMC5198572 DOI: 10.3390/toxins8120378
Source DB: PubMed Journal: Toxins (Basel) ISSN: 2072-6651 Impact factor: 4.546
Summary of RNA-sequencing and read assembly.
| Analysis of Read Assembly | Amount |
|---|---|
| Total number of reads | 47,970,296 |
| Total base pairs (bp) | 5,836,590,233 |
| Average read length (bp) | 121 |
| Total number of transcripts | 69,684 |
| Total number of unigenes | 53,284 |
| Average length of unigenes (bp) | 738 |
| Total number of unigenes >2000 bp in length | 4376 |
| Total number of unigenes annotated in at least one database | 14,185 |
Figure 1GO classification of the unigenes from L. tredecimguttatus eggs. Unigenes were annotated within three categories: biological process (A); molecular function (B); and cellular component (C). The x-axis represents the different categories. The number and percent of the unigenes matching GO annotation terms are presented on the y-axis.
Figure 2KOG classification of the egg unigenes: 9528 annotated unigenes were classified into 25 KOG categories. The x-axis represents the different KOG categories. The number and percent of unigenes matching KOG classification are presented on the y-axis.
Figure 3KEGG classification of the egg unigenes: 3141 unigenes were classified into 317 pathways. The different colors represent different KEGG categories. The number and percent of the unigenes matching KEGG classification are presented on the upper and lower axes, respectively.
Figure 4GO enrichment analysis scatterplot for the 280 unigenes encoding toxins. The size of black circle represents the unigene number. Different colors represent different p values for significance test.
Figure 5KEGG enrichment analysis scatterplot for the 280 unigenes encoding toxins. The size of black circle represents the unigene number. Different colors represent different p values for significance test.
Figure 6Distribution of the egg unigenes. Left pie graph shows the composition of the egg unigenes. “Not annotated” represents the unigenes not annotated in the available databases. Unigenes annotated to encode the proteins matching known toxins are labeled as “Putative toxins”, and those matching other proteins are labeled as “Non-toxins”. The number of the unigenes in each subcategory is given followed by its percentage shown in the bracket. Right pie graph is further classification of the putative toxins. Putative toxins are further divided into four types. The number of the unigenes in each type is given followed by its percentage shown in the bracket. ICK, inhibitor cystine knot.
Figure 7Primary and secondary structure analyses of the predicted ICK toxins. The amino acids forming an alpha helix are colored in pink. Yellow arrows represent β-sheet. Red and green rectangles indicate predicted signal peptides and propeptides, respectively. Blue lines show the connecting pattern of disulfide bonds.
Figure 8Sequence alignment and phylogenetic analysis of comp21602_c0_seq2. Alignment was performed by DNAman soft. Strictly conserved cysteins are indicated by deep blue and other less conserved amino acids are marked using different colors. Phylogenetic analysis was implemented in MEGA 3.1 using the neighbor-joining method. ArachnoServer accession numbers precede the species name for each sequence. Numbers at the nodes indicate the bootstrap values based on 10,000 replicates.
Figure 9Sequence alignment and phylogenetic analysis of comp25199_c0_seq1. Alignment was performed by DNAman soft. Strictly conserved cysteins are indicated by deep blue and other less conserved amino acids are marked using different colors. Phylogenetic analysis was implemented in MEGA 3.1 using the neighbor-joining method. ArachnoServer accession numbers precede the species name for each sequence. Numbers at the nodes indicate the bootstrap values based on 10,000 replicates.
Putative non-ICK toxins.
| Sequence ID | Signal Peptide | Score (Bits) | E-Value | Identity | BLAST Annotation |
|---|---|---|---|---|---|
| comp24914_c0_seq1 | N | 64.7 (156) | 8 × 10−11 | 34% | gb|AGA82764.1|toxin-like protein 14 precursor ( |
| comp6833_c0_seq1 | N | 92.0 (227) | 4 × 10−22 | 56% | gb|ABR21046.1|venom toxin-like peptide-6 ( |
| comp22890_c0_seq2 | N | 90.9 (224) | 9× 10−22 | 56% | gb|ABR21046.1|venom toxin-like peptide-6 ( |
| comp17051_c0_seq1 | N | 53.5 (127) | 3× 10−10 | 32% | as: U15-SYTX-Sth1a||1459 Translation of a toxin from the spider |
| comp25199_c0_seq1 | Y | 55.8 (133) | 7× 10−11 | 32% | as: U12-SYTX-Sth1a||1449 Translation of a toxin from the spider |
| comp21232_c0_seq1 | Y | 277 (709) | 5× 10−86 | 98% | gb|ADV40303.1|cystatin-like protein ( |
| comp20935_c0_seq1 | Y | 102 (255) | 7× 10−25 | 40% | as: U24-ctenitoxin-Pn1a|sp:P84032|Toxin from venom of the spider |
| comp213809_c0_seq1 | Y | 65.5 (158) | 1× 10−13 | 29% | as: U24-ctenitoxin-Pn1a|sp:P84032|Toxin from venom of the spider |
| comp27859_c0_seq1 | Y | 105 (263) | 8× 10−26 | 44% | as: U24-ctenitoxin-Pn1a|sp:P84032|Toxin from venom of the spider |
| comp96908_c0_seq1 | Y | 48.5 (114) | 1× 10−8 | 35% | as: U16-aranetoxin-Av1a_1||2248 Toxin from venom of the spider |
| comp5553_c0_seq1 | Y | 53.1 (126) | 9× 10−7 | 32% | sp|Q8MTX1|TXCA_CAEEX U3-aranetoxin-Ce1a OS = |
Figure 10Amino acid sequence alignment of comp31734_c0_seq1 and comp19276_c0_seq1: (A) Multisequence alignment of comp31734_c0_seq1 with AP00140 and its precursor (GenBank:GJ19999). The underlined amino acid residues indicate a putative signal peptide sequence. (B) Sequence alignment of comp19276_c0_seq1 with AP2030. Identical residues are shaded in blue in both (A) and (B).
Overview of the unigenes encoding toxin-like proteins or enzymes.
| Name | Number of Unigenes | Percent of Unigenes (%) | Putative Activities |
|---|---|---|---|
| Metalloprotease | 62 | 25 | Activating proteinogen or zymogen [ |
| Degradating tissue to facilitate the spreading of toxins [ | |||
| Serine protease | 55 | 22.2 | Activating proteinogen or zymogen [ |
| Degradating tissue to facilitate the spreading of toxins [ | |||
| Phospholipase | 41 | 16.5 | Neurotoxicity, myotoxicity, etc. [ |
| Serpin | 30 | 12.1 | Inhibiting degradation of proteinaceous toxins by protease [ |
| Acting on ion channels, e.g., K+ channel [ | |||
| Phosphatase | 18 | 7.3 | Assisting the liberation of purines [ |
| Cholinesterase | 12 | 4.8 | Blocking the neuromuscular transmission [ |
| Allergen/Lipocalin | 11 | 4.4 | Provoking anaphylaxis [ |
| Disrupting hemostasis [ | |||
| Chitinase | 10 | 4 | Degrading the chitin [ |
| CAP superfamily | 5 | 2 | Modulating ion channels [ |
| Plancitoxin-1 | 2 | 0.9 | Inducing apoptosis [ |
| Hyaluronidase | 1 | 0.4 | Enhancing tissue permeability to allow the spreading of toxins [ |
| Prokineticin/AVIT | 1 | 0.4 | Inhibiting the feeding or inducing hyperalgesia [ |