| Literature DB >> 32422990 |
Samira R Aili1, Axel Touchard2, Regan Hayward1, Samuel D Robinson3,4, Sandy S Pineda5, Hadrien Lalagüe2, Irina Vetter3, Eivind A B Undheim4,6,7, R Manjunatha Kini8,9, Pierre Escoubas10, Matthew P Padula1, Garry S A Myers1, Graham M Nicholson1.
Abstract
A critical hurdle in ant venom proteomic investigations is the lack of databases to comprehensively and specifically identify the sequence and function of venom proteins and peptides. To resolve this, we used venom gland transcriptomics to generate a sequence database that was used to assign the tandem mass spectrometry (MS) fragmentation spectra of venom peptides and proteins to specific transcripts. This was performed alongside a shotgun liquid chromatography-mass spectrometry (LC-MS/MS) analysis of the venom to confirm that these assigned transcripts were expressed as proteins. Through the combined transcriptomic and proteomic investigation of Paraponera clavata venom, we identified four times the number of proteins previously identified using 2D-PAGE alone. In addition to this, by mining the transcriptomic data, we identified several novel peptide sequences for future pharmacological investigations, some of which conform with inhibitor cysteine knot motifs. These types of peptides have the potential to be developed into pharmaceutical or bioinsecticide peptides.Entities:
Keywords: DRG neurons; RP-HPLC; hyaluronidase; neurotoxins; paraponeritoxin; phospholipases
Mesh:
Substances:
Year: 2020 PMID: 32422990 PMCID: PMC7290781 DOI: 10.3390/toxins12050324
Source DB: PubMed Journal: Toxins (Basel) ISSN: 2072-6651 Impact factor: 4.546
Assembly statistics and downstream metrics from the P. clavata venom gland transcriptome analysis.
| After QC | Counts |
|---|---|
| Total number of reads (paired end) | 17,026,962 |
|
| |
| Total number of contigs | 54,242 |
| GC content | 37.3% |
| Raw reads mapped to contigs | 99.3% |
|
| |
| Total BLASTx hits | 37,140 |
| Assembled contigs with BLASTx hits | 32, 003 |
| Contigs with gene ontology annotation | 6732 |
| At least one ORF | 31,586 |
| * Toxins | 295 |
| Proportion of contigs that are toxins | 4.4% |
* Only toxins with a TPM ≥ 1 are included.
Figure 1Abundance and expression levels of P. clavata toxin and toxin-like transcripts. The heat map shows contigs classified into 17 categories. Toxin expression levels are represented by transcript per million (TPM) and are depicted by a colour scale with yellow being least abundant (≥1 TPM) and purple signifying highest expressed (see right-hand scale). The x-axis indicates the number of transcripts within each category. The “Protease” category is comprised of serine and aspartic proteases and cathepsins.
Top 20 most expressed (based on TPM) toxin and toxin-like transcripts from the P. clavata venom gland transcriptome BLASTx and Tox|Note searches. A complete list of all the BLASTx toxin and toxin-like transcripts can be found in Supplementary File S1.
| Toxin Name | BLASTx/ | NCBInr Accession | Species | Trinity ID | TPM | Toxin Category |
|---|---|---|---|---|---|---|
| δ-Paraponeritoxin-Pc1e_1 | Poneratoxin | P41736.1 |
| DN10254_c0_g26_i11 | 39,657 | Neurotoxic |
| δ-Paraponeritoxin-Pc1e_2 | DN9710_c0_g11_i2 | 9279 | ||||
| δ-Paraponeritoxin-Pc1e_3 | DN10254_c0_g26_i12 | 2414 | ||||
| δ-Paraponeritoxin-Pc1e_4 | DN10254_c0_g26_i5 | 797 | ||||
| Phospholipase-A2-1a_1-P-clavata | PREDICTED: phospholipase A2 | XP_011262712 |
| DN9894_c0_g8_i1 | 6328 | Phospholipase |
| Translationally-controlled-tumor-protein-1a-P-clavata | PREDICTED: translationally controlled tumour protein homolog | XP_014489127 |
| DN9555_c0_g10_i3 | 1229 | Inflammation |
| Icarapin-1a-P-clavata | PREDICTED: icarapin-like | XP_011863857 |
| DN9969_c1_g1_i4 | 969 | Allergenic |
| Serine-protease-1a-P-clavata | Serine proteinase stubble | EZA53191.1 |
| DN10218_c2_g11_i1 | 835 | Protease |
| Serine-protease-2a-P-clavata | DN10218_c2_g7_i1 | 364 | ||||
| Arginine-kinase-1a_1-P-clavata | PREDICTED: arginine kinase isoform X1 | XP_014467324 |
| DN9942_c0_g1_i6 | 382 | Neurotoxic |
| U1-Paraponeritoxin-Pc1a | PREDICTED: omega-conotoxin-like protein 1 | XP_011257508 |
| DN5808_c0_g1_i1 | 253 | Unknown |
| Serine-protease-3a-P-clavata | PREDICTED: serine protease easter-like isoform X2 | XP_014488937 |
| DN10218_c1_g1_i2 | 161 | Protease |
| Serine-protease-4a-P-clavata | Serine protease snake | KMR05322.1 |
| DN11547_c0_g1_i1 | 138 | Protease |
| Venom-allergen3-1a-P-clavata | PREDICTED: venom allergen 3-like | XP_012534360 |
| DN10202_c0_g4_i2 | 127 | Allergenic |
| Cysteine-rich protein-1a-P-clavata | Cysteine-rich protein 1 | KYN02327.1 |
| DN7413_c0_g4_i1 | 123 | Ion channel modulator |
| Serine-protease-1b-P-clavata | PREDICTED: serine protease 42-like isoform X3 | XP_014488938 |
| DN10218_c2_g8_i1 | 113 | Protease |
| Serine-protease-5a-P-clavata | DN10218_c2_g6_i7 | 103 | ||||
| Disintegrin-and-metalloproteinase-domain-containing-protein-1a-P-clavata | Disintegrin and metalloproteinase domain-containing protein | XP_012242558 |
| DN21256_c0_g1_i1 | 113 | Metalloproteinase |
| Disintegrin-and-metalloproteinase-domain-containing-protein-2a-P-clavata | EZA52689.1 |
| DN20154_c0_g1_i1 | 75 | ||
| Lysosomal-aspartic-protease-1a-P-clavata | PREDICTED: lysosomal aspartic protease | XP_014468660 |
| DN7128_c0_g1_i1 | 111 | Protease |
| Defensin-2-like-1a-P-clavata | PREDICTED: defensin-2-like | XP_012537462 |
| DN5917_c0_g1_i1 | 89 | Antibacterial |
| Serine-protease-inhibitor-1a-P-clavata | PREDICTED: serine protease inhibitor 3/4-like isoform X2 | XP_012539061 |
| DN9965_c0_g10_i1 | 86 | Protease |
| Serine-protease-6a-P-clavata | PREDICTED: venom serine protease-like | XP_014477136 |
| DN9366_c0_g3_i1 | 85 | Protease |
| Serine-protease-7a-P-clavata | PREDICTED: serine protease 52-like | XP_011641802 |
| DN10218_c2_g11_i2 | 84 | Protease |
| Serine-protease-8a-P-clavata | PREDICTED: serine protease snake-like | XP_011142823 |
| DN9987_c0_g2_i9 | 83 | Protease |
| Cathepsin-L-1a-P-clavata | Cathepsin L | EFN75465.1 |
| DN9254_c0_g3_i1 | 77 | Protease |
Figure 2Protein categories detected in P. clavata venom. P. clavata contigs were identified by mapping the proteome to the transcriptome. Annotation from BLASTx was used to place each contig into one of the above categories. The green portion shows the proportion of toxin-like contigs (10%).
Paraponera clavata toxins identified by proteomic analysis and their BLASTx hit. The table shows transcripts which had at least two peptides from the shotgun MS/MS analysis identified using PEAKS. Also included are the BLASTx match (or BlastP if no match by BLASTx) for each transcript. Additional information regarding transcripts, accessions and full protein names can be found in Supplementary File S2. The presence (Y), or absence (N), of a predicted signal peptide is denoted in the last column of the table
| Toxin Name | BLASTx Hit | Species | Trinity ID | Unique Peptide | % Cover | −10lgP | TPM | SigP (y/n) |
|---|---|---|---|---|---|---|---|---|
| Hyalyronidase-1a_1-P-clavata | PREDICTED: hyaluronidase-like |
| DN10062_c0_g13_i9 | 70 | 65 | 333 | 38 | Y |
| Hyalyronidase-1a_2-P-clavata | DN10062_c0_g13_i12 | 70 | 65 | 333 | 23 | Y | ||
| Hyalyronidase-1a_3-P-clavata | DN10062_c0_g13_i7 | 70 | 65 | 333 | 3 | Y | ||
| Hyalyronidase-1a_4-P-clavata | DN10062_c0_g13_i16 | 70 | 65 | 333 | 3 | Y | ||
| Hyalyronidase-1a_5-P-clavata | DN10062_c0_g13_i15 | 70 | 65 | 333 | 1 | Y | ||
| Hyalyronidase-1a_6-P-clavata | DN10062_c0_g13_i3 | 70 | 65 | 333 | 1 | Y | ||
| Hyalyronidase-1a_7-P-clavata | DN10062_c0_g13_i10 | 70 | 65 | 333 | 1 | Y | ||
| Hyalyronidase-1a_8-P-clavata | DN10062_c0_g13_i5 | 70 | 65 | 333 | 1 | Y | ||
| Venom-dipeptidyl-peptidase-1a_1-P-clavata | PREDICTED: venom dipeptidyl peptidase 4 |
| DN8858_c0_g1_i1 | 26 | 29 | 281 | 60 | N |
| Venom-dipeptidyl-peptidase-1a_2-P-clavata | PREDICTED: venom dipeptidyl peptidase 4 | DN8858_c0_g1_i2 | 26 | 29 | 281 | 21 | N | |
| δ-Paraponeritoxin-Pc1e_2 | Poneratoxin |
| DN9710_c0_g11_i2 | 20 | 45 | 260 | 9279 | Y |
| δ-Paraponeritoxin-Pc1e_5 | DN10254_c0_g26_i8 | 20 | 45 | 260 | 2 | Y | ||
| δ-Paraponeritoxin-Pc1e_6 | DN10254_c0_g26_i3 | 20 | 45 | 260 | 0 | Y | ||
| Phospholipase-A2-1a_1-P-clavata | PREDICTED: phospholipase A2 |
| DN9894_c0_g8_i1 | 24 | 48 | 235 | 6328 | Y |
| Phospholipase-A2-1a_2-P-clavata | DN9894_c0_g2_i2 | 24 | 48 | 235 | 7 | Y | ||
| Venom-allergen3-1a-P-clavata | PREDICTED: venom allergen 3-like |
| DN10202_c0_g4_i2 | 14 | 44 | 198 | 127 | Y |
| Venom-allergen3-1b-P-clavata | DN10202_c0_g4_i1 | 14 | 53 | 198 | 1 | Y | ||
| Carboxypeptidase-Q-1a_1-P-clavata | PREDICTED: carboxypeptidase Q |
| DN8288_c0_g1_i2 | 12 | 22 | 180 | 3 | N |
| Carboxypeptidase-Q-1a_2-P-clavata | DN8288_c0_g1_i1 | 12 | 23 | 180 | 2 | N | ||
| Arginine-kinase-1a_1-P-clavata | PREDICTED: arginine kinase isoform X1 |
| DN9942_c0_g1_i6 | 10 | 32 | 169 | 382 | Y |
| Arginine-kinase-1a_2-P-clavata | DN9942_c0_g1_i1 | 10 | 32 | 169 | 4 | N | ||
| Pancreatic-lipase-1a-P-clavata | Pancreatic lipase-related protein 2 |
| DN10238_c1_g31_i1 | 8 | 23 | 169 | 15 | Y |
| Pancreatic-lipase-1b-P-clavata | DN10238_c1_g10_i1 | 8 | 25 | 169 | 3 | Y | ||
| Serine-protease-6a-P-clavata | PREDICTED: venom serine protease-like |
| DN9366_c0_g3_i1 | 6 | 10 | 135 | 85 | Y |
| Matrix-metalloproteinase-14-like-1a-P-clavata | Matrix metalloproteinase-14 |
| DN18586_c0_g1_i1 | 3 | 23 | 104 | 5 | N |
| Serine-protease-inhibitor-1a-P-clavata | PREDICTED: serine protease inhibitor 3/4-like isoform X2 |
| DN9965_c0_g10_i1 | 4 | 11 | 97 | 86 | Y |
| Alaserpin-X5-like-1a-P-clavata | PREDICTED: alaserpin-like isoform X5 |
| DN9965_c0_g10_i2 | 4 | 12 | 97 | 18 | Y |
| Pancreatic-triacylglycerol-lipase-X1-like-1a-P-clavata | PREDICTED: pancreatic triacylglycerol lipase-like isoform X1 |
| DN8997_c0_g2_i2 | 3 | 6 | 93 | 5 | Y |
| Pancreatic-triacylglycerol-lipase-X1-like-2a-P-clavata | DN8997_c0_g2_i3 | 1 | 3 | 37 | 2 | Y | ||
| Pancreatic-triacylglycerol-lipase-X1-like-3a-P-clavata | DN8997_c0_g2_i1 | 1 | 8 | 37 | 1 | Y | ||
| Venom-acid-phosphatase-1a-P-clavata | PREDICTED: venom acid phosphatase Acph-1 |
| DN5022_c0_g1_i1 | 3 | 7 | 63 | 5 | N |
| Iron-zinc-purple-acid phosphatase-like-1a_1-P-clavata | Iron zinc purple acid phosphatase-like protein |
| DN8672_c0_g1_i2 | 2 | 4 | 55 | 17 | N |
| Iron-zinc-purple-acid phosphatase-like-1a_2-P-clavata | DN8672_c0_g1_i1 | 2 | 4 | 55 | 9 | N | ||
| Alkaline-phosphatase-1a_1-P-clavata | PREDICTED: alkaline phosphatase 4-like |
| DN9670_c0_g1_i1 | 1 | 2 | 51 | 24 | N |
| Alkaline-phosphatase-1a_2-P-clavata | Alkaline phosphatase 4 |
| DN9670_c0_g1_i2 | 1 | 2 | 51 | 2.3 | N |
| Neprilysin-11-isoform-X2-1a_1-P-clavata | PREDICTED: neprilysin-11 isoform X2 |
| DN10125_c0_g9_i1 | 2 | 4 | 43 | 4.5 | Y |
| Neprilysin-11--isoform-X2-1a_2-P-clavata | PREDICTED: neprilysin-11 isoform X2 | DN10125_c0_g18_i1 | 2 | 4 | 43 | 1.7 | Y | |
| Matrix-metalloproteinase-14-like-2a-P-clavata | PREDICTED: matrix metalloproteinase-14 isoform |
| DN21699_c0_g1_i1 | 2 | 6 | 41 | 1.9 | Y |
| Protein-5NUC-1a-P-clavata | PREDICTED: protein 5NUC-like |
| DN10097_c1_g11_i1 | 1 | 2 | 37 | 25.2 | N |
| Icarapin-1a-P-clavata | PREDICTED: icarapin-like |
| DN9969_c1_g1_i4 | 1 | 7 | 32 | 969 | N |
| Cytosolic-dipeptidase-1a-P-clavata | PREDICTED: cytosolic non-specific dipeptidase |
| DN9768_c0_g1_i2 | 1 | 2 | 30 | 1.8 | Y |
| Phenoloxidase-2-like-1a-P-clavata | PREDICTED: phenoloxidase 2 |
| DN15311_c0_g1_i1 | 1 | 4 | 24 | 3.2 | Y |
Figure 3Amino acid sequence alignment of δ-paraponeritoxin-Pc1e isoforms to published δ-paraponeritoxin sequences. Predicted signal peptide (www.cbs.dtu.dk/services/SignalP/) and the predicted propeptide are indicated by green and orange tubes above the sequences, respectively. Identical residues in the peptide sequences are boxed in yellow while conservative substitutions are shown in red italic text. Percentage identity (%I) is relative to δ-paraponeritoxin-Pc1e while percentage similarity (%S) includes conservatively substituted residues. “UniProtKB ID” indicates the UniProt Knowledge base ID code or source reference. Dashes were inserted to maximise alignment. “Origin” refers to the country where the venom was collected. “Former name” refers to the previously published name assigned to that sequence. Previously published sequences (δ-paraponeritoxin-Pc1b, -Pc1c and -Pc1d) represent isoforms of -Pc1a (formerly poneratoxin) [4,10].
Figure 4δ-Paraponeritoxin-Pc1e isolation and sensory neuron assay. (A) RP-HPLC of P. clavata venom revealed a relatively simple profile, with a large late-eluting peak corresponding to δ-paraponeritoxin-Pc1e. (B) δ-paraponeritoxin-Pc1e was purified from the venom (Figure S3 illustrates the match to the δ-paraponeritoxin-Pc1e contig). (C) Addition of whole P. clavata venom (100 µg/mL) to dissociated mouse DRG cells produced an increase in [Ca2+]i in all DRG cells, which was particularly pronounced in neuronal cells, and was followed by some dye leakage from a small proportion of cells (see the right hand panels in C representing a snapshot at 180 s). An equivalent amount of isolated δ-paraponeritoxin-Pc1e produced an increase in [Ca2+]i specifically in excitable cells. Each trace represents a single DRG cell in the field of view. Snapshots shown of the recording are at 0 s (baseline) and 33 s (3 s after addition of venom or δ-paraponeritoxin-Pc1e). Scale bar: 100 µm.
Figure 5Amino acid alignment of P. clavata PLA2 isoforms. This figure shows the alignment of P. clavata transcripts with PLA2-like proteins from the ant species Camponotus floridanus, Wasmannia auropunctata, Monomorium pharaonis and the red flour beetle Tribolium castaneum. The predicted signal peptide (green tube) is indicated above the sequences. Residues identical to the P. clavata peptide sequences are boxed in yellow, while conservative substitutions are shown in red italic text. Cysteines are boxed in black. Gaps were introduced to optimize the alignments. The blue triangle indicates the predicted N-terminus [23]. The underlined coloured regions indicate the Ca2+-binding loop (blue), active site (orange) and conserved region (green) [23,26,28]. The red triangles indicate residues critical for calcium binding and the green triangles indicate the active site residues [23,26,29,30]. Percentage identity (%I) is relative to the first peptide of each family, while percentage similarity (%S) includes conservatively substituted residues.
Figure 6Distribution of novel P. clavata toxin-like peptides with four or more cysteines. Predicted peptide toxins were identified by Tox|Note. Upon closer inspection of the sequences that contained six cysteines, seven peptides were found that adhered to a canonical inhibitor cysteine knot (ICK) framework similar to the conotoxin VI/VII framework as shown in Table 4. There were also other peptides conforming to the ICK framework, however, they had eight and 10 cysteines.
Predicted P. clavata peptides conforming to ICK or conotoxin cysteine framework I frameworks. This table shows peptides identified using Tox|Note and their corresponding expression level indicated by TPM. However, peptides containing eight or more cysteines that conform to the ICK framework typically have four cysteines after the “CC” doublet, where two of the cysteines turn the fourth loop into a disulfide-stabilised hairpin. P. clavata peptides with eight or more cysteines may therefore represent putatively novel cysteine-rich scaffolds.
| Number of Cysteines | Scaffold | Mature Peptide Length | TPM |
|---|---|---|---|
| 4 | x8CCx1Cx19Cx5 | 37 | 772 |
| x1CCx34Cx1Cx1 | 41 | 18.4 | |
| x5CCx1Cx16Cx14 | 40 | 16.5 | |
| CCx23Cx7Cx6 | 40 | 13 | |
| x14CCx11Cx7Cx33 | 69 | 10.8 | |
| CCx1Cx27Cx12 | 44 | 7.1 | |
| x3CCx6Cx15Cx9 | 37 | 2.2 | |
| x6CCx7Cx11Cx8 | 36 | 0.76 | |
| x8CCx5Cx12Cx6 | 35 | 0.3 | |
| 6 | Cx6Cx5CCx4Cx6Cx29 | 56 | 43.8 |
| x18Cx6Cx3CC8Cx24Cx7 | 72 | 9.3 | |
| x7Cx8Cx21CCx5Cx22Cx20 | 89 | 5.9 | |
| Cx6Cx5CCx4Cx6Cx29 | 56 | 3.9 | |
| x8Cx6Cx5CCx4Cx6Cx25 | 60 | 3.4 | |
| x8Cx6Cx5CCx4Cx6Cx7 | 42 | 2.6 | |
| 8 | x14Cx9Cx3Cx5Cx5CCx3Cx4Cx26 | 77 | 1.76 |
| 10 | x10Cx5Cx4CCx11Cx9Cx28Cx1Cx5Cx6Cx1 | 90 | 12.67 |