| Literature DB >> 21605378 |
Jordi Durban1, Paula Juárez, Yamileth Angulo, Bruno Lomonte, Marietta Flores-Diaz, Alberto Alape-Girón, Mahmood Sasa, Libia Sanz, José M Gutiérrez, Joaquín Dopazo, Ana Conesa, Juan J Calvete.
Abstract
BACKGROUND: A long term research goal of venomics, of applied importance for improving current antivenom therapy, but also for drug discovery, is to understand the pharmacological potential of venoms. Individually or combined, proteomic and transcriptomic studies have demonstrated their feasibility to explore in depth the molecular diversity of venoms. In the absence of genome sequence, transcriptomes represent also valuable searchable databases for proteomic projects.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21605378 PMCID: PMC3128066 DOI: 10.1186/1471-2164-12-259
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Identity and relative abundance of venom protein entries identified in the whole 454 read sequence dataset of the 8 Costa Rican snake venom gland transcriptomes
| Number of reads | % of total venom protein entries | |
|---|---|---|
| Bradykinin potentiating peptide (BPP) | 9231 | 14.8 |
| Cysteine-Rich Secretoy Peptide (CRISP) | 1066 | 1.7 |
| C-type lectin-like protein (CTL) | 1039 | 1.6 |
| Growth factor (GF) | 789 | 1.2 |
| L-amino acid oxidase (LAO) | 2535 | 4.0 |
| Phospholipase A2 (PLA2) | 7065 | 11.3 |
| Metalloproteinase (SVMP) | 26646 | 42.7 |
| Serine Proteinase (SP) | 10019 | 16.0 |
| 5'-nucleotidase (5'-NTase) | 374 | 0.6 |
| Phosphodiesterase (PDE) | 119 | 0.2 |
| Glutaminyl cyclase (GC) | 170 | 0.3 |
| Cobra Venom Factor (CVF) | 8 | 0.01 |
| Crotamine (CRO) | 22 | 0.04 |
| Sarafotoxin (SARA) | 3 | 0.005 |
| Waprin (WAP) | 26 | 0.04 |
| Kunitz-type inhibitor (KUN) | 21 | 0.03 |
| Kazal-type inhibitor (KAZ) | 21 | 0.03 |
| Hyaluronidase (HYA) | 24 | 0.04 |
| Ohanin (OHA) | 2412 | 3.9 |
| Three-Finger Toxin (3FTx) | 845 | 1.3 |
Their relative abundance in each transcriptome is displayed in Table 2.
Relative contribution of the different venom protein family hits in each of the Costa Rican snake venom gland transcriptome
| Reads | % | Reads | % | Reads | % | Reads | % | Reads | % | Reads | % | Reads | % | Reads | % | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| BPP | 178 | 13.4 | 4232 | 15.0 | 169 | 12.2 | 818 | 8.3 | 1634 | 16.4 | 406 | 13.0 | 444 | 15.3 | 1350 | 23.6 |
| CRISP | 0 | 0 | 245 | 0.8 | 3 | 0.2 | 253 | 2.6 | 261 | 2.6 | 17 | 0.5 | 105 | 3.6 | 182 | 3.2 |
| CTL | 32 | 2.4 | 287 | 1.0 | 19 | 1.3 | 23 | 0.2 | 476 | 4.8 | 15 | 0.5 | 7 | 0.2 | 180 | 3.1 |
| GF | 60 | 4.5 | 275 | 0.9 | 38 | 2.7 | 64 | 0.6 | 56 | 0.6 | 50 | 1.6 | 142 | 4.9 | 103 | 1.8 |
| LAO | 50 | 3.8 | 1197 | 4.2 | 33 | 2.4 | 537 | 5.5 | 308 | 3.1 | 121 | 3.9 | 92 | 3.2 | 197 | 3.4 |
| PLA2 | 161 | 12.1 | 5026 | 17.8 | 195 | 14.2 | 777 | 7.9 | 228 | 2.3 | 224 | 7.2 | 278 | 9.6 | 176 | 3.0 |
| SVMP | 86 | 6.5 | 11733 | 41.6 | 559 | 40.6 | 4144 | 42.2 | 5583 | 56.1 | 1204 | 38.7 | 762 | 26.3 | 2575 | 44.9 |
| SP | 373 | 28.1 | 3790 | 13.4 | 114 | 8.3 | 2770 | 28.1 | 1089 | 10.9 | 692 | 22.2 | 594 | 20.5 | 597 | 10.4 |
| 5'-NTase | 3 | 0.2 | 214 | 0.7 | 3 | 0.2 | 67 | 0.7 | 13 | 0.1 | 27 | 0.8 | 5 | 0.17 | 42 | 0.7 |
| PDE | 3 | 0.2 | 56 | 0.2 | 3 | 0.2 | 13 | 0.1 | 8 | 0.08 | 0 | 0 | 2 | 0.06 | 33 | 0.6 |
| GC | 5 | 0.4 | 108 | 0.4 | 4 | 0.3 | 32 | 0.3 | 13 | 0.1 | 7 | 0.2 | 0 | 0 | 1 | 0.02 |
| CVF | 2 | 0.15 | 2 | 0.006 | 0 | 0 | 1 | 0.01 | 0 | 0 | 2 | 0.06 | 1 | 0.03 | 0 | 0 |
| CRO | 0 | 0 | 10 | 0.03 | 4 | 0.3 | 2 | 0.02 | 4 | 0.04 | 0 | 0 | 1 | 0.03 | 1 | 0.02 |
| SARA | 1 | 0.07 | 2 | 0.006 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| WAP | 0 | 0 | 26 | 0.09 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| KUN | 2 | 0.15 | 10 | 0.03 | 3 | 0.2 | 0 | 0 | 1 | 0.01 | 0 | 0 | 4 | 0.12 | 1 | 0.02 |
| KAZ | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 9 | 0.3 | 12 | 0.2 |
| HYA | 3 | 0.2 | 7 | 0.02 | 1 | 0.07 | 4 | 0.04 | 1 | 0.01 | 1 | 0.03 | 5 | 0.17 | 2 | 0.03 |
| OHA | 199 | 15.0 | 779 | 2.8 | 165 | 11.9 | 256 | 2. | 233 | 2.3 | 254 | 8.2 | 338 | 11.6 | 188 | 3.3 |
| 3FTx | 169 | 12.7 | 221 | 0.8 | 65 | 4.7 | 63 | 0.6 | 43 | 0.4 | 89 | 2.8 | 104 | 3.6 | 91 | 1.6 |
Protein family names are abbreviated as in Table 1.
Figure 1Gene Ontology annotation of the non-toxin . The figure represents the combination of the reads from all eight species. However, each species transcriptome exhibited similar relative expression levels of GO-annotated non-toxin transcript classes.
Figure 2Calculation of the minimum number of gene copies. Multiple alignment of the top six SVMP transcripts of B. asper (Car) (Additional file 1: Table S6) using the sequence (top) of the most similar database-annotated toxin sequence as template. Each line of the multiple sequence alignment displays a distinct set of contig(s), comprised by a unique set of reads indicated in parentheses (see also Additional file 1: Table S5). Since the short average length of the reads and the low coverage of reads per contig prevented the assemblage of reliable gene sequences, each line of the alignment corresponds to at least a distinct gene of the SVMP multigene family translated into the venom gland transcriptome of B. asper (car).
Estimation of the minimum number of toxin family gene copies translated in the venom gland transcriptomes of Costa Rican snakes
| BPP | 1 | 1 | 1 | 2 | 1 | 1 | 4 | 2 |
| CRISP | 0 | 2 | 1 | 2 | 4 | 1 | 2 | 1 |
| CTL | 2 | 5 | 2 | 3 | 9 | 1 | 0 | 5 |
| GF | 2 | 5 | 1 | 3 | 3 | 1 | 1 | 1 |
| LAO | 3 | 2 | 2 | 4 | 5 | 2 | 3 | 3 |
| PLA2 | 3 | 9 | 4 | 4 | 2 | 2 | 1 | 3 |
| SVMP | 9 | 29 | 5 | 19 | 15 | 4 | 14 | 20 |
| SP | 6 | 15 | 1 | 13 | 7 | 6 | 8 | 11 |
| 5'-NTase | 1 | 3 | 1 | 2 | 2 | 2 | 1 | 3 |
| PDE | 1 | 1 | 1 | 2 | 2 | 0 | 1 | 2 |
| GC | 1 | 2 | 1 | 1 | 1 | 1 | 0 | 1 |
| WAP | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 |
| HYA | 2 | 2 | 0 | 1 | 1 | 1 | 0 | 1 |
| OHA | 0 | 0 | 0 | 0 | 1 | 1 | 0 | 0 |
| 3FTx | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| KUN | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 |
| KAZ | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 |
Protein family names are abbreviated as in Table 1.
Figure 3Cartesian graph and corresponding chart pie displaying the uneven distribution of the number of reads per contig among the 29 SVMP genes identified in the venom gland transcriptome of .
Figure 4Transcriptomes versus proteomes. Comparison of the protein composition of the venom of Costa Rican snakes reported from proteomic analysis (chart pies labelled "a") (Additional file 1: Table S4)49-52 and predicted from their venom gland transcriptomes (this work). Chart pies "b" display the relative occurrence of ORF-coding reads listed in Additional file 1: Table S4 normalized for the full-length DNA sequence of a canonical member of the protein family (%mol). Panels c show the relative abundance (mol%) of toxin families as in panels "b" but computing only toxins previously identified in the venom proteome49-52. Chart pies depicted in panels d show the relative composition (reads%) of all venom protein family hits in each of the Costa Rican snake venom gland transcriptome (Table 2). Protein family names are abbreviated as in Table 1.
Figure 5Principal Component Analysis (PCA) of the Costa Rican snake venom gland transcriptomes (A, B) and the corresponding proteomes (C, D). Panels A and C show, respectively, the contributions to PC1 and PC2 of the different toxin families of the transcriptomes and the proteomes. Panels B and D, score plots displaying the segregation of the transcriptomes (B) and the proteomes (D) into different categories. In B, PC1 and PC2 contribute equally and together explain 65% of the observed transcriptome variability; in D, PC1 and PC2 explain 70% of the variability among proteomes.
Figure 6Cladogram of phylogenetic alliances among Central American snakes inferred from comparison of concatenated consensus sequences for SVMP, SP, BPP, LAO and PLA. Numbers at branching points indicate the degree of sequence divergence (0.1 = 10% sequence divergence).