| Literature DB >> 33923919 |
Caterina Peggion1, Fiorella Tonello2.
Abstract
Snake venom phospholipases A2 (PLA2s) have sequences and structures very similar to those of mammalian group I and II secretory PLA2s, but they possess many toxic properties, ranging from the inhibition of coagulation to the blockage of nerve transmission, and the induction of muscle necrosis. The biological properties of these proteins are not only due to their enzymatic activity, but also to protein-protein interactions which are still unidentified. Here, we compare sequence alignments of snake venom and mammalian PLA2s, grouped according to their structure and biological activity, looking for differences that can justify their different behavior. This bioinformatics analysis has evidenced three distinct regions, two central and one C-terminal, having amino acid compositions that distinguish the different categories of PLA2s. In these regions, we identified short linear motifs (SLiMs), peptide modules involved in protein-protein interactions, conserved in mammalian and not in snake venom PLA2s, or vice versa. The different content in the SLiMs of snake venom with respect to mammalian PLA2s may result in the formation of protein membrane complexes having a toxic activity, or in the formation of complexes whose activity cannot be blocked due to the lack of switches in the toxic PLA2s, as the motif recognized by the prolyl isomerase Pin1.Entities:
Keywords: PLA2G1B; PLA2G2A; glycogen synthase kinase 3; myotoxins; neurotoxins; prolyl isomerase; secretory phospholipases A2; sequence alignment; short linear motifs; snake venom phospholipases A2
Year: 2021 PMID: 33923919 PMCID: PMC8073766 DOI: 10.3390/toxins13040290
Source DB: PubMed Journal: Toxins (Basel) ISSN: 2072-6651 Impact factor: 4.546
Figure 1Conservation of the structures of snake venom and mammalian sPLA2s. (A) 3D structures of a group I neuro-myotoxin (notexin of Notechis scutatus scutatus, PDB entry AE7) and of human PLA2G1B (PDB entry 3ELO) and their superposition. (B) 3D structures of a group II myotoxin (bothropstoxin-I of Bothrops jararacussu, PDB entry 3I3I) and of human PLA2G2A (PDB entry 1BBC) and their superposition. The calcium binding loop region is colored in grey. PLA2G1B has an extra loop, named the pancreatic loop, represented in magenta. The lateral chains of the amino acids involved in the active site are represented in ball and sticks: H48 in yellow, D49 in orange. Bothropstoxin-I is a PLA2-like toxin (not-D49) with a lysine (in green) in the place of the glutamic acid 49.
Figure 2Consensus sequences analyses of snake venom PLA2s and their mammalian homologs. (A) Consensus sequences of group I snake venom neuro-myotoxic and neurotoxic PLA2s compared with that of mammalian PLA2G1B. (B) Consensus sequences of group II snake venom myotoxic, neuro-myotoxic, and neurotoxic PLA2s compared with that of mammalian PLA2G2A. In the scheme, myotoxins and neuro-myotoxins are divided in the “not D” group and “D” group, according to the absence or presence of the D amino acid in the position known as “49” of the active site, respectively. The sequences were collected from the Swiss-Prot database and aligned with the align tool of UniProt (Clustal O, https://www.uniprot.org/align/, accessed on 1 March 2021). The alignments were then visualized with SnapGene Viewer version 4.3.11. Conserved residues are reported in the consensus sequence, whereas the X code indicates an amino acid. The colored bars above the protein sequence show the degree of conservation, where a red bar indicates the maximum level of conservation (100%). Green, yellow, and grey squares represent α-helix, β-sheet, and calcium binding loop, respectively. The pancreatic loop (pink square, panel A) is present only in mammalian PLA2G1B. Blue circles indicate amino acids involved in the Ca2+ binding, whereas red circles represent the catalytic sites. The alignments are reported in the Supplementary Material.
Figure 3Mean amino acidic composition of the two low complexity regions of sPLA2s. Analyses of the mean amino acidic compositions of the central regions (55–85) (panel A and C) and of the C-terminal stretches (from 103) (panel B and D) belonging to toxins of group I and mammalian PLA2G1B (panel A and B) and to toxins of group II and mammalian PLA2G2B (panel C and D), respectively. Evaluation of amino acidic composition was performed for each sequence using the ProtParam tool (https://web.expasy.org/protparam/, accessed on 1 March 2021), and then the mean value for each amino acid was calculated and reported in the bar diagram. Amino acids reported in the x-axis were separated based on their chemical properties (S,T = phosphorylable sites of ser/thr kinases; FW: aromatic; DE: negatively charged at neutral pH; KR: positively charged at neutral pH; NQ: amine-added amino acids; Y: aromatic phosphorylable sites of Tyr kinases; Others: A, G, V, L, I, H, P, C, M).
Short Linear Motifs (SLiM) conserved in exposed regions of snake venom Group I.
| Motif | ELM Class | Sequence Section | Fraction of Proteins Containing the Motif in the Specified Section | ||
|---|---|---|---|---|---|
| Neuro-Myotoxins | Neurotoxins | PLA2G1B | |||
| N-terminal | DEG_Nend_UBRbox_3 | 1 | 7/7 | 14/14 | 0/10 |
| NEK2 phosph. site | MOD_NEK2_1 | 31–36 | 0/7 | 0/14 | 9/10 |
| I-BAR binding site | LIG_IBAR_NPY_1 | 67–69 | 0/7 | 0/14 | 10/10 |
| SH2 binding site | LIG_SH2_STAP1 | 69–73 | 0/7 | 0/14 | 10/10 |
| GSK3 phosph. sites * | MOD_GSK3_1 | 67–80 | 0/7 | 1/14 | 10/10 |
| SH2 binding site | LIG_SH2_NCK_1 | 105–109 | 3/7 | 6/14 | 0/10 |
| PTB binding sites | LIG_PTB_Apo_2LIG_PTB_Phospho_1 | 104–111 | 1/7 | 9/14 | 0/10 |
* Three to four consecutives.
Short Linear Motifs (SLiM) conserved in exposed regions of snake venom Group II.
| Motif | ELM Class | Sequence Section | Fraction of Proteins Containing the Motif in the Specified Section | |||||
|---|---|---|---|---|---|---|---|---|
| Myotoxins | Neuro-Myotoxins | Neurotoxins | PLA2G2A | |||||
| NOT D49 | D49 | NOT D49 | D49 | D49 | ||||
| Pin1 site, and | DOC_WW_Pin1_4 | 35–37 | 0/25 | 0/14 | 1/5 | 1/9 | 4/12 | 10/10 |
| SH2 binding site | LIG_SH2_CRK | 51–55 | 23/25 | 9/14 | 3/5 | 6/9 | 9/12 | 0/10 |
| PKA phosph. site | MOD_PKA_1 | 52–56 | 23/25 | 3/14 | 3/5 | 1/9 | 0/12 | 0/10 |
| FHA binding site | LIG_FHA_1 | 59–65 | 0/25 | 0/14 | 0/5 | 2/9 | 4/12 | 10/10 |
| Pin1 site and | DOC_WW_Pin1_4 | 119–121 | 0/25 | 0/14 | 0/5 | 0/9 | 0/12 | 7/10 |
| PDZ binding site | LIG_PDZ_Class_3 | 119–121 | 23/25 | 9/14 | 5/5 | 5/9 | 10/12 | 0/10 |
Short Linear Motifs (SLiM) description.
| ELM Class Identifier | Regular Expression | Interaction Partner(s) | Examples of Proteins * Containing the SLiM |
|---|---|---|---|
| DEG_Nend_ UBRbox_3 | ^M{0,1}([NQ]) | Aminohydrolases for deamidation | - |
| DOC_WW_Pin1_4 | ...([ST])P. | Peptidyl prolyl isomerase 1 | Many |
| LIG_FHA_1 | ..(T)..[ILV]. | Proteins containing the forkhead-associated domain, e.g., TIFA, TIFAB, AGGF1 | Kinesin interactors, TIFA |
| LIG_IBAR_NPY_1 | NPY | I-BAR domain-containing proteins, involved in membrane dynamic | The bacterial protein Tir, SHANK2 |
| LIG_PDZ_Class_3 | ...[DE].[ACVILF]$ | PDZ containing proteins, PDZ domains also bind to phospholipid headgroups | Many PDZ ligands are membrane proteins |
| LIG_PTB_Apo_2 | (.[^P].NP.[FY].)|(.[ILVMFY].N..[FY].) | Proteins containing phosphotyrosine binding (PTB) domains, e.g., insulin receptor substrate 1 (IRS-1) | Integrins, LRP1 |
| LIG_SH2_CRK | (Y)[^EPILVFYW][^HDEW][PLIV][^DEW] | Proteins containing SH2 domain of the CRK family | Transmembrane receptors, |
| LIG_SH2_NCK_1 | (Y)[DESTNA][^GWFY][VPAI][DENQSTAGYFP] | Proteins containing SH2 domain of the NCK family | Transmembrane proteins, adapter protein docking 1 |
| LIG_SH2_STAP1 | (Y)[DESTA][^GP][^GP][ILVFMWYA] | Proteins containing SH2 domain of the STAP1 family | Transmembrane proteins, |
| MOD_CDK_SPK_2 | ...([ST])P[RK] | Cyclin-dependent kinases, proline directed kinase | Proteins involved in different biochemical pathways |
| MOD_GSK3_1 | ...([ST])...[ST] | Glycogen synthase kinase 3, needs priming, inhibitory | |
| MOD_NEK2_1 | [FLM][^P][^P]([ST])[^DEP][^DE] | Never in mitosis A (NimA)-related kinases | |
| MOD_PKA_1 | [RK][RK].([ST])[^P].. | cAMP-dependent protein kinase A, basophilic kinase | |
| MOD_ProDKin_1 | ...([ST])P.. | MAP Kinase, proline directed kinase |
* Proteins involved in lipid metabolism, inflammation, redox reactions, lipid transfer, or in membrane complexes.
Figure 4Main SLiMs that differentiates snake venom PLA2s from their mammalian homologues. (A) 3D structures of the snake venom neuro-myotoxin notexin, of human PLA2G1B and their superimposition. The S/T residues (represented in yellow) in the central region of PLA2G1B, belong to four superimposed motifs phosphorylable by GSK3. The tyrosine is represented in magenta form, together with two residues of the pancreatic loop. The motif of interaction with I-BARREL proteins: the loop colored in orange in the group I toxin contains, in numerous cases, an SH2 and PTB binding sites. (B) 3D structures of the snake venom myotoxin bothropstoxin-I, of human PLA2G2A and their superimposition. The S/T residue evidenced in yellow in PLA2G2A, when phosphorylated, allows isomerization of the adjacent proline by Pin1. The loop following the second α-helix in the toxin structure, colored in orange, contains a PKA phosphorylation site and an SH2 binding site. The third and last C-terminal amino acid of the toxins (colored in red) forms a PDZ binding motif. The PDB files used for this figure are the same as those described for Figure 1.