| Literature DB >> 34357975 |
Jean-Luc Gatti1, Maya Belghazi2, Fabrice Legeai3, Marc Ravallec4, Marie Frayssinet4, Stéphanie Robin3, Djibril Aboubakar-Souna4,5, Ramasamy Srinivasan6, Manuele Tamò5, Marylène Poirié1, Anne-Nathalie Volkoff4.
Abstract
Meteorus pulchricornis (Ichneumonoidea, Braconidae) is an endoparasitoid wasp of lepidopteran caterpillars. Its parasitic success relies on vesicles (named M. pulchricornis Virus-Like Particles or MpVLPs) that are synthesized in the venom gland and injected into the parasitoid host along with the venom during oviposition. In order to define the content and understand the biogenesis of these atypical vesicles, we performed a transcriptome analysis of the venom gland and a proteomic analysis of the venom and purified MpVLPs. About half of the MpVLPs and soluble venom proteins identified were unknown and no similarity with any known viral sequence was found. However, MpVLPs contained a large number of proteins labelled as metalloproteinases while the most abundant protein family in the soluble venom was that of proteins containing the Domain of Unknown Function DUF-4803. The high number of these proteins identified suggests that a large expansion of these two protein families occurred in M. pulchricornis. Therefore, although the exact mechanism of MpVLPs formation remains to be elucidated, these vesicles appear to be "metalloproteinase bombs" that may have several physiological roles in the host including modifying the functions of its immune cells. The role of DUF4803 proteins, also present in the venom of other braconids, remains to be clarified.Entities:
Keywords: Braconidae; DUF-4803 proteins; Meteorus pulchricornis; metalloproteases; parasitoid wasp; proteomic; transcriptomic; venomics; virus-like particles (VLPs)
Mesh:
Substances:
Year: 2021 PMID: 34357975 PMCID: PMC8310156 DOI: 10.3390/toxins13070502
Source DB: PubMed Journal: Toxins (Basel) ISSN: 2072-6651 Impact factor: 4.546
Figure 1The venom apparatus of Meteorus pulchricornis. From the female wasp (A) the venom apparatus (B) was obtained by pulling out the ovipositor (ovi; white arrow in A). The ovarioles (ov), the two filamentous venom glands (vgf) and the large unique milky venom reservoir (r) are clearly visible. (C) TEM observation of a cross section of a venom gland filament showing a glandular cell with the secretory cell glandular canal (cgc) at its center that ends in the lumen of the collecting gland duct (lu). The nucleus (nu) surrounded by its nuclear envelope had and apparent normal shape. (D) MpVLPs may derive from the small electron dense vesicles contained within large cytoplasmic vesicles (Lv) found associated with the Golgi apparatus (Gg). (E) The internal canal (cgc) of the secretory gland cell, lined by microvilli, is filled with secreted mature MpVLPs. (F) MpVLPs purified by centrifugation of venom collected from M. pulchricornis venom reservoir (MpVLPs average longer diameter 250 nm, n = 20). These vesicles were apparently single membraned vesicles (membrane mean width 30 ± 13 nm, n = 17).
Figure 2Species classified by the number of hits reported for M. pulchricornis CDS protein sequences against the NCBI NR database.
Sequencing coverage of predicted M. pulchricornis venom gland CDS (TPMv; number of sequences for given TPM (Transcripts Per Kilobase Million) ranges in venom gland transcriptome).
| TPMv Range | Number of Predicted Proteins |
|---|---|
| 75,000–10,000 | 15 |
| 9999–5000 | 24 |
| 4999–1000 | 88 |
| 999–500 | 118 |
| 499–100 | 480 |
| 99–50 | 477 |
| 49–25 | 852 |
| 24–10 | 2370 |
| 9–0 | 12,404 |
List of the 39 CDS highly expressed in M. pulchricornis venom gland (TPMv > 5000). CDS with TPMv/TPMa ratio > 500 are indicated in pink, in orange 500 > ratio > 200 and in green ratio < 50.
| CDS ID | Sequence Description * | Species | TPMv | TPMa | TPMv/ | Orthogroup | Total Proteins in the OG | |
|---|---|---|---|---|---|---|---|---|
| 60.p3 | BAL70305.1 hypothetical protein |
| 70,871 | 549 | 129 | |||
| 64.p1 | BAL70301.1 hypothetical protein |
| 31,382 | 115 | 272 | OG0042547 | 1 | 2 |
| 32.p1 | BAL70307.1 hypothetical protein |
| 30,408 | 124 | 245 | |||
| 338.p1 | BAL70305.1 hypothetical protein |
| 28,057 | 174 | 161 | OG0042502 | 1 | 2 |
| 1043.p6 | uncharacterized protein LOC115116642 |
| 18,777 | 111 | 169 | |||
| 7952.p1 | No hit | 17,559 | 133 | 132 | ||||
| 38.p1 | uncharacterized protein LOC103570348 |
| 15,273 | 94 | 162 | OG0042509 | 1 | 2 |
| 174.p2 | BAL70304.1 hypothetical protein |
| 15,034 | 133 | 113 | OG0019101 | 5 | 6 |
| 29.p1 | BAL70308.1 hypothetical protein |
| 13,879 | 66 | 210 | OG0042489 | 2 | 2 |
| 2896.p4 | No hit | 13,698 | 123 | 111 | OG0042486 | 1 | 2 | |
| 35.p1 | No hit | 13,271 | 131 | 101 | OG0021629 | 4 | 5 | |
| 4874.p12 | No hit | 12,545 | 103 | 121 | ||||
| 5.p1 | BAL70306.1 hypothetical protein |
| 12,037 | 74 | 162 | OG0019101 | 5 | 6 |
| 42.p1 | No hit | 10,282 | 56 | 183 | OG0042514 | 1 | 2 | |
| 57.p1 | No hit | 10,256 | 105 | 97 | OG0042539 | 2 | 2 | |
| 39.p1 | BAL70290.1 hypothetical protein |
| 9755 | 71 | 137 | OG0019101 | 5 | 6 |
| 121.p1 | BAL70308.1 hypothetical protein |
| 9660 | 104 | 93 | OG0025719 | 4 | 4 |
| 6996.p3 | No hit | 8834 | 57 | 155 | ||||
| 6621.p3 | No hit | 8315 | 73 | 114 | ||||
| 52.p1 | BAL70302.1 hypothetical protein |
| 8230 | 13 | 633 | OG0021663 | 4 | 5 |
| 281.p1 | No hit | 8078 | 17 | 475 | ||||
| 86.p1 | No hit | 8049 | 18 | 447 | OG0025738 | 4 | 4 | |
| 30.p1 | No hit | 7603 | 30 | 253 | OG0042491 | 1 | 2 | |
| 421.p1 | Low quality protein, fibrillin-1- partial |
| 7188 | 63 | 114 | OG0009646 | 17 | 20 |
| 186.p3 | No hit | 6991 | 129 | 54 | ||||
| 72.p1 | XP.023948444.1 uncharacterized protein LOC112053291 isoform X2 |
| 6968 | 36 | 193 | OG0015773 | 7 | 8 |
| 73.p1 | No hit | 6695 | 28 | 239 | OG0042561 | 1 | 2 | |
| 21.p1 | 5NUC-like |
| 6656 | 4 | 1664 | OG0031454 | 2 | 3 |
| 11189.p3 | BAL70308.1 hypothetical protein |
| 6606 | 42 | 157 | OG0025719 | 4 | 4 |
| 155.p1 | No hit | 5916 | 4 | 1479 | OG0009638 | 18 | 20 | |
| 2291.p1 | PREDICTED: location of vulva defective 1 X1 |
| 5903 | 145 | 40 | OG0004499 | 3 | 36 |
| 4719.p2 | No hit | 5735 | 20 | 286 | ||||
| 285.p2 | No hit | 5626 | 29 | 194 | ||||
| 114.p1 | Lipase-related 2 |
| 5395 | 31 | 174 | OG0010100 | 8 | 18 |
| 380.p3 | No hit | 5346 | 22 | 243 | ||||
| 3115.p10 | No hit | 5227 | 99 | 52 | ||||
| 49.p1 | BAL70302.1 hypothetical protein |
| 5214 | 10 | 521 | |||
| 932.p4 | No hit | 5076 | 436 | 11 | ||||
| 132.p1 | No hit | 5042 | 47 | 107 | OG0015773 | 7 | 8 |
* best hit protein blast E < 10−8; §, Mp: M. pulchricornis.
Figure 3Protein profile of total venom (TV) compared to the supernatant (Super) and the MpVLPs pellet fractions obtained after centrifugation. A total of 25 and 13 bands were cut off for TV and the washed 15,000 g Pellets (MpVLPs) lanes, respectively, to be submitted to the MS-MS analysis. Left and right lanes correspond to molecular weight markers (MW in kDa). 12% SDS-PAGE in reducing conditions and gel was silver stained.
Names and functions of the most abundant MpVLPs proteins. They are classified starting from the most abundant MpVLP protein using the Mascot “ranks” that ranged from 1 (the most abundant) to 516 (the less abundant) for MpVLPs and from 1 to 1354 for TV CDS/proteins, respectively.
| CDS ° ID | Gel Bands * | Names and Functions of MpVLPs Proteins | MpVLPs Mascot Score (Rank) | TV Mascot Score (Rank) | TPMv/TPMa | OG |
|---|---|---|---|---|---|---|
| 38 | TV8-9 | Contig containing 2 CDS: | 44,936 (1) | 40,770 (1) | 15,273/95 | OG0042509 |
| 35 | TV10 | No hit | 24,427 (2) | 28,317 (2) | 13,271/131 | OG0021629 |
| 29 | TV14-17 | Hypothetical protein [ | 13,954 (3) | 21,697 (4) | 13,879/67 | OG0042489 |
| 39 | TV13-17 | Hypothetical protein [ | 11,019 (4) | 9659 (11) | 9755/71 | OG0019101 |
| 281 | TV6-7 | No hit | 9946 (5) | 11,374 (10) | 8078/17 | na |
| 72 | TV7 | Uncharacterized protein LOC106655966 [ | 9842 (6) | 7277 (19) | 6968/36 | OG0015773 |
| 52 | TV14-15 | Hypothetical protein [ | 7157 (7) | 12,900 (8) | 8230/13 | OG0021663 |
| 1173 | TV21 | Hypothetical protein [ | 6728 (8) | 5156 (27) | 4196/19 | OG0019101 |
| 110 | TV7 | Hypothetical protein [ | 6535 (9) | 3149 (41) | 763/5 | OG0000843 |
| 64 | TV22-25 | Hypothetical protein [ | 6517 (10) | 15,886 (6) | 31,382/115 | OG0042547 |
| 42 | TV14-15 | No hit | 5465 (11) | 6617 (21) | 10,282/57 | OG0042514 |
| 16 | TV12-13 | A disintegrin and metalloproteinase with thrombospondin motifs 6-like [ | 5007 (12) | 12,463 (9) | 3528/17 | OG0005678 |
| 6 | TV18-19 | Contig containing 2 CDSs: | 4948 (13) | 4063 (37) | 2924/23 | OG0009646 |
| 267 | TV13-14 | Uncharacterized protein LOC105263636 [ | 4058 (14) | 5138 (28) | 2950/28 | OG0025736 |
| 263 | TV13-14 | Hypothetical protein [ | 3397 (15) | 4678 (33) | 1340/6 | OG0001872 |
| 109 | TV13-14 | Hypothetical protein [ | 2190 (16) | 1630 (85) | 522/1 | OG0001872 |
| 73 | TV23-24 | No hit | 2131 (17) | 7876 (16) | 6695/28 | OG0042561 |
| 285 | TV22 | No hit | 2076 (18) | 1592 (73) | 5625/28 | na |
| 230 | TV5-6 | Unknown. | 2071 (19) | 2654 (48) | 907/10 | OG0031457 |
| 668 | TV8-9 | Hypothetical protein [ | 1812 (20) | 2198 (57) | 676/6 | OG0000843 |
| 339 | TV6 | Hypothetical protein [ | 1726 (21) | 2811 (46) | 544/5 | OG0001872 |
| 126 | TV11 & TV22 | A disintegrin and metalloproteinase with thrombospondin motifs 18-like [ | 1566 (22) | 8290 (13) | 1684/1 | OG0000306 |
| 927 | TV8 | Uncharacterized protein LOC118072154 [ | 1531 (23) | 2074 (61) | 285/23 | OG0000306 |
| 5 | TV20-21 | Hypothetical protein [ | 1511 (24) | 7780 (17) | 12,037/74 | OG0019101 |
| 8 | TV18-19 | No hit | 1486 (25) | 5084 (29) | 1622/7 | OG0031468 |
| 1266 | TV6 | No hit | 1476 (26) | 1598 (72) | 285/1 | OG0000306 |
| 1636 | TV7 | No hit | 1463 (27) | 1578 (88) | 235/0.3 | OG0000843 |
| 875 | TV9 | No hit | 1376 (28) | 1296 (78) | 1114/8 | OG0031455 |
| 208 | TV12-13 | Hypothetical protein [ | 1368 (29) | 1575 (68) | 1051/5 | OG0001872 |
| 1290 | TV15 | Uncharacterized protein LOC118072154 [ | 1356 (30) | 2543 (57) | 188/3 | OG0005678 |
| 1198 | TV7 | Hypothetical protein [ | 1276 (31) | 635 (146) | 948/6 | OG0000306 |
| 2114 | P3-4 | No hit | 1259 (32) | 918 (108) | 480/3 | OG0000306 |
| 1039 | P2 | Fibrillin-2 like protein [ | 1086 (33) | 800 (125) | 386/2 | OG0009646 |
| 379.p2 | P3 | Hypothetical protein [ | 1025 (34) | 1057 (94) | 429/7 | OG0000843 |
| 647 | TV7 | Hypothetical protein [ | 1023 (35) | 1282 (80) | 242/2 | OG0000843 |
| 897 | TV3-4 | Venom protein 2 [ | 952 (36) | 21,507 (5) | 262/7 | OG0007440 |
| 293 | TV8 | Hyaluronidase [ | 934 (37) | 902 (110) | 363/1 | OG0005785 |
| 686 | P2 | No hit | 930 (38) | 956 (104) | 342/2 | OG0042554 |
| 496 | P6 | Venom metalloproteinase 2-like [ | 921 (39) | 1168 (89) | 279/4 | OG0009638 |
| 91 | TV6 | Venom metalloproteinase 3-like [ | 915 (40) | 1185 (110) | 182/1 | OG0000306 |
| 575 | P3 | Hypothetical protein [ | 910 (41) | 839 (153) | 952/6 | OG0000843 |
| 590 | P3 | Unknown. | 896 (42) | 483 (223) | 641/4 | OG0042523 |
| 153 | TV13 | Hypothetical protein [ | 818 (43) | 2230 (56) | 1637/11 | OG0001872 |
| 651 | TV10-11 | Venom metalloproteinase 3-like [ | 804 (44) | 2022 (63) | 2059/4 | OG0009638 |
| 2103 | P4 | Uncharacterized protein LOC118072154 [ | 801 (45) | 1171 (112) | 100/3 | OG0000843 |
| 4685 | TV10 | No hit | 766 (46) | 881 (114) | 4416/30 | na |
| 375 | TV5 | Venom metalloproteinase 2-like [ | 760 (47) | 1638 (71) | 161/4 | OG0000843 |
| 216.p1 | TV7 | Uncharacterized protein LOC118072154 [ | 710 (48) | 1227 (84) | 959/7 | OG0000306 |
| 609 | P2 | Unknown. | 697 (49) | 601 (153) | 238/4 | OG0021629 |
| 429.p1 | TV10-11 & TV23 | Uncharacterized protein LOC118072154 [ | 691 (50) | 3131 (42) | 439/11 | OG0000306 |
| 286 | TV14 & TV22 | Hypothetical protein [ | 690 (51) | 1198 (107) | 506/1 | OG0001872 |
| 244 | TV11-12 | A disintegrin and metalloproteinase with thrombospondin motifs 15-like [ | 699 (52) | 3022 (44) | 501/4 | OG0009638 |
| 554 | P5 | Pancreatic lipase-related protein 2-like [ | 611 (53) | 1258 (82) | 602/9 | OG0010100 |
| 124 | TV20-21 | Hypothetical predicted protein [ | 594 (54) | 1018 (97) | 1243/6 | OG0019104 |
| 185 | TV12-13 | Hypothetical protein [ | 594 (55) | 2979 (50) | 789/3 | OG0001872 |
| 1150 | - | No hit. | 584 (56) | 530 (164) | 113/1 | OG0021629 |
| 166 | - | No hit | 549 (57) | 438 (181) | 557/5 | OG0000306 |
| 378.p2 | - | A disintegrin and metalloproteinase with thrombospondin motifs 1-like [ | 532 (58) | 204 (286) | 359/6 | OG0000306 |
| 943 | - | No hit | 530 (59) | 556 (157) | 101/2 | OG0000306 |
| 1411 | P6 | Hypothetical protein [ | 523 (60) | 738 (136) | 238/0.3 | OG0001872 |
° contigs are .p1 unless indicated; * presence in TV and Pellet (P) bands from Figure 3 are indicated when the protein was within the 15 highest validated mascot scores in the bands (a protein may have a high total mascot score, but not be among the most abundant in a single band). §, In italic, the domain search results.
Names and functions of the most abundant venom soluble proteins. Mascot “ranks” range from 1 (the most abundant) to 1354 (the less abundant) for TV CDS/proteins and from 1 to 516 for MpVLP CDS/proteins.
| CDS ° | Gel Bands * | Names and Functions of Venom Soluble Proteins (E-Value Protein Blast) § | TV | MpVLPs Score (Rank) | TPMv/TPMa | Contig |
|---|---|---|---|---|---|---|
| 10764 | TV9-12 | Uncharacterized protein LOC103570348 [ | 23,572 (3) | 16 (489) | 0.9/0.3 | OG0015773 |
| 536 | TV12-13 | No hit. | 14,497 (7) | 278 (87) | 581/30 | OG0015781 |
| 341 | TV3-4 | Venom protein 2 [ | 9650 (12) | 429 (69) | 196/4 | OG0025733 |
| 21 | TV4-5 | Protein 5NUC [ | 8248 (14) | 125 (136) | 6656/4 | OG0002704 |
| 114 | TV11-13 | Pancreatic lipase-related protein 2 [ | 7962 (15) | 106 (151) | 5394/31 | OG0010100 |
| 8250 | TV3-4 | Uncharacterized protein LOC105686340 isoform X1 [ | 7489 (18) | 60 (214) | 1/4 | OG0010765 |
| 7504 | TV4 | Uncharacterized protein LOC107042423 isoform X1 [ | 6704 (20) | 173 (118) | 0.4/5 | OG0010765 |
| 2792 | TV10 | Uncharacterized protein LOC103574503 [ | 6520 (22) | 73 (183) | 5/6 | OG0005090 |
| 136 | TV13 | No hit. | 6406 (23) | 193 (104) | 4945/25 | OG0000843 |
| 106 | TV5-6 | Predicted pancreatic triacylglycerol lipase [ | 6267 (24) | 63 (209) | 3310/21 | OG0031414 |
| 2331 | TV19-21 | No hit. | 5752 (25) | 175 (114) | 52/8 | OG0015781 |
| 3422 | TV10-11 | Arginine kinase [ | 5653 (26) | 49 (238) | 1/764 | OG0001469 |
| 58 | TV11-12 | A disintegrin and metalloproteinase with thrombospondin motifs 6 [ | 5079 (30) | 444 (66) | 1579/8 | OG0009638 |
| 8023 | TV4 | Venom protein [ | 4972 (31) | 276 (88) | 3/2 | OG0010765 |
| 11420 | TV4-5 | Uncharacterized protein LOC115884266 [ | 4758 (32) | 118 (141) | 298/6 | OG0010765 |
| 11339 | TV9-10, TV20 | γ-Glutamyltranspeptidase 1 isoform X1 [ | 4507 (34) | 101 (122) | 0.8 /3 | OG0000441 |
| 662.p2 | TV19 | Hypothetical protein [ | 4465 (35) | 181 (111) | 1321/49 | OG0042432 |
| 1164 | TV3 | Venom protein 2 [ | 4306 (36) | not found | 11/6 | OG0008161 |
| 2455 | TV3 | Hypothetical protein TSAR.007430 [ | 3922 (38) | not found | 32/1 | OG0000373 |
| 11685 | TV4 | Uncharacterized protein LOC105686340 isoform X2 [ | 3866 (39) | 118 (142) | 0.4/2 | OG0010765 |
| 275 | TV4 | Uncharacterized protein LOC108086051 [ | 3455 (40) | 65 (195) | 3317 /19 | OG0025743 |
| 6649 | TV1 | Myosin heavy chain, muscle isoform X5 [ | 3032 (43) | not found | 0.5/153 | OG0000051 |
| 69 | TV13 | Glyceraldehyde-3-phosphate dehydrogenase [ | 2911 (45) | not found | 205/1324 | OG0003321 |
| 229 | TV22 | No match | 2693 (47) | 109 (147) | 4413/13 | OG0001872 |
| 726 | TV3 | Uncharacterized protein LOC118068317 [ | 2551 (49) | not found | 197/2 | OG0019097 |
| 53 | TV4 | Receptor-type tyrosine-protein phosphatase epsilon isoform X2 [ | 2514 (51) | 190 (107) | 4243/31 | OG0010765 |
| 155 | TV11 | Hypothetical protein [ | 2458 (52) | not found | 5915 /4 | OG0009638 |
| 4840 | TV1 | Apolipophorins [ | 2451 (53) | 51 (232) | 0.4/189 | OG0013776 |
| 763 | TV12 | Predicted aldose reductase [ | 2365 (54) | not found | 112/257 | OG0000155 |
| 14605 | TV1 | Apolipophorins [ | 2303 (55) | not found | 0.1/17 | OG0025456 |
| 1036 | TV16 | Putative 14-3-3 protein zeta isoform X1, partial [ | 2188 (58) | 148 (126) | 309/970 | OG0000955 |
| 11280 | TV25 | Odorant-binding protein 3 [ | 2175 (59) | not found | 2/3700 | OG0000832 |
| 6971 | TV3 | Neprilysin-2-like isoform X2 [ | 2098 (60) | not found | 1/1 | OG0017160 |
° contigs are .p1 unless indicated; *, Band number from Figure 3; §, in italic the domain search results.
Figure 4Sequence alignment of nucleotidases from M. pulchricornis, human, E.coli, snake, drosophila and bee. Important conserved residues or motifs are shaded in grey or boxed. Functionally important residues are marked below the alignment with the following labels: “R” indicates the three arginine in the active site of the E. coli enzyme; “F”, the phenylalanines that bind the adenine moiety; “S” the residues that interact with the substrate and “s” those involved in non-polar interactions (from [61]). The only two cysteines in MpNUC that can form a bond are in red. USHA.ECOLI, Protein UshA Escherichia coli P07024 (uniport); 5NTD.HUMAN, 5’-nucleotidase Homo sapiens P21589; V5NTD.CROAD, Crotalus adamanteus venom 5’-nucleotidase F8S0Z7; NT5E-2PA, D. melanogaster Q8SZY4; 5NUC.Apis, protein 5NUC Apis mellifera XP.394018 (NCBI). Identical amino acids are indicated by a star, conservation by a colon and substitution by a dot.
Figure 5Alignment of the putative M. pulchricornis venom lipocalins. Grey boxes indicate the three putative lipocalin structural regions (SCR1, 2, 3). In green, the hydrophobic tryptophan (W) retrieved in SCRs, in red the cysteines. Identical amino acids are indicated by a star, conservation by a colon and substitution by a dot.
Figure 6Clustal alignment of the M. pulchricornis GGTs with the human (CAG30380.1) and A. ervi (CAL69624.1) GGT1 sequences. The catalytic threonine residue that forms the N-terminus of the small subunit (T381) after autocleavage to form the mature heterodimeric enzyme is highlighted with a black box containing a white “T” with an arrow above. Residues proposed to interact with glutathione in human GGT1 are shaded in grey and residues that significantly reduced human GGT enzymatic activity by site-directed mutagenesis are in red. Identical amino acids are indicated by a star, conservation by a colon and substitution by a dot.
Figure 7Sequence Alignments for Pancreatic Lipase Sequences. Key active site residues from Human Pancreatic lipase-related protein 2 (P54317) are in red shaded in grey; in shaded grey, amino acids conserved around the active site residues. The underline human sequence residues represent the two bonds between cysteines. Identical amino acids are indicated by a star, conservation by a colon and substitution by a dot.
Figure 8Alignment of hyaluronidases from M. pulchricornis (Hyal-Mp from contig 293), Human (Hyaluronidase 2, HYAL2-HUMAN, Q12891), bee (Apis mellifera, Q08169) and Cotesia congregata (CCQ71107). The putative active-site residues Asp(D), Glu(E), and Glu(E) (shaded in grey) of the hyaluronidase were conserved in the M. pulchricornis sequence, although a conservative replacement E → N occurred that may affect its activity [81]. The four cysteine residues forming two disulfide bridges are also conserved (in red). Identical amino acids are indicated by a star, conservation by a colon and substitution by a dot.
Figure 9Alignment of the first CAP domain of contig 2455 (Mp-CRISP) with selected sequences of hymenoptera antigen 5 allergens. The CAP signature motifs are shaded in grey and conserved residues that form the putative active site are shaded in cyan [85]. Cysteine residues that form disulphide bridges in Ag5 proteins are marked in red. Asterisks, colons and periods indicate identical, conserved and semi-conserved residues, respectively. Sequence from Vespula vulgaris (VA5.VESVU, Q05110.1), Dolichovespula maculata (bald-faced hornet)(VA52.DOLMA, P10736.1), Polistes dominula (European paper wasp) (VA5.polistes, NP.001310265.1), and Microctonus hyperodae (A9YME1.1).
Figure 10Amino acid sequence alignments of contig 590 protein (Mp_SERPIN) and human SERPINA1 (P01009|A1AT_HUMAN Alpha-1-antitrypsin). The RCL position of the human sequence is underlined. The cleavage site is situated between the position P1 (Met) and P1′ (Ser) of the human sequence, only the Ser is conserved in Mp_SERPIN. The important Alanine-rich hinge site shaded in grey is also absent from contig 590. Asterisks, colons and periods indicate identical, conserved and semi-conserved residues, respectively.