| Literature DB >> 17069636 |
Z Zou1, Dawn L Lopez, Michael R Kanost, Jay D Evans, Haobo Jiang.
Abstract
We have identified 44 serine protease (SP) and 13 serine protease homolog (SPH) genes in the genome of Apis mellifera. Most of these genes encode putative secreted proteins, but four SPs and three SPHs may associate with the plasma membrane via a transmembrane region. Clip domains represent the most abundant non-catalytic structural units in these SP-like proteins -12 SPs and six SPHs contain at least one clip domain. Some of the family members contain other modules for protein-protein interactions, including disulphide-stabilized structures (LDL(r)A, SRCR, frizzled, kringle, Sushi, Wonton and Pan/apple), carbohydrate-recognition domains (C-type lectin and chitin-binding), and other modules (such as zinc finger, CUB, coiled coil and Sina). Comparison of the sequences with those from Drosophila led to a proposed SP pathway for establishing the dorsoventral axis of honey bee embryos. Multiple sequence alignments revealed evolutionary relationships of honey bee SPs and SPHs with those in Drosophila melanogaster, Anopheles gambiae, and Manduca sexta. We identified homologs of D. melanogaster persephone, M. sexta HP14, PAP-1 and SPH-1. A. mellifera genome includes at least five genes for potential SP inhibitors (serpin-1 through -5) and three genes of SP putative substrates (prophenoloxidase, spätzle-1 and spätzle-2). Quantitative RT-PCR analyses showed an elevation in the mRNA levels of SP2, SP3, SP9, SP10, SPH41, SPH42, SP49, serpin-2, serpin-4, serpin-5, and spätzle-2 in adults after a microbial challenge. The SP41 and SP6 transcripts significantly increased after an injection of Paenibacillus larva, but there was no such increase after injection of saline or Escherichia coli. mRNA levels of most SPs and serpins significantly increased by 48 h after the pathogen infection in 1st instar larvae. On the contrary, SP1, SP3, SP19 and serpin-5 transcript levels reduced. These results, taken together, provide a framework for designing experimental studies of the roles of SPs and related proteins in embryonic development and immune responses of A. mellifera.Entities:
Mesh:
Substances:
Year: 2006 PMID: 17069636 PMCID: PMC1761132 DOI: 10.1111/j.1365-2583.2006.00684.x
Source DB: PubMed Journal: Insect Mol Biol ISSN: 0962-1075 Impact factor: 3.585
Serine proteases (SPs) and serine proteinase homologs (SPHs) in Apis mellifera
| Homologous proteins | Conserved regions | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Gene name | ID | Drosophila | other arthropods | TAAHC | DIAL | GDSGGP | Length | Activation site | Enzyme specificity | Domain structure |
| cSP1 | 16147 | ea CG1102 | MsHP5 | 376 | TEQK∧IFGG | T(DGA) | C-SP | |||
| cSP2 | 14247 | ea CG1102 | MsHP8 | ∼391 | LSQR∧IIGG | ?(D??) | C-SP | |||
| cSP3 | 11698 | MsHP17 | 353 | SHTR∧VVGG | T(DGG) | C-SP | ||||
| SP4 | 10646 | CG4914 | > 304 | EESR∧IVGG | T(DGG) | SP | ||||
| SP5 | 12300 | CG4386 CG18735 | 329 | VQRR∧IVGG | T(DGG) | SP | ||||
| cSP6 | 14077 | CG8172 | DVAL | 622 | RSNR∧IVGG | T(DGG) | 2LC-C-SP | |||
| cSP7 | 17145 | CG31728 | 512 | DQER∧IVGG | T(DGG) | C-SP | ||||
| SP8 | 18767 | CG9372 | MsHP21 | DIAV | > 292 | SRSR∧LTGG | T(DGG) | SP | ||
| cSP9 | 18732 | CG11843 | MsHP13 | 423 | DRKL∧IVGG | T(DGG) | pSP-SP | |||
| cSP10 | 17927 | Psh | MsHP21 CfSP | ?751 | PNKF∧IVGG | T(DGG) | 2(C-LC-SP) | |||
| Psh | MsHP21 CfSP | PNKF∧IVGG | T(DGG) | |||||||
| SP11 | 14654 | CG11836 | MsHP1 | DVAL | > 255 | QEDR∧IVGG | T(DGG) | SP | ||
| SP12 | 19856 | CG5255 CG31265 | DIGL | > 237 | EIPK∧IVGG | ?(GGD) | SP | |||
| SP13 | 15640 | CG7996 | MsHP21 Ag18D | > 448 | PNHL∧VIGG | T(DGG) | 3LC(Nr)-SP | |||
| cSP14 | 14044 | CG2056-PB,snake | MsHP6 | DVAI | > 385 | LSFH∧IFNG | T(DGG) | C-SP | ||
| SP15 | 18178 | > 294 | TTGR∧IFNG | ?(GGD) | SP | |||||
| SP16 | 12253 | CG16996 | TAGHC | DLAL | ?1149 | PETR∧IVGG | T(DGG) | 2LC(HTr)-SP | ||
| SP17 | 14603 | CG4316 | TAGHC | ?498 | LEPR∧ITGD | C(SAG) | SP-SP | |||
| CG4316 | TAGHC | FDTR∧IVGG | C(SGS) | |||||||
| SP18 | 10222 | CG31954 | CsSP | DVAL | > 247 | LQPR∧IIGG | T(DGG) | SP | ||
| cSPH19 | 17345 | CG4998 | TtFD | DIAI | GDGGGP | 741 | 4LC-LC(Yr)-C?-SPH | |||
| SP20 | 19590 | nudel corin | BmOvarianSP | DIGM | ?1645 | SQLR∧VVGG | T(DGG) | 3LDLA-SP-3LDLA-pSP-LDLA(RGD)-2LDLA(pSP) | ||
| cSP21 | 16220 | CG7432 | TtPCE | DIAV | > 408 | GKYR∧VVGG | T(DGG) | C-LC-SP | ||
| SP22 | 13791 | CsSP | > 259 | PDTQ∧IVGG | T(DGG) | SP | ||||
| SP23 | 12538 | Tequila CG4821 | Ag22D | ?2323 | IFQK∧VVRG | T(DGG) | 4LC-4CBD-SR-Clect-KR-LDLA-PA-2LDLA-SP | |||
| SP24 | 14233 | CG6865 | DIAI | > 236 | ? | T(DGG) | SP | |||
| cSP25 | 19719 | CG11824 | DLAL | ?942 | PESR∧IVGG | T(DGG) | C-10LC(STr)-SP | |||
| cSP26 | 18450 | CG8170 | CfSP | TAGHC | DVAV | ?667 | AQRR∧IVGG | T(DGG) | TM-2LC-C-SP | |
| SP27 | 11588 | CG31954 | AgTry | ?537 | MDGR∧IVGG | T(DGG) | 2(LC-SP) | |||
| DVAV | PTGQ∧IIGG | T(DGG) | ||||||||
| SP28 | 13489 | CG30375 | DVAL | MDSGGP | > 405 | NPSR∧IVGG | T(DGG) | TM-CUB-SP | ||
| SP29 | 14644 | CG18375 | DIAI | > 224 | ? | T(DGG) | SP | |||
| SP30 | 19649 | corin | TASHC | DVAL | ∼944 | AKTR∧IVGG | T(DGG) | cc-LC-TM-Fri(ZnF)-LDLA-LC-LDLA-SR-SP | ||
| SP31 | 11297 | AaTry | TAGHC | > 291 | EEDR∧IFGG | T(DGG) | SP | |||
| SP32 | 11511 | AaChy | TAGHC | > 260 | RPTR∧IVGG | C(AGS) | SP | |||
| cSP33 | 14309 | CG8213 | OnT2b | DLAL | > 1269 | KSGR∧IVGG | T(DGG) | C-5LC(Tr)-SP | ||
| SP34 | 11552 | CG30371 | CsChy | TAGHC | MGSGGP | ∼405 | NPSR∧IVGG | ?(DGN) | TM-CUB-SP | |
| SP35 | 16021 | CG5255 | DIAI | > 255 | NLEK∧IVGG | ?(GVD) | LC-SP | |||
| SP36 | 19846 | CG5255 | > 263 | PESK∧IVGG | ?(GGD) | SP | ||||
| SPH37 | 18944 | CG13318 | PlMasq | TVAHC | DVAV | GDGGSP | > 307 | pC?-SPH | ||
| SP38 | 16214 | CG10663 | DVAM | ∼481 | YFTR∧IIGG | T(DGG) | 2TSP1-SP | |||
| cSPH39 | 14366 | LD13269p | CrVn50 TmPPAF | DIAV | GDGGGP | ?783 | ZnF-LC-C-LC(Tr)-C-C(LC)-C-SPH | |||
| SP40 | 13263 | CG32808 | PlTry | ?725 | YNPK∧IING | C(GAT) | ZnF-LC-Sina-LC-SP | |||
| cSPH41 | 10943 | masquerade CG15002 | GDGGGP | 735 | 5[C-LC(STr)]-SPH | |||||
| cSPH42 | 11298 | CG5390 | BmMasq CrVn50 | DFAI | GDGGSP | 417 | LC-C-SPH | |||
| SP43 | 18530 | CG9564 | DVAV | 268 | PTGQ∧IIGG | T(DGG) | SP | |||
| SP44 | 15453 | DITI | GDSGGG | > 340 | LIGR∧IVNG | T(DGG) | SP | |||
| SP45 | 17654 | SAAHC | DIAM | 1748 | RRSR∧IVGG | ?(D??) | TM?-6LC-3LDLA-SP | |||
| SP46 | 16367 | CG13461 stubble gd | MsHP19 | DLAV | GDSGSG | 439 | FNLL∧VAGG | E(GSI) | SP | |
| SP47 | 14774 | ----- | > 157 | ? | ?(DI?) | pSP | ||||
| SP48 | 12379 | CG32376 | MsHP3 | TALHC | > 257 | ATIK∧IIGG | T(DGG) | SP | ||
| SP49 | 15317 | CG31217 | MsHP14 | DIAI | GDSGGG | ∼628 | SKTL∧IVNG | E(SSS) | LC-4LDLA-Sushi-Wonton-SP | |
| cSPH50 | 14001 | CG14945 | MsPAP1 | TTANC | NIAM | GYNGSP | 707 | TM-LC-PLCXc-C-SPH | ||
| SPH51 | 13397 | CG18735 CG4386 | TCGNC | ---- | LDVSSS | > 296 | SPH | |||
| SPH52 | 19292 | ----- | ---- | > 136 | pSP | |||||
| SPH53 | 15702 | TSAQC | NIAL | GNPGSP | > 294 | LC-SPH | ||||
| SPH54 | 15980 | ASYSC | NDEGAP | ?2733 | TM-LC-EGF-13LC(HEPSr)-5LDLA-SR-SPH | |||||
| cSPH55 | 15254 | CG11066 | TAANC | DLAT | TDIGSP | > 539 | C-SPH | |||
| SPH56 | 13019 | CG1632 | TTASC | TTVL | EFAGSP | ∼777 | LC-TM-SEA-LC-FRI-2LDLA-SPH | |||
| SPH57 | 16038 | CG31954 | ----- | ------ | > 159 | pSPH | ||||
Aa, Aedes aegypti; Ag, Anopheles gambiae; Bm, Bombyx mori; Cf, Ctenocephalides felis; Cr, Cotesia rubecula; Cs, Culicoides sonorensis; Ms, Manduca sexta; On, Ostrinia nubilalis; Pl, Pacifastacus leniusculus; Tm, Tenebrio molitor; Tt, Tachypleus tridentatus.
If not listed, sequences are identical to the conserved TAAHC, DIAL, or GDSGGP. -----: conserved region not identified.
>, incomplete sequence due to prediction errors; ∼, nearly complete (e.g. partial signal peptide); ?, prediction error?
∧, putative activation cleavage site; ?, not predicted; blank, not applicable (SPH).
Enzyme specificity predicted based on Perona and Craik (1995). T, trypsin; C, chymotrypsin; E, elastase; ?: not predictable; blank: not applicable (SPH). Letters in parentheses: amino acid residues determining the primary specificity of a serine proteinase.
C, clip domain; CBD: chitin-binding domain; cc, coiled coil region; Clect, C-type lectin domain; CUB, a domain identified in Complement 1r/s, Uegf, and Bmp1; EGF, Ca2+-binding EGF domain; FRI, frizzle domain; KR: kringle domain; LC, low complexity region; LDLA: low-density lipoprotein receptor class A domain; p, partial; PA, pan-apple domain; PLCXc, phospho-lipase C catalytic domain; SEA, a ∼120-residue domain in Sperm protein, Enterokinase and Agrin; Sina, a domain identified in Drosophilaevenbsentia; SP, serine proteinase catalytic domain; SPH, serine proteinase-like domain; SR: scavenger receptor cysteine-rich domain; Sushi, Sushi domain, also known as CCP or SCR. Wonton: a disulfide knotted domain found in M. sexta HP14; TSP1, thrombospondin type I repeat; TM, transmembrane region; XYr, regions rich in amino acid residues X and Y; ZnF, Zinc finger domain.
Figure 1Sequence comparison and phylogenetic relationships among the Apis mellifera clip-domain SPs and SPHs. A. alignment of the clip domain sequences. Six conserved Cys residues form 3 disulphide bonds. B. phylogenetic tree based on an alignment of the catalytic and protease-like domains. Vertical bars and numbers indicate the clip domain groups.
Figure 2Domain organization of some SPs in Apis mellifera and other insects. A. Apis mellifera SP49 is orthologous to Manducta sexta HP14, An. gambiae AgCP12488 and Drosophila melanogaster AY118964. Apis mellifera SP23 is similar to Anopheles gambiae SP22D and D. melanogaster Tequila, whereas honey bee SP30 is homologous to D. melanogaster corin. B. A proposed SP cascade (left) for establishing the dorsal-ventral axis of A. mellifera embryo, in comparison to a similar system discovered in D. melanogaster.
Figure 3Alignment of Drosophila spätzle and Apis spätzle-1 and −2. The first 127 residues at the amino terminus of the fly protein were not shown. *, identical:, similar.
Serine protease inhibitors (serpins) in Apis mellifera
| Homologous proteins | |||||||
|---|---|---|---|---|---|---|---|
| Accession number | G | Drosophila | Other arthropods | Length (aa) | Signal peptide | Predicted reactive site | Target enzyme specificity |
| serpin-1 | GB17012 | serpin-4 | MsSerpin-1,2/AgSRPN-10 | 334 | No | LR*RC | T |
| serpin-2 | GB16472 | serpin-4 | MsSerpin-1,2 | 342 | QG-ET | PL*SS | C |
| serpin-3 | GB12279 | spn-27 A | MsSerpin-3/AgSRPN-2 | 466 | DG-KE | NK*NQ | T |
| serpin-4 | GB13578 | CG7219 | AgSRPN-6 | 469 | FG-QL | ER*DG | T |
| serpin-5 | GB19582 | serpin-5 | MsSerpin-6 | 451 | SA-QC | FR*SG | T |
| GB10078 | CG14470 | 1543 | VG-SP | ER*AE | T | ||
| GB15070 | CG12807 | 612 | YC-VD | ER*AG | T | ||
Ag, Anopheles gambiae; Ms, Manduca sexta; C, chymotrypsin; T, trypsin.
GB10078 contains a carboxyl-terminal serpin domain; GB15070 contains a split serpin domain (maleszka3).
Figure 4Sequence alignment and phylogenetic relationships of serpins from Apis mellifera and other insects. A. Amino acid sequence alignment of the P17-P4′ region. Identical residues are indicated by ‘*’, and similar residues by ‘:’. B. Phylogenetic tree based on alignment of full-length serpins selected from A. mellifera, Anopheles gambiae, Drosophila melanogaster and Manducta sexta.
Figure 5Quantitative RT-PCR analyses of Apis mellifera SP-related transcripts. A. RNA samples from adult workers at 24 h after injections of Paenibacillus larvae (P.l.), E. coli (E.c.), or saline and uninjected control. B. RNA from the 2nd instar larvae at 24 h or 48 h after feeding on the regular diet (–) or diet with an infective dose of P. larvae. Gene expression is shown in grey scale, with darker squares indicating higher expression levels. SP10N and SP10C: N- and C-terminal halves.
Oligonucleotides used in real time PCR of Apis mellifera SP-related genes
| Locus | Forward Primer | Reverse primer | Gene ID |
|---|---|---|---|
| SP1 | TGCTCATTGCGTTACATCGT | TTGTCAGCGCAAACAACTTC | GB16147 |
| SP2 | GCGTTTAGAAAGCGTTCGTC | TCCGCGCAAAGTAAGCTATT | GB14247 |
| SP3 | ATGGACCCTTGTTACCACCA | GTTGCGAAGGGTTCAAAGAA | GB11698 |
| SP6 | CGATGACGATGACATTCCTG | TGTGTCCACCCACGATTCTA | GB14077 |
| SP7 | GGCTGGGTTCTTGGTGTTTA | GCTCGACTGTGGTGTAACGA | GB17145 |
| SP8 | GTTTGGTCGACGGAAGAAAA | CCGTCGACTCGAAATCGTAT | GB18767 |
| SP9 | GAGATGTTGAATGGCACGAA | CCACCACTATCTCCCTGACAA | GB18732 |
| SP10N | CCGGTGAACTTGGAAAAGAT | CTTCGCCAGGAATAATGGAA | GB17927 |
| SP10C | GAGATGTTGAATGGCACGAA | CCACCACTATCTCCCTGACAA | GB17927 |
| SP13 | CGGAGCTTAAATGCGAAGAA | TTGTTCCTAGAGCAACCATGTG | GB15640 |
| SP14 | GATTACCCAATGGCATCGAC | GCTGGTGAACCGCAAGTATT | GB14044 |
| SPH19 | ACCATCGAGAAAACCACGAC | GTACACGCTTTCCGTTGGAT | GB17345 |
| SP21 | GCCGGAAACTTACACGGTTA | CGATAATGTGCTTGCGGTAA | GB16220 |
| SP23 | AACGGAAACGAAATGGACAG | GAGCACATGCTTGAACGAAA | GB12538 |
| SP30 | CACCAGAAGGCACTCTCACA | CCTGAGCGAAGCCTAAATTG | GB19649 |
| SPH39 | GCGCCAGGAAACTCTGTTAG | ACGAAGCTTCCCCGTTTATT | GB14366 |
| SPH41 | ACCGGCACAAGCAAAATTAC | GCGAACTCTTCGTGTTGTCA | GB10943 |
| SPH42 | GAAGTCCCCTTGTTTGTCCA | TCGATCCAATCACGAACAGA | GB11298 |
| SP49 | TGTGATGGCATAGCAGATTGT | CAGGCACCATAATCACAACG | GB15317 |
| SPH50 | GCAAATCGAAAGGGAAATGA | CTGATGGAAAGCTGGTGGTT | GB14001 |
| SPH55 | GTCAACGACGTGGAAGGAAT | CGTTGGAAGACATCCCGTAT | GB13397 |
| serpin-1 | CATGGTGACATGCCAATGTT | CGAGTTGTATTTGCAAGCATTT | GB17012 |
| serpin-2 | TCCATGGAGGCAGCAAATA | CCATTGGCCTTTAAAATAAACTG | GB16472 |
| serpin-3 | CGGGAGACGAAACTGATGAT | TTCACCTTGAGCTCCTTCGT | GB12279 |
| serpin-4 | CTGGGCCACGTGTAGATTTT | ATGTCCATTGCTGCTTTTCC | GB13578 |
| serpin-5 | ACTCAGCGAACCGATTATGG | GGACAGCATTTGGATTCGTT | GB19582 |
| Spz-1 | TGCACAAATTGTTTTTCCTGA | GTCGTCCATGAAATCGATCC | GB15688 |
| Spz-2 | AATCGAAGGTTTCGCTGAAG | TTCCGGTATTATGGAACCATTT | GB13503 |
| PPO | AGATGGCATGCATTTGTTGA | CCACGCTCGTCTTCTTTAGG | GB18313 |