| Literature DB >> 34922456 |
Hui Zhang1,2, Mengxiao Tian1,2, Wenjing Qi1,2, Juan Wu1,2, Huajun Zheng3, Gang Guo1,2, Liang Zhang3, Shiwanthi L Ranasinghe4, Donald P McManus4, Jun Li5,6, Wenbao Zhang7,8.
Abstract
BACKGROUND: Cystic and alveolar echinococcosis caused by the tapeworms Echinococcus granulosus sensu stricto (s.s.) and E. multilocularis, respectively, are important zoonotic diseases. Protease inhibitors are crucial for the survival of both Echinococcus spp. Kunitz-type inhibitors play a regulatory role in the control of protease activity. In this study,we identified Kunitz-type domain protease inhibitors(KDPIs) present in the genomes of these two tapeworms and analyzed the gene sequences using computational, structural bioinformatics and phylogenetic approaches to evaluate the evolutionary relationships of these genes. Hi-seq transcriptome analysis showed that E. granulosus s.s. KDPIs were differentially expressed in the different developmental stages. We validated some of the genes expressed in adult worm, protoscolex and cyst germinal membrane of E. granulosus s.s. and E. multilocularis by quantitative PCR.Entities:
Keywords: Bioinformatic analysis; E. granulosus sensu stricto; E. multilocularis; Kunitz inhibitors; Stage expression
Mesh:
Substances:
Year: 2021 PMID: 34922456 PMCID: PMC8684439 DOI: 10.1186/s12864-021-08219-4
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Summary of physiological and biological characteristics of Kunitz protease inhibitors in E. multilocularis and E. granulosus s.s.
| Species |
|
|
|---|---|---|
| KDPIs/single KDPIs | 19/16 | 23/21 |
| Number of amino acids | 333.47 | 269.96 |
| Molecular weight (Da) | 37153.2 | 30117.02 |
| Isoelectric points | 7.44 | 7.84 |
| No. of trans-domain (%) | 4(21.05) | 5(21.74) |
| No. of cysteine in Kunitz domain | 5.63 | 5.65 |
| No. of cysteine in the protein | 24.89 | 18.13 |
| Instability index | 45.17 | 47.36 |
| Stable protein (No/yes) | 12/7 | 15/8 |
| Aliphatic index | 72.35 | 71.37 |
| Grand average of hydropathicity | -0.22 | -0.22 |
| Signal peptides (%) | 17(89.47) * | 14(60.87) |
| No. of Kunitz Motifs | 1.79 | 1.17 |
| En-t-In(T/C) | 9/2 | 9/4 |
Note: KDPIs, Kunitz domain protease inhibitors; No. of aa, number of amino acids; No. of tran-domain, percentage of containing transmembrane domains; Aver of cysteine, average of cysteine per sequence; Aliphatic indexes; GRSVY, hydropathic index; En-t-In (T/C), enzyme targeting inhibitors, trypsin inhibitors(T) or chymotrypsin inhibitors(C)
There are significant differences of signal peptides between E. multilocularis and E. granulosus s.s. KDPIs (χ2 = 13.544, P < 0.01)
Fig. 1Alignment and clustering of E. multilocularis and E. granulosus s.s. Kunitz-type domain protease inhibitors. These inhibitor protein sequences are compared with the homologue sequences (blocked in gray background) Homo sapiens including P48307(Hsa), P05067(Hsa), P12111(Hsa) and O95925(Hsa), Ovis aries (NP_001009291.1(Oar)), Bos taurus(1BTH_Q(Bt)), Schistosoma mansoni (Smp_147730 (Sm)), Stichodactyla haddoni(B1B5I8(Sha)), Stichodactyla helianthus (3OFW_A(She)), Trinorchestia longiramus(KAF2343884(Tl)), Haplopelma schmidti (P68425(Hsc)). “En”, “T”, “C” and “/” represent enzyme and inhibitor of trypsin, chymotrypsin, and nonprediction, respectively
The expression of Kunitz-type domain protease inhibitors in four stages of E. granulosus s.s. showing Hi-seq reads
| Gene ID | Adt | Onc | PSC | CGM | Seq. Description | Seq. | #GOs | GOs |
|---|---|---|---|---|---|---|---|---|
| EGR_07244 | 217 | 0 | 1 | 10 | serine protease inhibitor | 106 | 1 | F: serine-type endopeptidase inhibitor activity |
| EGR_08716 | 155 | 0 | 0 | 0 | kunitz-type protease inhibitor 3-like | 84 | 3 | F: peptidase inhibitor activity; F: protein binding; P: transforming growth factor beta receptor signaling pathway |
| EGR_10096 | 70 | 0 | 3 | 4 | kunitz domain-containing | 98 | 42 | C: cytoplasmic vesicle; P: apoptosis; P: neuromuscular process controlling balance; P: ionotropic glutamate receptor signaling pathway; P: regulation of epidermal growth factor receptor activity; F: receptor binding; P: regulation o |
| EGR_07242 | 46 | 0 | 1 | 5 | serine protease inhibitor | 83 | 7 | F: peptidase inhibitor activity; C: nematocyst; F: ion channel inhibitor activity; F: potassium channel inhibitor activity; F: serine-type endopeptidase inhibitor activity; C: extracellular region; P: pathogenesis |
| EGR_07243 | 35 | 0 | 2 | 5 | wap four-disulfide core domain 6b | 75 | 2 | C: cytoplasm; F: serine-type endopeptidase inhibitor activity |
| EGR_03480 | 24 | 0 | 1 | 5 | four-domain proteases inhibitor | 88 | 1 | F: peptidase inhibitor activity |
| EGR_09490 | 17 | 2 | 14 | 51 | spon-1 protein | 976 | 3 | F: serine-type endopeptidase inhibitor activity; C: proteinaceous extracellular matrix; C: extracellular region |
| EGR_03481 | 16 | 0 | 8 | 2 | trypsin inhibitor | 242 | 4 | F: peptidase inhibitor activity; P: multicellular organismal process; F: extracellular matrix structural constituent; C: proteinaceous extracellular matrix |
| EGR_08720 | 10 | 0 | 0 | 0 | kunitz bovine pancreatic trypsin inhibitor domain protein | 84 | 4 | F: serine-type endopeptidase inhibitor activity; F: peptidase activity; F: peptidase inhibitor activity; C: extracellular region |
| EGR_07944 | 9 | 0 | 22 | 14 | kunitz bovine pancreatic trypsin inhibitor domain protein | 539 | 2 | C: extracellular region; F: hydrolase activity |
| EGR_09269 | 8 | 0 | 0 | 1 | tissue factor pathway inhibitor 2-like | 92 | 2 | F: extracellular matrix structural constituent; C: proteinaceous extracellular matrix |
| EGR_05317 | 7 | 0 | 19 | 16 | Kunitz-like protease inhibitor precur | 1540 | ||
| EGR_08721 | 6 | 108 | 0 | 0 | serine protease inhibitor- with kunitz and wap domains 1 | 79 | 1 | C: acrosomal vesicle |
| EGR_04958 | 5 | 0 | 1 | 2 | elegans protein partially confirmed by transcript evidence | 135 | 2 | F: serine-type endopeptidase inhibitor activity; P: epidermis development |
| EGR_05483 | 5 | 12 | 0 | 2 | secreted protein with kunitz | 191 | 2 | C: extracellular region; F: hydrolase activity |
| EGR_09007 | 4 | 0 | 0 | 0 | mechanosensory abnormality family member (mec-1) | 86 | 2 | P: extracellular structure organization; P: mechanosensory behavior |
| EGR_09008 | 3 | 0 | 0 | 0 | acp24a4 | 102 | 2 | C: extracellular region; F: peptidase activity |
| EGR_01779 | 1 | 0 | 13 | 3 | isoform g | 878 | 1 | F: hydrolase activity |
| EGR_05482 | 1 | 2 | 0 | 0 | kunitz bovine pancreatic trypsin inhibitor domain containing protein | 130 | 1 | C: extracellular region |
| EGR_07266 | 1 | 0 | 0 | 3 | serine protease inhibitor | 129 | 3 | F: binding; F: serine-type endopeptidase inhibitor activity; C: extracellular region |
| EGR_05316 | 0 | 0 | 3 | 2 | kunitz-type protease inhibitor 3-like | 239 | 1 | F: peptidase inhibitor activity |
| EGR_08718 | 0 | 2 | 0 | 0 | kunitz domain-containing | 144 | 4 | F: peptidase activity; F: serine-type endopeptidase inhibitor activity; F: peptidase inhibitor activity; C: extracellular region |
| EGR_09006 | 0 | 0 | 0 | 0 | single kunitz protease inhibitor | 89 | 5 | F: peptidase activity; F: serine-type endopeptidase inhibitor activity; F: peptidase inhibitor activity; C: extracellular region; P: multicellular organismal development |
Note: Adt, adult worms; Onc, oncospheres; PSC, protoscoleces; CGM, Cyst Germinal membrane
Fig. 2Phylogram constructed to compare the sequences of E. granulosus s.s.- and E. multilocularis-KDPIs with KDPIs from Bos taurus, Homo sapiens and other species. These inhibitor protein sequences are also compared with the sequences from Dendroaspis including Dendroaspis angusticeps (P00982.1(Da)), Dendroaspis polylepis (P00981(Dp)). The phylogenetic tree was constructed using MEGA 7.0 and Interactive Tree of Life iTOL v6 with the maximum likelihood method. Bootstrapping analysis was performed and the bootstrap values from 1000 replicate iterations are shown on the nodes. The clades are represented by different color
The secondary structure prediction of the single Eg-KDPIs and Em-KDPIs
| Accession number | Alpha helix(aa/%) | Extended strand(aa/%) | Beta turn(aa/%) | Random coil(aa/%) |
|---|---|---|---|---|
| EGR_03480 | 32/36.36 | 19/21.59 | 3/3.41 | 34/38.64 |
| EGR_03481 | 46/19.01 | 57/23.55 | 19/7.85 | 120/49.59 |
| EGR_04958 | 33/24.44 | 28/20.74 | 11/8.15 | 63/46.67 |
| EGR_05316 | 92/38.17 | 54/22.41 | 17/7.05 | 78/32.37 |
| EGR_05482 | 54/41.54 | 18/13.85 | 8/6.15 | 50/38.46 |
| EGR_05483 | 66/34.55 | 41/21.47 | 10/5.24 | 74/38.74 |
| EGR_07242 | 25/30.12 | 13/15.66 | 9/10.84 | 36/43.37 |
| EGR_07243 | 20/26.67 | 20/26.67 | 5/6.67 | 30/40 |
| EGR_07244 | 41/38.68 | 23/21.7 | 2/1.89 | 40/37.74 |
| EGR_07266 | 68/52.71 | 29/22.48 | 8/6.2 | 24/18.6 |
| EGR_07944.1 | 138/25.6 | 91/16.88 | 34/6.31 | 276/51.21 |
| EGR_08716 | 31/36.9 | 11/13.1 | 3/3.57 | 39/46.43 |
| EGR_08718 | 41/28.47 | 34/23.61 | 8/5.56 | 61/42.36 |
| EGR_08720 | 25/29.76 | 18/21.43 | 4/4.76 | 37/44.05 |
| EGR_08721 | 35/44.3 | 13/16.46 | 3/3.8 | 28/35.44 |
| EGR_09006 | 28/31.46 | 22/24.72 | 9/10.11 | 30/33.71 |
| EGR_09007 | 23/26.74 | 13/15.12 | 7/8.14 | 43/50 |
| EGR_09008 | 43/42.16 | 22/21.57 | 7/6.86 | 30/29.41 |
| EGR_09269 | 37/40.22 | 15/16.3 | 4/4.35 | 36/39.13 |
| EGR_09490 | 203/20.8 | 145/14.86 | 39/4.0 | 589/60.35 |
| EGR_10096 | 20/20.41 | 23/23.47 | 3/3.06 | 52/53.06 |
| EmuJ_000077700.1 | 65/30.95 | 42/20 | 17/8.1 | 86/40.95 |
| EmuJ_000077800.1 | 33/24.26 | 39/28.68 | 10/7.35 | 54/39.71 |
| EmuJ_000302900.1 | 128/23.97 | 76/14.23 | 31/5.81 | 299/55.99 |
| EmuJ_000419200.1 | 37/40.22 | 15/16.30 | 0/0.00 | 40/43.48 |
| EmuJ_000534800.1 | 24/32.00 | 6/8.00 | 5/6.67 | 40/53.33 |
| EmuJ_000548800.1 | 23/23.23 | 21/21.21 | 4/4.04 | 51/51.52 |
| EmuJ_000549400.1 | 19/19.00 | 27/27.00 | 8/8.00 | 46/46.00 |
| EmuJ_001136500.1 | 36/40.45 | 16/17.98 | 2/2.25 | 35/39.33 |
| EmuJ_001136600.1 | 31/36.90 | 13/15.48 | 6/7.14 | 34/40.48 |
| EmuJ_001136700.1 | 20/25.64 | 18/23.08 | 8/10.26 | 32/41.03 |
| EmuJ_001136800.1 | 36/40.45 | 16/17.98 | 2/2.25 | 35/39.33 |
| EmuJ_001136900.1 | 21/23.33 | 17/18.89 | 4/4.44 | 48/53.33 |
| EmuJ_001137000.1 | 26/30.95 | 16/19.05 | 4/4.76 | 38/45.24 |
| EmuJ_001137100.1 | 22/26.19 | 20/23.81 | 9/10.71 | 33/39.29 |
| EmuJ_001137300.1 | 20/25 | 29/36.25 | 5/6.25 | 26/32.5 |
| EmuJ_001137400.1 | 26/30.59 | 16/18.82 | 4/4.71 | 39/45.88 |
Note: aa/%, number/percentage of amino acids in each secondary structure
Fig. 3Three dimensional structures of single Kunitz domain protease inhibitors in E. granulosus s.s. and E. multilocularis using SWISS-MODEL. a Structures of single Kunitz domain protease inhibitors in E. granulosus s.s. . b Structures of single Kunitz domain protease inhibitors in E. multilocularis
Fig. 4Transcription levels of KDPIs (EGR_08716, EGR_07244, EGR_07944, EGR_09490, EmuJ_001137000, EmuJ_000534800, EmuJ_000302900, and EmuJ_000929500) in different life cycle stages of E. granulosus s.s. and E. multilocularis. Bars indicate the mean value± standard deviation (SD) of three individual experiments. Differences among different groups were analyzed by one-way ANOVA and Two-tailed Student’s t test. “Adult”, “PSC”, and “CGM” represent adult worm, protoscoleces, and cyst germinal membrane three stages of the tapeworm, respectively
Sequences of forward and reverse primers for the genes analysed by qPCR
| Gene | Primer sequences(5’-3’) | Product size |
|---|---|---|
| EGR_08716 | F: CCAATCTCGCACTTCTACTCCTCATG R: ATCGACAGGGTCCTACCTCAATGG | 102 bp |
| EGR_07244 | F: TCCTCCTCGTGGTCATCAGCTATTC R: CCACAACCGCCGTAGATGAACTC | 148 bp |
| EGR_07944 | F: AAGTGCCTTCATCGCCTCCATTG R: CCGCCTCTGATTCTGCTTTCTCTG | 96 bp |
| EGR_09490 | F: CCAGTCATAGCGGGCATCAGTTG R: GTCATCGTCTCCGTGGCATTCC | 113 bp |
| EmuJ_001137000 | F: CTCCTCATGCTGCTCGGTGTTG R: ATCGACAGGGTCCTACCTCAATGG | 85 bp |
| EmuJ_000534800 | F: TTTCCTCCTTGTGGTCATCAGCTATTC R: CACAACCGCCGTAGATGAACTCC | 149 bp |
| EmuJ_000302900 | F: TACAATGGTCACGGCGGCAATG R: TCTGGCGTTCTGGCACAAATCG | 99 bp |
| EmuJ_000929500 | F: GCACATCTTCAGTGGCAGGTATCC R: TGGCGGTTGGCATTTGGACTAC | 87 bp |
| Eif3 | F: GTTACATCCCTCCGACCTTG R: AAGCAGCCTCCTCTTGAGTG | 243 bp |
Note: F, forward primer; R, reverse primer