Literature DB >> 24864215

Cloning, Sequencing, and In Silico Analysis of β-Propeller Phytase Bacillus licheniformis Strain PB-13.

Vinod Kumar1, Gopal Singh2, Punesh Sangwan3, A K Verma2, Sanjeev Agrawal2.   

Abstract

β -Propeller phytases (BPPhy) are widely distributed in nature and play a major role in phytate-phosphorus cycling. In the present study, a BPPhy gene from Bacillus licheniformis strain was expressed in E. coli with a phytase activity of 1.15 U/mL and specific activity of 0.92 U/mg proteins. The expressed enzyme represented a full length ORF "PhyPB13" of 381 amino acid residues and differs by 3 residues from the closest similar existing BPPhy sequences. The PhyPB13 sequence was characterized in silico using various bioinformatic tools to better understand structural, functional, and evolutionary aspects of BPPhy class by multiple sequence alignment and homology search, phylogenetic tree construction, variation in biochemical features, and distribution of motifs and superfamilies. In all sequences, conserved sites were observed toward their N-terminus and C-terminus. Cysteine was not present in the sequence. Overall, three major clusters were observed in phylogenetic tree with variation in biophysical characteristics. A total of 10 motifs were reported with motif "1" observed in all 44 protein sequences and might be used for diversity and expression analysis of BPPhy enzymes. This study revealed important sequence features of BPPhy and pave a way for determining catalytic mechanism and selection of phytase with desirable characteristics.

Entities:  

Year:  2014        PMID: 24864215      PMCID: PMC4017775          DOI: 10.1155/2014/841353

Source DB:  PubMed          Journal:  Biotechnol Res Int        ISSN: 2090-3146


1. Introduction

Phytases (myo-inositol 1,2,3,4,5,6-hexakisphosphate phosphohydrolase) belongs to a special group of phosphatases which can hydrolyse phytate (myo-inositol 1,2,3,4,5,6-hexakisphosphate, IP6) to inositol phosphates, inorganic phosphorus, and myo-inositol [1]. Phytate is synthesized by plants and represents a very significant amount of organic phosphorus (60–80%) in soil [2]. Phytase lowers down affinity of phytate to associate minerals and proteins [3] and its additions to animal feed liberate the inorganic phosphorus from bound phytate-phosphorus and make it available to the monogastric animals [4, 5]. Phytases are widely distributed among plants and microbial cells [6, 7]. Based on the specific consensus sequence, catalytic mechanism, and three dimensional structures, four classes of phytases, which have been characterized so far, are histidine acid phosphatase (HAPhy), cysteine phytase (CPhy), purple acid phosphatase (PAPhy), and beta-propeller phytase (BPPhy) [8, 9]. Alternatively, according to the initiation site of dephosphorylation of the phytate, the ENZYME database (available through the ExPASy Proteomics Server: http://enzyme.expasy.org/) classifies phytases into three groups: 3-phytase (alternative name, 1-phytase; EC 3.1.3.8), 4-phytase (alternative name, 6-phytase; EC 3.1.3.26), and 5-phytase (EC 3.1.3.72). Among them, BPPhy is widely distributed in nature and plays a major role in phytate-phosphorus cycling in both soil and aquatic microbial communities [8]. BPPhy has a six-bladed beta-propeller folding architecture [10] and dephosphorylate phytate in a stereospecific way by sequential removal of every second phosphate group. These exhibit both unique Ca2+-dependent catalytic property and highly strict substrate specificity for the calcium-phytate complex [11]. Bioinformatics analysis of genes and genomes from different species makes possible the identification of new genes including orthologs or paralogs [12] and also facilitates the establishment of phylogenetic relationships between genes and evolutionary molecular mechanisms [13]. Large numbers of phytase gene sequences are available in various databases providing further opportunity to study detailed mechanistic and sequential diversity of this class of enzymes. It has been utilized for formation of consensus phytase sequence [14], in silico analysis of HAP sequences [15], and motif analysis of different phytases [16]. However, no such study has been conducted to assess sequence diversity of BPPhy. The sequence information and further analysis of superfamily will help in understanding the underlying mechanisms and also helps to develop and/or implement a range of alternate effectors for enzyme activity. The in silico characterization of protein sequences of other industrially important enzymes has also been reported recently [17-19]. In the present study, a phytase producing Bacillus licheniformis strain was used for the isolation, cloning, and sequencing of BPPhy gene in pET32a vector and expression in E. coli BL21. The phytase sequence was characterized in silico. Simultaneously, in order to better understand the structural, functional, and evolutionary aspects of BPPhy, we exploited the reference protein sequences of BPPhy in NCBI and ExPASy databases for in silico study of their biochemical features, multiple sequence alignment and identity search, phylogenetic tree construction, and distribution of motifs and superfamilies using various bioinformatics tools. We provide here information regarding conserved and variable amino acids and protein motifs that might have an impact on function. In addition, we analyzed other structural aspects including the position of conserved residues and the cleavage site of the zymogen and presented a preliminary phylogenetic analysis of selected members of this subfamily.

2. Material and Methods

2.1. Chemicals and Bacterial Strains

All the chemicals, solvents, and antibiotics used in this study were of molecular biology and analytical grade and procured from standard manufacturers as GeNei, Sigma, Merck, and HiMedia Pvt. Ltd. Phytase producing Bacillus licheniformis strain PB-13 (identified using 16S rRNA gene sequencing, GenBank Accession number JX406744.1) isolated in our laboratory was used for isolation of phytase gene [20]. E. coli DH5α and E. coli BL21 (DE3) (Novagen) were used as cloning host and expression host, respectively. Plasmid vector pET32a(+) (Novagen) was used for cloning and expression studies. E. coli strains and plasmid were kindly provided by Dr. S. P. Singh, Department of Veterinary Public Health, College of Veterinary and Animal Sciences, G. B. Pant University of Agriculture and Technology, Pantnagar.

2.2. PCR Cloning and Expression of the Phytase Gene

Phytase gene sequence (GenBank accession number BL018) was retrieved from complete genome sequence of Bacillus licheniformis ATCC 14580 from KEGG genome database (http://www.genome.jp/kegg-bin/show_organism?org=bli). Primers were designed from end regions of complete ORF. For the directional cloning, restriction sites for HindIII and XhoI were introduced at 5′ ends of forward primer, PhyL F11 “CGAAGCTTATCATATGAACTTTTACAAAACG,” and reverse primer, PhyL R “GTGCTCGAGCCTTATTTGGCTCGTTTTTTCA,” respectively. The primers were custom-synthesized by SBS Gentech Co. Ltd. The PCR amplification was carried out using Pfu polymerase (Fermentas) for 30 cycles at 94°C for 45 sec, 50°C for 45 sec, and 72°C for 1 min with genomic DNA of Bacillus licheniformis strain PB-13 as template. For directional cloning of PCR product into pET32a(+), the amplified PCR fragment was restriction-digested with HindIII/XhoI and separated on agarose gel. The separated fragment was cut from the gel and purified with the QIAquick DNA purification kit (Qiagen). Purified HindIII/XhoI fragment was cloned into an HindIII/XhoI-cut pET32a(+) E. coli expression vector harbouring C-terminal His6 tag. The E. coli DH5α cells were transformed with recombinant plasmid. Recombinant plasmid from positive clone for phytase gene was isolated and transformed into expression host E. coli BL21- (DE3-) pLysS as per standardized protocol [21]. A colony was randomly picked from among the colonies observed on ampicillin selection plate. This was tested for presence of recombinant plasmid containing phytase gene using gene specific primers (PhyL F11 and PhyL R). The transformants were grown in LB broth containing ampicillin (100 μg/mL), induced with the different amount of IPTG to optimize expression. For production analysis, samples were withdrawn at various times after induction and cells were pelleted, resuspended into 50 mM Tris-HCl buffer (pH 7.0, containing 1 mM CaCl2) were sonicated on ice for 5 min with a pulse rate of 30 sec and a gap of 10 sec. Cell debris were removed by centrifugation at 10000 rpm for 30 min, 4°C. The recombinant protein from supernatant was assayed for crude phytase activity.

2.3. Phytase Assay

Crude phytase activity was determined using 5 mM sodium phytate as substrate in 0.1 M sodium acetate buffer, with pH 5.5 following the method of Engelen et al. [22]. One unit was defined as the amount of enzyme that released 1 μM of inorganic phosphate in 1 min. The amount of phosphate released was calculated based on standard curve of KH2PO4.

2.4. In Silico Analysis of B. licheniformis PB-13 Phytase Sequence

Amplified PCR products were sequenced by automated DNA sequencer at DNA Sequencing Facility, University of Delhi (South Campus), New Delhi, India. The sequence analysis was done using MEGA5 (http://www.megasoftware.net/) and NCBI database by employing BLASTN algorithm (http://blast.ncbi.nlm.nih.gov/Blast.cgi). The sequences obtained were deposited in NCBI GenBank (http://www.ncbi.nlm.nih.gov/genbank/submit/). ORF Finder (http://www.ncbi.nlm.nih.gov/projects/gorf/) was used for identifying open reading frame into gene sequence. Nucleotide sequence represented complete true ORF was translated into protein sequence using ExPASy translate tool (http://web.expasy.org/translate/) and used for in silico characterization. The signal peptide was predicted using SignalP (http://www.cbs.dtu.dk/services/SignalP/). The tertiary structure of rPhyPB13 was predicted using the homology modeling approach at SwissModel Workspace (http://swissmodel.expasy.org/) with the β-propeller phytase TS-Phy from Bacillus amyloliquefaciens (PDB code 1H6L) as the template [23, 24]. The evolutionary history was inferred using the neigbour-joining method [25]. The evolutionary distances were computed using the maximum composite likelihood method and are in the units of the number of base substitutions per site [26]. Evolutionary analyses were conducted in MEGA5.

2.5. In Silico Characterization of β-Propeller Phytase Sequence

PhyPB13 β-propeller phytases sequence was used as probe NCBI protein database (http://www.ncbi.nlm.nih.gov/; accessed in June, 2012) to retrieve the 44 reference protein sequences of BPPhy used in this study (Table 1). The protein sequences in FASTA format from RefSeq entries, which were shown to exhibit phytase activities, were selected for further in silico study. The sequences were characterized for homology, phylogenetic relationship, functional domain, and biophysical characteristics using available bioinformatic tools following methodology as adapted by Kumar et al. [15].
Table 1

List of source organisms of retrieved BPPhy protein sequences (with accession number).

S. numberSource organismAccession number Total sequences
1 Sphingobacterium  spiritivorum ZP_03969865.1, ZP_07083876.12
2 Desulfuromonas  acetoxidans ZP_01312505.11
3 Capnocytophaga  canimorsus YP_004741572.11
4 Chlorobium  phaeobacteroides YP_001959943.11
5 Prosthecochloris  aestuarii YP_002014808.11
6 Myroides  odoratus ZP_09672975.11
7 Riemerella  anatipestifer YP_004046143.11
8 Flavobacteria  bacterium ZP_01734242.11
9 Chlorobium  limicola YP_001943170.11
10 Zobellia  galactanivorans YP_004735798.11
11 Chryseobacterium  gleum ZP_07088398.11
12 Cellulophaga  lytica YP_004261716.11
13 Mesoflavibacter  zeaxanthinifaciens ZP_09499218.11
14 Zunongwangia  profunda YP_003586972.11
15 Cyanothece  sp.   YP_002374284.11
16 Paenibacillus  mucilaginosus YP_004643897.1, YP_004639353.12
17 Paenibacillus  curdlanolyticus ZP_07387906.1, ZP_07387907.12
18 Paenibacillus  polymyxa YP_003868637.11
19 Paenibacillus  sp.   ZP_08507024.1, ZP_09771671.12
20 Bacillus  pseudomycoides ZP_04154570.11
21 Bacillus  mycoides ZP_04160523.11
22 Singulisphaera  acidiphila ZP_09566405.11
23 Bacillus  subtilis YP_004877642.1, ZP_06871959.1, NP_389861.13
24 Bacillus  licheniformis YP_090097.1, AFQ59979.1 2
25 Streptomyces  roseosporus ZP_06588929.1, ZP_04713225.12
26 Ajellomyces  dermatitidis XP_002627863.11
27 Deinococcus  proteolyticus YP_004255627.11
28 Bacillus  sp.   ZP_08003013.11
29 Microscilla  marina ZP_01694652.11
30 Paracoccidioides  brasiliensis XP_002790172.11
31 Caulobacter  segnis YP_003593415.11
32 Bacillus  amyloliquefaciens YP_001421557.1, YP_005130694.22
33 Methylophaga  aminisulfidivorans ZP_08535745.11
34 Glaciecola  sp.   YP_004432278.11
35 Thiorhodococcus  drewsii ZP_08825440.11

3. Result and Discussion

3.1. Cloning and Expression of Phytase

E. coli expression system is one of the simplest, cost-effective, and suitable systems for large scale production of recombinant proteins [27]. In the present study, we have used a soluble recombinant proteins expression system to express phytase from B. licheniformis PB-13. PCR amplification for the isolation of phytase gene resulted into an amplified PCR product of ~1,150 bp as observed after electrophoresis on 1% agarose gel. Appearance of single band on gel revealed specific amplification of phytase gene using end-specific primers. This good quality PCR product was taken for restriction digestion using HindIII and XhoI restriction enzymes. E. coli DH5α was transformed with recombinant vector (pET32a + PhyPB13 phytase gene). E. coli BL21 (DE3) was used as an expression host, as it encodes the T7 RNA polymerase under the control of lacUV5 promoter [28]. Transformation of plasmid from positive clone to E. coli BL21 competent cells followed by induction with IPTG for 4 h resulted in expression of recombinant phytase by SDS-PAGE as an intense band of ~66 kDa while no such band was observed in uninduced culture. The size of induced protein was consistent with the calculated value for the fusion protein (~63 kDa), which includes an additional peptide sequence of about 20 kDa (175 amino acids) along with encoded phytase sequence of 381 amino acids (theoretical molecular weight ~42 kDa). The additional sequence includes Trx-tag (109 amino acids; which increases solubility of expressed protein), S-tag (used in purification of recombinant proteins), His6-tag (role in purification), and linker sequence [28]. Despite the presence of this additional amino acid stretch, the recombinant phytase was found to be catalytically active. The recombinant phytase was designated as “rPhyPB13.” Transformed E. coli BL21 cells produced rPhyPB13 with an enzyme activity of 1.15 U/mL and specific activity of 0.92 U/mg proteins. It was comparable to wild type B. licheniformis PB-13 phytase in production media. B. licheniformis PB-13 produced 0.99 U/mL phytase in PSMWB media (phytase screening media supplemented with 10% wheat bran) with a specific activity of 0.70 U/mg proteins [20].

3.2. Sequencing and Characterization B. licheniformis PB-13 Phytase Gene Sequence

Sequencing of target insert from positive clone by automated DNA sequencer at Department of Biochemistry, University of Delhi (South Campus), New Delhi, resulted in a nucleotide sequence of 1,149 bp (GenBank accession number JX187608.1). Analysis of sequence using BlastN resulted into 99% identity of sequence with B. licheniformis phytase L precursor gene (GenBank accession number AF469936.1). The phylogenetic tree constructed using neighbor-joining method also showed similar classification. The nucleotide sequence was searched for open reading frame (ORF) using ORF Finder. Ten (10) ORFs of varying length starting from different frames were obtained. The largest sequence was present in frame +1 which corresponded to the true ORF for phytase gene as it was, which started from first nucleotide and ended with a stop codon. Also, it showed 99% similarity to phytase sequences present in GenBank database. This full length ORF designated as “PhyPB13” encoded a protein of 381 amino acid residues with a calculated molecular mass of 42.1 kDa. The nucleotide sequence along with translated protein sequence (GenBank accession number AFQ59979.1) using ExPASy translation tool contained a putative signal peptide of 29 amino acid residues starting from amino acid residue 1 to 29. A cleavage site was present between residues 29 and 30 (Figure 1). Wang et al. [29] isolated a phyC gene of 1,146 bp from B. licheniformis encoding a peptide of 381 amino acids. The length of signal peptide in phyC was 31 amino acids. A BPPhy gene with an ORF of 1,074 bp (357 amino acid residues) and a signal peptide of 27 amino acid residues was isolated from P. nyakensis [30]. The amino acid composition of PhyPB13 protein sequence determined using ProtParam server revealed that Asp, Gly, Lys, and Ala were major amino acids constituting about 36% of PhyPB13. Cysteine was not observed in the sequence indicating that PhyPB13 did not bear disulfide bonds, which were believed to be essential for conformational stability and catalytic activity in several fungal phytases [29, 31, 32]. It was consistent with absence of cysteine in phytase from B. licheniformis [29].
Figure 1

Translated protein sequence from PhyPB13 nucleotide sequence (1146 bp). Signal peptide sequence is present from amino acid residues 1–29 (sequence underlined); ↓ indicates cleavage site of signal peptide;  *asterisk indicates stop codon; conserved sequences are highlighted.

Alignment of homologous sequences with Mega5 revealed presence of two conserved motifs, namely, “D-A-[A/T/E]-D-D-P-A-[I/L/V]-W” (amino acids 51–59) and “N-N-[V/I]-D-[I/L/V]-R-[Y/D/Q]” (amino acids 98–104), in PhyPB13 and other homologous sequences (Figure 1). Similar motifs were reported in multiple sequence alignments of 66 BPPhy sequences by Huang et al. [30]. Like other Bacillus phytases, PhyPB13 did not show sequence homology with HAPhys. The conserved regions “RHGXRXP” and “HD” of HAPhys [33] were absent in PhyPB13. Functional domain analysis using pfam (http://www.sanger.ac.uk/resources/software/) showed that the complete sequence (residues 1–381) was encoding a phytase enzyme. The sequence (residues 34–378) belongs to a thermostable phytase (3-phytase) superfamily (ID 50956) as indicated by Superfam (http://supfam.cs.bris.ac.uk/SUPERFAMILY/hmm.html) analysis. This superfamily includes thermostable phytases such as phytase from B. amyloliquefaciens and the other Bacillus sp. with 6-bladed beta-propeller fold structure. A putative conserved domain of phytase superfamily has been detected while performing a BlastP (http://blast.ncbi.nlm.nih.gov/Blast.cgi?PAGE=Proteins) similarity search analysis of PhyPB13 protein sequence. Further, the sequence appeared to be 99% identical to phyL precursor from B. licheniformis (GenBank accession number AAM74021.1). Alignment of PhyPB13 with phyL precursor sequence revealed that the sequences were different at three positions (PhyPB13 contains Leu, Lys, and Asn in place of Lys, Asp, and Asp at 33rd, 67th, and 281st positions, resp.).

3.3. Prediction of Three-Dimensional Structure of PhyPB13

Analysis of suitable template for 3D structure model of PhyPB13 using Phyre2 server (http://www.sbg.bio.ic.ac.uk/phyre2/html/page.cgi?id=index) revealed B. amyloliquefaciens phytase (TS-Phy, PDB ID—1H6L) as the best template for 3D modeling based on number of aligned residues and quality of alignment, with a “confidence” score of 100% which indicated the probability that a match between PhyPB13 and TS-Phy was based on homology. A match with “confidence >90%” represents similar fold and high accuracy in the modeling of core protein. The identity between target sequence and template was ~68%, which revealed accuracy of model; as for extremely high accuracy models this number should be above 30–40% (http://www.sbg.bio.ic.ac.uk/phyre2/html/help.cgi?id=help). Tridimensional structure of TS-Phy was downloaded from PDB (PDB ID 1H6L) and its PDB ID was provided as template for 3D structure prediction of PhyPB13 protein sequence using SWISS-Model server. It features automated modeling of homooligomeric assemblies and modeling of essential metal ions and cofactors in protein structures [23, 24]. Small E-value in sequence identity indicates that the TS-Phy and rPhyPB13 have a very similar sequence and good reliability of the alignment. The model has a six-bladed-propeller folding architecture [10] and 7 calcium binding sites in protein sequence predicted by 3DLigandSite [34]. Oh et al. [35] reported that an electronegative central channel accessible to solvent binds seven Ca2+ ions and these Ca2+ ions have been found important in catalytic activity and substrate binding of BPPhy. Valine at 100th position was found to be a putative ligand binding site with 4 contacts as predicted by 3DLigandSite [34]. It is present inside of the conserved region of BPPhys (residues 98–104) and might play an important role in the binding of substrate for enzyme catalysis.

3.4. In Silico Analysis and Characterization of BPPhy

The accession numbers along with source organisms of 44 reference protein sequences of BPPhy are given in Table 1. The majority of the sequences were reported to be from bacterial species dominated by Bacillus and Paenibacillus species (17 sequences). Analysis of multiple sequence alignment revealed the presence of conserved regions throughout the sequences. In all sequences, a conserved site “[D/A][STA]DDPA[I/V]W[I/V/L]T[N/D/L]K” was observed toward their N-terminus, followed by one more sequence “NN[F/V]D[I/V/L].” Huang et al. [30] reported the presence of similar sequences “DA[A/T/E]DDPA[I/L/V]W” and “NN[V/I] D[I/L/V]R[Y/D/Q]” with minor differences (sequence information was not given) during analysis of several BPPhy sequences. In the present study, we have also observed the presence of highly conserved sequence “DG” towards its C-terminus. Aspartic acid at conserved C-terminal “DG” sequence in these BPPhy sequences might act as a proton donor to the oxygen atom of the scissile phosphomonoester bond and may play a role in catalytic mechanism of these enzymes. Similar role has been suggested for aspartic acid in conserved “HD” residues towards C-terminal in HAPhy sequences [36, 37]. Evolutionary relationship among different sequences was studied using phylogenetic tree constructed by neighbor-joining method (Figure 2). Overall, three major clusters were observed in phylogenetic tree. Cluster “1” represented sequences of Bacillus with Paenibacillus species. The amino acid residues in sequences of this cluster were 380 ± 10 except for three sequences from Paenibacillus sp., that is, Y_003868637.1, YP_004639353.1, and ZP_07387907.1 which have length of 465, 461, and 469 amino acid residues, respectively. Cluster “2” represents BPPhy with the largest protein sequence in the range of 436–769, while cluster “3” had the smallest sequence with 331 to 364 amino acid residues (Table 2).
Figure 2

Phylogenetic tree of PhyPB13 with BPPhy protein sequences constructed by Neighbor-Joining method.

Table 2

Biochemical characteristics of BPPhy protein sequences determined by ProtParam server.

S. numberAccession numberSource organismsNumber of amino acidsMolecular weightTheoretical pIInstability indexAliphatic index
1ZP_03969865.1 Sphingobacterium  spiritivorum 36240320.55.7430.3581.88
2ZP_07083876.1 Sphingobacterium  spiritivorum 36240216.45.7430.7981.35
3ZP_01312505.1 Desulfuromonas  acetoxidans 36439756.64.824.8483.08
4YP_004741572.1 Capnocytophaga  canimorsus 34338361.55.0228.9886.41
5YP_001959943.1 Chlorobium  phaeobacteroides 35639458.35.3441.0884.1
6YP_002014808.1 Prosthecochloris  aestuarii 35238123.95.0527.6785.65
7ZP_09672975.1 Myroides  odoratus 35539587.85.0433.983.58
8YP_004046143.1 Riemerella  anatipestifer 34738778.86.3428.883.4
9ZP_01734242.1 Flavobacteria  bacterium 35539803.36.4824.0589.48
10YP_001943170.1 Chlorobium  limicola 35238025.15.6226.488.86
11YP_004735798.1 Zobellia  galactanivorans 33837881.94.8927.2781.04
12ZP_07088398.1 Chryseobacterium  gleum 35039037.25.4628.9284.03
13YP_004261716.1 Cellulophaga  lytica 33937698.96.2425.5982.45
14ZP_09499218.1 Mesoflavibacter  zeaxanthinifaciens 33737423.44.8328.0780.68
15YP_003586972.1 Zunongwangia  profunda 33137122.44.629.7270.06
16YP_002374284.1 Cyanothece  sp.  PCC  8801 43646836.34.2223.7591.88
17AFQ59979.1 Bacillus  licheniformis  PB-13 38142131.54.7425.9469.95
18YP_004643897.1 Paenibacillus  mucilaginosus 39041788.14.2122.4286.85
19ZP_07387906.1 Paenibacillus  curdlanolyticus 37140205.94.121.381.75
20YP_003868637.1 Paenibacillus  polymyxa 46550676.94.9322.8181.83
21YP_004639353.1 Paenibacillus  mucilaginosus 46149436.74.3430.4283.45
22ZP_08507024.1 Paenibacillus  sp.  HGF7 46249590.44.9217.0782.19
23ZP_04154570.1 Bacillus  pseudomycoides 39042684.75.3418.578.26
24ZP_04160523.1 Bacillus  mycoides 39042698.75.3418.2878.51
25ZP_09566405.1 Singulisphaera  acidiphila 36639065.55.1932.1980.49
26ZP_07387907.1 Paenibacillus  curdlanolyticus 46951012.55.2222.0988.44
27YP_004877642.1 Bacillus  subtilis  subsp.  Spizizenii 38241965.45.1916.2774.55
28YP_090097.1 Bacillus  licheniformis  ATCC  14580 38142040.64.8126.1470.73
29ZP_06871959.1 Bacillus  subtilis 38241896.45.215.8983.72
30ZP_09771671.1 Paenibacillus  sp.  Aloe-11 46550835.15.1321.2982.04
31ZP_06588929.1 Streptomyces  roseosporus 43646575.84.2429.1776.31
32XP_002627863.1 Ajellomyces  dermatitidis 76881904.94.8136.4880.01
33ZP_04713225.1 Streptomyces  roseosporus 44247136.54.2429.9176.61
34YP_004255627.1 Deinococcus  proteolyticus 38140092.94.6135.1689.82
35NP_389861.1 Bacillus  subtilis  str.  168 38241946.45.120.2474.55
36ZP_08003013.1 Bacillus  sp.BT1B  CT2 38142245.84.8128.4671.23
37ZP_01694652.1 Microscilla  marina 39243056.25.0924.7675.61
38XP_002790172.1 Paracoccidioides  brasiliensis 76981961.45.6428.6585.18
39YP_003593415.1 Caulobacter  segnis 67370502.55.2526.8891.62
40YP_001421557.1 Bacillus  amyloliquefaciens 38341723.35.0223.771.91
41ZP_08535745.1 Methylophaga  aminisulfidivorans 640707165.0629.5890.81
42YP_004432278.1 Glaciecola  sp.65671676.74.7833.1396.33
43YP_005130694.1 Bacillus  amyloliquefaciens 38341812.35.0724.8769.87
44ZP_08825440.1 Thiorhodococcus  drewsii 76282173.34.2234.5588.82
Other biophysical features of all protein sequences are also given in Table 2. Molecular weight of sequences varied according to length of protein sequences in the range of 37–82 kDa. Isoelectric point (pI) was found between 4.1 and 6.4 with the majority of sequences having a pI value above 5. The pI values for the sequences were highest in cluster 2, followed by cluster 1 and 3, respectively. The instability index was used to measure in vivo half-life of a protein [38]. Analysis of instability index indicated uniformity among all sequences of BPPhy and was predicted to be below 40 for all sequences except phytase from C. phaeobacteroides (YP_001959943.1). Further, a majority of sequences have instability index less than 30, suggesting that these proteins exhibited good in vivo stability [38]. Aliphatic index of reported protein sequences ranged from 69 to 90, indicating the high thermostability of BPPhy enzymes. Aliphatic index of protein measures the relative volume occupied by aliphatic side chains of the amino acids: alanine, valine, leucine, and isoleucine. Globular proteins with high aliphatic index have high thermostability and an increase in aliphatic index suggests an increase in protein thermostability [39]. Superfam server based analysis of protein sequences revealed their similarity to thermostable phytase (3-phytase) superfamily (Table 3). This family represents phytases which are thermostable at high temperatures and have a distinct catalytic mechanism with removal of initial phosphorus from 3rd carbon of phytate ring. A total of 10 motifs with given parameters were reported by MEME analysis. The 29 amino acid residues long motif “1” “DDPAIWVHPHDPEKSRIIGTNKKSGLAVY” was observed in all 44 protein sequences, with a conserved sequence “DDPAIW[VI][HN]PK[DN]P[ESA]KS.” This sequence might be used for diversity and expression analysis of BPPhy enzymes. Functional domain analysis using BlastP search for this motif revealed that the sequence belongs to phytase superfamily (Table 4).
Table 3

Distribution of superfamily among BPPhy determined using superfam server.

SuperfamilyFamilyAccession number (range of amino acids residues)
Thermostable phytase(3-phytase)Thermostable phytase(3-phytase)YP_004767129.1 (35–378), AFQ59979.1 (PhyPB13) (34–375), YP_001421557.1 (31–379), YP_005130694.1 (31–379), ZP_08003013.1 (35–378), YP_004877642.1 (34–378), NP_389861.1 (35–378), ZP_06871959.1 (34–378), ZP_04154570.1 (44–383), ZP_04160523.1 (44–383), ZP_01694652.1 (56–392), YP_003868637.1 (120–461), ZP_09771671.1 (120–461), ZP_08507024.1 (118–457), ZP_07387906.1 (28–368), YP_004639353.1 (119–445), ZP_08535745.1 (59–281), YP_003593415.1 (31–339), ZP_07387907.1 (121–465), YP_004643897.1 (40–384), YP_004432278.1 (60–281), XP_002790172.1 (402–735), YP_002374284.1 (23–402), XP_002627863.1 (402–734), YP_004741572.1 (25–342), ZP_09566405.1 (28–354), YP_001959943.1 (28–352), YP_002014808.1 (32–350), ZP_01312505.1 (44–360), ZP_03969865.1 (49–355), ZP_07083876.1 (49–355), ZP_07088398.1 (36–337), ZP_01734242.1 (31–351), YP_004046143.1 (33–337), YP_001943170.1 (30–346), ZP_08825440.1 (409–761), ZP_09499218.1 (22–331), YP_003586972.1 (11–320), ZP_06588929.1 (21–284, 312–434), ZP_04713225.1 (27–290, 318–440), YP_004735798.1 (20–332), YP_004255627.1 (40–378), YP_004261716.1 (25–333), ZP_09672975.1 (26–345)
Table 4

Distribution of commonly observed motifs in different BPPhy protein sequences along with their functional domains.

Motifs Motif width Motif present in number of sequence Amino acid sequence Conserved region for degenerate primers Domain
1 29 44 DDPAIWVHPHDPEKSRIIGTNKKSGLAVY DDPAIW[VI][HN]PK[DN]P[ESA]KS Phytase superfamily
2 30 43 QIEGCVADDEYGYMYIAEEQHCIWKYYAEP [AV]DDE[YL]GY[LIV]Y Phytase superfamily
3 24 43 GYLMVSSQGNNSYAIYERQGNNRY GY[LI][IL][AV]SSQ Local conserved
4 30 33 IDGTSETDGIDVMGFGLGPKFPHGIFVAQD IDG[TV]S[DE][TS]DGIDV Local conserved
5 33 21 EVYGFCLYHSQKTGKFYAMVTGKEGEFEQYELF EVYGFSLYHS[QL]KTGK[FY]YA[LM]V[TL]GKEGEFEQYELF Local conserved
6 16 44 RMNNVDVRYGFPLNGK NNVD[VLI]RY[GD]F Local conserved
7 21 44 FDGEHFTADHEGLTIYYGPDG GEH[LF]TAD[IV]EG[LI]Local conserved
8 29 22 GENMDHGQKVNQNFKMVPWERIAQHFPRP K[AV]NQNFK[IM]V Local conserved
9 15 43 KIDIAAATNRSTNKI K[VIT]D[IL]A[AV][AV][TS][NE]RST[NG][KT][ILV]Local conserved
10 15 42 GQITGKLVREFKMWS G[KQ][VI]T[GA][KT][LK]VR[EK]F[KG]Local conserved

4. Conclusion

In conclusion, a β-propeller phytase of 3-phytase family from B. licheniformis strain PB-13 was successfully expressed in E. coli BL21. Phylogenetic clustering, conserved motifs sequences, and variation among biochemical features of different BPPhy phytases in this study could be key information for screening of novel phytases and comparison with other classes of phytases, which might contribute in further classification and application of diverse BPPhys. Functional characterization of amino acid residues in conserved regions of BPPhy is required for determining their role in enzyme catalysis. Overall, this in silico analysis will be important for future genetic engineering of this most diverse and important class of phytase.
  32 in total

1.  Microbial genes in the human genome: lateral transfer or gene loss?

Authors:  S L Salzberg; O White; J Peterson; J A Eisen
Journal:  Science       Date:  2001-05-17       Impact factor: 47.728

2.  In silico characterization of alkaline proteases from different species of Aspergillus.

Authors:  Vivek Kumar Morya; Sangeeta Yadav; Eun-Ki Kim; Dinesh Yadav
Journal:  Appl Biochem Biotechnol       Date:  2011-11-10       Impact factor: 2.926

3.  Crystal structures of a novel, thermostable phytase in partially and fully calcium-loaded states.

Authors:  N C Ha; B C Oh; S Shin; H J Kim; T K Oh; Y O Kim; K Y Choi; B H Oh
Journal:  Nat Struct Biol       Date:  2000-02

4.  The SWISS-MODEL workspace: a web-based environment for protein structure homology modelling.

Authors:  Konstantin Arnold; Lorenza Bordoli; Jürgen Kopp; Torsten Schwede
Journal:  Bioinformatics       Date:  2005-11-13       Impact factor: 6.937

5.  Age-related influence of a cocktail of xylanase, amylase, and protease or phytase individually or in combination in broilers.

Authors:  O A Olukosi; A J Cowieson; O Adeola
Journal:  Poult Sci       Date:  2007-01       Impact factor: 3.352

6.  The neighbor-joining method: a new method for reconstructing phylogenetic trees.

Authors:  N Saitou; M Nei
Journal:  Mol Biol Evol       Date:  1987-07       Impact factor: 16.240

7.  Thermostability and aliphatic index of globular proteins.

Authors:  A Ikai
Journal:  J Biochem       Date:  1980-12       Impact factor: 3.387

8.  Influence of disulfide bonds on the conformational changes and activities of refolded phytase.

Authors:  G Y Song; X Y Wang; M Wang
Journal:  Protein Pept Lett       Date:  2005-08       Impact factor: 1.890

9.  3DLigandSite: predicting ligand-binding sites using similar structures.

Authors:  Mark N Wass; Lawrence A Kelley; Michael J E Sternberg
Journal:  Nucleic Acids Res       Date:  2010-05-31       Impact factor: 16.971

10.  Biochemical characterization of fungal phytases (myo-inositol hexakisphosphate phosphohydrolases): catalytic properties.

Authors:  M Wyss; R Brugger; A Kronenberger; R Rémy; R Fimbel; G Oesterhelt; M Lehmann; A P van Loon
Journal:  Appl Environ Microbiol       Date:  1999-02       Impact factor: 4.792

View more
  4 in total

1.  Comparative Analysis of Peripheral Alkaline Phytase Protein Structures Expressed in E. coli.

Authors:  Mohammadreza Nassiri; Hamid Ariannejad
Journal:  Rep Biochem Mol Biol       Date:  2015-10

2.  Computational-based structural, functional and phylogenetic analysis of Enterobacter phytases.

Authors:  Krishnendu Pramanik; Shreyasi Kundu; Sandipan Banerjee; Pallab Kumar Ghosh; Tushar Kanti Maiti
Journal:  3 Biotech       Date:  2018-05-19       Impact factor: 2.406

Review 3.  Research status of Bacillus phytase.

Authors:  Ting Zhao; Xihao Yong; Ziming Zhao; Vincenza Dolce; Yuan Li; Rosita Curcio
Journal:  3 Biotech       Date:  2021-08-19       Impact factor: 2.893

Review 4.  Characterization of the Catalytic Structure of Plant Phytase, Protein Tyrosine Phosphatase-Like Phytase, and Histidine Acid Phytases and Their Biotechnological Applications.

Authors:  Alex Sander Rodrigues Cangussu; Deborah Aires Almeida; Raimundo Wagner de Souza Aguiar; Sidnei Emilio Bordignon-Junior; Kelvinson Fernandes Viana; Luiz Carlos Bertucci Barbosa; Edson Wagner da Silva Cangussu; Igor Viana Brandi; Augustus Caeser Franke Portella; Gil Rodrigues Dos Santos; Eliane Macedo Sobrinho; William James Nogueira Lima
Journal:  Enzyme Res       Date:  2018-03-11
  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.