Literature DB >> 33694107

Structure and dynamics analysis of multi-domain putative β-1,4-glucosidase of family 3 glycoside hydrolase (PsGH3) from Pseudopedobacter saltans.

Priyanka Nath1, Arun Goyal2.   

Abstract

Structure and conformational behaviour of a putative β-1,4-glucosidase of glycoside hydrolase family 3 (PsGH3) from Pseudopedobacter saltans was predicted by using in-silico tools. PsGH3 modeled structure constructed using Phyre2 displayed multidomain architecture comprising an N-terminal (β/α)8-fold domain followed by (α/β)6-sandwich domain, PA14 domain, and a C-terminal domain resembling an immunoglobulin fold. Ramachandran plot displayed 99.3% of amino acids in the allowed region and 0.7% residues in the disallowed region. Multiple sequence alignment (MSA) and structure superposition of PsGH3 with other homologues from GH3 family revealed the conserved residues, Asp274 and Glu624 present in loops LA and LB, respectively originating from N-terminal domain act as catalytic residues. The volume and area calculated for PsGH3 displayed a deep active-site conformation comparable with its homologues, β-1,4-glucosidases (GH3) of Kluyveromyces marxianus and Streptomyces venezuelae. Molecular dynamic (MD) simulation of PsGH3 structure for 80 ns suggested stable and compact structure. Molecular docking studies revealed deeper active site conformation of PsGH3 that could house larger cellooligosaccharides up to 7° of polymerization (DP7). The amino acid residues, Ala86, Leu88, Cys275, Pro483, Phe493, Asn417, Asn491, Pro492, and Leu495 created a binding pocket near the catalytic cleft, crucial for ligand binding. MD simulation of PsGH3 in the presence of cellooligosaccharides, viz., cellobiose and celloheptaose showed stability in terms of RMSD, Rg, and SASA values till 80 ns. The calculation of average number of hydrogen bond (H-bond), interaction energy, and binding free energy confirmed the stronger binding affinity of the larger cellooligosaccharides such as celloheptaose in the binding cavity of PsGH3.

Entities:  

Keywords:  3D modeling; Family 3 glycoside hydrolase; Molecular docking; Pseudopedobacter saltans; β-1,4-glucosidase

Year:  2021        PMID: 33694107      PMCID: PMC7945971          DOI: 10.1007/s00894-021-04721-4

Source DB:  PubMed          Journal:  J Mol Model        ISSN: 0948-5023            Impact factor:   1.810


Introduction

The extensive consumption of fossil fuels has caused the enhancement in the rate of global warming; therefore, the research and development in the field of renewable energy production have also increased. One of the forms of renewable energy is bioethanol. Bioethanol can be produced from plant-based lignocellulosic biomass. This lignocellulosic biomass is made up of cellulose, hemicellulose, and lignin. Cellulose is the most abundant polysaccharide present on earth and is composed of glucose units forming a parallel and consecutive linear chain [1]. Cellobiose, a disaccharide of glucose in the repeating unit, is present in cellulose [2]. The hydrolysis of cellobiose can be achived by β-1,4-glucosidase that gives two molecules of glucose [3]. The cellulose content in the plant biomass is more than 50%, and it is present in the crystalline fibre form which is resistant to hydrolysis [4]. The complete hydrolysis of cellulosic chain can be brought about by the combined action of endoglucanase, cellobiohydrolase, and β-glucosidase, collectively called as cellulases, converting cellulose polymer into cello-oligosaccharides, cellobiose, and glucose, respectively [5]. Cellulolytic enzymes belong to different glycoside hydrolase families (GH) (www.cazy.org). They act synergistically, in which endo-β-1,4-glucanase (EC 3.2.1.4) acts randomly on cellulosic chains and yields larger cellooligosaccharides as a hydrolysed products [6]. Cellobiohydrolase (EC 3.2.1.91) acts at the non-reducing end of the larger cellooligosaccharides and generates cellobiose as a primary product after the hydrolysis [7]. Finally, β-1,4-glucosidase (EC 3.2.1.21) hydrolyzes the released cellobiose moiety to form two glucose molecules [1]. Several studies suggested that cellobiose formed during the hydrolysis of lignocellulosic material for bioethanol production inhibits the cellulases during the enzymatic hydrolysis process [8-10]. Thus, the instant removal of cellobiose becomes essential for increasing the efficiency of the process. The commercial cellulolytic enzymes mixture lacks the sufficient β-1,4-glucosidase activity. Therefore, efforts were made for the production of enzyme cocktails rich in β-1,4-glucosidase activity [11, 12]. The presence of adequate β-1,4-glucosidase activity in the enzyme cocktail displayed an increase in hydrolysis performance, resulting in a 20–40% increase in total bioethanol production [13]. Therefore, the identification, production, functional, and structural characterisation of an efficient β-1-4-glucosidase, from a bacterial source, Thermoanaerobacterium aotearoense, was carried out [14]. The enzyme, β-1,4-glucosidase belongs to GH families 1, 3, 5, 9, 30, and 116 (www.cazy.org) [15-17]. GH3 is one of the largest families in the CAZy database [18]. It contains around 44 of β-glucosidases from bacterial, mold, and yeast origin [19]. Moreover, the GH3 family is known to display β-1,4-glucosidase, β-xylosidase, and exo-1,3-1,4-β glucanase activities [18]. The most studied fungal β-1,4-glucosidase belongs to GH3 family [19]. However, the structural data of GH3 enzymes are still limited. Therefore, for understanding, the functional aspects of family GH3 enzymes, their structural characterisation is required. In this study, computational approaches were used for modeling of β-1,4-glucosidase of family GH3 (PsGH3) from Pseudopedobacter saltans and understanding its structural characteristics. The stability and compactness of the developed modeled structure of PsGH3 was analyzed by Molecular Dynamics (MD) simulation. Furthermore, the binding interaction of PsGH3 with most probable ligands, the cellooligosaccharides, was studied by molecular docking approach for identification of crucial amino acid residues involved in substrate binding and catalysis. Furthermore, the cellooligosaccharides-bound PsGH3 complexes obtained from molecular docking analysis were subjected to protein-ligand complex MD simulation for analyzing the conformation and stability of PsGH3 in the presence of these ligands. Moreover, the affinity of cellooliogosaccharides towards PsGH3 was also determined and compared.

Materials and methods

Sequence and evolutionary investigation of PsGH3

The amino acid sequence of β-1,4-glucosidase (PsGH3) belonging to family 3 GH from P. saltans having Genbank ID-ADY51046 and UniProt ID F0S5X5 was fetched from the NCBI database (http://www.ncbi.nlm.nih). The preserved domain within the amino acid sequence of PsGH3 was explored by feeding the amino acid sequence of PsGH3 in the conserved domain database (http://www.ncbi.nlm.nih.gov/cdd/). The signal peptide existing in the amino acid sequence of PsGH3 was recognized by utilizing SignalP 4.1 server [20]. Furthermore, to determine the homologous sequences of PsGH3 from the PDB database, the amino acid sequence of PsGH3 was analyzed by using BLASTp tool [21]. The homologous sequences, thus obtained after BLASTp analysis, were further aligned by using Clustal Omega [22] and were envisioned by employing the web-based tool, ESPript 3 [23]. The evolutionary correlation between PsGH3 and its homologues was studied by the phylogenetic tree created in Mega 7 software by using the neighbor-joining method [24].

3-dDimensional model prediction and validation of PsGH3

The 3D model structure of PsGH3 was predicted by using Phyre2 web server (http://www.sbg.bio.ic.ac.uk/phyre2/html/page.cgi?id=index). Phyre 2 is web-based server used for the prediction of protein structure and function [25]. This online program functions statistically for the template-based modeling of proteins. The modeled structure of PsGH3 thus obtained was further energy minimized by using YASARA energy minimization tool (http://www.yasara.org/minimizationserver.php). The energy minimized modeled PsGH3 structure was analyzed by PyMol 2.0 software [26] and further validated by SAVES web server (http://servicesn.mbi.ucla.edu/SAVES/). The stereo-chemical properties of the energy-minimized PsGH3 structure were further explored by the PROCHECK [27]. An integrated web-based server, the ProSA (https://prosa.services.came.sbg.ac.at/prosa.php), was used to analyze the deviation of statistical Z-score for the modeled PsGH3 structure from the high-resolution PDB deposited structures. The ERRAT plot [28] and VERIFY-3D [29] were further used to validate the energy-minimized modeled PsGH3 structure to analyze the compatibility of the atomic model with the amino acid sequence. The illustration of the topology for PsGH3 structure was generated by employing PDBSum (http://www.ebi.ac.uk/thornton-srv/databases/pdbsum/Generate.html). The conserved active site amino acid residues of PsGH3 involved in catalysis were recognized by the superposition of the energy-minimized PsGH3-modeled structure with its template structure of β-1,4-glucosidase from Kluyveromyces marxianus (PDB ID: 3ABZ) by using PyMOL 2.0 [26]. β-1,4-glucosidase from Kluyveromyces marxianus (PDB ID: 3ABZ) was employed for structure superposition based on structural comparison obtained from DALI server.

Molecular dynamics simulation of PsGH3 modeled structure

The molecular dynamics simulation of modeled PsGH3 structure was performed by employing Gromacs v 5.14 [30] equipped in a supercomputing facility (Param-Ishan), available at Indian Institute of Technology Guwahati, Assam, India. The GROMOS9653a6 FF force field [31] was utilized for the calculation of the protein forces. The PsGH3 modeled structure was positioned inside a box with a cubic shape of volume 1405.23 nm3 and dimensions 6.43 × 7.12 × 8.97. The system was then solvated with a simple point charges water model (SPC) to maintain the equilibrium. The MD simulation contained 42,563 water molecules, in which 6 Cl− counter ions were added to neutralize the charges present on PsGH3. Solvated systems were then minimized at a maximum force of 1000.0 kJ/mol/nm by using 50,000 steps cut-off. Furthermore, the system was equilibrated in NVT (number of atoms, volume, and temperature of the system) ensemble at 300 K over 500 ps at constant volume and temperature, by coupling with a modified Berendsen thermostat with a time constant of 0.1 ps [30]. Additionally, another 500 ps NPT (normal pressure and temperature) equilibration was performed at constant temperature (300 K) and pressure (1 atm) by coupling with modified Berendsen thermostat with a time constant of 0.1 ps and Parrinello-Rahman barostat [32] with a time constant of 2 ps, respectively. Additionally, all bonds were constrained via LINCS algorithm [33]. Finally, the 80-ns production run was setup at a constant temperature of 300 K by using modified Berendsen thermostat and pressure 1 atm by using Parrinello-Rahman barostat with a time step of 2 fs, The LINCS algorithm was used to constrain the lengths of the hydrogen atom containing bonds. Additionally, the PsGH3 modeled structure was analyzed throughout the process of simulation, as a time-dependent function to validate its stability in the solvent system. After 80 ns of simulation, the root mean square deviation (RMSD) and root-mean-square fluctuation (RMSF) of the simulated system was analyzed by using gmx rms and gmx rmsf programs, respectively. The radius of gyration (Rg), intramolecular hydrogen bonds (H-bond), and solvent accessible surface area (SASA) were determined by using the program gmx gyrate, gmx h-bond, and gmx sasa, respectively. The simulated structure after 80 ns was extracted and visualized by ChimeraX [34].

The active site investigation of PsGH3

The existence of the accessible pockets for ligand binding on the surface of PsGH3 was determined by CASTp, a web-based server (http://cast.engr.uic.edu) [35]. The area and volume of those accessible pockets for ligand binding present on the surface of PsGH3 were calculated by employing a probe radius with a default value of 1.4 Å, by using the molecular surface model (Connolly’s surface) method and accessible surface model (Richards surface) as described previously [36]. Additionally, the dissemination of electrostatic potential over the substrate binding pocket and surface of PsGH3 structure was calculated by using a tool known as Adaptive Poisson-Boltzmann Solver (APBS) equipped PyMOL 2.0 software [26]. The structures were then further envisaged and examined by PyMOL 2.0 software [26].

Ligand-binding analysis of PsGH3 by molecular docking approach

The molecular docking of PsGH3 with the most probable ligands (cellooligosaccharides) was executed in a web-based server, SwissDock, (http://www.swissdock.ch/docking). In the initial step, the PsGH3 structure was prepared for docking, by employing the Dock Prep tool available in UCSF Chimera 1.10.1 [37]. In this step, the hydrogen atoms were added, and partial charges were allocated by using AMBER99 FF force field. Moreover, the cellooligosaccharides were designed and downloaded from the GLYCAM server [38]. The docking analysis of PsGH3 modeled structure with cellooligosaccharides in the SwissDock was executed by submitting the protein in PDB and ligand in Mol2 file format in the server. The SwissDock utilizes the EADock DSS software in which after accomplishment of each docking run, the output clusters are attained and ranked by a fixed SwissDock algorithm [39, 40]. This algorithm ranks the cluster based on scoring function known as FullFitness (FF). Moreover, the distinct conformer obtained from each cluster was further arranged according to the FF score. A more favorable binding mode has a higher negative FF score and hence, a better fit. The UCSF-Chimera 1.10.1 was used for analyzing the results obtained from SwissDock [37]. Furthermore, different docking poses were screened based on binding free energy value (deltaG, kcal/mol). The best docked protein-ligand complex displaying the highest negative binding free energy was considered and was further selected. The selected ligand-bound protein structure of PsGH3 was extracted using ChimeraX [34] and envisioned in PyMol 2.0 [26]. The illustration of amino acid residues interacting with ligand in the protein-ligand complex was created by using the Ligplot software [41].

Molecular dynamics of protein-ligand complexes

The two docked complexes (PsGH3 + cellobiose and PsGH3 + celloheptaose) with the highest negative binding free energy obtained from the molecular docking study were further subjected to MD simulation to understand the stability and conformational changes of the protein in the presence of ligands. The MD simulations were performed by using GROMACSv 5.14 [30], while the protein forces were calculated by GROMOS96 43A1 FF force field [31]. The ligand topologies for cellobiose and celloheptaose were generated by using PRODRG server [42]. The two MD simulations systems were solvated with a simple point charges water model (SPC) with a dodecahedron box configuration with a volume 555 nm3. The MD simulation system, PsGH3 + cellobiose contained 33,671 water molecules, in which 8 Cl− counter ions were added to neutralize the charges while PsGH3 + celloheptaose contained 33,647 water molecules and 11 Cl− counter ions. The MD systems were further subjected to energy minimization with the steepest descent algorithm for the removal of any steric clashes and bad contacts and at maximum force of 1000.0 kJ/mol/nm by using 50,000 steps cutoff. After energy minimization, the equilibration with position restraint was carried out under NVT at 300 K over 500 ps at constant volume and temperature, by coupling with a modified Berendsen thermostat with a time constant of 0.1 ps. Additionally, another 500 ps NPT equilibration was performed by using modified Berendsen thermostat coupling (300 K) and isotropic Brendsen pressure coupling (1 atm) [30]. For both equilibration steps, the LINCS algorithm was used to constrain the lengths of the hydrogen-containing bonds [33]. Finally, the 80- ns production run at a constant temperature of 300 K by coupling with a modified Berendsen thermostat and pressure 1 atm with a time step of 2 fs employing Parrinello-Rahman for isotropic pressure coupling [32] using LINCS constraints for H-bonds were carried out in a supercomputing facility described in the previous section (Molecular dynamics simulation of PsGH3 modeled structure). Thereafter, the stability of the protein-ligand complexes during the course of simulation was analysedanalyzed by using various parameters by employing in-built “gmx” commands of GROMACS. The plots for root-mean-square deviation (RMSD), root mean square fluctuation (RMSF), radius of gyration (Rg), number of hydrogen bonds (H-bond), and solvent accessible surface area (SASA) of protein-ligand complexes upon simulation were generated using GraphPad prism 9.0.0. Moreover, the average energies comprising of short-range interactions between protein and the ligand represented in the terms of Lennard Jones and coulombic interaction energy were also analyzed using gmx energy command. Post MD simulations, the protein-ligand complexes were further subjected to the total binding free energy, potential energy (electrostatic and Van der Waals interactions), and solvation free energy (polar and non-polar solvation energies) calculations by using the Molecular Mechanics Poisson-Boltzmann Surface Area (MM-PBSA) method by using g_mmpba tool [43].

Calculation of binding free energy using Poisson-Bolzmann surface area (MM-PBSA)

The Poisson-Boltzmann or generalized Born and surface area continuum solvation (MM-PBSA or MM-GBSA) is a method for the calculation of the ligand affinity for proteins [44]. The binding free energy of the protein-ligand complexes (PsGH3 + cellobiose and PsGH3 + celloheptaose), was calculated from their last stable 20 ns trajectories. The tool “g_mmpbsa” with default parameters [43] was utilized for potential energy and solvation free energy calculations.

Results and discussion

The blastp analysis of amino acid sequence of PsGH3 having GenBank ID-ADY51046 from Pseudopedobacter saltans exhibited the sequence identity for amino acid of PsGH3 with earlier biochemically characterized β-1,4-xylosidase, viz., Xyl3A (38.9%) from Trichoderma reesei and XlnD (37.8%) from Aspergillis nidulans, respectively (Table 1). However, PsGH3 also exhibited the sequence identity with earlier biochemically characterized β-1,4-glucosidase,viz., EmGH1 (34.4%) from Erythrobacter marinus, KmBglI (33.5%) from Kluyveromyces marxianus, DesR (33.2%) from Streptomyce venezuelae, and BlBG3 (30.9%) from Bifidobacterium longum, respectively (Table 1). Moreover, the SignalP 4.1 server inferred the presence of an N-terminal signal peptide sequence comprising 20 amino acids with a cleavage site between Ala19 and Gln20. The sequence analysis of PsGH3 by BLAST and conserved domain database displayed that the N-terminal domain of PsGH3 comprises 375 amino acids for β-glucosidase (BG1X) and 434–747 amino acid for Glyco Hydro 3_C superfamily domain (GH3-C domain) that also contained an inserted PA14 domain between amino acids, 460–581 (Fig. 1a). The PA14 domain was named after its location that was identified in the protective antigen of anthrax toxin [45]. The PA14 domain has a binding function rather than a catalytic role though it is shown to be located in the catalytic core [18]. The rest of the amino acids 747–854 constituted the C-terminal domain. The amino acid residues position 20–62 and 375–434 were not assigned in the conserved domains search for PsGH3 sequence; therefore, they were considered as unknown (Fig. 1a). This distinct domain architecture was also reported for β-1,4-glucosidase (GH3) structure from K. marxianus [18].
Table 1

BLASTp investigation of PsGH3 amino acid residue sequence against PDB database

Name of organismName of enzymeQuery cover (%)Identity (%)E-valueTotal scoreResolution (Å)PDB ID
T. reeseiXyl3A8838.98e-441581.85A7M
A. nidulansXlnD8937.81e-431581.46Q7I
B. longumBlBG39130.91e-271122.35Z9S
S. venezuelaeDesR8733.22e-271121.44I3G
E. marinusEm GH18034.46e-271102.35Z87
K. marxianusKmBglI8433.52e-261082.53 AC0
Fig. 1

a Molecular architecture of PsGH3 amino acid sequence. b Phylogenetic analysis for elucidating the evolutionary correlation between PsGH3 and its homologues by utilizing the neighbor-joining method by using MEGA 7

BLASTp investigation of PsGH3 amino acid residue sequence against PDB database a Molecular architecture of PsGH3 amino acid sequence. b Phylogenetic analysis for elucidating the evolutionary correlation between PsGH3 and its homologues by utilizing the neighbor-joining method by using MEGA 7 The phylogenetic tree analysis gave the relatedness of PsGH3 with other homologous proteins from different organisms. The PsGH3 sequence was closely related to an exo- β-1-4-xylosidase of GH3 family from A. nidulans, whereas the GH3 enzyme from Bacteroides intestinalis was found distantly related to PsGH3 (Fig. 1b). Furthermore, the alignment of sequence of PsGH3 with T. reesei (PDB ID-5A7M), A. nidulans (PDB ID-6Q7I), B. longum (PDB ID-5Z9S), S. venezuelae (PDB ID- 4I3G), E. marinus (PDB ID-5Z87), and K. marxianus (PDB ID-3 AC0) by using MSA, suggested that Asp274 and Glu624 are the catalytic amino acid residues, while the residues Lys193, His194, and Tyr242 might be involved in binding and were found conserved in family 3 GH proteins (Fig. 2).
Fig. 2

Multiple sequence alignment of PsGH3 with its homologous proteins, viz., Xyl3A (PDB ID-5AE6), XlnD (PDB-ID 6Q7I), EmGH1 (PDB ID 5Z87), KmBglI (PDB ID 3 AC0), , and BlBG3 (PDB ID-5Z9S) as also described in Table 1. In MSA, the conserved amino acid residues are shown in black with the red background and semi-conserved amino acid residues are depicted in red with white background

Multiple sequence alignment of PsGH3 with its homologous proteins, viz., Xyl3A (PDB ID-5AE6), XlnD (PDB-ID 6Q7I), EmGH1 (PDB ID 5Z87), KmBglI (PDB ID 3 AC0), , and BlBG3 (PDB ID-5Z9S) as also described in Table 1. In MSA, the conserved amino acid residues are shown in black with the red background and semi-conserved amino acid residues are depicted in red with white background

3-Dimensional structure modeling of PsGH3 and structural analysis

The 3-dimensional modeled structure of PsGH3 protein was created by Phyre2 by employing multiple templates, of which the β-1,4-xylosidase (GH3) structure of Hypocrea jecorina showed 100% confidence with a percentage identity of 40%, while β-1,4-glucosidase (GH3) structures of K. marxianus (PDB ID: 3 AC0) and S. venezuelae (PDB ID-4I3G) showed 100% confidence with percentage identity of 38% and 34%, respectively. The PsGH3 modeled structure generated was energy minimized by the energy minimization tool, YASARA. The energy of PsGH3 modeled structure, initially was 1.5 × 105 kJmol−1, which after minimization, showed lower energy, − 4.96 × 105 kJmol−1. Thus, the energy-minimized PsGH3 structure was obtained from the server and used for further structure characterization. The modeled structure of PsGH3 was composed of four distinct domains, viz., an N-terminal (β/α)8-fold-like domain 1 (grey) comprising 357 amino acids from (21–378), an N- terminal (α/β)6-sandwich domain 2 (yellow) comprising 341 amino acids from (382–723), the domain 1 and domain 2 were connected by a short linker C1, comprising amino acids Ser 379, Ile380, and Thr 381 (Fig. 3a). In a previous study, it was reported that the two N-terminal domains of β-1,4-glucosidase (GH3) from Hypocrea jecorina were connected by a 16 residue linker [46]. In domain 2 of PsGH3, a PA14 i.e. domain 3 (blue) was found to be inserted between the α helix D2α4 and D2α6 by linkers LC and LD, respectively (Fig. 3a). The domain 3 (PA14) comprising 133 amino acids from (459–592) have a jelly-roll fold with 10 antiparallel β-strands forming a two-layered β-sheet (Fig. 3a). In an earlier study, a similar fold of PA14 domain was reported to be inserted between a β-strand and an α-helix of domain 2 in the β-glucosidase (GH3) structure of K. marxianus [18]. The N-terminal domain was followed by a C-terminal domain 4 (green) (Fig. 3a). The C-terminal domain 4 comprised 124 amino acids, (735–859). The C-terminal domain 4 was linked to domain 2 through a linker C2 comprising, 12 amino acids (722–734) (Fig. 3a). The domain 4, forms 9 β-strands resembling an immunoglobin fold. A similar C-terminal domain exhibiting immunoglobin fold was reported earlier in β-1,4-glucosidase KmBglI and DesR structures of K. marxianus [18] and S. venezuelae [47], respectively. The overall topology diagram of multi-domain PsGH3 is shown in Fig. 3c. The overall fold of PsGH3 strongly resembles the KmBglI, β-1,4-glucosidase (GH3) structure from K. marxianus [18]. The structure comparison using DALI server also demonstrated that PsGH3 is closely related to β-1,4-glucosidase (GH3) structures from K. marxianus (PDB ID: 3 AC0) and S. venezuelae (PDB ID- 4I3G) with RMSD values of 0.3 Å and 1.1 Å, respectively. The PsGH3 active site yields a deep pocket conformation (Fig. 3b). A similar pattern of the active site was previously reported in β-1,4-glucosidase (GH3) structure from K. marxianus [18] and H. jecorina [46]. In PsGH3, near the active-site, the loops LA and LB, originating from domains 1 and 2, respectively are present (Fig. 3a, b). The loop LA comprises 14 amino acids (274–288), and LB is comparatively longer, comprising 22 amino acids (624–646). The longer active-site loop LB, where the catalytic acid/base residue Glu624 is present, causes the formation of a deeper active-site pocket in PsGH3 structure. Similarly, the deeper conformation of active site was reported due to longer and structurally flexible loops near the active site in β-1-4-glucosidase (GH3) structure of H. jecorina [46] and B. longum [48]. The superposition of PsGH3 structure with its homologue, β-1,4-glucosidase (PDB ID-3 AC0) structure from K. marxianus revealed that the key active-site residues are spatially aligned and found to be conserved (Fig. 3d, e). Asp274 and Glu624 are the nucleophile and acid/base catalytic amino acid residues, respectively (Fig. 3e), which are also conserved in family 3 GH proteins. The hydrolytic mechanism for catalysis (Retaining or Inverting) can be speculated by measuring the distance between the carboxyl group of catalytic residues through in silico process as described earlier [49]. In an earlier study, these mechanisms for hydrolysis were experimentally determined, which suggested that the average distances for inverting and retaining mechanism should be 10.5 Å (± 2 Å) and 5.5 Å, respectively [50]. The distance between the carboxyl group Asp274 (nucleophile) and Glu624 (acid/base) for PsGH3 determined by in silico approach was 5 Å (Fig. 3f). These results suggested the retaining type of hydrolytic mechanism for PsGH3. Additionally, the residues Lys193, His194, and Tyr242, in the substrate binding site were found highly conserved between PsGH3 and its homologous protein structure of β-1,4-glucosidase (PDB ID- 3 AC0) from K. marxianus (Fig. 3e). These residues were found conserved also in MSA analysis as described earlier in the previous section.
Fig. 3

3-Dimensional modeled structure analysis of PsGH3. a Cartoon view showing the different domains. b Surface view highlighting the catalytic site for binding. c Illustration topology diagram of PsGH3 modeled structure demonstrating the positioning of the secondary structures (cylinders signify α-helix and thick arrows signify the β-strand). d Superposition of PsGH3 structure (Cyan) with homologous GH3 structure (PDB ID: 3 AC0) from K. marxianus (Green). e The superposed binding cavity of PsGH3 structure with homologous GH3 structure (PDB ID: 3 AC0) from K. marxianus, the conserved active site shown in red, while sub-site residues are shown in blue for PsGH3 and f distance between acid/base and nucleophile residue

3-Dimensional modeled structure analysis of PsGH3. a Cartoon view showing the different domains. b Surface view highlighting the catalytic site for binding. c Illustration topology diagram of PsGH3 modeled structure demonstrating the positioning of the secondary structures (cylinders signify α-helix and thick arrows signify the β-strand). d Superposition of PsGH3 structure (Cyan) with homologous GH3 structure (PDB ID: 3 AC0) from K. marxianus (Green). e The superposed binding cavity of PsGH3 structure with homologous GH3 structure (PDB ID: 3 AC0) from K. marxianus, the conserved active site shown in red, while sub-site residues are shown in blue for PsGH3 and f distance between acid/base and nucleophile residue

Quality assessment of PsGH3

The energy minimized modeled PsGH3 structure was validated by the Ramachandran plot through PROCHECK server. The modeled PsGH3 structure showed 83.1% of amino acid residues in the most favored region, 14.1% in the additionally allowed region, 2.1% in the generously allowed region, and only 0.7% amino acid residues (Asp186, Ala445, and Asp589) in the disallowed region, out of non-proline and non-glycine residues (Fig. 4a). This result indicated that the PsGH3 modeled structure follows the backbone phi (ϕ) and psi (ψ) dihedral angles of the Ramachandran plot. ProSA results also showed that the PsGH3 modeled structure is accurate having a Z-score of − 8.7 and belongs to the X-ray zone (Fig. 4b). VERIFY 3D result showed that 80.7% amino acid residues of the modeled PsGH3 structure have 3D-1D score ≥ 0.2 indicating the compatibility of amino acid residues with the modeled structure (Fig. 4c). The ERRAT plot of modeled PsGH3 structure suggested an overall quality factor of 82.8% establishing that the structure has an acceptable quality (Fig. 4d), since a good 3D model should have a quality factor > 50% [51].
Fig. 4

The structure validation and exploration of 3D modeled structure PsGH3. a The Ramachandran plot examination by employing PROCHECK in web-based UCLA server, SAVES. b ProSA plot displaying Z-score. c VERIFY 3D. d ERRAT plot

The structure validation and exploration of 3D modeled structure PsGH3. a The Ramachandran plot examination by employing PROCHECK in web-based UCLA server, SAVES. b ProSA plot displaying Z-score. c VERIFY 3D. d ERRAT plot

Molecular dynamics simulation of PsGH3

Molecular dynamics simulations of the modeled structure of PsGH3 was performed to analyze its conformational behaviour, stability, as well as compactness at the nanosecond level. The values for RMSD for the complete trajectory were calculated by gmx rms program to analyze the deviation from the primary structure. The RMSD profile depicted that at 50 ns, the RMSD stretched a plateau phase of value 0.5 nm and continued to be stable up to 80 ns with less oscillations of ∼ 0.01 nm (Fig. 5a). These less oscillations and stable RMSD depicted that the PsGH3 structure achieved stable conformation during MD simulation. The estimation of the overall compactness of PsGH3 throughout the MD simulation was assessed by utilizing the gmx gyrate program. These analyses indicated the fluctuation of radius of gyration (Rg), between 2.75 and 2.89 nm till 30 ns due to the flexibility; however, after 30 ns, the PsGH3 attained stability as well as compactness at 2.7 nm and continued to be stable till 80 ns (Fig. 5b). These observations demonstrated a stable conformation of the PsGH3 modeled structure. The structural variations of PsGH3 after MD simulation were analyzed by root mean square fluctuation (RMSF) by using the gmx rms program. RMSF computes the movement of a specific atom, or group of atoms, comparative to the reference structure used in a MD simulation. The higher values of RMSF were observed for the flexible amino acid residues with maximum α-carbon flexibility between all the residues in a protein molecule. However, lower values of RMSF were observed for the rigid amino acid residues with the lowest flexibility. In PsGH3, the starting amino acid residues of N-terminal and the end residues of C-terminal displayed higher flexibility (Fig. 5c). The higher fluctuation in RMSF was also found evident among residues, 379–391(Fig. 5c). In this particular region, the linker (C1) was involved in connecting the domain 1 and domain 2 of PsGH3 as described in the previous section (3D modeling of PsGH3 and structural analysis). This higher fluctuation in RMSF for the region demonstrated the flexibility of the linker connecting the two domains of PsGH3. In an earlier study, the flexibility of the linker connecting two domains was described in cellulase (Cel45) from Humicola insolens [52]. Similarly, the loops L3 (435–458) and L4 (592–594), where domain 3 (PA14) was inserted in domain 2, showed comparatively higher fluctuation in RMSF (Fig. 5c) suggesting flexibility of these loops. In an earlier study, the flexibility of loops extending between two domains was reported in a multidomain β-1,4-glucosidase 3B from Thermotoga Neapolitana [53]. The amino acid residues of PsGH3 residing within the active site core formed between the interface of domain 1 and domain 2 and existing in loops L1 (274–289) and L2 (624–634), showed less fluctuation (Fig. 5c). A previous study also proposed that the catalytic core is the principle factor for enzyme activity and stability [54]. Therefore, the lower fluctuation in the catalytic core suggested stability of the PsGH3 for catalytic function. The gmx sas programme was used to calculate the solvent accessible surface area. The average SASA of the PsGH3 calculated was 350 nm2 and it remained stable till 80 ns (Fig. 5d). This also proposed that the global contact of PsGH3 towards the solvent area remained unaffected. Similarly, this also proposed that the availability of the substrate remained unaffected in the catalytic core of PsGH3. Similar observation for SASA was made for the catalytic cores in modular chimeric enzyme [55]. The gmx hbond program was used to estimate the intramolecular hydrogen bond analysis of PsGH3. The analysis showed that 581 average intramolecular hydrogen bond of PsGH3 were formed till 80 ns (Fig. 5e). The hydrogen bonds thus formed further aided the PsGH3 structure in attaining its stable state. However, the superposition of PsGH3 initial structure and MD simulated PsGH3 structure using ChimeraX (Fig. 5f) displayed the deviation in the loop areas (Fig. 5g) and gave a RMSD of 0.2 Å. The MD simulated PsGH3structure was found stable and thus utilized for further analysis.
Fig. 5

Molecular dynamics simulation of PsGH3 modeled structure. a RMSD plot. b Radius of gyration plot. c RMSF plot. d SASA plot and e h-bond plot, f PsGH3 simulated structure (magenta). g Structure superposition of simulated PsGH3 (magenta) with modeled PsGH3 structure (cyan)

Molecular dynamics simulation of PsGH3 modeled structure. a RMSD plot. b Radius of gyration plot. c RMSF plot. d SASA plot and e h-bond plot, f PsGH3 simulated structure (magenta). g Structure superposition of simulated PsGH3 (magenta) with modeled PsGH3 structure (cyan)

Active site examination of PsGH3 structure

The active site pocket of PsGH3 was examined by computing the area and volume by CASTp server and was compared with its homologues, viz., β-1,4-glucosidase (GH3) structure from K. marxianus (PDB ID: 3 AC0) and S. venezuelae (PDB ID- 4I3G). The computed active-site solvent accessible mouth surface area and volume of PsGH3 were 3810.2 Å2 and 5352.1 Å3 (Fig. 6a), respectively. A similar analysis performed for the homologous β-1,4-glucosidase structures of 3 AC0 and 4I3G revealed the computed active site solvent accessible mouth surface area of 3899.8 Å2 and 2461.3 Å2, respectively (Fig. 6d, g) and the volume 4820 Å3 and 2489.9 Å3, respectively (Fig. 6d, g). These analyses confirmed that the active site structure of PsGH3 possesses deeper conformation similar to its homologous β-1,4-glucosidase (GH3) structures from K. marxianus [18] and S. venezuelae [47]. The charge distribution for PsGH3 and its homologous β-glucosidase (GH3) structures 3 AC0 and 4I3G structures displayed that their active site is a mixture of positively and negatively charged amino acid residues (Fig. 6b, e, h). These patterns of mix charges in active site were demonstrated for GH3 β-1,4-glucosidase from the thermophilic fungus Chaetomium thermophilum and was found beneficial for structural stability [56]. Furthermore, the active site analysis of PsGH3 showed the presence of some conserved (Arg154, Lys192, His193, and Tyr242) and semi-conserved (Trp140 and Tyr416) amino acid residues which along with active site residues Asp274 and Glu624 might be important in substrate binding and catalysis (Fig. 6c). Correspondingly, in the active site analysis of β-1,4-glucosidase (GH3) structure of K. marxianus (PDB ID: 3 AC0) conserved residues, such as Arg113, Lys146, His147, and Tyr193 (Fig. 6f) are present for substrate binding along with active site residues Asp225 and Glu590 as described earlier [18]. Moreover, in the active site analysis of β-1,4-glucosidase (GH3) structure of S. venezuelae (PDB ID: 4I3G) along with conserved active site residues, Asp273 and Glu579, the amino acid residues, such as Gly99, Pro150, His196 and Ala 198 (Fig. 6i) are present, which were earlier described to be involved in substrate binding [47].
Fig. 6

Active-site pocket analysis of PsGH3 and its closest homologues, viz., KmBglI (PDB ID 3 AC0) from K. marxianus and DesR (PDB ID-4I3G). Analysis of active-site volume by employing the CASTp web-based server. a PsGH3. d 3 AC0. g 4I3G. The calculation of electrostatic potential distribution for b PsGH3, e 3 AC0, h 4I3G, and the active-site organization of c PsGH3, f 3 AC0, and i 4I3G

Active-site pocket analysis of PsGH3 and its closest homologues, viz., KmBglI (PDB ID 3 AC0) from K. marxianus and DesR (PDB ID-4I3G). Analysis of active-site volume by employing the CASTp web-based server. a PsGH3. d 3 AC0. g 4I3G. The calculation of electrostatic potential distribution for b PsGH3, e 3 AC0, h 4I3G, and the active-site organization of c PsGH3, f 3 AC0, and i 4I3G The molecular docking of PsGH3 with different cellooligosaccharides was performed for analysis of different interactions involved between the ligands (cellooligosaccharides) and the amino acid residue present in the catalytic site of PsGH3. The docking analysis performed on the server SwissDock displayed more than 24 probable docking resolutions for PsGH3. The docking results of PsGH3 with cellooligosaccharides displayed information about different amino acid residues participating in protein-ligand interactions (Table 2). PsGH3 exhibited binding with various cellooligosaccharides, viz., glucose, cellobiose, cellotriose, cellotetraose, cellopentaose, cellohexaose, and celloheptaose with free energy of binding (ΔG) of − 7.26, − 7.51, − 9.15, − 9.51, − 10.74, − 11.62, and − 14.04 kcal/mol, respectively (Table 2). The molecular docking analysis of xylooligosaccharides with catalytic clefts of the PsGH3 was also performed but no significant changes were observed in binding free energy upon docking (data not shown). Therefore, the binding interaction studies of PsGH3 with most probable ligands, i.e., cellooligosaccharides was continued. The binding studies of cellooligosaccharides with PsGH3 revealed that the catalytic cleft of PsGH3 efficiently binds glucose and cellobiose and can also house cellooligosaccharides up to 7 degree of polymerization (DP7) (Table 2). The possible accommodation of larger cellooligosaccharides in the catalytic cleft of PsGH3 could be because of the deep and large size of the catalytic pocket as described in the earlier section of active site examination of PsGH3 structure. Similarly, in an earlier study, it was demonstrated that a β-1,4-glucosidase from family 3 glycoside hydrolase (MtBgl3b) of Myceliophthora thermophile could accommodate large cellooligosaccharides such as cellotetraose in its catalytic cleft due to its large catalytic pocket [57]. The fitting of glucose, cellobiose and celloheptaose inside the catalytic cleft of the modeled PsGH3 is shown in Fig. 7a, d g.
Table 2

Molecular docking analysis of PsGH3 with cellooligosaccharides

Cello-oligosacharidesFree energy of binding (kcal/mol)Amino acid involved in hydrogen bond formationAmino acid involved in hydrophobic bond formation
Glucose− 7.26Gly87, Gly418, Thr419, Met489Glu78, Ala84, Ala86, Tyr416, Asn417, Pro420, Pro492
Cellobiose− 7.51Gln25, Lys27, Pro420, Thr485, Met489Thr419, Ala421, Pro492
Cellotriose− 9.15Glu78, Gly87 Tyr416, Met489Gln25, Ala84, Arg85, Leu88, Ala86, Asn417, Thr419, Pro420, Pro492
Cellotetraose− 9.51Ala86, Thr419, Pro492, Met489Gly87, Leu88, Ala421, Pro483, Ala484,Thr485, Phe493
Cellopentaose− 10.74Asp60, Cys309, Asn491, Asp583Trp75, Trp76, Glu78, Cys275, Tyr416, Asn417,Val507, Leu509, Pro492 Gly582, Met626
Cellohexaose− 11.62Ala84, Arg85, Ala86, Gly87, Thr419, Arg479, Ala482, Thr485, Ala494, Met489, Pro492Trp75, Leu88, Pro420, Ser490, Phe493
Celloheptaose− 14.04Glu25, Ala84, Glu78, Gly87, Tyr416, Gly418, Thr419, Thr485, Arg479, Met489, and Ala494Ala86, Leu88, Cys275, Pro483, Phe493, Asn417, Leu495, Pro492, and Asn491
Fig. 7

Molecular docking analysis of PsGH3 with cellooligosaccharides, viz., glucose, cellobiose and celloheptaose. Surface view by using ChimeraX. a Glucose. d Cellobiose. g Celloheptaose. Active site showing ligand orientation. b Glucose, e Cellobiose. h Celloheptaose and two-dimensional schematic presentation. c Glucose. f Cellobiose, i Celloheptaose

Molecular docking analysis of PsGH3 with cellooligosaccharides Molecular docking analysis of PsGH3 with cellooligosaccharides, viz., glucose, cellobiose and celloheptaose. Surface view by using ChimeraX. a Glucose. d Cellobiose. g Celloheptaose. Active site showing ligand orientation. b Glucose, e Cellobiose. h Celloheptaose and two-dimensional schematic presentation. c Glucose. f Cellobiose, i Celloheptaose The docking interaction of PsGH3 showed that the amino acid residues, Gly87, Gly418, Thr419, and Met489 are involved in polar interactions or building hydrogen bonds with the ligand glucose, while, Glu78, Ala84, Ala86, Tyr416, Asn417, Pro420, and Pro492 are making hydrophobic non-polar interactions with glucose (Table 2, Fig. 7b, c). The molecular docking interaction of PsGH3 with cellobiose displayed the involvement of amino acid residues Gln25, Lys27, Pro420, Thr485, Met489 in hydrogen bond formation or polar interaction, while, Thr419, Ala421, Pro492 were involved in hydrophobic non-polar interactions (Table 2, Fig. 7e, f). The docking interaction of PsGH3 with celloheptaose revealed amino acid residues Glu25, Ala84, Glu78, Gly87, Tyr416, Gly418, Thr419, Thr485, Arg479, Met489, and Ala494, involved in hydrogen bond formation or polar interactions, while, Ala86, Leu88, Cys275, Pro483, Phe493, Asn417, Asn491, Pro492, and Leu495 are involved in hydrophobic non-polar interactions (Table 2, Fig. 7h, i). Thus, from this docking analysis, the deduced amino acid residues found near the catalytic cleft of PsGH3 could be involved in binding the ligands and therefore crucial for catalysis. After the MD simulation of protein-ligand complexes (PsGH3 + cellobiose and PsGH3 + celloheptaose), the stability and conformational changes of the protein-ligand complexes were compared with the only protein (PsGH3) without the ligands by analysing the parameters, RMSD, RMSF, Rg, H-bond, and SASA. The Fig. 8 depicts that all MD simulation systems including the only PsGH3 as well as the protein-ligand complexes attained stability after 20 ns and remained stable till 80 ns (Fig. 8a). The RMSD plot of PsGH3 + cellobiose and PsGH3 + celloheptaose after attaining stability at 20 ns remained stable till 80 ns, with an average RMSD of 0.56 nm and 0.72 nm, respectively (Fig. 8a). Similarly, the RMSD plot of only PsGH3 also showed stability at 80 ns with an average RMSD of 0.45 nm (Fig. 8a). A similar pattern of RMSD was obtained for native as well as the protein-ligand complexes of peptide deformylase from Xanthomonas oryzae upon MD simulation [58]. Therefore, from the RMSD plot, it was evident that the protein as well as the protein-ligand complexes reached equilibrium at 80 ns and thus could be considered for further analysis. The structural compactness of only the protein and the protein-ligand complexes during MD simulation were analyzed by Radius of gyration (Rg) as shown in Fig. 8b. The fluctuation of Rg values for both only PsGH3 and PsGH3 + cellobiose were in the between 2.75 and 2.89 nm till 30 ns; however, after 30 ns, both the systems attained stability as well as compactness at 2.7 nm and continued to be stable till 80 ns. The fluctuation in the Rg values (2.87–2.77 nm) for PsGH3 + celloheptaose was observed till 40 ns, while after 40 ns, it achieved stability and compactness at 2.75 nm and remained stable till 80 ns (Fig. 8b). The fluctuation in Rg can be due to the flexibility in the protein-ligand complex upon MD simulation. Similar observation was reported earlier in the Rg during protein-ligand complex simulation of a chimeric enzyme [55]. Therefore, the only protein, as well as the protein-ligand complexes, showed a relatively consistent and similar value of Rg confirming that all the three systems have stable conformation. Similarly, in an earlier study, the stability and compactness were observed for both native protein as well as protein-ligand complexes of peptide deformylase from Xanthomonas oryzae upon MD simulation [58]. The RMSF plot indicated that most of the residues in the only PsGH3 and protein-ligand complexes showed a similar pattern of fluctuations throughout the simulation (Fig. 8c). The amino acid residues of PsGH3 residing within the active site core existing in the loops L1 (274–289) and L2 (624–634), showed less fluctuation (0.2–0.3 nm) depicting rigidity and intactness of the binding cavity. Similarly, in a previous study, the rigidity and compactness of the binding site of ligand-protein complexes upon MD simulation were reported for mitogen-activated protein kinase 4 in Leishmania sp. [59]. The Solvent Accessible Surface Area (SASA) parameter computes the fraction of the protein surface accessible to the water solvent. Thus, the range of conformational changes that occurred in the protein during these interactions could be predicted by calculation of SASA [59]. Figure 8d depicts the plot of SASA during the course of MD simulation for only the protein and the protein-ligand complexes. The average value of SASA for only PsGH3 was 350 nm2, and it remained stable till 80 ns (Fig. 8d), while the average value calculated for the protein-ligand complexes, PsGH3 + cellobiose and PsGH3 + celloheptaose were 324 nm2 and 333 nm2, respectively. The protein-ligand complexes showed a slight decrease in SASA values which indicated that in the protein-ligand complexes, the surface of the protein exposed to the aqueous solvent is reduced due to the ligand binding. Similarly, in a previous study on the remdesivir-protein system, a decrease in SASA values for protein-ligand complexes was observed [60]. The number of hydrogen bonds (H-bond) analysis indicates the strength and specificity of ligand for binding to the active site of protein as mentioned earlier [59]. Figure 8e depicts an average number of hydrogen bonds formed between protein and the ligands. The analysis showed that an average of 4 hydrogen bonds between PsGH3 and cellobiose till 80 ns (Fig. 8e), while an increase in the average number of hydrogen bond to 11 was observed between PsGH3 and celloheptaose. Hence, the average number of hydrogen bonds between the protein and ligand indicated that both the ligands, cellobiose, and celloheptaose were bound to PsGH3. However, celloheptaose bound more strongly to the active site of PsGH3. Additionally, for further validation of affinity of ligands for PsGH3, the average short-range interaction energies were estimated for protein-ligand complexes.
Fig. 8

Molecular dynamics simulation analysis of PsGH3 and PsGH3-ligand complexes, viz., PsGH3 + cellobiose and PsGH3 + celloheptaose. a RMSD plot, b Rg plot, c RMSF plot, d SASA plot, and e H-bond plot

Molecular dynamics simulation analysis of PsGH3 and PsGH3-ligand complexes, viz., PsGH3 + cellobiose and PsGH3 + celloheptaose. a RMSD plot, b Rg plot, c RMSF plot, d SASA plot, and e H-bond plot The average short-range Lennard-Jones energy calculated for PsGH3 + cellobiose and PsGH3 + celloheptaose were − 115.4 kJ/mol and − 181 kJ/mol, respectively. Moreover, the average short-range coulombic energy calculated for PsGH3 + cellobiose and PsGH3 + celloheptaose were − 90 kJ/mol and − 678 kJ/mol, respectively. Thus, the interaction energies calculated for both protein-ligand complexes confirmed the molecular docking results, where it suggested that the catalytic cleft of PsGH3 could accommodate larger cellooligosaccharides and efficiently act upon them.

Binding energy of protein-ligand complexes using MM-PBSA

The binding free energy is the overall energy comprising the polar, nonpolar or SASA energy, and non-bonded interaction energies (electrostatic and Van der Waals interaction) and can be calculated for the protein-ligand complexes by employing the MM-PBSA method [44]. The calculated binding free energies are presented in Table 3. The protein-ligand complex, PsGH3 + cellobiose displayed binding free energy of − 227.4 ± 12.5 kJ/mol (Table 3), while PsGH3 + celloheptaose displayed much higher binding free energy of − 1347.4 ± 38.8 kJ/mol (Table 3). Hence, from these values, it was found that, though, both the cellooligosaccharides showed binding affinity for PsGH3, but celloheptaose showed excellent binding affinity in terms of binding free energy. Thus, the affinity of celloheptaose calculated by MM-PBSA corroborated with the results of molecular docking (Table 2) and interaction energy calculations, as described in previous sections.
Table 3

Energy parameters calculated by MM-PBSA method for PsGH3-ligand complexes

Protein-ligand complexesVan der Waals energy (kJ/mol)Electrostatic energy (kJ/mol)Polar solvation energy (kJ/mol)SASA energy (kJ/mol)Binding free energy (kJ/mol)
PsGH3 + cellobiose− 130.1 ± − 8.7− 205.5 ± 31.5122.1 ± 43.3− 13.9 ± 1.1− 227.4 ± 12.5
PsGH3 + celloheptaose− 235.7 ± 23.1− 1727.9 ± 71.1644.6 ± 31.6− 28.3 ± 1.7− 1347.4 ± 38.8
Energy parameters calculated by MM-PBSA method for PsGH3-ligand complexes

Conclusions

The three-dimensional modeled structure of a putative β-1,4-glucosidase of family GH3 (PsGH3) from Pseudopedobacter saltans was developed by computational modeling. PsGH3 modeled structure demonstrated multi-domain structure comprising an N-terminal (β/α)8-fold-like domain followed by an (α/β)6-sandwich domain, a PA14 domain, and a C-terminal domain resembling an immunoglobulin fold, which is a distinctive feature of the GH3 family. The modeled structure showed 99.3% of amino acid residues in the favoured region and only 0.7% amino acid residues in the disallowed region. MSA revealed that Asp274 and Glu624 are the nucleophiles and acid/base catalytic amino acid residues and are conserved across the family 3 glycoside hydrolase. The distance between the carboxyl group of catalytic residues was 5 Å, which indicated the retaining type of hydrolytic mechanism of PsGH3. MD simulation of the modeled PsGH3 structure for 80 ns showed stable conformation. The active-site analysis of PsGH3 displayed a mixture of positively and negatively charged amino acid residues present in the catalytic cleft which is made up of some conserved amino acid residues, viz., Arg154, Lys192, His193, along with Asp274 and Glu624 active site residues. The molecular docking analysis revealed that the catalytic cleft of PsGH3 could house larger cellooligosaccharides up to DP7, because of its large and deep size catalytic pocket. The binding interaction of PsGH3 after docking analysis revealed the putative amino residues Ala86, Leu88, Cys275, Pro483, Phe493, Asn417, Asn491, Pro492, and Leu495, near active-site might be involved in substrate binding. The MD simulation of PsGH3 in presence of cellobiose and celloheptaose displayed stable RMSD, Rg and SASA values till 80 ns. Thus the results projected towards the comfortability of the cellooligosaccharides in the binding cavity of PsGH3. The low fluctuation of RMSF in presence of cellobiose and celloheptaose in the binding cavity indicated preserved rigidity and compactness of the binding site residues in the protein-ligand complexes. The calculations of average number of hydrogen bonds (H-bond), interaction energy, and binding free energy confirmed the stronger affinity of PsGH3 towards larger cellooligosaccharides.
  39 in total

1.  Inhibition of the Trichoderma reesei cellulases by cellobiose is strongly dependent on the nature of the substrate.

Authors:  Marju Gruno; Priit Väljamäe; Göran Pettersson; Gunnar Johansson
Journal:  Biotechnol Bioeng       Date:  2004-06-05       Impact factor: 4.530

2.  Methods for the isolation of cellulose-degrading microorganisms.

Authors:  James E McDonald; David J Rooks; Alan J McCarthy
Journal:  Methods Enzymol       Date:  2012       Impact factor: 1.600

3.  Enzymatic kinetic of cellulose hydrolysis: inhibition by ethanol and cellobiose.

Authors:  Rui M F Bezerra; Albino A Dias
Journal:  Appl Biochem Biotechnol       Date:  2005-07       Impact factor: 2.926

Review 4.  The path forward for biofuels and biomaterials.

Authors:  Arthur J Ragauskas; Charlotte K Williams; Brian H Davison; George Britovsek; John Cairney; Charles A Eckert; William J Frederick; Jason P Hallett; David J Leak; Charles L Liotta; Jonathan R Mielenz; Richard Murphy; Richard Templer; Timothy Tschaplinski
Journal:  Science       Date:  2006-01-27       Impact factor: 47.728

5.  Structural organization and a standardized nomenclature for plant endo-1,4-beta-glucanases (cellulases) of glycosyl hydrolase family 9.

Authors:  Breeanna R Urbanowicz; Alan B Bennett; Elena Del Campillo; Carmen Catalá; Takahisa Hayashi; Bernard Henrissat; Herman Höfte; Simon J McQueen-Mason; Sara E Patterson; Oded Shoseyov; Tuula T Teeri; Jocelyn K C Rose
Journal:  Plant Physiol       Date:  2007-08       Impact factor: 8.340

6.  Enhanced enzymatic hydrolysis of lignocellulose by optimizing enzyme complexes.

Authors:  Mingjia Zhang; Rongxin Su; Wei Qi; Zhimin He
Journal:  Appl Biochem Biotechnol       Date:  2009-03-14       Impact factor: 2.926

7.  Structure and biochemical characterization of glucose tolerant β-1,4 glucosidase (HtBgl) of family 1 glycoside hydrolase from Hungateiclostridium thermocellum.

Authors:  Kedar Sharma; Abhijeet Thakur; Rajeev Kumar; Arun Goyal
Journal:  Carbohydr Res       Date:  2019-07-17       Impact factor: 2.104

8.  Development of bi-functional chimeric enzyme (CtGH1-L1-CtGH5-F194A) from endoglucanase (CtGH5) mutant F194A and β-1,4-glucosidase (CtGH1) from Clostridium thermocellum with enhanced activity and structural integrity.

Authors:  Priyanka Nath; Arun Dhillon; Krishan Kumar; Kedar Sharma; Sumitha Banu Jamaldheen; Vijayanand Suryakant Moholkar; Arun Goyal
Journal:  Bioresour Technol       Date:  2019-03-12       Impact factor: 9.642

9.  Identification of two functionally different classes of exocellulases.

Authors:  B K Barr; Y L Hsieh; B Ganem; D B Wilson
Journal:  Biochemistry       Date:  1996-01-16       Impact factor: 3.162

10.  Structure of the beta-glucosidase gene bglA of Clostridium thermocellum. Sequence analysis reveals a superfamily of cellulases and beta-glycosidases including human lactase/phlorizin hydrolase.

Authors:  F Gräbnitz; M Seiss; K P Rücknagel; W L Staudenbauer
Journal:  Eur J Biochem       Date:  1991-09-01
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.