| Literature DB >> 31992772 |
Kazi Zubaida Gulshan Ara1, Anna Månberger2, Marek Gabriško3, Javier A Linares-Pastén2, Andrius Jasilionis2, Ólafur H Friðjónsson4, Guðmundur Ó Hreggviðsson4,5, Štefan Janeček3,6, Eva Nordberg Karlsson7.
Abstract
The genome of Rhodothermus marinus DSM 4253 encodes six glycoside hydrolases (GH) classified under GH family 3 (GH3): RmBgl3A, RmBgl3B, RmBgl3C, RmXyl3A, RmXyl3B and RmNag3. The biochemical function, modelled 3D-structure, gene cluster and evolutionary relationships of each of these enzymes were studied. The six enzymes were clustered into three major evolutionary lineages of GH3: β-N-acetyl-glucosaminidases, β-1,4-glucosidases/β-xylosidases and macrolide β-glucosidases. The RmNag3 with additional β-lactamase domain clustered with the deepest rooted GH3-lineage of β-N-acetyl-glucosaminidases and was active on acetyl-chitooligosaccharides. RmBgl3B displayed β-1,4-glucosidase activity and was the only representative of the lineage clustered with macrolide β-glucosidases from Actinomycetes. The β-xylosidases, RmXyl3A and RmXyl3B, and the β-glucosidases RmBgl3A and RmBgl3C clustered within the major β-glucosidases/β-xylosidases evolutionary lineage. RmXyl3A and RmXyl3B showed β-xylosidase activity with different specificities for para-nitrophenyl (pNP)-linked substrates and xylooligosaccharides. RmBgl3A displayed β-1,4-glucosidase/β-xylosidase activity while RmBgl3C was active on pNP-β-Glc and β-1,3-1,4-linked glucosyl disaccharides. Putative polysaccharide utilization gene clusters were also investigated for both R. marinus DSM 4253 and DSM 4252T (homolog strain). The analysis showed that in the homolog strain DSM 4252T Rmar_1080 (RmXyl3A) and Rmar_1081 (RmXyl3B) are parts of a putative polysaccharide utilization locus (PUL) for xylan utilization.Entities:
Mesh:
Substances:
Year: 2020 PMID: 31992772 PMCID: PMC6987092 DOI: 10.1038/s41598-020-58015-5
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Gene cluster analysis in Rhodothermus marinus DSM 4253. The gene cluster corresponding to GH3 loci in strains DSM 4252T and DSM 4253 is identical. (A) Downstream of Rmar_0536 are genes enconding: membrane proteins (yedZ and yedY); a membrane transporter protein (mfs); an arylesterase precursor (arylest.); an ABC transporter ATP-binding protein (yvcR); a hypothetical protein (hyp.). Upstream of Rmar_0536 are genes encoding: a DNA-binding protein (hiran); two hypothetical proteins (hyp.); a potential methyltransferase (duf43). (B) Downstream of Rmar_0925 are genes encoding: a methylmalonyl-CoA mutase (mm_coa_mut); a 1,4-dihydroxy-2-naphthoyl-CoA synthase (menB); a 4′-phosphopantetheinyl transferase (sfp); a superoxide dismutase (sodA); a hypothetical protein (hyp.). Upstream of Rmar_0925 are genes encoding: ATPase (atpase); a 23 S rRNA methyltransferase (rlmN); an O-acetylhomoserine aminocarboxypropyltransferase (acpt); a homoserine O-acetyltransferase (esterase). (C) Downstream of Rmar_1080 and Rmar_1081 are genes encoding: two hypothetical proteins (hyp.); a periplasmic ABC transporter substrate-binding protein (abc), a potential cobalamin binding protein (yvcR.); a cobalamin biosynthesis protein (cbiX); a heme-binding protein (hmuY). Upstream of Rmar_1080 and Rmar_1081 genes encoding: an α-Glucuronidase (α-glucuronidase) and a pair of susC (a TonB-dependent receptor) and susD homologes. (D) Downstream of Rmar_2069 are genes encoding: a tRNA-Gln-CTG,tRNA (gray arrow); a hypothetical protein similar to ABC transporter (hyp. abc); a hypothetical protein (hyp.); a riboflavin synthase (ribD); a phosphohydrolase (hd); a riboflavin. Upstream of Rmar_2069 are genes encoding: an asparagine synthase (asn_synth); a Fe-S oxidoreductase (rad sam); a FAD-dependent oxidoreductase (oxidoreduct.); a hypothetical protein (hyp.). (E) Downstream of Rmar_2616 are genes encoding: a glycoside hydrolase of the GH43_62_32_68 superfamily (gh43_62_32_68); a predicted glycosyl hydrolase of the GH43/DUF377 family (gh43); a protein similar to glycosyltransferase involved in cell wall biosynthesis (rfaB); a short-chain dehydrogenase (sdr); a saccharopine dehydrogenas (sacc). Upstream of Rmar_2616 are genes encoding: a RNA polymerase ECF-type sigma factor gene (sig70); a tRNA (cytosine-5-)-methyltransferase (met_tra); a Fmu domain protein (fmu); a flavin containing amine oxidoreductase (amine oxidase); a bacterioferritin (ferr.); an exodeoxyribonuclease III (exoIII).
Putative locus for degradation and uptake of xylan in Rhodothermus marinus DSM 4252T.
| Gene loci | Annotation | Potential cellular locationa | Potential co-transcriptionb |
|---|---|---|---|
| LacI family transcriptional regulator | IC | 1 | |
| GH43, CBM6 | EC and CA | 2 | |
| CBM4, CBM4, GH10- | EC and CA | 2 | |
| TonB-dependent receptor | OM | 2 | |
| Hypothetical protein | EC and CA | 2 | |
| Hypothetical protein | PS or EC | 3 | |
| Hypothetical protein | EC and CA | 3 | |
| GH10 | PS or EC | 4 | |
| RNA polymerase σ70/24-factor | IC | 4 | |
| anti-FecI σ-factor, FecR | IM | 4 | |
| TonB-dependent receptor, SusC/RagA domain protein | OM | 4 | |
| SusD/RagB domain protein | EC and CA | 4 | |
| α-glucuronidase, GH67 | PS or EC | 5 | |
| GH3 - | PS or EC | 5 | |
| GH3 - | IC | 5 |
aEC - extracellular, CA - cell attached, OM - outer membrane, PS - periplasmic space, IC – intracellular, IM – inner membrane. The cellular location is predicted based on prediction of signal peptides and putative por secretion system C-terminal sorting domain.
bCo-transcription is based on predicted operons, Rho-independent translational termination sites and promotors.
Kinetic parameters on aryl substrates. Data presented as means ± standard error from three independent experiments.
| Enzyme | Substrate | Km (mM) | kcat (s−1) | kcat/KM (s−1 mM−1) |
|---|---|---|---|---|
| 0.1 ± 0.0 | 79.9 ± 3.4 | 754.9 ± 28.1 | ||
| 11.8 ± 1.0 | 140.8 ± 0.6 | 12.0 ± 0.0 | ||
| 0.1 ± 0.0 | 49.5 ± 4.0 | 511.4 ± 66.1 | ||
| 1.2 ± 0.1 | 65.8 ± 0.4 | 56.2 ± 0.7 | ||
| 1.9 ± 0.1 | 47.7 ± 0.8 | 25.0 ± 1.2 | ||
| 1.6 ± 0.9 | 67.2 ± 0.5 | 42.9 ± 2.2 | ||
| 0.4 ± 0.0 | 50.1 ± 1.9 | 140.17 ± 15.7 | ||
| 6.5 ± 0.8 | 29.2 ± 0.5 | 4.5 ± 0.3 | ||
| 8.8 ± 1.0 | 94.2 ± 11.2 | 10.7 ± 1.0 | ||
| 0.3 ± 0.0 | 160.1 ± 0.6 | 481.9 ± 21.5 | ||
| 1.6 ± 0.4 | 57.5 ± 0.6 | 36.1 ± 0.9 | ||
| 2.0 ± 0.2 | 556.8 ± 8.6 | 280.9 ± 2.4 | ||
| 0.1 ± 0.0 | 102.3 ± 3.0 | 1314.4 ± 50.3 |
aSubstrate inhibition was observed for pNP-β-Glc at concentration above 2 mM.
Kinetic parameters for aryl substrate hydrolysis by RmNag3.
| Phosphate (mM) | Substrate | KM (mM) | kcat (s−1) | kcat/KM (s−1 mM−1) |
|---|---|---|---|---|
| 0 | 0.2 ± 0.1 | 144 ± 4 | 920 ± 50 | |
| 50 | 0.3 ± 0.1 | 223 ± 4 | 740 ± 20 | |
| 100 | 0.3 ± 0.1 | 189 ± 3 | 630 ± 10 | |
| 200 | 0.3 ± 0.1 | 232 ± 1 | 860 ± 30 |
All the reactions were performed in 50 mM HEPES buffer pH 6.0 in absence or presence of sodium phosphate. Data presented as means ± standard error from three independent experiments.
Hydrolysis of oligosaccharides.
| Substrate | Specific activity (µmol min−1 mg−1) | |||||
|---|---|---|---|---|---|---|
| Cellobiose (β-1,4) | 333 ± 1 | 312 ± 0 | 301 ± 0 | 6 ± 2 | 10 ± 2 | −a |
| Xylobiose (β-1,4) | 67 ± 0 | — | — | 94 ± 1 | 244 ± 1 | — |
| Laminaribiose (β-1,3) | — | — | 552 ± 0 | — | — | — |
| Cellohexaose (β-1,4) | 197 ± 0 | — | 142 ± 1 | 22 ± 0 | 86 ± 1 | — |
| Xylohexaose (β-1,4) | — | — | — | 4 ± 0 | 98 ± 1 | — |
| Xylan (β-1,4) | — | — | — | — | 50 ± 1 | — |
| Chitobiose (β-1,4) | — | — | — | — | — | 132 ± 1 |
| Chitopentose (β-1,4) | — | — | — | — | — | 35 ± 0 |
Type of linkage for each oligosaccharide is indicated in parentheses and activity were measured based on release of monosaccharides.
aNo activity was detected.
Figure 2Phylogenetic relationship between proteins in GH3. The maximum likelihood phylogenetic tree was calculated using amino acid sequences of biochemically characterized proteins from GH3 together with six protein sequences from Rhodothermus marinus (highlighted in colored boxes). Sequences belonging to particular group are colored in the same color and the schematic representation of domain organization is shown in a circle above the tree. If domain organization of particular protein differs from prevailing domain organization of the group to which it belongs, its domain organization is shown separately next to the protein.
Figure 3Homology models of the six GH3 enzymes from Rhodothermus marinus DSM 4253. Ribbon representation of (A) RmBgl3A, (B) RmBgl3B, (C) RmBgl3C, (D) RmXyl3A, (E) RmXyl3B and (F) RmNag3. Domain 1 is colored in gray, domain 2 in purple, FnIII in blue, PA14 in green and the linker between domain 2 and FnIII in yellow. RmBgl3A, RmBgl3B and RmBgl3C were modelled as dimers, one chain in each dimer is colored in light colors while the other is colored in dark colors. The β-lactamase domain of RmNag3 was not modelled and is not represented in the figure.
Validation of the homology models of the six GH3 enzymes from Rhodothermus marinus.
| Enzymes | Main template | Verify 3D | PROCHECK | ProSA | ERRAT | ||||
|---|---|---|---|---|---|---|---|---|---|
| PDB-entry | Resolution (Å) | Sequence identity (%) | Average 3D-1D score ≥0.2a (%) | Ramachandran plotb | Z-score | Overall quality factorc (%) | |||
| 3U4A | 2.20 | 47 | 94.9 | 91.9;7.6;0.2;0.2 | −11.3 | −10.9 | 93.8 | 93.9 | |
| 4I3G | 1.40 | 44 | 94.7 | 89.5;9.9;0.4;0.1 | −11.6 | 92.0 | |||
| 3WLH | 1.65 | 48 | 95.0 | 90.6;8.6;0.4;0.4 | −11.4 | 98.4 | |||
| 3U48 | 2.20 | 34 | 88.3 | 90.0;8.8;0.9;0.4 | −11.0 | −10.4 | 96.3 | 94.3 | |
| 3U48 | 2.20 | 36 | 93.0 | 89.2;9.9;0.5;0.4 | −11.8 | −11.1 | 96.0 | 94.3 | |
| 3NVD | 1.84 | 30 | 93.4 | 92.8;6.8;0.2;0.2 | −10.4 | 97.2 | |||
RmBgl3A, RmXyl3A and RmXyl3B were modelled as dimers, result for chain A and B are showed separately for ProSA and ERRAT analysis.
a80% of the residues in a structure is required at an averaged 3D-1D score of 0.2 to be considered a good model.
bNumbers corresponds to percentage of residues in: Most favorable regions; additionally allowed regions; generously allowed regions; disallowed regions.
cFor high-resolution X-ray structures an overall quality factor of 95% is considered a good protein structure and above 91% is expected for X-ray structures with a resolution between 2.5 and 3 Å.
Comparison of potential substrate interacting residues with closely related characterized GH3 enzymes based on sequence alignment and structure.
| Enzyme | Subsite−1 | Subsite +1 | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| D110 | G218*a | A428* | Y287 | Y614 | G431 | −b | R606 | F254 | ||
| D71 | G179 | S387 | Y248 | E587 | S390 | — | R572 | F215 | ||
| D71 | R178 | S380 | W246 | Y703 | V383 | — | S682* | Y213 | ||
| D98 | R206 | S410 | W274 | Y691 | V413* | — | D670* | Y241 | ||
| D121 | G234* | T450* | Y311 | — | W453 | — | — | Y278 | ||
| D136 | G252 | S460 | W321 | — | W463 | — | — | F288 | ||
| D95 | G217 | T431 | W286 | — | W434 | — | — | Y253 | ||
| E123 | G237* | S446* | W307 | — | P449* | Y545 | E648* | Y274 | ||
| E125 | G239* | S448* | W309 | — | P451* | W547 | E650* | Y276 | ||
| E108 | G218 | A429 | Y288 | W631 | C432 | C524 | G627 | Y255 | ||
| Enzyme | Subsite−1 | Subsite +1 | ||||||||
| D118 | R186 | D227 | H229 | R60 | R126 | R306 | ||||
| D164 | R232 | D274 | H276 | R80 | R172 | G352 | ||||
The identities between the enzymes are indicated in parenthesis. Strictly conserved resides presented in Supplementary Fig. 2 are excluded from the table. The table is broken before the β-N-acetyl-glucosidases and the columns are not comparable above and under this line.
Gap in the sequence alignment.
bThe residue is not in proximity for substrate interaction based on structures.
Figure 4Comparison of active sites of the β-glucosidase and β-xylosidases showing residues involved in (A) subsite −1 and (B) subsite +1. Superimposition of homology models of RmBgl3A (pink), RmBgl3B (blue), RmBgl3C, RmXyl3A (yellow) and RmXyl3B (orange). Cellobiose (gark gray) and laminaribiose (light gray) in subsite −1 and +1 from PDB entries 1IEX and 1J8V respectively, both solved in complex with ExoI from barley. RmXyl3A and RmXyl3B have similar −1 and +1 subsites and RmBgl3C have a similar −1 subsite as RmBgl3A in terms of possible interacting residues.