| Literature DB >> 34817219 |
Viviana G Correia1,2, Filipa Trovão1,2, Benedita A Pinheiro1,2, Joana L A Brás3, Lisete M Silva4, Cláudia Nunes5, Manuel A Coimbra6, Yan Liu4, Ten Feizi4, Carlos M G A Fontes3,7, Barbara Mulloy4, Wengang Chai4, Ana Luísa Carvalho1,2, Angelina S Palma1,2.
Abstract
A multigene polysaccharide utilization locus (PUL) encoding enzymes and surface carbohydrate (glycan)-binding proteins (SGBPs) was recently identified in prominent members of Bacteroidetes in the human gut and characterized in Bacteroides ovatus. This PUL-encoded system specifically targets mixed-linkage β1,3-1,4-glucans, a group of diet-derived carbohydrates that promote a healthy microbiota and have potential as prebiotics. The BoSGBPMLG-A protein encoded by the BACOVA_2743 gene is a SusD-like protein that plays a key role in the PUL's specificity and functionality. Here, we perform a detailed analysis of the molecular determinants underlying carbohydrate binding by BoSGBPMLG-A, combining carbohydrate microarray technology with quantitative affinity studies and a high-resolution X-ray crystallography structure of the complex of BoSGBPMLG-A with a β1,3-1,4-nonasaccharide. We demonstrate its unique binding specificity toward β1,3-1,4-gluco-oligosaccharides, with increasing binding affinities up to the octasaccharide and dependency on the number and position of β1,3 linkages. The interaction is defined by a 41-Å-long extended binding site that accommodates the oligosaccharide in a mode distinct from that of previously described bacterial β1,3-1,4-glucan-binding proteins. In addition to the shape complementarity mediated by CH-π interactions, a complex hydrogen bonding network complemented by a high number of key ordered water molecules establishes additional specific interactions with the oligosaccharide. These support the twisted conformation of the β-glucan backbone imposed by the β1,3 linkages and explain the dependency on the oligosaccharide chain length. We propose that the specificity of the PUL conferred by BoSGBPMLG-A to import long β1,3-1,4-glucan oligosaccharides to the bacterial periplasm allows Bacteroidetes to outcompete bacteria that lack this PUL for utilization of β1,3-1,4-glucans. IMPORTANCE With the knowledge of bacterial gene systems encoding proteins that target dietary carbohydrates as a source of nutrients and their importance for human health, major efforts are being made to understand carbohydrate recognition by various commensal bacteria. Here, we describe an integrative strategy that combines carbohydrate microarray technology with structural studies to further elucidate the molecular determinants of carbohydrate recognition by BoSGBPMLG-A, a key protein expressed at the surface of Bacteroides ovatus for utilization of mixed-linkage β1,3-1,4-glucans. We have mapped at high resolution interactions that occur at the binding site of BoSGBPMLG-A and provide evidence for the role of key water-mediated interactions for fine specificity and affinity. Understanding at the molecular level how commensal bacteria, such as prominent members of Bacteroidetes, can differentially utilize dietary carbohydrates with potential prebiotic activities will shed light on possible ways to modulate the microbiome to promote human health.Entities:
Keywords: Bacteroides ovatus; SusD-like proteins; X-ray crystallography; carbohydrate microarrays; polysaccharide utilization loci; protein-carbohydrate interactions; β-glucan
Mesh:
Substances:
Year: 2021 PMID: 34817219 PMCID: PMC8612152 DOI: 10.1128/Spectrum.01826-21
Source DB: PubMed Journal: Microbiol Spectr ISSN: 2165-0497
FIG 1Mixed-linkage β1,3-1,4-glucans and Bacteroides ovatus polysaccharide utilization locus (PUL) targeting these carbohydrates, designated MLGULs (14). (A) Representative repeating units of the chemical structure of β1,3-1,4-glucans, which consist of a linear chain of β1,4-linked cellotriosyl or cellotetraosyl units spaced by β1,3-linked glycosidic linkages. Arrows indicate the specific sites of hydrolysis by the surface endoglucanase GH16 (BO2741) of the MLGUL. (B) Chemical structure and symbol nomenclature of the nonasaccharide (DP9) derived from barley β-glucan used in this study. Glucose units (Glc) are numbered from the reducing to the nonreducing end. Monosaccharide symbol representation follows the symbol nomenclature for glycans (SNFG) (52). (C) MLGUL system from B. ovatus strain ATCC 8483. (D) Molecular architecture of SusD-like protein BoSGBPMLG-A and diagram of the recombinant versions used in this study. Amino acid notation follows the numbering of the native protein, from N to C terminus. Gene or recombinant vector is in parentheses. SP, signal peptide; His6, recombinant tag with six histidine residues; ZB tag, recombinant Zbasic tag; aa, amino acids.
FIG 2Carbohydrate binding by BoSGBPMLG-A using a structurally diverse saccharide microarray. Heatmap comparing the binding patterns of BoSGBPMLG-A and selected control proteins. The microarray comprised soluble saccharides of different origins (fungal, bacterial, plant, and microalgal polysaccharides or glycoproteins) (Table S1); the major backbone sequences are depicted at the bottom. The heatmap represents the relative binding intensities calculated as the percentage of the fluorescence signal intensity at 150 pg (0.5 mg/ml)/spot given by the saccharide probe most strongly bound by each protein (normalized as 100%). Results are detailed in Table S2. S. cerevisiae, Saccharomyces cerevisiae; N. oculata, Nanochloropsis oculata; P. palmata, Palmaria palmata; C. albicans, Candida albicans; M. tuberculosis, Mycobacterium tuberculosis; hMalectin, human malectin; TmCBM41, CBM41 of Thermotoga maritima; mDectin-1, murine dectin-1; CtCBM11, CBM11 of Clostridium thermocellum; ConA, concanavalin A; AAL, Aleuria aurantia lectin.
FIG 3Determination of BoSGBPMLG-A oligosaccharide binding specificity and chain length requirements using sequence-defined oligosaccharides. (A) “Glucome” microarray comprising 153 sequence-defined gluco-oligosaccharides prepared as neoglycolipid (NGL) probes. CtCBM11 (CBM11 of Clostridium thermocellum) and CmCBM6-2 (CBM6-2 of Cellvibrio mixtus) were used as control proteins. The degree of polymerization (DP) and glucose linkages are indicated at the top of the panels. Some relevant carbohydrate probe sequences for binding to CtCBM11 are depicted in panel B. G, glucose; AO, NGLs were prepared from reducing oligosaccharides by oxime ligation with an aminooxy (AO)-functionalized lipid (39). The binding signals are depicted as mean values of fluorescence intensities of duplicate spots for each probe arrayed at 5 fmol/spot (with error bars) and are representative of at least two independent experiments (details are in Table S4). (B) Microscale thermophoresis analysis of the interaction of BoSGBPMLG-A with sequence-defined gluco-oligosaccharides. Dose-response curves were fitted to a one-site binding model to obtain K values. Error bars indicate the standard deviations from triplicate experiments (n = 3). Quality of the fitting is given by the standard error of regression and the reduced chi-square (Red χ2) parameters. S/N, signal-to-noise ratio. (C) Sequences of the gluco-oligosaccharides depicted from the nonreducing to the reducing end. Monosaccharide symbol representations follow the symbol nomenclature for glycans (SNFG) (52). The 3-linkages are underscored.
FIG 4Three-dimensional structure of BoSGBPMLG-A in complex with barley-9 (G4G3G4G4G3G4G4G3G) at 1.43-Å resolution. (A) Cartoon representation of the overall canonical BoSGBPMLG-A SusD structure (in blue ribbon) showing two distinct regions: tetratricopeptide repeats (TPR) with four helix-turn-helix pairs, and a variable region with the bound barley-9 (G4G3G4G4G3G4G4G3G, in stick representation with blue carbon atoms and red oxygen atoms). The BoSGBPMLG-A–barley-9 complex structure is superposed with the structure of BoSGBPMLG-A (pink ribbon) in complex with barley-7 (G4G4G3G4G4G3G, in pink stick representation) (PDB ID 6E61) (15). (B) Close-up view of the binding site of BoSGBPMLG-A showing an overall match of sugar units 1 to 7. In both complexes, the three tryptophan residues form a platform that accommodates the carbohydrate twisted chain. The side chains of the protein residues involved in binding are shown as a stick model and labeled according to the respective model. Carbon atoms in barley-7 are colored in pink and barley-9 in yellow, nitrogen atoms are colored in dark blue, and oxygen atoms in red. Amino acids are numbered according to the native protein. Glucose units (Glc) are numbered from the reducing to the nonreducing end. (C) Schematic representation of the direct interactions between BoSGBPMLG-A and barley-9: curved surfaces, CH-π interactions; dashed lines, hydrogen bonding. (D) Ordered water network that extended intermolecular hydrogen bonding with different amino acids along the binding site and supported interaction with the entire oligosaccharide ligand (colored yellow). Contact distances are shown in Å and marked as dashed lines. The complete list of contacts is in Table S5.
X-ray diffraction data collection and 3-D structure refinement statistics
| Parameter | Value(s) for BoSGBPMLG-A-ZB–barley-DP9 |
|---|---|
| Data collection and processing | |
| Beamline | I04 (Diamond Light Source) |
| Wavelength (Å) | 0.9795 |
| Space group | |
| Unit cell parameters | |
| 87.84, 88.44, 156.15 | |
| 90, 90, 90 | |
| Resolution range (Å) | 62.32–1.43 (1.55–1.43) |
| Solvent content (%) | 45 |
| Protein molecules in the asymmetric unit | 2 |
| Matthews coefficient (Å3 · Da−1) | 2.24 |
| 1.6 (19.1) | |
| Wilson B factor (Å2) | 13.1 |
|
| 0.05 (0.78) |
|
| 0.02 (0.35) |
| CC1/2 | 1.00 (0.74) |
| Multiplicity | 9.0 (5.6) |
| Total no. of reflections | 1,569,176 (48,717) |
| No. of unique reflections | 175,289 (8,764) |
| Completeness (%) | 96.5 (69.0) |
| Refinement statistics | |
| Resolution range (Å) | 44.86–1.43 |
| No. of: | |
| Protein atoms | 8,337 |
| Carbohydrate atoms | 216 |
| Water molecules | 974 |
| Other solvent atoms | 168 |
|
| 0.1638 |
|
| 0.1897 |
| RMSD | |
| Bond length (Å) | 0.010 |
| Bond angle (°) | 1.140 |
| Avg B factor (Å2) of | |
| Main chain (A, B) | 18.39, 21.47 |
| Side chain (A, B) | 21.50, 24.38 |
| G4G3G4G4G3G4G4G3G (A, B) | 35.95, 35.00 |
| Magnesium ions (A, B) | 16.94, 16.33 |
| Sodium ions | 25.83 |
| TAM | 56.83 |
| Azide ions | 42.92 |
| GOL | 35.5 |
| PEG | 49.04 |
| Water molecules (974) | 31.10 |
| Ramachandran statistics (%) | |
| Favored | 97.78 |
| Allowed | 2.22 |
| Outliers | 0.00 |
Values for the outer shell are given in parentheses. The data have been deposited in the Protein Data Bank under PDB ID 7NOX.
, where is the observed intensity and is the statistically weighted average intensity of multiple observations.
, a redundancy-independent version of .
CC1/2, correlation coefficient (CC) between intensities from random half-datasets.
, where and are the calculated and observed structure factor amplitudes, respectively.
Rfree is calculated for a randomly chosen 5% of the reflections.
TAM, Tris(hydroxyethyl)aminomethane; GOL, glycerol; PEG, polyethylene glycol; (A, B), indicates the corresponding values for chain A and chain B, respectively, in the asymmetric unit.
FIG 5Interactions of BoSGBPMLG-A and its site-directed mutants with barley β-glucan and xyloglucan polysaccharides measured using isothermal titration calorimetry (ITC). Titrations were conducted at 25°C in 50 mM HEPES buffer (pH 7.5) with 100 mM NaCl, 5 mM CaCl2, and 5 mM TCEP. The top half of each panel shows the raw heats of binding, whereas the bottom shows the integrated heats of dilution-corrected data. The curves represent the best fits to a single-site binding model. NB, no binding observed; weak, value was too low to be quantified.