| Literature DB >> 20944224 |
Andrew P Yeh1, Polat Abdubek, Tamara Astakhova, Herbert L Axelrod, Constantina Bakolitsa, Xiaohui Cai, Dennis Carlton, Connie Chen, Hsiu Ju Chiu, Michelle Chiu, Thomas Clayton, Debanu Das, Marc C Deller, Lian Duan, Kyle Ellrott, Carol L Farr, Julie Feuerhelm, Joanna C Grant, Anna Grzechnik, Gye Won Han, Lukasz Jaroszewski, Kevin K Jin, Heath E Klock, Mark W Knuth, Piotr Kozbial, S Sri Krishna, Abhinav Kumar, Winnie W Lam, David Marciano, Daniel McMullan, Mitchell D Miller, Andrew T Morse, Edward Nigoghossian, Amanda Nopakun, Linda Okach, Christina Puckett, Ron Reyes, Henry J Tien, Christine B Trame, Henry van den Bedem, Dana Weekes, Tiffany Wooten, Qingping Xu, Keith O Hodgson, John Wooley, Marc André Elsliger, Ashley M Deacon, Adam Godzik, Scott A Lesley, Ian A Wilson.
Abstract
BT2081 from Bacteroides thetaiotaomicron (GenBank accession code NP_810994.1) is a member of a novel protein family consisting of over 160 members, most of which are found in the different classes of Bacteroidetes. Genome-context analysis lends support to the involvement of this family in carbohydrate metabolism, which plays a key role in B. thetaiotaomicron as a predominant bacterial symbiont in the human distal gut microbiome. The crystal structure of BT2081 at 2.05 Å resolution represents the first structure from this new protein family. BT2081 consists of an N-terminal domain, which adopts a β-sandwich immunoglobulin-like fold, and a larger C-terminal domain with a β-sandwich jelly-roll fold. Structural analyses reveal that both domains are similar to those found in various carbohydrate-active enzymes. The C-terminal β-jelly-roll domain contains a potential carbohydrate-binding site that is highly conserved among BT2081 homologs and is situated in the same location as the carbohydrate-binding sites that are found in structurally similar glycoside hydrolases (GHs). However, in BT2081 this site is partially occluded by surrounding loops, which results in a deep solvent-accessible pocket rather than a shallower solvent-exposed cleft.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20944224 PMCID: PMC2954218 DOI: 10.1107/S1744309110028228
Source DB: PubMed Journal: Acta Crystallogr Sect F Struct Biol Cryst Commun ISSN: 1744-3091
Summary of crystal parameters, data-collection and refinement statistics for BT2081 (PDB code 3hbz)
Values in parentheses are for the highest resolution shell.
| Space group | |
| Unit-cell parameters (Å) | |
| Data collection | |
| Wavelength (Å) | 0.9785 |
| Resolution range (Å) | 29.8–2.05 (2.10–2.05) |
| No. of observations | 268747 |
| No. of unique reflections | 35437 |
| Completeness (%) | 99.9 (99.9) |
| Mean | 16.7 (1.8) |
|
| 0.091 (0.75) |
| Model and refinement statistics | |
| Resolution range (Å) | 29.8–2.05 |
| No. of reflections (total) | 35400 |
| No. of reflections (test) | 1773 |
| Completeness (%) | 99.9 |
| Cutoff criterion | | |
|
| 0.159 |
|
| 0.191 |
| Stereochemical parameters | |
| Restraints (r.m.s.d. observed) | |
| Bond angles (°) | 1.60 |
| Bond lengths (Å) | 0.018 |
| Average isotropic | 42.9 |
| ESU | 0.121 |
R merge = .
Typically, the number of unique reflections used in refinement is slightly less than the total number that were integrated and scaled. Reflections are excluded owing to systematic absences, negative intensities and rounding errors in the resolution limits and unit-cell parameters.
R cryst = , where F calc and F obs are the calculated and observed structure-factor amplitudes, respectively.
R free is the same as R cryst but for 5.0% of the total reflections chosen at random and omitted from refinement.
This value represents the total B that includes TLS and residual B components.
Estimated overall coordinate error (Collaborative Computational Project, Number 4, 1994 ▶; Cruickshank, 1999 ▶).
Figure 1Crystal structure of BT2081 from B. thetaiotaomicron. (a) Ribbon diagram of the BT2081 monomer, showing the distinct N-terminal (green) and C-terminal (purple) domains. Helices (H1–H7) and β-strands (β1–β28) are indicated. (b) Diagram showing the secondary-structure elements of BT2081 superimposed on its primary sequence. The labeling of secondary-structure elements [colored by domain as in (a)] is in accord with PDBsum (http://www.ebi.ac.uk/pdbsum), in which α-helices (H1, H3, H5 and H6), 310-helices (H2, H4 and H7) and β-strands (β1–β28) are labeled sequentially, β-turns and γ-turns are designated by Greek letters (β, γ) and β-hairpins by red loops.
Figure 2Structural comparison of BT2081 with PKD-CBM44 of C. thermocellum CtCel9D-Cel44A. Superposition of BT2081 (orange/yellow) with PKD-CBM44 (lilac/pale blue; PDB code 2c26), which also contain both the Ig-like and β-jelly-roll domains. Calcium ions are represented as spheres. Protein regions which were used for alignment by FATCAT are shown in darker shades (orange for BT2081 and lilac for PKD-CBM44) to highlight similarities in the protein cores. Please note that two twists have been introduced by FATCAT into the structure of PKD-CBM44 in order to obtain optimal full-length alignments of the structures.
Figure 3Molecular surface of the BT2081 N-terminal domain colored according to residue conservation by ConSurf (http://consurf.tau.ac.il; Landau et al., 2005 ▶). The most conserved residues are shown in magenta, the least conserved residues are shown in cyan and those with insufficient data to determine the conservation level are shown in yellow. The molecular surface of the C-terminal domain is shown in gray. (a) and (b) are views of opposite surfaces of the N-terminal domain, showing that residue conservation is predominantly on one side. The calcium ion in (b) is represented as a green sphere.
Figure 4Superposition of the BT2081 C-terminal domain (yellow) with two representative top structural neighbors as assessed by FATCAT: the catalytic module of a Bacillus 1,3–1,4-β-glucanase (red; PDB code 1byh) and CBM29 of a mannanase from T. polysaccharolyticum (green; PDB code 2zew). (a) View from above the convex surface of the β-jelly-roll core, highlighting the conserved calcium ion represented as spheres and circled. (b) View from above the concave surface of the β-jelly-roll core, highlighting the extended loop regions of BT2081 (circled) which help to form a pocket on the concave surface of the core.
Figure 5Coordination of the conserved C-terminal calcium ion of BT2081. Electron density from 2F o − F c (contoured at 2.5σ level) and F o − F c (contoured at 3.0σ level) maps is represented as green and blue mesh, respectively. A 13σ level anomalous signal obtained from data collected below the nickel edge was seen for the calcium and is shown here contoured at the 6.0σ level as magenta mesh. Distances between the calcium ion and its ligands are indicated in Å.
Figure 6Putative carbohydrate-binding pocket as seen from above the concave surface of the jelly-roll fold. An arrow indicates the pocket entrance. The residues that line the pocket are highlighted in stick representation and are color-coded according to type as follows: aromatic, green; hydrophobic, light green; polar, lilac; acidic, red; basic, blue.
Figure 7Cutaway side view of the putative carbohydrate-binding pocket highlighting the nature of the residues lining the pocket, including several that may play a role in catalysis. The color scheme of the residues is the same as in Fig. 6 ▶.
Figure 8Sequence alignment between BT2081 and its top 20 PSI-BLAST homologs (the alignment is only shown for residues 255–361 of BT2081 for clarity). Residues in white characters on a red background are strictly conserved, while those in red characters with a white background are highly conserved. Red arrows indicate the two residues of BT2081 (Asp259 and Asp338) which may play a role in catalysis. Sequences with aspartates at these positions are highlighted in yellow.
Superposition of the BT2081 N-terminal domain with CBM families that adopt Ig-like folds.
| PDB code | CBM family | Optimized r.m.s.d. (Å) | Equivalent positions (No. of Cα atoms) | Sequence identity (%) | |
|---|---|---|---|---|---|
| CBM9 | 3.83 | 67 | 3.3 | 2.94 × 10−1 | |
| CBM20 | 3.39 | 43 | 9.1 | 8.34 × 10−1 | |
| CBM25 | 3.01 | 50 | 6.5 | 1.29 × 10−1 | |
| CBM26 | 3.45 | 56 | 3.6 | 2.05 × 10−1 | |
| CBM31 | 3.43 | 65 | 5.5 | 7.31 × 10−2 | |
| CBM33 | 3.09 | 72 | 4.7 | 2.71 × 10−1 | |
| CBM34 | 3.14 | 59 | 2.6 | 7.30 × 10−1 |
Representative closest structural neighbors of the BT2081 C-terminal domain.
| PDB code | CBM/GH family | Optimized r.m.s.d. (Å) | Equivalent positions (No. of Cα atoms) | Sequence identity (%) | |
|---|---|---|---|---|---|
| CBM29 | 3.08 | 128 | 6.8 | 3.22 × 10−4 | |
| CBM16 | 3.01 | 136 | 10.4 | 9.93 × 10−4 | |
| CBM11 | 3.05 | 130 | 9.8 | 1.78 × 10−3 | |
| GH16 | 3.04 | 137 | 5.0 | 1.79 × 10−3 | |
| CBM6 | 3.05 | 115 | 4.6 | 1.95 × 10−3 |
The FATCAT P-value measures the probability of obtaining a similar result between two random structures. This P-value is calculated based on empirical fitting of the extreme value distribution to the FATCAT similarity score (Ye & Godzik, 2004 ▶). The smaller the P-value, the more statistically significant the similarity between corresponding structures (P-values of <0.05 are considered to be significant).