| Literature DB >> 20944232 |
Qingping Xu1, Polat Abdubek, Tamara Astakhova, Herbert L Axelrod, Constantina Bakolitsa, Xiaohui Cai, Dennis Carlton, Connie Chen, Hsiu Ju Chiu, Michelle Chiu, Thomas Clayton, Debanu Das, Marc C Deller, Lian Duan, Kyle Ellrott, Carol L Farr, Julie Feuerhelm, Joanna C Grant, Anna Grzechnik, Gye Won Han, Lukasz Jaroszewski, Kevin K Jin, Heath E Klock, Mark W Knuth, Piotr Kozbial, S Sri Krishna, Abhinav Kumar, Winnie W Lam, David Marciano, Mitchell D Miller, Andrew T Morse, Edward Nigoghossian, Amanda Nopakun, Linda Okach, Christina Puckett, Ron Reyes, Henry J Tien, Christine B Trame, Henry van den Bedem, Dana Weekes, Tiffany Wooten, Andrew Yeh, Keith O Hodgson, John Wooley, Marc André Elsliger, Ashley M Deacon, Adam Godzik, Scott A Lesley, Ian A Wilson.
Abstract
Dipeptidyl-peptidase VI from Bacillus sphaericus and YkfC from Bacillus subtilis have both previously been characterized as highly specific γ-D-glutamyl-L-diamino acid endopeptidases. The crystal structure of a YkfC ortholog from Bacillus cereus (BcYkfC) at 1.8 Å resolution revealed that it contains two N-terminal bacterial SH3 (SH3b) domains in addition to the C-terminal catalytic NlpC/P60 domain that is ubiquitous in the very large family of cell-wall-related cysteine peptidases. A bound reaction product (L-Ala-γ-D-Glu) enabled the identification of conserved sequence and structural signatures for recognition of L-Ala and γ-D-Glu and, therefore, provides a clear framework for understanding the substrate specificity observed in dipeptidyl-peptidase VI, YkfC and other NlpC/P60 domains in general. The first SH3b domain plays an important role in defining substrate specificity by contributing to the formation of the active site, such that only murein peptides with a free N-terminal alanine are allowed. A conserved tyrosine in the SH3b domain of the YkfC subfamily is correlated with the presence of a conserved acidic residue in the NlpC/P60 domain and both residues interact with the free amine group of the alanine. This structural feature allows the definition of a subfamily of NlpC/P60 enzymes with the same N-terminal substrate requirements, including a previously characterized cyanobacterial L-alanine-γ-D-glutamate endopeptidase that contains the two key components (an NlpC/P60 domain attached to an SH3b domain) for assembly of a YkfC-like active site.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20944232 PMCID: PMC2954226 DOI: 10.1107/S1744309110021214
Source DB: PubMed Journal: Acta Crystallogr Sect F Struct Biol Cryst Commun ISSN: 1744-3091
Figure 1Proposed metabolic pathways for murein peptides in E. coli and B. subtilis.
Data-collection, phasing and refinement statistics (PDB code 3h41)
Values in parentheses are for the highest resolution shell. The high-resolution cutoff was chosen such that the mean I/σ(I) in the highest resolution shell was around 2.
| λ1 MADSe, peak | λ2 MADSe, remote | λ3 MADSe, inflection | |
|---|---|---|---|
| Space group | |||
| Unit-cell parameters (Å, °) | |||
| Data collection | |||
| Wavelength (Å) | 0.9786 | 0.9184 | 0.9799 |
| Resolution range (Å) | 41.5–1.79 (1.88–1.79) | 41.5–1.84 (1.94–1.84) | 41.5–1.86 (1.96–1.86) |
| No. of observations | 116822 | 107729 | 104295 |
| No. of unique reflections | 31085 | 28629 | 27652 |
| Completeness (%) | 98.0 (95.1) | 98.6 (98.1) | 98.5 (97.9) |
| Mean | 11.3 (2.1) | 12.5 (2.8) | 13.1 (2.8) |
|
| 11.1 (71) | 10.0 (54) | 10.0 (56) |
| MAD phasing | |||
| Resolution | 41.5–1.79 | ||
| No. of Se sites | 5 | ||
| Mean figure of merit | 0.36 | ||
| Model and refinement statistics | |||
| Resolution range (Å) | 41.5–1.79 | ||
| No. of reflections (total) | 31083 | ||
| No. of reflections (test) | 1564 | ||
| Completeness (%) | 98.0 | ||
| Data set used in refinement | λ1 MADSe | ||
| Cutoff criterion | | | ||
|
| 16.3 | ||
|
| 19.7 | ||
| Stereochemical parameters | |||
| Restraints (r.m.s.d. observed) | |||
| Bond lengths (Å) | 0.015 | ||
| Bond angles (°) | 1.50 | ||
| Average isotropic | 19.9 | ||
| ESU | 0.11 | ||
| Protein residues/atoms | 305/2463 | ||
| Waters/peptide/other ligands | 265/1/7 | ||
|
| |||
| All-atom clash score | 5.07 | ||
| Ramachandran favored (%) | 97.7 | ||
| No. of Ramachandran outliers | 1 | ||
| No. of rotamer outliers | 1 | ||
R merge = .
R cryst = − , where F calc and F obs are the calculated and observed structure-factor amplitudes, respectively.
R free is the same as R cryst but for 5.0% of the total reflections chosen at random and omitted from refinement.
Estimated standard uncertainty in atomic coordinates.
Figure 2Genomic context of the ykfB and ykfC genes in B. subtilis and B. cereus.
Figure 3Crystal structure of YkfC from B. cereus in complex with l-Ala-γ-d-Glu. (a) Ribbon representation of BcYkfC, highlighting its domain organization. SH3b1 is depicted in blue, SH3b2 in green and NlpC/P60 in red. The bound l-Ala-γ-d-Glu is shown as a stick model. (b) Ribbon representations of individual domains, showing the secondary-structure elements. (c) Molecular surface of YkfC colored by sequence conservation. The surface color gradient indicates the level of sequence conservation from the most conserved residues (deep red) to nonconserved residues (white).
Structural comparisons of BcYkfC and other bacterial proteins that share at least one common domain
The alignment was performed by the DALI structural comparison server using full-length BcYkfC and individual domains (SH3b1 and NlpC/P60) as search probes. For proteins with multiple SH3-like domains (PDB codes 1m9s and 1xov) or multiple chains, only the best match is shown.
| Aligned substructure(s) | PDB code | Reference | R.m.s.d. (Å) | Aligned length | No. of residues | Sequence identity (%) | Comments | |
|---|---|---|---|---|---|---|---|---|
| SH3b + NlpC/P60 | Xu | 2.1 | 193 | 221 | 21.3 | 26 | Cyanobacterial NpPCP | |
| Xu | 2.3 | 193 | 220 | 21.2 | 29 | Cyanobacterial AvPCP | ||
| SH3b | Marino | 1.9 | 60 | 523 | 6.0 | 12 | TD | |
| Lu | 2.3 | 57 | 103 | 4.7 | 12 | TD | ||
| Korndorfer | 3.0 | 51 | 317 | 3.5 | 10 | TD | ||
| Srisailam | 1.8 | 44 | 74 | 2.7 | 7 | PhnA-like protein | ||
| Nasertorabi | 1.9 | 54 | 164 | 5.7 | 13 | Eukaryotic SH3 | ||
| NlpC/P60 | Aramini | 1.9 | 119 | 129 | 17.8 | 29 | ||
| NYSGXRC | 2.5 | 110 | 230 | 9.6 | 18 | Uncharacterized | ||
| Pai | 3.2 | 106 | 589 | 7.3 | 9 | CHAP domain of GspS |
Number of residues present in the model used for comparison.
Targeting domain.
New York SGX Research Center for Structural Genomics (unpublished work).
Glutathionylspermidine synthetase/amidase.
Figure 4Active site and recognition of l-Ala-γ-d-Glu by BcYkfC. (a) Stereoview of a 2F o − F c OMIT map, where l-Ala-γ-d-Glu and OCS238 were omitted from phasing/refinement, contoured at 1.5σ. (b) The extensive hydrogen-bond network in the active site of BcYkfC. Hydrogen bonds and distances are shown as dashed lines. (c) l-Ala-γ-d-Glu (stick representation; yellow C atoms) is located in the active site at the interface of the SH3b1 domain (blue) and the NlpC/P60 domain (red). (d) The interaction between l-Ala-γ-d-Glu and the active site of YkfC. This figure was generated using the program MOE 2008.10 (Chemical Computing Group Inc.).
Figure 5Sequence alignment of YkfC from B. subtilis and B. cereus, dipeptidyl-peptidase VI (DPP VI) from B. sphaericus and a γ-d-glutamyl-l-diamino acid endopeptidase from A. variabilis (AvPCP). The sequence numbering and secondary-structure elements of YkfC from B. cereus and AvPCP are indicated at the top and bottom, respectively. The alignment was generated by merging and manually editing the structure-based sequence alignment of BcYkfC and AvPCP with the sequence alignment of the top three sequences. The active-site residues are marked with colored dots at the bottom (blue, S2; orange, S1; red, catalytic triad; green, potential S′ sites).
Figure 6Models of substrate recognition by BcYkfC. (a) l-Ala-γ-d-Glu-DAP was docked into the active site of BcYkfC. The protein surface is colored according to a gradient in electrostatic potential from negative (red) to positive (blue) (MOE 2008.10; Chemical Computing Group Inc.). (b) Stereoview of the specific interactions (four polar, one nonpolar) of γ-d-Glu (cyan) in the context of the tripeptide by five residues of YkfC (Tyr226, Trp228, Asp237, Ser239 and Ser257). The protein residues are colored according to subsite (S1, orange; catalytic triad, magenta; S1′, green). (c) Sequence conservation of the active sites in NlpC/P60 domains based on 2277 NlpC/P60 domains with an intact catalytic dyad (Cys238 and His291) and a conserved Tyr226 (blue, S2; orange, S1; red, catalytic triad; green, S′ sites). (d) Sequence conservation of the active sites in the YkfC subfamily of NlpC/P60 enzymes based on 282 sequences selected based on the presence of a conserved aspartate residue at position 256 of BcYkfC. The conservation of this residue is highly correlated with a conserved tyrosine in the βD region of the SH3b domain (Tyr118 of BcYkfC); both of these residues interact with the free amine of l-Ala of the substrate.
Figure 7Structural comparisons between BcYkfC and the cyanobacterial NlpC/P60 endopeptidase AvPCP. (a) BcYkfC and AvPCP (PDB code 2hbw; Xu et al., 2009 ▶) are shown with the same orientation of their common SH3b1 and NlpC/P60 domains. The structurally equivalent residues in each are shown in red. (b) Stereoview of the Cα traces of the two proteins shown in (a) [the coloring is the same as in (a)]. (c) The S binding sites of the two proteins are nearly identical. The corresponding residues of AvPCP are labeled in parentheses. (d) Comparison of the active-site cavities and their environments. The catalytic cysteine is shown in white. A stick model of a docked murein tripeptide is shown in the active site of BcYkfC.