| Literature DB >> 25010333 |
Alexander Eletsky1, Karolina Michalska2, Scott Houliston3, Qi Zhang1, Michael D Daily4, Xiaohui Xu5, Hong Cui5, Adelinda Yee3, Alexander Lemak3, Bin Wu3, Maite Garcia3, Meagan C Burnet4, Kristen M Meyer4, Uma K Aryal4, Octavio Sanchez4, Charles Ansong4, Rong Xiao6, Thomas B Acton6, Joshua N Adkins4, Gaetano T Montelione6, Andrzej Joachimiak2, Cheryl H Arrowsmith3, Alexei Savchenko7, Thomas Szyperski1, John R Cort4.
Abstract
Bacterial species in the Enterobacteriaceae typically contain multiple paralogues of a small domain of unknown function (DUF1471) from a family of conserved proteins also known as YhcN or BhsA/McbA. Proteins containing DUF1471 may have a single or three copies of this domain. Representatives of this family have been demonstrated to play roles in several cellular processes including stress response, biofilm formation, and pathogenesis. We have conducted NMR and X-ray crystallographic studies of four DUF1471 domains from Salmonella representing three different paralogous DUF1471 subfamilies: SrfN, YahO, and SssB/YdgH (two of its three DUF1471 domains: the N-terminal domain I (residues 21-91), and the C-terminal domain III (residues 244-314)). Notably, SrfN has been shown to have a role in intracellular infection by Salmonella Typhimurium. These domains share less than 35% pairwise sequence identity. Structures of all four domains show a mixed α+β fold that is most similar to that of bacterial lipoprotein RcsF. However, all four DUF1471 sequences lack the redox sensitive cysteine residues essential for RcsF activity in a phospho-relay pathway, suggesting that DUF1471 domains perform a different function(s). SrfN forms a dimer in contrast to YahO and SssB domains I and III, which are monomers in solution. A putative binding site for oxyanions such as phosphate and sulfate was identified in SrfN, and an interaction between the SrfN dimer and sulfated polysaccharides was demonstrated, suggesting a direct role for this DUF1471 domain at the host-pathogen interface.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25010333 PMCID: PMC4092069 DOI: 10.1371/journal.pone.0101787
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1DUF 1471 sequences.
A: Multiple sequence alignment of DUF1471 paralogues from S. Typhimurium, as well as E. coli YbiM, for which there is no close homolog in Salmonella. Alignment of SrfN, YahO, SssB-I and SssB-III is structure-based over the entire structured sequence (SrfN residues 22–91), other alignments are sequence-based and are between core regions only (SrfN residues 35–91) because sequence identity to SrfN residues 22–38 is low and alignments in this region are uncertain. Secondary structure in SrfN, YahO, SssB-I, and SssB-III is indicated above: E = extended (β-sheet) structure, H = helix. The core residues of the sulfate-binding motif in SrfN are indicated with asterisks. Conserved sequence motifs identified by Rudd [1] are underlined. Other conserved residues are highlighted in green or dark grey. Two notable loop regions in the structure are also indicated. SrfN and YahO both have C-terminal tag sequences LEHHHHHH that are not shown. Light grey highlighted portions indicate likely signal sequences for periplasmic localization that are known or likely to be cleaved by a signal peptidase. In the case of SrfN and YahO, the signal sequence was proven experimentally to be cleaved during heterologous expression in E. coli. Inter-domain regions of SssB are not shown. Lower case letters in SssB-III (C-terminal domain) indicate residues with missing electron density in the X-ray structure. Highly conserved residues are indicated by highlighting (blue = hydrophobic, green = polar), somewhat conserved residues are indicated with grey highlighting. The following sequences are listed (S. Typhimurium LT2 locus and UniProt/TrEMBL numbers in parentheses): SrfN (STM0082/Q7CR88), YjfY (STM4389/Q8ZK84), YhcN (STM3361/Q8ZLP6), YcfR seq. I (STM1214/Q8ZQ03), YcfR seq. II (STM3362/Q7CPN0), YahO (STM0366/Q7CR49), YbiJ (STM0823/Q7CQW3), YkgI (STM0565/Q7CR04), YjfO (STM4379/Q8ZK92), YjfN (STM4378/Q8ZK93), SssB (STM1478/Q8ZPL1), YbiM/McbA (E. coli, P0AAX6). B: Unrooted phylogenetic tree (phylogram) constructed from ten diverse genera from the Enterobacteriaceae. Major branches containing Salmonella and E. coli subfamily members are indicated. C: Multiple sequence alignment of SrfN homologues: a subfamily of DUF1471 proteins. For each sequence and abbreviated organism name listed, the full genus and species name, protein/ORF name, database accession number, and similarity to SrfN, excluding the signal sequence, are as follows: Sty, Salmonella enterica Typhimurium, STM0082 (SrfN), NP_459087 and many other Salmonella strains; Sbo, Salmonella bongori, SBG_0068, YP_004728986 (93%); Cro, Citrobacter rodentium, ROD_12311, YP_003364817 (80%); Eho, Enterobacter hormaechei, HMPREF9086_0329, ZP_08496071 (65%); Eae, Enterobacter aerogenes EAE_13230, YP_004592839 (70%); Kpn, Klebsiella pneumonia, KPK_4095, YP_002239898 (68%); Pan, Pantoea sp., Pat9b_3745, YP_004117591 (61%). Notes: Other Salmonella, Klebsiella, and Enterobacter species and strains contain identical or nearly identical sequences to the representatives shown here. However, some Pantoea species do not contain homologues that fall within this DUF1471 subfamily.
Summary of NMR and structural statisticsa for Salmonella SrfN, YahO, and SssB-I.
| SrfN | YahO | SssB-I | ||||
|
| ||||||
| Backbone (%) | 100.0 | 100.0 | 92.1 | |||
| Side chain (%) | 97.6 | 99.1 | 89.3 | |||
| Stereospecific methyl (%) | 100.0 | 100.0 | 0.0 | |||
|
| ||||||
| Distance restraints | ||||||
| Total | 1876 | 1622 | 1212 | |||
| intra-residue ( | 264 | 269 | 157 | |||
| sequential (| | 362 | 411 | 348 | |||
| medium range (1< | | 378 | 335 | 207 | |||
| long range (| | 872 | 607 | 500 | |||
| intermolecular | 34 | – | – | |||
| Dihedral angle restraints | 182 | 100 | 78 | |||
| Hydrogen bond restraints | 78 | 60 | 0 | |||
| Total number of restricting restraints | 2210 | 1782 | 1290 | |||
| Restricting NOE restraints per residue | 14.8 | 24.4 | 19.0 | |||
| Long-range restraints per residue | 6.4 | 8.7 | 7.4 | |||
|
| ||||||
| Average distance restraint violations per structure | ||||||
| 0.1–0.2 Å | 0.1 | 8 | 4.3 | |||
| 0.2–0.5 Å | 0 | 1.25 | 0.4 | |||
| >0.5 Å | 0 | 0 | 0 | |||
| RMS violation per restraint/max viol. (Å) | 0.00/0.15 | 0.02/0.38 | 0.01/0.34 | |||
| Average dihedral restraint violations per structure | ||||||
| 1–10° | 0 | 6.75 | 3.1 | |||
| >10° | 0 | 0 | 0 | |||
| RMS viol. per restraint/max viol. (°) | 0.04/0.50 | 0.66/6.90 | 0.47/4.20 | |||
|
| ||||||
| chain A | A&B | |||||
| backbone atoms (C,Cα,N) | 0.5/0.6 | 0.7/0.8 | 0.4/0.7 | 0.5/0.8 | ||
| heavy atoms | 0.9/1.1 | 1.0/1.2 | 0.8/1.1 | 0.9/1.3 | ||
|
| 0.87/0.92/0.75 | 0.98/0.93/0.88 | 0.96/0.94/0.86 | |||
|
| ||||||
| most favored regions (%) | 96.3/94.2 | 98.1/92.1 | 96.0/94.5 | |||
| allowed regions (%) | 3.7/5.5 | 1.7/5.9 | 4.0/5.4 | |||
| disallowed regions (%) | 0.0/0.3 | 0.3/1.2 | 0.0/0.1 | |||
|
| raw | Z | raw | Z | raw | Z |
| Verify3D | 0.33 | −2.09 | 0.40 | −0.96 | 0.37 | −1.44 |
| ProsaII | 0.45 | −0.83 | 0.61 | −0.17 | 0.56 | −0.37 |
| Procheck G-factor (phi-psi) | −0.31 | −0.90 | −0.09 | −0.04 | −0.19 | −0.43 |
| Procheck G-factor (all dihedrals) | −0.23 | −1.36 | −0.12 | −0.71 | −0.19 | −1.12 |
| MolProbity clashscore | 16.28 | −1.27 | 12.04 | −0.54 | 13.26 | −0.75 |
Structural statistics were computed for ensembles of 20 deposited structures (PDB entries, SrfN: 2MA8, YahO: 2MA4, SssB-I: 2M2J2) using PSVS 1.4 [51], except as noted otherwise.
Computed the expected number of typically observed resonance peaks, excluding: highly exchangeable protons (N-terminal, Lys, and Arg amino groups, hydroxyls of Ser, Thr, Tyr), carbonyl carbons of Asp, Glu, Asn, and Gln side chains, non-protonated aromatic carbons, and the C-terminal His6 tag.
Average distance violations were calculated using the sum over r −6.
Ordered residue ranges [S(phi)+S(psi) >1.8] : SrfN:23–45,49–92; YahO: 23–45,49–73,84–90; SssB-I: 23–43,48–75,81–90.
All (excluding tags) residue ranges: SrfN: 22–96; YahO: 22–91; SssB-I: 22–91.
X-ray data collection and refinement statistics for SssB-III.
| Data collection | |
| Space group |
|
| Cell dimensions [Å],[°] |
|
|
| |
|
| |
| β = 109.4 | |
| Temperature [K] | 100 |
| Radiation source | APS, ID-19 |
| Wavelength [Å] | 0.9793 |
| Resolution [Å] | 50.0–1.45 (1.48–1.45) |
| Unique reflections | 26,374 (1279) |
| Rmerge
| 0.079 (0.576) |
| <I >/<σI> | 25.5 (2.2) |
| Completeness [%] | 99.8 (100) |
| Redundancy | 3.7 (3.6) |
|
| |
| Resolution [Å] | 36.92–1.45 |
| Reflections work/test set | 25,059/1,271 |
| Rwork/Rfree
| 0.133/0.172 |
| Twin fraction | 0.204 |
| No. of atoms protein/ligands/water | 1116/11/144 |
| Average B factor [Å2] | |
| protein/ligands/water | 19.0/32.5/33.1 |
| bond lengths [Å] | 0.016 |
| bond angles [°] | 1.45 |
| most favored | 97.64 |
| outliers | 0 |
| PDB entry | 4EVU |
Values in parentheses correspond to the highest resolution shell.
Rmerge = ΣhΣj|Ihj–
R = Σh|Fo|–|Fc|/Σh|Fo| for all reflections, where Fo and Fc are observed and calculated structure factors, respectively. Rfree is calculated analogously for the test reflections, randomly selected and excluded from the refinement.
Figure 2Structures and surface electrostatic characteristics of DUF1471 proteins.
A: SrfN (dimer), B: YahO, C: SssB-I, D: SssB-III. In addition to ribbon cartoons colored by secondary structure, NMR structures show 20-member ensembles as Cα traces superimposed over the ordered residues determined by PSVS [51]. Electrostatic surface calculations were calculated with APBS using the PyMol plugin, with default parameters (0.15 M salt, kT/e = −1 to 1, pH 7.0) for the first members of the SrfN, SssB-I, and YahO ensembles and for SssB-III after conversion with PDB2PQR. The van der Waals surfaces are shown and are colored according to the charge (red for negative, blue for positive) on the water-accessible surface. Hydrogen atoms were added to SssB-III. The C-terminal 6x-His tags were removed and replaced with carboxylates for the calculation. N-termini for SrfN and YahO did not have the cleaved signal sequence, the N-terminus of SssB-III began at residue Lys 247.
Comparison of DUF1471 structures. Pairwise Dali [20] was used to align structures and calculate RMSD. The average NMR structures were used for purposes of structure comparison.
| SssB-I_ave | SssB-III | YahO_ave | ||
|
| rmsd | 1.80 | 1.49 | 2.73 |
| #res aligned | 66 | 63 | 68 | |
| %id | 21% | 22% | 26% | |
|
| rmsd | 1.51 | 2.30 | |
| #res aligned | 68 | 63 | ||
| %id | 28% | 21% | ||
|
| rmsd | 2.46 | ||
| #res aligned | 70 | |||
| %id | 26% |
Protein structures from the Protein Data Bank that are similar (Z score >5.0) to DUF1471 proteins, determined with Dali [20]. Structures are X-ray structures except as noted.
| protein | PDBentries | domain length(residues) | aligned length(residues) | oligomer | rmsd toSssB-III (Å) | Z-score |
| periplasmic domain of RcsF outer | 2l8y (NMR) | 86 | 71 | monomer | 2.2 | 6.3 |
| membrane protein ( | 2y1b | 86 | 73 | 2.2 | 6.8 | |
| PFAM 11524 Archaeal Se- | 2jz7 | 81 | 73 | pentamer | 2.8 | 5.8 |
| binding protein | ||||||
| UPF0145/YbjQ superfamily | ||||||
| ( | 1vr4 | 103 | 68 | pentamer | 2.7 | 6.1 |
| 2gtc | 103 | 69 | 2.7 | 6.1 | ||
| ( | 1y2i | 107 | 69 | 2.4 | 5.3 | |
| DUF74 protein PCPN_1048 | 3qkb | 92 | 67 | pentamer | 2.2 | 6.6 |
| ( |
Figure 3Proteins structurally similar to DUF1471 proteins.
Similar structures were identified with Dali [20]. Additional details are presented in Table 4.
Figure 4Addition of ligands to SrfN.
A: SrfN–sucrose octasulfate titration monitored with 2-D 1H-15N HSQC. Superimposed spectra: blue, SrfN only; green and red, SrfN +5x and 10x molar excess sucrose octasulfate. B: Chemical shift perturbations following sucrose octasulfate (shown at right) addition mapped onto SrfN surface; the perspective is the same as in Fig. 2 where the positively-charged surface is shown (blue). Ribbon cartoon of SrfN from the same perspective is shown adjacent to the surface depiction. Side chains colored violet have >1 linewidth shift with sucrose octasulfate and similar shifts with heparin & high [SO4 2−] but are not conserved in SssB-III (Q24, Q28, A76). Side chains colored magenta have >1 linewidth shift and are conserved in SssB-III (K27/253, H73/294, E89/D311). A fourth conserved residue at the SO4-binding position (Y91/313) from the SssB crystal structure does not show chemical shift perturbation upon sucrose octasulfate addition to SrfN. The sulfate ion was positioned by a superposition of the SssB-III crystal structure on SrfN. C: Chemical shift perturbations in SssB-III upon titration of sucrose octasulfate, showing that interactions occur not at the sulfate-binding site common to SrfN, but at a patch of basic residues some distance away.
Figure 5Ligand binding sites predicted by coarse-grained simulations.
Predicted sucrose octasulfate interactions with (A) SrfN and with (B) SssB-III, and (C) predicted maltohexaose dodecasulfate interactions with SrfN. Red, yellow, and green indicate binding score levels of >0.1, >0.05, and >0.02, respectively. D and E: Low energy structures of models shown in panels A and B after being reverse-mapped to atomistic sucrose octasulfate in the space occupied by the coarse-grained equivalent, followed by 1000 steps of vacuum minimization in GROMACS [73] using the CHARMM force field to eliminate clashes, with ligand parameters were derived using SwissParam [74]. The all-atom models are shown for illustrative purposes and do not necessarily indicate global free energy minima at all-atom resolution.