Literature DB >> 20836087

The crystal structure of a bacterial Sufu-like protein defines a novel group of bacterial proteins that are similar to the N-terminal domain of human Sufu.

Debanu Das1, Robert D Finn, Polat Abdubek, Tamara Astakhova, Herbert L Axelrod, Constantina Bakolitsa, Xiaohui Cai, Dennis Carlton, Connie Chen, Hsiu-Ju Chiu, Michelle Chiu, Thomas Clayton, Marc C Deller, Lian Duan, Kyle Ellrott, Carol L Farr, Julie Feuerhelm, Joanna C Grant, Anna Grzechnik, Gye Won Han, Lukasz Jaroszewski, Kevin K Jin, Heath E Klock, Mark W Knuth, Piotr Kozbial, S Sri Krishna, Abhinav Kumar, Winnie W Lam, David Marciano, Mitchell D Miller, Andrew T Morse, Edward Nigoghossian, Amanda Nopakun, Linda Okach, Christina Puckett, Ron Reyes, Henry J Tien, Christine B Trame, Henry van den Bedem, Dana Weekes, Tiffany Wooten, Qingping Xu, Andrew Yeh, Jiadong Zhou, Keith O Hodgson, John Wooley, Marc-André Elsliger, Ashley M Deacon, Adam Godzik, Scott A Lesley, Ian A Wilson.   

Abstract

Sufu (Suppressor of Fused), a two-domain protein, plays a critical role in regulating Hedgehog signaling and is conserved from flies to humans. A few bacterial Sufu-like proteins have previously been identified based on sequence similarity to the N-terminal domain of eukaryotic Sufu proteins, but none have been structurally or biochemically characterized and their function in bacteria is unknown. We have determined the crystal structure of a more distantly related Sufu-like homolog, NGO1391 from Neisseria gonorrhoeae, at 1.4 Å resolution, which provides the first biophysical characterization of a bacterial Sufu-like protein. The structure revealed a striking similarity to the N-terminal domain of human Sufu (r.m.s.d. of 2.6 Å over 93% of the NGO1391 protein), despite an extremely low sequence identity of ∼15%. Subsequent sequence analysis revealed that NGO1391 defines a new subset of smaller, Sufu-like proteins that are present in ∼200 bacterial species and has resulted in expansion of the SUFU (PF05076) family in Pfam.

Entities:  

Mesh:

Substances:

Year:  2010        PMID: 20836087      PMCID: PMC3005784          DOI: 10.1002/pro.497

Source DB:  PubMed          Journal:  Protein Sci        ISSN: 0961-8368            Impact factor:   6.725


Introduction

The Hedgehog (Hh) and Wingless (Wnt) signaling pathways are critical for animal development.1–4 They are important in embryonic growth in vertebrates and stem cell differentiation. Disruption of these pathways can lead to disruption of stem cell function and cancer, for example.5,6 Recent studies have addressed how these pathways may have evolved.7–9 The human Sufu (Suppressor of Fused) protein is a key transcriptional regulator in the Hh pathway. Until recently, 20 eukaryotic, two-domain proteins of ∼500 residues [N-terminal domain: Pfam SUFU family (PF05076) and C-terminal domain: Pfam SUFU_C family (PF12470)] and 13 bacterial Sufu-like single-domain proteins of unknown function (∼250 residues, with ∼25–40% sequence identity to the N-terminal Sufu domain of eukaryotic proteins) were classified in Pfam (version 23.0).10 The Sufu_C domain interacts with the N-terminal domain of Gli transcription factors, whereas the Sufu domain is believed to interact with the C-terminal tail of Gli.11,12 Apart from general classification of these bacterial proteins as Sufu-like, none had been investigated experimentally and their roles were consequently unclear, since bacteria do not use Hedgehog signaling. As part of our efforts to characterize novel protein sequence space, we identified a set of proteins in Neisseria species that are distantly related to members of PF05076. This similarity was only detectable using profile–profile sequence comparison methods.13 NGO1391 (UniProt id Q5F6Z8) from Neisseria gonorrhea FA1090 was selected as a representative of this set for structure determination, using the semiautomated, high-throughput pipeline of the Joint Center for Structural Genomics (JCSG; http://www.jcsg.org) as part of the National Institute of General Medical Sciences' Protein Structure Initiative.

Results and Discussion

Overall structure

NGO1391 from Neisseria gonorrhoeae FA 1090 was cloned, expressed, purified, and crystallized according to JCSG protocols as described in Materials and Methods. The crystal structure (Fig. 1) was determined by single-wavelength anomalous diffraction (SAD) phasing to a resolution of 1.40 Å. Data collection, model, and refinement statistics are summarized in Table I.14 The final model contains one monomer consisting of Gly0 (from the purification tag) and residues 1–181 of NGO1391 (the full-length protein is 182 residues), 265 waters, 4 sulfates, and 1 glycerol molecule in the asymmetric unit (ASU). Residue 182 was disordered and was not modeled. A monomer is the likely oligomeric form in solution as judged from crystal lattice packing and assembly analysis using PISA, and the monomeric form is supported by analytical size-exclusion chromatography. The Matthews' coefficient (VM)15 is 2.5 Å3/Da, with an estimated solvent content of 50%. The Ramachandran plot produced by Molprobity16 shows that 97.8% of the amino acids are in the favored regions with one outlier, Asp132, which is in a region of good density, but with alternate main-chain and side-chain conformations.
Figure 1

Crystal structure of NGO1391 from Neisseria gonorrhoea FA1090. (a) Stereo ribbon diagram of the NGO1391 monomer color-coded from N-terminus (yellow) to C-terminus (magenta). Helices H1–H8 (helix H5 is a 310 helix) and β-strands β1-β7 are indicated. (b) Diagram showing the secondary structural elements of NGO1391 superimposed on its primary sequence. The α-helices, 310 helix, and β-strands are indicated.

Table I

Summary of Crystal Parameters, Data Collection, and Refinement Statistics for PDB 3k5j

Space groupP3221
Unit cell parametersa = 50.72 Å, b = 50.72 Å, c = 143.60 Å
Data collectionλ1 SAD-Se
 Wavelength (Å)0.9789
 Resolution range (Å)27.7–1.40
 Number of observations365,215
 Number of unique reflections43,173
 Completeness (%)99.7 (98.6)a
 Mean I/σ (I)17.2 (1.8)a
Rsym on I (%)6.0 (76.1)a
 Highest resolution shell (Å)1.45–1.40
Model and refinement statistics
 Resolution range (Å)27.7–1.40Data set used in refinementλ1 SAD-Se
 Number of reflections (total)43,107bCutoff criteria|F|>0
 Number of reflections (test)2169Rcryst0.134
 Completeness (% total)99.8Rfree0.163
Stereochemical parameters
 Restraints (RMSD observed)
  Bond angle (°)1.60
  Bond length (Å)0.015
  Average protein isotropic B-value (Å2)20.3c
  ESU based on Rfree (Å)0.05
  No. of protein residues/atoms182/1593
  No. of water/sulfate/glycerol molecules265/4/1

ESU = Estimated overall coordinate error.14

Rsym = Σ|Ii−〈Ii〉|/Σ|Ii|, where Ii is the scaled intensity of the ith measurement and 〈Ii〉 is the mean intensity for that reflection.

Rcryst = Σ||Fobs|−|Fcalc||/Σ|Fobs|, where Fcalc and Fobs are the calculated and observed structure factor amplitudes, respectively.

Rfree = as for Rcryst, but for 5.0% of the total reflections chosen at random and omitted from refinement.

Highest resolution shell.

Typically, the number of unique reflections used in refinement is slightly less that the total number that were integrated and scaled. Reflections are excluded due to negative intensities and rounding errors in the resolution limits and cell parameters.

This value represents the total B.

Crystal structure of NGO1391 from Neisseria gonorrhoea FA1090. (a) Stereo ribbon diagram of the NGO1391 monomer color-coded from N-terminus (yellow) to C-terminus (magenta). Helices H1–H8 (helix H5 is a 310 helix) and β-strands β1-β7 are indicated. (b) Diagram showing the secondary structural elements of NGO1391 superimposed on its primary sequence. The α-helices, 310 helix, and β-strands are indicated. Summary of Crystal Parameters, Data Collection, and Refinement Statistics for PDB 3k5j ESU = Estimated overall coordinate error.14 Rsym = Σ|Ii−〈Ii〉|/Σ|Ii|, where Ii is the scaled intensity of the ith measurement and 〈Ii〉 is the mean intensity for that reflection. Rcryst = Σ||Fobs|−|Fcalc||/Σ|Fobs|, where Fcalc and Fobs are the calculated and observed structure factor amplitudes, respectively. Rfree = as for Rcryst, but for 5.0% of the total reflections chosen at random and omitted from refinement. Highest resolution shell. Typically, the number of unique reflections used in refinement is slightly less that the total number that were integrated and scaled. Reflections are excluded due to negative intensities and rounding errors in the resolution limits and cell parameters. This value represents the total B. NGO1391 is a compact molecule with approximate dimensions of 50 Å × 40 Å × 30 Å and has a tapered appearance due to a bulkier N-terminal region. It consists of a central, slightly S-shaped, antiparallel β-sheet (β1–β7) flanked by α- and 310-helices (H1–H8) (Fig. 1). Some of the loops are long and extend out from the protein core. The C-terminus consists of a long tail with no secondary structure (residues 165–181). A glycerol molecule from the cryoprotectant binds in a relatively hydrophobic cleft formed by Ser119, Tyr122, Trp138, Leu140, Glu159, Phe162, Asp163, Ile167, and Tyr169 near the C-terminus. Most of these residues are conserved in the newly identified Sufu-like proteins (see Sequence Analysis) and three are also conserved in human Sufu (Ser119, Leu140 and Ile167 correspond to human Sufu Thr180, Val201, and Ile228, respectively).

Comparison to human Sufu

A search for other proteins of similar structure using sequence and structure-based methods was carried out using FFAS,13 DALI,17 FATCAT,18 and SSM.19 In all cases, the only significant hit (for example, the e-value of the top FATCAT hit is 3.95e-10 and the next hit is 4.35e-03; the top DALI hit has a Z-score of 16.7 and the next hit has 5.2, the top FFAS hit has a score of −49.4 and the next hit is −5.7) was to the N-terminal domain (NTD, 236 out of 484 residues) of the human Sufu protein (PDB accession code 1m1l,12 UniProt Q9UMX1). NGO1391 can be superimposed onto the NTD of Sufu with an r.m.s.d. of 2.6 Å over 170 Cα atoms (93% of the NGO1391 protein) with structure-based (DALI) and sequence-based (FFAS) sequence identities of 15% (Fig. 2). The notable structural differences between NGO1391 and the NTD of Sufu include: short helices H2 and H3 in NGO1391 correspond to long loops in Sufu; longer loops in Sufu occur between strands β2 and β3, β3 and H3, and β4 and H4; and additional helices are present near the C-terminus.
Figure 2

Structural comparison of NGO1391 and human Sufu protein. The structures of NGO1391 and the N-terminal domain (NTD) of human Sufu protein (gray) are highly similar and superimpose with an r.m.s.d. of 2.6 Å over 170 Cα atoms.

Structural comparison of NGO1391 and human Sufu protein. The structures of NGO1391 and the N-terminal domain (NTD) of human Sufu protein (gray) are highly similar and superimpose with an r.m.s.d. of 2.6 Å over 170 Cα atoms. Human full-length Sufu was found to be a monomer in solution (n.b. the NTD of Sufu forms a tetramer in the crystal structure and only the NTD 3D structure of human Sufu is available). In its inactive signaling state, the NTD of Sufu interacts with the C-terminal region of Gli transcription factors and prevents their transport into the nucleus. In the active form, which is phosphorylation dependent, Sufu forms a complex with Fused, thereby releasing Gli. Asp159 of Sufu was found to be important in the interaction with Gli.12 This residue corresponds to Lys99 in NGO1391 (Fig. 3), which in itself is not highly conserved across the bacterial Sufu-like proteins. Seven residues contribute to a more negatively charged region on one side of Sufu: Glu106, Asp111, Glu152, Asp159, Glu181, Glu221, and Asp262 (Fig. 4). These residues are not conserved across bacterial Sufu-like proteins. Both sides of NGO1391 appear to have a similar number of negatively charged residues (Fig. 4). Some other residues are more strongly conserved between the eukaryotic and bacterial forms of the protein with several on the protein surface (NGO1391 Thr24, Glu49, Trp51, Tyr53, Pro106, Ser111, and His115) that could be functionally important.
Figure 3

Multiple sequence alignment of Sufu-like proteins. The Neisseria proteins with UniProt codes Q5F6Z8 (NGO1391) from N. gonorrhoeae FA1090, C1HYY6 from N. gonorrhoeae 1291, B4RNC6 from N. gonorrhoeae NCCP11945, C0EME3 from N. flavescens NRL30031/H210, A1KSR8 from N. meningitidis serogroup C/serotype 2a, C0DN13 from N. cinerea ATCC14685 and C0F5L2 from N. lactamica ATCC23970 share ∼92% sequence identity. More distant homologs with ∼33–39% sequence identity are B5HWZ3 from Streptomyces sviceus ATCC 29083, C2A7G8 from Thermomonospora curvata DSM 43183, C1YUI8 from Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111, and A6A6Q5 from Vibrio cholerae MZO-2. The alignment was generated using the CLUSTALW web-server.20

Figure 4

Electrostatic surface potential representations of human Sufu and NGO1391. (a) Surface for the NTD of human Sufu shows a greater accumulation of negatively charged residues on one side (left panel) of the protein due to surface residues Glu106, Asp111, Glu152, Asp159, Glu181, Glu221, and Asp262. (b) Surface for the NGO1391 indicates that charged residues are equally distributed on both sides of the protein. The color scale is in units of kT/e from −9 to +9. The figure in the right panel represents a 180° rotation around the vertical axis compared to the left panel.

Multiple sequence alignment of Sufu-like proteins. The Neisseria proteins with UniProt codes Q5F6Z8 (NGO1391) from N. gonorrhoeae FA1090, C1HYY6 from N. gonorrhoeae 1291, B4RNC6 from N. gonorrhoeae NCCP11945, C0EME3 from N. flavescens NRL30031/H210, A1KSR8 from N. meningitidis serogroup C/serotype 2a, C0DN13 from N. cinerea ATCC14685 and C0F5L2 from N. lactamica ATCC23970 share ∼92% sequence identity. More distant homologs with ∼33–39% sequence identity are B5HWZ3 from Streptomyces sviceus ATCC 29083, C2A7G8 from Thermomonospora curvata DSM 43183, C1YUI8 from Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111, and A6A6Q5 from Vibrio cholerae MZO-2. The alignment was generated using the CLUSTALW web-server.20 Electrostatic surface potential representations of human Sufu and NGO1391. (a) Surface for the NTD of human Sufu shows a greater accumulation of negatively charged residues on one side (left panel) of the protein due to surface residues Glu106, Asp111, Glu152, Asp159, Glu181, Glu221, and Asp262. (b) Surface for the NGO1391 indicates that charged residues are equally distributed on both sides of the protein. The color scale is in units of kT/e from −9 to +9. The figure in the right panel represents a 180° rotation around the vertical axis compared to the left panel.

Sequence analysis and genomic context

At the time of the NGO1391 structure determination, Pfam (version 23.0) SUFU family (PF05076) contained 33 proteins from 27 species, 20 of which were eukaryotic including the N-terminal domain of human Sufu and 13 were bacterial. The eukaryotic Sufu proteins (primarily from chordates and insects) all contained ∼500 residues, and consisted of two domains of approximately equal size (Sufu and Sufu_C). The bacterial proteins, on the other hand, were uncharacterized proteins of only around 250 residues. A search against the UniProt database using PSI-BLAST revealed that NGO1391 (Q5F6Z8) had several sequence homologs of ∼92% sequence identity in other Neisseria species: N. gonorrhoeae 1291 (C1HYY6), N. gonorrhoeae NCCP11945 (B4RNC6), N. flavescens NRL30031/H210 (C0EME3), N. meningitidis serogroup C/serotype 2a (A1KSR8), N. cinerea ATCC14685 (C0DN13), and N. lactamica ATCC23970 (C0F5L2). The next nearest hits were to proteins with ∼33–39% sequence identity (B5HWZ3 from Streptomyces sviceus ATCC 29083, C2A7G8 from Thermomonospora curvata DSM 43183, C1YUI8 from Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 and A6A6Q5 from Vibrio cholerae MZO-2). However, none of these proteins (Fig. 3) were included in PF05076. Three rounds of iterative PSI-BLAST, including sequence similarities down to a threshold of 0.1, identified ∼300 additional bacterial proteins, in addition to the 33 vertebrate and bacterial Sufu proteins already present in PF05076. Pairwise comparisons revealed that the bacterial proteins already included in PF05076 had sequence identities of ∼25–40% to the eukaryotic Sufu proteins, but less than 16% identity to the additional Sufu-like proteins identified here. A multiple sequence alignment of a representative set of these new Sufu-like proteins identified residues that are invariant or similar in these proteins (Fig. 3) and their distribution in the structure (Fig. 5). Based on this analysis, Pfam PF05076 was independently revised to include these proteins using the HMMER3 software (http://hmmer.janelia.org). Thus, PF05076 in the current release of Pfam (v. 24.0, October 2009) now contains 341 proteins from 31 eukaryotic and 220 bacterial species. Inspection of the residue conservation scores in the multiple sequence alignment for this revised protein family on the Pfam website (http://pfam.sanger.ac.uk/family/PF05076), reveals that the only residues that that are conserved across the entire protein family (341 proteins) are Val134, Phe136, Leu139, Ile142, Glu146, and Leu158 in NGO1391. Of these residues, only Val134 is surface exposed whereas the rest are in the protein core.
Figure 5

Sequence conservation plot of NGO1391. The Sufu-like proteins from Figure 3 have been used for the calculation of conservation scores21 (a) A stereo stick representation in which the most conserved residues are represented in dark pink, the light pink residues are less conserved and residues in cyan are least conserved. (b) and (c) Two views of some of the most conserved surface exposed residues in the same color scheme. The molecular orientations in the top and bottom panels are different to facilitate visualization using PyMOL.

Sequence conservation plot of NGO1391. The Sufu-like proteins from Figure 3 have been used for the calculation of conservation scores21 (a) A stereo stick representation in which the most conserved residues are represented in dark pink, the light pink residues are less conserved and residues in cyan are least conserved. (b) and (c) Two views of some of the most conserved surface exposed residues in the same color scheme. The molecular orientations in the top and bottom panels are different to facilitate visualization using PyMOL. The genomic neighbors of NGO1391 are putative MafB-like (NGO1392) and MafA2 (NGO1393) proteins that are predicted to be functional partners of NGO1391 with high scores of 0.87 and 0.86, respectively.22 The Maf proteins are believed to have adhesion capabilities.23,24 Interestingly, this predicted association is similar to that of the β-catenin in the Wnt signaling pathway, where β-catenin can form a complex with the cadherin cell-adhesion molecules. β-catenin also functions as a transcriptional regulator. However, no sequence or structural similarity is found between NGO1391 and β-catenin and this genomic context does not appear to be conserved across other members of this Pfam. The crystal structure and sequence analysis presented here provides the first molecular characterization of a bacterial Sufu-like protein that may be a prokaryotic homolog of modern Sufu proteins involved in embryonic development in vertebrates. This study has also led to identification of a large number of other bacterial Sufu-like proteins. Mechanistic and mutagenesis experiments can help to elucidate which residues are important for molecular function and substrate specificity and aid in understanding the function of NGO1391 and related proteins. Additional information about the proteins described in this study is available from TOPSAN25,26 at http://www.topsan.org/explore?PDBid=3k5j.

Materials and Methods

Protein expression, purification, and crystallization

Clones were generated using the Polymerase Incomplete Primer Extension (PIPE) cloning method.27 The gene encoding NGO1391 (GenBank: YP_208451, gi|59801739, UniProt: Q5F6Z8) was amplified by polymerase chain reaction (PCR) from Neisseria gonorrhoeae genomic DNA using PfuTurbo DNA polymerase (Stratagene) and I-PIPE (Insert) primers (forward primer: 5′-ctgtacttccagggc ATGGACTATAA CCAAACTGTTTTATCTC3′, reverse primer: 5′-aat taagtcgcgtta TCCTGCCTGCCAGACAGTACTCGC ACG-3′, target sequence in upper case) that included sequences for the predicted 5′ and 3′ ends. The expression vector, pSpeedET, which encodes an amino-terminal tobacco etch virus (TEV) protease-cleavable expression and purification tag (MGSDK IHHHHHHENLYFQ/G), was PCR amplified with V-PIPE (Vector) primers (forward primer: 5′-taacgc gacttaattaactcgtttaaacggtctccagc-3′, reverse primer: 5′-gccctggaagtacaggttttcgtgatgatgatgatgatg-3′). V-PIPE and I-PIPE PCR products were mixed to anneal the amplified DNA fragments together. Escherichia coli GeneHogs (Invitrogen) competent cells were transformed with the I-PIPE/V-PIPE mixture and dispensed on selective LB-agar plates. The cloning junctions were confirmed by DNA sequencing. Expression was performed in a selenomethionine-containing medium at 37°C. Selenomethionine was incorporated via inhibition of methionine biosynthesis, which does not require a methionine-auxotrophic strain.28 At the end of fermentation, lysozyme was added to the culture to a final concentration of 250 μg/mL, and the cells were harvested and frozen. After one freeze/thaw cycle the cells were sonicated in lysis buffer [50 mM HEPES pH 8.0, 50 mM NaCl, 10 mM imidazole, 1 mM Tris(2-carboxyethyl)phosphine-HCl (TCEP)] and the lysate was clarified by centrifugation at 32,500 × g for 30 minutes. The soluble fraction was passed over nickel-chelating resin (GE Healthcare) pre-equilibrated with lysis buffer, the resin was washed with wash buffer [50 mM HEPES pH 8.0, 300 mM NaCl, 40 mM imidazole, 10% (v/v) glycerol, 1 mM TCEP], and the protein was eluted with elution buffer [20 mM HEPES pH 8.0, 300 mM imidazole, 10% (v/v) glycerol, 1 mM TCEP]. The eluate was buffer exchanged with TEV buffer [20 mM HEPES pH 8.0, 200 mM NaCl, 40 mM imidazole, 1 mM TCEP] using a PD-10 column (GE Healthcare), and incubated with 1mg of TEV protease per 15 mg of eluted protein. The protease-treated eluate was passed over nickel-chelating resin (GE Healthcare) pre-equilibrated with HEPES crystallization buffer [20 mM HEPES pH 8.0, 200 mM NaCl, 40 mM imidazole, 1 mM TCEP] and the resin was washed with the same buffer. The flow-through and wash fractions were combined and concentrated for crystallization trials to 13 mg/mL by centrifugal ultrafiltration (Millipore). NGO1391 was crystallized by mixing 200 nL protein solution with 200 nL crystallization solution in a sitting drop format over a 50 μL reservoir volume using the nanodroplet vapor diffusion method29 with standard JCSG crystallization protocols.30 The crystallization reagent consisted of 1.6M ammonium sulfate, and 0.1M MES pH 6.0. Glycerol was diluted using the reservoir solution and then added 1:1 to the drop to a final concentration 20% (v/v) as a cryoprotectant prior to harvesting. A cubic-shaped crystal of approximate size 100 μm × 100 μm × 100 μm was harvested after 15 days at 277 K for data collection. Initial screening for diffraction was carried out using the Stanford Automated Mounting system (SAM)31 at the Stanford Synchrotron Radiation Lightsource (SSRL, SLAC National Accelerator Laboratory, Menlo Park, CA). The diffraction data were indexed in trigonal space group P3221. Protein concentrations were determined using the Coomassie Plus assay (Pierce). To determine its oligomeric state in solution, NGO1391 was analyzed using a 1 × 30 cm2 Superdex 200 size-exclusion column (GE Healthcare). The mobile phase consisted of 20 mM Tris pH 8.0, 150 mM NaCl, and 0.02% (w/v) sodium azide. The molecular weight was calculated using ASTRA 5.1.5 software (Wyatt Technology).

Data collection, structure solution, and refinement

SAD data were collected at 100 K using a MarMosaic 325 CCD detector (Rayonix) at the SSRL on beamline 11–1 at the selenium peak (λ1) wavelength using the BLU-ICE32 data collection environment. The SAD data were integrated and reduced using XDS33 and XSCALE, respectively. All other programs for data manipulation were from the CCP4 suite.34 The heavy atom substructure and phasing calculations were performed using SHELXD35 and autoSHARP36, respectively, and ARP/wARP37 was used for automatic model building to 1.40 Å resolution. Model completion and crystallographic refinement were performed using COOT38 and REFMAC5,39 respectively. The refinement protocol included the experimental phase restraints in the form of Hendrickson–Lattman coefficients from autoSHARP. Data and refinement statistics are summarized in Table I.

Validation and deposition

The quality of the crystal structure was analyzed using the JCSG Quality Control server (http://smb.slac.stanford.edu/jcsg/QC). This server automatically processes the coordinates and data through a variety of validation tools including AutoDepInputTool,40 MolProbity,16 WHATIF 5.0;41 agreement between the atomic model and the data using SFcheck 4.042 and RESOLVE,43 the protein sequence using CLUSTALW,44 atomic occupancies using MOLEMAN2,45 consistency of NCS pairs and evaluates difference in Rcryst/Rfree, expected Rfree/Rcryst and maximum/minimum B-factors by parsing the refinement log-file and PDB header. Protein quaternary structure analysis was performed using the PISA server,46 Figure 1(B) was adapted from an analysis using PDBsum,47 and all others were prepared with PyMOL.48 Figure 3 was prepared using the PDB2PQR49 server and the APBS49,50 module in PyMOL. Atomic coordinates and experimental structure factors for NGO1391 have been deposited in the PDB under the accession code 3k5j.
  47 in total

1.  SFCHECK: a unified set of procedures for evaluating the quality of macromolecular structure-factor data and their agreement with the atomic model.

Authors:  A A Vaguine; J Richelle; S J Wodak
Journal:  Acta Crystallogr D Biol Crystallogr       Date:  1999-01-01

2.  Electrostatics of nanosystems: application to microtubules and the ribosome.

Authors:  N A Baker; D Sept; S Joseph; M J Holst; J A McCammon
Journal:  Proc Natl Acad Sci U S A       Date:  2001-08-21       Impact factor: 11.205

3.  WHAT IF: a molecular modeling and drug design program.

Authors:  G Vriend
Journal:  J Mol Graph       Date:  1990-03

4.  Combining the polymerase incomplete primer extension method for cloning and mutagenesis with microscreening to accelerate structural genomics efforts.

Authors:  Heath E Klock; Eric J Koesema; Mark W Knuth; Scott A Lesley
Journal:  Proteins       Date:  2008-05-01

5.  Multiple sequence alignment using ClustalW and ClustalX.

Authors:  Julie D Thompson; Toby J Gibson; Des G Higgins
Journal:  Curr Protoc Bioinformatics       Date:  2002-08

6.  Features and development of Coot.

Authors:  P Emsley; B Lohkamp; W G Scott; K Cowtan
Journal:  Acta Crystallogr D Biol Crystallogr       Date:  2010-03-24

7.  Suppressor of fused regulates Gli activity through a dual binding mechanism.

Authors:  Mark Merchant; Felix F Vajdos; Mark Ultsch; Henry R Maun; Ulrich Wendt; Jennifer Cannon; William Desmarais; Robert A Lazarus; Abraham M de Vos; Frederic J de Sauvage
Journal:  Mol Cell Biol       Date:  2004-10       Impact factor: 4.272

8.  Atomic structures of the human immunophilin FKBP-12 complexes with FK506 and rapamycin.

Authors:  G D Van Duyne; R F Standaert; P A Karplus; S L Schreiber; J Clardy
Journal:  J Mol Biol       Date:  1993-01-05       Impact factor: 5.469

9.  ConSurf 2005: the projection of evolutionary conservation scores of residues on protein structures.

Authors:  Meytal Landau; Itay Mayrose; Yossi Rosenberg; Fabian Glaser; Eric Martz; Tal Pupko; Nir Ben-Tal
Journal:  Nucleic Acids Res       Date:  2005-07-01       Impact factor: 16.971

10.  Evolution of hedgehog and hedgehog-related genes, their origin from Hog proteins in ancestral eukaryotes and discovery of a novel Hint motif.

Authors:  Thomas R Bürglin
Journal:  BMC Genomics       Date:  2008-03-11       Impact factor: 3.969

View more
  3 in total

Review 1.  Resilience of biochemical activity in protein domains in the face of structural divergence.

Authors:  Dapeng Zhang; Lakshminarayan M Iyer; A Maxwell Burroughs; L Aravind
Journal:  Curr Opin Struct Biol       Date:  2014-06-19       Impact factor: 6.809

Review 2.  Mechanisms and functions of Hedgehog signalling across the metazoa.

Authors:  Philip W Ingham; Yoshiro Nakano; Claudia Seger
Journal:  Nat Rev Genet       Date:  2011-04-19       Impact factor: 53.242

3.  A novel immunity system for bacterial nucleic acid degrading toxins and its recruitment in various eukaryotic and DNA viral systems.

Authors:  Dapeng Zhang; Lakshminarayan M Iyer; L Aravind
Journal:  Nucleic Acids Res       Date:  2011-02-08       Impact factor: 16.971

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.