| Literature DB >> 20734145 |
Yuefeng Tang1, William M Schneider, Yang Shen, Srivatsan Raman, Masayori Inouye, David Baker, Monica J Roth, Gaetano T Montelione.
Abstract
Determination of high-quality small protein structures by nuclear magnetic resonance (NMR) methods generally requires acquisition and analysis of an extensive set of structural constraints. The process generally demands extensive backbone and sidechain resonance assignments, and weeks or even months of data collection and interpretation. Here we demonstrate rapid and high-quality protein NMR structure generation using CS-Rosetta with a perdeuterated protein sample made at a significantly reduced cost using new bacterial culture condensation methods. Our strategy provides the basis for a high-throughput approach for routine, rapid, high-quality structure determination of small proteins. As an example, we demonstrate the determination of a high-quality 3D structure of a small 8 kDa protein, E. coli cold shock protein A (CspA), using <4 days of data collection and fully automated data analysis methods together with CS-Rosetta. The resulting CspA structure is highly converged and in excellent agreement with the published crystal structure, with a backbone RMSD value of 0.5 Å, an all atom RMSD value of 1.2 Å to the crystal structure for well-defined regions, and RMSD value of 1.1 Å to crystal structure for core, non-solvent exposed sidechain atoms. Cross validation of the structure with (15)N- and (13)C-edited NOESY data obtained with a perdeuterated (15)N, (13)C-enriched (13)CH(3) methyl protonated CspA sample confirms that essentially all of these independently-interpreted NOE-based constraints are already satisfied in each of the 10 CS-Rosetta structures. By these criteria, the CS-Rosetta structure generated by fully automated analysis of data for a perdeuterated sample provides an accurate structure of CspA. This represents a general approach for rapid, automated structure determination of small proteins by NMR.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20734145 PMCID: PMC2970817 DOI: 10.1007/s10969-010-9095-6
Source DB: PubMed Journal: J Struct Funct Genomics ISSN: 1345-711X
800 MHz triple resonance data used for determining backbone resonance assignments
| 15N-HSQC | HNcoCA | HNCO | HNCA | HNCACB | HNcoCACB | HNcaCO | |
|---|---|---|---|---|---|---|---|
| No. of points | |||||||
| Collected | 1024, 256 | 1024, 40, 50 | 1024, 40, 40 | 1024, 40, 50 | 1024, 64, 100 | 1024, 64, 100 | 1024, 40, 40 |
| After LP | 1024, 512 | 1024, 72, 82 | 1024, 72, 72 | 1024, 72, 82 | 1024, 96, 164 | 1024, 96, 164 | 1024, 72, 72 |
| After zero filling | 1024, 512 | 1024, 128, 128 | 1024, 128, 128 | 1024, 128, 128 | 1024, 128, 256 | 1024, 128, 256 | 1024, 128, 128 |
| No. of scans | 8 | 4 | 4 | 4 | 16 | 16 | 16 |
| Spectral width (ω1, ω2, ω3; ppm) | 14, 28 | 14, 23, 32 | 14, 23, 24 | 14, 23, 32 | 14, 28, 72 | 14, 28, 72 | 14, 23, 24 |
| Recycle delay (s) | 1 | 1 | 1 | 1 | 1 | 1 | 1 |
| Collection time (h) | 0.6 | 2.2 | 2.0 | 2.2 | 33.2 | 33.6 | 8.6 |
Fig. 1Summary of backbone and 13Cβ resonances assignments for CspA derived from triple resonance NMR experiments. Red bars and yellow bars underneath the amino acid sequence represent the connectivity established between intra and sequential residues respectively. These data were obtained by analyzing six 2D and 3D NMR spectra, summarized in Table 1. Slowly exchanging backbone amides, used in the conventional structure analysis but not in the CS-Rosetta analysis, identified by 1H/2H exchange measurements, are represented by filled circles. Secondary structures of the β-barrel found in the final structure are indicated by arrows along the amino acid sequence
Summary of structural statistics for E. coli CspA NMR structures
| Sparse-constraint NMR structurea | 3mefb | Sparse-constraint CS-Rosetta structurec | |
|---|---|---|---|
| Conformationally-restricting constraintsd | |||
| Distance constraints | |||
| Total | 131 | 131 | |
| Intra-residue ( | 17 | 17 | |
| Sequential (| | 45 | 45 | |
| Medium range (1 < | | 8 | 8 | |
| Long range (| | 61 | 61 | |
| Distance constraints per residue | 2.0 | 2.0 | |
| Dihedral angle constraints | 68 | 68 | |
| Hydrogen bond constraints | |||
| Total | 22 | 22 | |
| Long range (| | 20 | 20 | |
| Number of constraints per residue | 3.3 | 3.3 | |
| Number of long range constraints per residue | 1.2 | 1.2 | |
| Residual constraint violationsd | |||
| Average number of distance violations per structure | |||
| 0.1–0.2 Å | 1.4 | 0.9 | |
| 0.2–0.5 Å | 0 | 1.9 | |
| >0.5 Å | 0 | 3.7 | |
| Average RMS distance violation/constraint (Å) | 0.02 | 0.17 | |
| Maximum distance violation (Å) | 0.18 | 1.74 | |
| Average number of dihedral angle violations per residue | |||
| 1–10° | 3.6 | 3 | |
| >10° | 0 | 0.8 | |
| Average RMS dihedral angle violation/constraint (°) | 0.45 | 1.73 | |
| Maximum dihedral angle violation (°) | 3.4 | 16.70 | |
| RMSD from average coordinates (Å)d,e | |||
| Backbone atoms | 1.2 ± 0.2 | 0.5 ± 0.1 | 0.8 ± 0.2 |
| Heavy atoms | 1.7 ± 0.2 | 1.1 ± 0.1 | 1.2 ± 0.2 |
| RMSD from X-ray structure (Å)d,f | |||
| Backbone atoms | 1.58 ± 0.38 | 0.95 ± 0.11 | 0.52 ± 0.12 |
| Heavy atoms | 2.24 ± 0.34 | 1.63 ± 0.16 | 1.17 ± 0.11 |
| Sidechain RMSD from X-ray structure (Å)d,g | |||
| Heavy atoms | 1.75 ± 0.20 | 1.59 ± 0.15 | 0.86 ± 0.11 |
| Heavy sidechain atoms | 1.81 ± 0.23 | 1.93 ± 0.22 | 1.14 ± 0.12 |
| Ramachandran statisticsd,e | |||
| Most favored regions (%) | 92.0 | 78.3 | 93.7 |
| Additional allowed regions (%) | 8.0 | 21.7 | 6.3 |
| Generously allowed (%) | 0.0 | 0.0 | 0.0 |
| Disallowed regions (%) | 0.0 | 0.0 | 0.0 |
Analysis for residues 1–70, excluding disordered N-terminal expression tag
aStructure obtained from sparse NMR constraints
bNMR structure determined by conventional methods (PDB id 3mef)
cStructure obtained from CS-Rosetta structure generation, compared with constraints; note that these distance constraints were not used in generating the CS-Rosetta structure
dGenerated using PSVS 1.3 program. Average distance violations were calculated using the sum over r −6. Note that the conformational constraints were not used in CS-Rosetta calculations except to validate the structure by providing the statistics listed in this table
eOrder residue ranges [S(phi) + S(psi) > 1.8]. NMR structure using minimum constraints: 4–24, 30–33, 35–36, 45–46, 51–55, 63–64, 67–69; Conventionally-determined NMR structure: 4–10, 20–23, 30–32, 48–51, 53–54, 68–69; CS-Rosetta generated structure: 4–27, 29–37, 40–60, 62–66
fWell-defined core region: 5–9, 19–22, 50–56, 63–69
gBuried hydrophobic residues: V9, I21, V30, V32, I37, L45, V51, F53, A64, V67
Fig. 2Stereoview of the superimposition of AutoStructure-CNS structure for [1H-13C]-I(δ1)LV, 13C, 15N, 2H-enriched CspA determined by conventional automated analysis methods (blue) with 1mjc (red). a Backbone line representations of the 10 lowest energy conformers obtained from AutoStructure-CNS structure compared with 1mjc. b Ribbon diagram of the lowest energy conformer of AutoStructure-CNS structure versus 1mjc. c The packing of the hydrophobic residues (viz, V9, I21, V30, V32, I37, L45, V51, F53, A64, and V67) for the lowest energy conformer of AutoStructure-CNS structure versus 1mjc. The disordered N-terminal hexaHis expression tag is excluded from the analysis
Fig. 3Stereoview of the superimposition of the CS-Rosetta structure for 2H,13C,15N-enriched CspA (blue) with 1mjc (red). a Backbone line representations of the 10 lowest energy conformers obtained from CS-Rosetta structure compared with 1mjc. b Ribbon diagram of the lowest energy conformer of CS-Rosetta structure versus 1mjc. c The packing of the core hydrophobic residues (viz, V9, I21, V30, V32, I37, L45, V51, F53, A64, and V67) for the lowest energy conformer of CS-Rosetta structure versus 1mjc. The disordered N-terminal expression tag is excluded from the analysis