| Literature DB >> 26121406 |
Yuefeng Tang1, Yuanpeng Janet Huang1, Thomas A Hopf2, Chris Sander3, Debora S Marks4, Gaetano T Montelione5.
Abstract
Accurate determination of protein structure by NMR spectroscopy is challenging for larger proteins, for which experimental data are often incomplete and ambiguous. Evolutionary sequence information together with advances in maximum entropy statistical methods provide a rich complementary source of structural constraints. We have developed a hybrid approach (evolutionary coupling-NMR spectroscopy; EC-NMR) combining sparse NMR data with evolutionary residue-residue couplings and demonstrate accurate structure determination for several proteins 6-41 kDa in size.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26121406 PMCID: PMC4521990 DOI: 10.1038/nmeth.3455
Source DB: PubMed Journal: Nat Methods ISSN: 1548-7091 Impact factor: 28.547
Experimental data and comparisons of EC-NMR structures with benchmark reference structures.
| Protein Name and | N | NOE | 15N-1H | No. | RMSD (Å) | PDB ID and |
|---|---|---|---|---|---|---|
| 64 / 6.3 | HN- HN only | None | 10,962 | 1.5 | 2K2P NMR | |
| 66 / 7.3 | HN- HN only | 2 | 4,410 | 2.2 | 2K5N NMR | |
| 84 / 9.7 | HN- HN only | 2 | 4,964 | 1.9 | 2KAN NMR | |
| 134 / 15.0 | HN- HN only | 1 | 2,620 | 2.0 | 2LCG NMR | |
| 141 / 15.0 | HN- HN only | 2 | 10,296 | 1.7 | 2K1S NMR | |
| 166 / 18.9 | HN- HN | None | 6,669 | 2.6 | 5P21 Xray | |
| Slr1183 P74712_SYNY3 | 194/ 21.3 | HN- HN, Me- | 2 | 45,708 | 2.1 | 3MER Xray |
| 370 / 40.7 | HN- HN | 1 | 12,416 | |||
| NTD (1–112; 259–329) | 1.6 | 1DMB Xray | ||||
| CTD (113–258; 330–370) | 1.9 | 1DMB Xray | ||||
| Full-length (1–370) | 2.8 | 1DMB Xray |
Number of residues (N) and molecular weight (MW) of the protein construct studied by NMR, excluding affinity purification tags.
HN-HN NOESY cross peak data include NOEs between backbone and sidechain amide HN resonances. For P74712_SYNY3 and MALE_ECOLI, additional HN-Me NOESY cross peak data obtained for uniformly 15N,13C,2H-enriched samples with 13CH3 labeling of Ile(δ), Leu, and Val methyls were also included. As only restraint lists are available for H-Ras oncogene protein p21, RASH_HUMAN, NOESY peak lists were back-calculated from the experimental NMR constraint list (2LCF) and chemical shift data (BMRB ID 17610).
All experimental 15N-1H RDC data were measured in the laboratory of James Prestegard.
Number of non-redundant sequences in multiple sequence alignment used to generate ECs
Residues ranges for superimpositions and rmsd calculations: 2–63
Residues ranges for superimpositions and rmsd calculations: 1–64
Residues ranges for superimpositions and rmsd calculations: 7–78
Residues ranges for superimpositions and rmsd calculations: 1–29, 36–58, 62–135
Residues ranges for superimpositions and rmsd calculations: 15–39,41–76,79–120,127–141
Residues ranges for superimpositions and rmsd calculations: 1–29, 39–60, 64–166
Residues ranges for superimpositions and rmsd calculations: 20–37, 41–134, 147–172, 185–196. Residues 1–15 and 175–183 are not observed in the crystal structure.
Residues ranges for superimpositions and rmsd calculations: 2–12,14–112,259–329
Residues ranges for superimpositions and rmsd calculations: 115–117,125–142,144–172, 175–218, 221–227, 247–258, 330–370. Interfacial residues 233–240 are exchange-broadened, precluding NMR assignments. The sugar binding site of MBP (1DMB) includes residues: K42, D65, E111, E153, Y155, E172, W230, W340, and R344
Residues ranges for superimpositions and rmsd calculations: 2–12,14–112,259–329, 115–117,125–142,144–172, 175–218, 221–227, 247–258, 330–370. Interfacial residues 233–240 are exchange-broadened, precluding NMR assignments.
Fig. 1The EC-NMR process
Top panel. EC information is interpreted together with ambiguous NOESY peak list data. Inconsistent ECs (dashed red contacts), NOESY noise peaks (dashed blue contacts), and ambiguous assignments of NOESY cross peaks (dotted blue contacts) are identified and/or resolved, and additional residue pair contacts consistent with the NOE and EC data are discovered. Performance is assessed by comparing the resulting EC-NMR structure (green) with a reference X-ray crystal or NMR structure (grey). Each of the lower three horizontal panels illustrates the process of EC-NMR analysis using sparse NMR data for proteins with MW of 19 – 41 kDa. Red contacts – initial EC residue-pair contacts. Blue contacts – contacts indicated by unambiguous NOESY peak assignments obtained by the ASDP program[29]. Green contacts – final Residue Pair Contacts (RPCs) resulting from simultaneous analysis of EC and NMR data. Grey contacts – contacts in the reference X-ray crystal structure. Green ribbon structures – final EC-NMR structures. Grey ribbons – reference X-ray crystal structures. Box plots show the RMS deviation to reference structures for backbone atoms of structures generated with EC data alone (red), sparse NMR data alone (blue), and the hybrid EC-NMR method (green). In box plots, the box in the middle indicates quartiles and median scores; the “whiskers” show the largest/smallest observation that falls within a distance of 1.5 times the nearest quartile; any additional points are shown as outliers. The EC-NMR protocol provides structures with backbone accuracy of ~ 2 Å (dashed grey line) relative to the corresponding X-ray crystal structures.
Fig. 2Performance of the EC-NMR method
(a) Number of long-range residue pair contacts (i.e., between residue pairs (i, j) where |i – j| ≥ 5) for the initial EC list (white histograms), the initial unambiguous sparse NOESY data (grey), and the final EC-NMR residue contact list (black). For smaller (< 150 residues, grey-open) proteins, the NMR data include only HN-HN NOEs, while for larger proteins (> 150 residues, gray-hashed) the NMR data also include NOEs to Val, Leu, and Ile(δ) methyl protons. Inset – The Precision of contacts, relative to the corresponding reference structures, is higher for final Residue Pair Contact list (solid histograms) than for the initial EC list (open histograms), as false-positives are identified and removed by the EC-NMR algorithm. (b,c) Comparison of buried sidechain conformations in EC-NMR structures and the corresponding X-ray crystal structure. (d) Comparison of backbone RMSD and buried sidechain χ1 rotamers, relative to crystal structures. EC-NMR structures were determined using exclusively the experimental NMR data (no RDC data for p21 H-Ras, two RDC alignment tensors for P74712, and one RDC alignment tensor for MBP, light green). Results obtained after adding additional RDC data calculated from the reference structure are also shown for comparison (EC-NMR*, two hydrodynamic alignments of p21 H-Ras, or a second hydrodynamic alignment for MBP, dark green). The size of the circles corresponds to the percentage of core sidechains with χ1 rotamers different from that observed in the crystal structure; smaller circles indicate a better match of sidechain conformations to the crystal structure.