| Literature DB >> 19074506 |
Dirk Stratmann1, Carine van Heijenoort, Eric Guittet.
Abstract
MOTIVATION: A prerequisite for any protein study by NMR is the assignment of the resonances from the (15)N-(1)H HSQC spectrum to their corresponding atoms of the protein backbone. Usually, this assignment is obtained by analyzing triple resonance NMR experiments. An alternative assignment strategy exploits the information given by an already available 3D structure of the same or a homologous protein. Up to now, the algorithms that have been developed around the structure-based assignment strategy have the important drawbacks that they cannot guarantee a high assignment accuracy near to 100%.Entities:
Mesh:
Substances:
Year: 2008 PMID: 19074506 PMCID: PMC2642640 DOI: 10.1093/bioinformatics/btn638
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.(a) Simplices and (simple) polytopes; (b) multiple neighbours; (c) generalized polytope.
Fig. 2.(a) Simple example of polytope matching possibilities: the experimental NOE-network (in blue) is matched onto the theoretical distance-network (in green) in different ways. (b) Example of an assignment table.
Fig. 3.Quality of the assignment represented by the SAR for all HSQC peaks of EIN. The peaks are ordered by increasing SARs. The crosses correspond to case 4 in Table 1. The overlap check (see Section 2) has been deactivated for this case, so that the runtime remained reasonable (18 h). The results of case 4 are also represented in Figures 4 and 5. The circles correspond to case 6 in Table 1. The activated overlap check resulted in a higher number of peaks with a low SAR value at the expense of a longer runtime (44 h).
Fig. 4.Assignment results on EIN. (a, b) The SAR values are mapped on the NMR structure using the correct assignment and the indicated color code. Unique assignments are shown in black [NMR structure 1EZA (Garrett et al., 1997b)]. (a) Without overlap check (case 4 in Table 1 and crosses in Fig. 3) (b) With overlap check (case 6 in Table 1 and circles in Fig. 3). Thanks to the overlap check, the SAR values of the peaks corresponding to the longest helix of EIN are reduced significantly. The NOE fragment of the longest helix is disconnected from the two largest fragments shown in (c). (c) The two largest disconnected NOE fragments (NMR structure 1EZA is shown). All assignment possibilities of one fragment are colored by the color of the fragment. (d) Theoretical contact graph. The 1034 theoretical contacts are represented on the X-ray structure 1ZYM (Liao et al., 1996) by blue, green and red lines corresponding to the three distance classes, short (d<4.5 Å), medium (d<6 Å) and long (d<7.5 Å), respectively. The NOE connectivities are represented by black lines in (a–c).
Tested T values on EIN using only NOE data
| No. | Δ | Check overlap | Runtime | Status | Errors | Accuracy | |||
|---|---|---|---|---|---|---|---|---|---|
| 1 | 5 | 1 | No | 4 days | Not finished | 5 | 24 | 100 | |
| 2 | 4 | " | No | 2.5 days | Not finished | 8 | 30 | 100 | |
| 3 | 3 | " | No | 6 days | Not finished | 11 | 32 | 100 | |
| 4* | 2 | " | No | 18 h | Finished | One swap:207↔208 | 28 | 53 | 99.2 |
| 5 | 1 | " | No | 1 h | Hole | ||||
| 6 | 2 | " | Yes | 44 h | Finished | One swap:207↔208 | 28 | 69 | 99.2 |
The optimization of the T parameter (see NOE outliers-part in Section 2) using only NOE data for EIN is shown in this table. The theoretical distance thresholds for the three NOE classes are here d=(4.5 Å,6 Å, 7.5 Å, yielding N=(391, 320, 323) distances in each class (short, medium, long) using the X-ray structure 1ZYM. The number of experimental NOEs is here N=(36, 208, 163) for strong, medium and weak NOEs, respectively. The number of HSQC peaks is here N=243. Columns: T: maximum number of permitted NOE outliers for an arbitrary matching. Δ d: the theoretical distance range [d−Δd, d] in Ångstrom for which a NOE is considered as outlier. Check overlap: indicates whether the overlap between the disconnected fragments is tested during the search. Runtime: the calculation time required for the presented result. Status: ‘hole’ indicates the presence of peaks that have no assignment possibility left in the assignment table; ‘finished’ and ‘not finished’ indicates whether the run converged or not for the given runtime. N, number of uniquely assigned peaks. N, number of peaks having a SAR-value below 10 Å including the uniquely assigned peaks. The optimized T parameter that has been retained is marked by a asterisk.
Fig. 5.EIN-Hpr interaction site. Only the X-ray structure 1ZYM (Liao et al., 1996) of EIN is shown here. (a) In red are shown the residues corresponding to the peaks with a significant chemical shift perturbation (CSP) due to the interaction EIN-Hpr (Garrett et al., 1997b). This plot require the knowledge of the correct assignment possibility for each peak. In comparison, the plot (b) is using the assignment ensemble shown in Figure 4a (case 4 in Table 1, without overlap check). The assignment possibilities of the same perturbed peaks as in (a) are plotted in (b). The unique assignments are plotted in black, while the assignment possibilities of all perturbed peaks with a SAR value below 30 Å are plotted in red.