| Literature DB >> 32002710 |
Takuma Kasai1,2, Shunsuke Ono3,4, Seizo Koshiba5,6, Masayuki Yamamoto5,6, Toshiyuki Tanaka7, Shiro Ikeda8, Takanori Kigawa9,10.
Abstract
Signal overlapping is a major bottleneck for protein NMR analysis. We propose a new method, stable-isotope-assisted parameter extraction (SiPex), to resolve overlapping signals by a combination of amino-acid selective isotope labeling (AASIL) and tensor decomposition. The basic idea of Sipex is that overlapping signals can be decomposed with the help of intensity patterns derived from quantitative fractional AASIL, which also provides amino-acid information. In SiPex, spectra for protein characterization, such as 15N relaxation measurements, are assembled with those for amino-acid information to form a four-order tensor, where the intensity patterns from AASIL contribute to high decomposition performance even if the signals share similar chemical shift values or characterization profiles, such as relaxation curves. The loading vectors of each decomposed component, corresponding to an amide group, represent both the amino-acid and relaxation information. This information link provides an alternative protein analysis method that does not require "assignments" in a general sense; i.e., chemical shift determinations, since the amino-acid information for some of the residues allows unambiguous assignment according to the dual selective labeling. SiPex can also decompose signals in time-domain raw data without Fourier transform, even in non-uniformly sampled data without spectral reconstruction. These features of SiPex should expand biological NMR applications by overcoming their overlapping and assignment problems.Entities:
Keywords: Combinatorial selective labeling; Non-uniform sampling (NUS); Relaxation analysis; Spectral deconvolution; Stable isotope encoding (SiCode); Tensor factorization
Mesh:
Substances:
Year: 2020 PMID: 32002710 PMCID: PMC7080692 DOI: 10.1007/s10858-019-00295-9
Source DB: PubMed Journal: J Biomol NMR ISSN: 0925-2738 Impact factor: 2.835
Fig. 1Encoding and decoding amino acid information for amide signals. a The labeling pattern (“codebook”) used in this study. Each amino acid is represented as a combination (a “codeword”) of the isotope labeling ratios of three labeled samples. The labeling ratios of 13C and 15N are indicated as percentages. b A set of 2D spectra to form a three-order tensor. A small region-of-interest (ROI) that contains the (E)V17 signal was extracted. c Loading vectors of the signal. From left to right, the loading vectors along the 1H, 15N, and SiCode dimensions are shown as black lines and circles. The best fits to the extracted amino-acid information (Eqs. 7and8) are shown as red triangles
Fig. 2Decomposition of simulated overlapping signals. a Preparation of the artificial dataset with overlapping signals. ROIs with the same size, including the (R)G75 (indicated by blue crosses) and (I)Q62 (indicated by red crosses) signals, were extracted and merged by element-wise tensor addition. Only the 15N HSQC spectrum of sample 1 is shown. b Illustration of four-order tensor formation with a set of 2D spectra. c Loading vectors along the 1H (left) and 15N (right) dimensions. The first (f = 1) component is shown as black lines and circles, and the second (f = 2) component is shown as blue lines and squares. d Loading vectors along the SiCode (left) and relaxation (right) dimensions. The first (top panels) and second (bottom panels) components are shown. Red triangles and lines indicate the best fits for the extraction of amino acid information and the exponential decays
Amino acids and relaxation parameters of Ub3A obtained from overlapping signals by SiPex, compared with those obtained by the conventional method with the uniformly labeled sample
| Residue | SiPex | Conventional | |||||||
|---|---|---|---|---|---|---|---|---|---|
| i | i-1 | R1 [s−1] | R2 [s−1] | NOE | R1 [s−1] | R2 [s−1] | NOE | ||
Simulated overlapping | (I)Q62 | Q | I | 1.50 | 5.83 | 0.63 | 1.50 | 6.46 | 0.67 |
| (R)G75 | G | R | 1.39 | 3.75 | 0.02 | 1.45 | 3.77 | 0.13 | |
Actual overlapping | (L)E16 | E | L | 1.59 | 7.08 | 0.74 | |||
| (N)V26 | N | V | 1.75 | 6.87 | 0.80 | ||||
Fig. 3Analysis of non-uniformly sampled time domain data. a Decomposition of the ROI in which the 15N dimension is the Fourier transformed frequency domain. Black and red lines show positive and negative contours, respectively. The leftmost panel is the observed data and the right five panels are the decomposed components. Only the 15N HSQC spectrum of sample 1 is shown. b Decomposition of the same ROI as in (a), but the 15N dimension is the time-domain raw data. Both the real and imaginary parts of the complex data are shown. c A simulation of NUS in the 15N dimension, by extracting 8 out of the 64 complex points used in (b). d Loading vectors along the SiCode (left) and relaxation (right) dimensions. From top to bottom, five decomposed components are shown. The markers and line styles are the same as in Fig. 2d
Amino acids and relaxation parameters of Ub3A obtained with frequency- or time-domain spectra
| Residue | i | i-1 | R1 [s−1] | R2 [s−1] | NOE | |
|---|---|---|---|---|---|---|
Conventional Frequency domain | (G)S-2 | 1.21 | 3.34 | -0.30 | ||
| (M)Q2 | 1.68 | 7.22 | 0.79 | |||
| (E)D52 | 1.49 | 7.90 | 0.79 | |||
| (T)L56 | 1.87 | 7.80 | 0.82 | |||
| (G)G76 | 1.23 | 3.33 | -0.15 | |||
SiPex Frequency domain | (G)S-2 | S | G | 1.09 | 3.04 | -0.53 |
| (M)Q2 | Q | M | 1.67 | 7.50 | 0.85 | |
| (E)D52 | D | E | 1.42 | 7.97 | 0.79 | |
| (T)L56 | L | T | 1.75 | 7.67 | 0.80 | |
| (G)G76 | G | G | 1.20 | 3.74 | -0.13 | |
SiPex Time domain Full sampling | (G)S-2 | S | G | 1.09 | 3.07 | -0.51 |
| (M)Q2 | Q | M | 1.67 | 7.42 | 0.84 | |
| (E)D52 | D | E | 1.42 | 7.92 | 0.79 | |
| (T)L56 | L | T | 1.75 | 7.58 | 0.80 | |
| (G)G76 | G | G | 1.20 | 3.71 | -0.13 | |
SiPex Time domain NUS | (G)S-2 | S | G | 1.11 | 3.09 | -0.44 |
| (M)Q2 | Q | M | 1.69 | 7.41 | 0.85 | |
| (E)D52 | D | E | 1.43 | 7.90 | 0.78 | |
| (T)L56 | L | T | 1.76 | 6.93 | 0.82 | |
| (G)G76 | G | G | 1.21 | 3.84 | -0.09 |
Fig. 4Analysis of a crowded spectral region of an intrinsically disordered protein. a15N HSQC spectrum of sample 1 of the analyzed spectral region. Four subregions for individual tensor decomposition runs are shown in the blue box and numbered. The decomposed signals are shown by red crosses with amino-acid information by SiPex. b–d Decomposition of subregion 1. b The observed spectrum (left panel) was decomposed into two components (right two panels). Only the 15N HSQC spectrum of sample 1 is shown. c Loading vectors along the 1H (left) and 15N (right) dimensions. The markers and line styles are the same as in Fig. 2c. b Loading vectors along the SiCode (left) and relaxation (right) dimensions. The markers and line styles are the same as in Fig. 2d
Amino acids and relaxation parameters of Nrf2 Neh2 obtained by SiPex
| Residue | i | i-1 | R1 [s−1] | R2 [s−1] | NOE | |
|---|---|---|---|---|---|---|
| Overlapping | (D)M17 | M | D | 1.28 | 9.2 | 0.44 |
| (Y)E47 | E | Y | 1.10 | 22.5 | 0.66 | |
| (K)E57 | E | K | 1.03 | 20.4 | 0.59 | |
| (R)Q59 | Q | R | 1.12 | 27.7 | 0.65 | |
| (E)Q66 | Q | E | 1.10 | 20.3 | 0.51 | |
| Isolated | (D)L19 | L | D | 1.40 | 8.2 | 0.45 |