| Literature DB >> 31705614 |
Nina Kubatova1, Dennis J Pyper1, Hendrik R A Jonker1, Krishna Saxena1, Laura Remmel1, Christian Richter1, Sabine Brantl2, Elena Evguenieva-Hackenberg3, Wolfgang R Hess4, Gabriele Klug3, Anita Marchfelder5, Jörg Soppa6, Wolfgang Streit7, Maxim Mayzel8, Vladislav Y Orekhov8,9, Monika Fuxreiter10, Ruth A Schmitz11, Harald Schwalbe1.
Abstract
Proteins encoded by small open reading frames (sORFs) have a widespread occurrence in diverse microorganisms and can be of high functional importance. However, due to annotation biases and their technically challenging direct detection, these small proteins have been overlooked for a long time and were only recently rediscovered. The currently rapidly growing number of such proteins requires efficient methods to investigate their structure-function relationship. Herein, a method is presented for fast determination of the conformational properties of small proteins. Their small size makes them perfectly amenable for solution-state NMR spectroscopy. NMR spectroscopy can provide detailed information about their conformational states (folded, partially folded, and unstructured). In the context of the priority program on small proteins funded by the German research foundation (SPP2002), 27 small proteins from 9 different bacterial and archaeal organisms have been investigated. It is found that most of these small proteins are unstructured or partially folded. Bioinformatics tools predict that some of these unstructured proteins can potentially fold upon complex formation. A protocol for fast NMR spectroscopy structure elucidation is described for the small proteins that adopt a persistently folded structure by implementation of new NMR technologies, including automated resonance assignment and nonuniform sampling in combination with targeted acquisition.Entities:
Keywords: NMR spectroscopy; proteomics; small proteins; structural biology; structure-activity relationships
Mesh:
Substances:
Year: 2020 PMID: 31705614 PMCID: PMC7217052 DOI: 10.1002/cbic.201900677
Source DB: PubMed Journal: Chembiochem ISSN: 1439-4227 Impact factor: 3.164
Figure 1A comparison of the structures of DAKD and BK in free and BR‐bound forms determined by means of NMR spectroscopy.12
Figure 2Workflow for NMR spectroscopy structural investigations of small proteins, including synthesis/expression and purification steps. CD: circular dichroism.
Figure 3Left: SP‐22 small protein from Haloferax volcanii; middle: SP‐6 small protein from Methanosarcina mazei; right: SP‐10 small protein from Sinorhizobium fredii. A) CD spectra in phosphate buffer at pH 7. B) MALDI mass analysis of the purified small protein. C) The 1D 1H NMR spectra with an enlargement of the amide proton region. D) The 2D 1H,15N HSQC spectra at 600 MHz, 298 K. E) Left and middle: TALOS secondary‐structure prediction of the residues that are classified as “good”; right: expanded regions from the 2D 1H,15N HSQC spectrum for three signals showing differences in line width and intensity of the NMR signals, which are characteristic for molten globule‐type conformational behavior. These results agree perfectly with calculations by the Espritz method in free‐form propensity of structures SP‐22 (73.3 % folded), SP‐6 (0 % structured), and SP‐10 (61.3 % molten globule).
Structural analysis validated by means of NMR and CD spectroscopy of the small proteins screened in this study. The table is shown with an ascending molecular weight of small proteins. A full overview can be found in Table S1 A–D.
|
ID |
aa |
MW |
Microorganism |
CD and NMR |
|---|---|---|---|---|
|
|
|
[kDa] |
|
structural analysis |
|
SP‐1 |
14 |
1.6 |
|
unstructured |
|
SP‐2 |
14 |
1.8 |
|
unstructured |
|
SP‐3 |
18 |
1.9 |
|
unstructured |
|
SP‐5 |
23 |
2.6 |
|
unstructured |
|
SP‐6 |
23 |
2.8 |
|
unstructured |
|
SP‐7 |
27 |
2.9 |
|
unstructured |
|
SP‐8 |
28 |
3.1 |
|
unstructured |
|
SP‐9 |
29 |
3.1 |
|
unstructured |
|
SP‐10 |
31 |
3.7 |
|
molten globule |
|
SP‐11 |
38 |
4.0 |
|
molten globule |
|
SP‐12 |
39 |
4.5 |
|
molten globule |
|
SP‐13 |
43 |
4.8 |
|
molten globule |
|
SP‐19 |
51 |
5.7 |
|
structured |
|
SP‐21 |
59 |
6.5 |
|
structured |
|
SP‐22 |
60 |
6.7 |
|
structured |
|
SP‐23 |
61 |
6.9 |
|
molten globule |
|
SP‐24 |
61 |
7.1 |
|
structured |
|
SP‐25 |
61 |
7.2 |
|
molten globule |
|
SP‐27 |
78 |
8.1 |
|
partially structured |
Figure 4Structural characterization of SP‐22 small protein from H. volcanii. (PDB ID: 6Q2Z; BMRB ID: 34334). A) The amino acid sequence and the schematic representation of the secondary‐structure elements based on the solution‐state NMR spectroscopy structure. B) Sequential assignment for residues K3 to D23. The 3D HNCACB NMR spectrum was recorded at 700 MHz, 298 K; it contains 5 mm protein, 50 mm sodium phosphate buffer pH 7.5, 100 mm NaCl, 5 % D2O, 0.5 mm DSS. CA are shown in red and CB are highlighted in green. C) 1H,15N Best TROSY spectrum (600 MHz) of 5 mm small protein in 50 mm sodium phosphate buffer, pH 7.5, 100 mm NaCl, 95 % H2O/5 % D2O at 298 K. Backbone resonance assignment is indicated. D) Solution‐state NMR spectroscopy structure of SP‐22 protein. Ribbon representation of the best 20 structures is shown as a symmetrical dimer. The monomer consists of one α helix and four β‐sheet regions. Black and red represent two monomeric subunits. The figure was generated with PyMOL.50
Espritz NMR spectroscopy predictions of sequences in free form. Dynamic transitions induced by interactions were computed by using the FuzPred method with a reference to the Espritz NMR spectroscopy free form. Small proteins were combined in classes with respect to experimental secondary‐structure screening analysis.
|
|
ID |
aa |
Free form [%] |
Bound form [%] | ||
|---|---|---|---|---|---|---|
|
|
|
|
Structured |
Disordered |
Structured |
Disordered |
|
folded |
SP‐19 |
51 |
68.6 |
31.4 |
92.2 |
7.8 |
|
|
SP‐21 |
59 |
71.2 |
28.8 |
84.7 |
15.3 |
|
|
SP‐22 |
60 |
73.3 |
26.7 |
100 |
0 |
|
|
SP‐24 |
61 |
100 |
0 |
100 |
0 |
|
|
SP‐27 |
78 |
100 |
0 |
100 |
0 |
|
molten |
SP‐10 |
31 |
61.3 |
38.8 |
93.5 |
6.5 |
|
globule |
SP‐11 |
38 |
73.7 |
26.4 |
100 |
0 |
|
|
SP‐12 |
39 |
20.5 |
79.4 |
41 |
59 |
|
|
SP‐13 |
43 |
53.5 |
46.5 |
67.4 |
32.6 |
|
|
SP‐23 |
61 |
67.2 |
32.8 |
78.7 |
21.3 |
|
|
SP‐25 |
61 |
19.7 |
80.4 |
75.4 |
24.6 |
|
unstructured |
SP‐1 |
14 |
78.6 |
21.4 |
100 |
0 |
|
|
SP‐2 |
14 |
21.4 |
78.6 |
78.6 |
21.4 |
|
|
SP‐3 |
18 |
0 |
100 |
61.1 |
38.9 |
|
|
SP‐5 |
23 |
21.7 |
78.2 |
87 |
13 |
|
|
SP‐6 |
23 |
0 |
100 |
52.2 |
47.8 |
|
|
SP‐7 |
27 |
0 |
100 |
63 |
37 |
|
|
SP‐8 |
28 |
0 |
100 |
57.1 |
42.9 |
|
|
SP‐9 |
29 |
79.3 |
20.7 |
100 |
0 |