| Literature DB >> 32707977 |
Sergei Y Grishin1, Evgeniya I Deryusheva2, Andrey V Machulin3, Olga M Selivanova1, Anna V Glyakina1,4, Elena Y Gorbunova5, Leila G Mustaeva5, Vyacheslav N Azev5, Valentina V Rekstina6, Tatyana S Kalebina6, Alexey K Surin1,5,7, Oxana V Galzitskaya1,8.
Abstract
Structural S1 domains belong to the superfamily of oligosaccharide/oligonucleotide-binding fold domains, which are highly conserved from prokaryotes to higher eukaryotes and able to function in RNA binding. An important feature of this family is the presence of several copies of the structural domain, the number of which is determined in a strictly limited range from one to six. Despite the strong tendency for the aggregation of several amyloidogenic regions in the family of the ribosomal S1 proteins, their fibril formation process is still poorly understood. Here, we combined computational and experimental approaches for studying some features of the amyloidogenic regions in this protein family. The FoldAmyloid, Waltz, PASTA 2.0 and Aggrescan programs were used to assess the amyloidogenic propensities in the ribosomal S1 proteins and to identify such regions in various structural domains. The thioflavin T fluorescence assay and electron microscopy were used to check the chosen amyloidogenic peptides' ability to form fibrils. The bioinformatics tools were used to study the amyloidogenic propensities in 1331 ribosomal S1 proteins. We found that amyloidogenicity decreases with increasing sizes of proteins. Inside one domain, the amyloidogenicity is higher in the terminal parts. We selected and synthesized 11 amyloidogenic peptides from the Escherichia coli and Thermus thermophilus ribosomal S1 proteins and checked their ability to form amyloids using the thioflavin T fluorescence assay and electron microscopy. All 11 amyloidogenic peptides form amyloid-like fibrils. The described specific amyloidogenic regions are actually responsible for the fibrillogenesis process and may be potential targets for modulating the amyloid properties of bacterial ribosomal S1 proteins.Entities:
Keywords: amyloidogenic regions; antimicrobial properties; fibrillogenesis; ribosomal S1 proteins
Mesh:
Substances:
Year: 2020 PMID: 32707977 PMCID: PMC7432502 DOI: 10.3390/ijms21155199
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Percentages of the amyloidogenic regions in the full-sized ribosomal S1 proteins. The largest and smallest values are highlighted in bold.
| Amount of Structural S1 Domains | FoldAmyloid | Waltz | PASTA 2.0 | AggreScan | ||||
|---|---|---|---|---|---|---|---|---|
| % Amyloidogenic Regions | Number of Sequences | % Amyloidogenic Regions | Number of Sequences | % Amyloidogenic Regions | Number of Sequences | % Amyloidogenic Regions | Number of Sequences | |
| 1S1 | 24 ± 10 | 12 | 15 ± 13 | 7 | 29 ± 19 | 12 |
| 12 |
| 2S1 | 17 ± 6 | 9 | 9 ± 6 | 6 | 16 ± 7 | 9 | 39 ± 10 | 9 |
| 3S1 | 16 ± 3 | 26 | 5 ± 3 | 26 | 10 ± 4 | 26 | 27 ± 4 | 26 |
| 4S1 | 12 ± 2 | 460 | 6 ± 2 | 458 | 13 ± 4 | 460 | 28 ± 6 | 459 |
| 5S1 | 14 ± 5 | 16 | 5 ± 3 | 15 | 10 ± 4 | 16 | 31 ± 9 | 15 |
| 6S1 | 12 ± 3 | 851 |
| 816 | 8 ± 3 | 851 | 29 ± 5 | 848 |
Figure 1Amyloidogenic propensities of ribosomal proteins within the family of 30S ribosomal S1 proteins according to the FoldAmyloid program. (a) Violin plot of the frequency distribution of amyloidogenic regions in the ribosomal S1 proteins. Median: white dot; average: blue stripe; boxplot ends: first and third quantiles; thin black line: +/− the 1.5 fold interquartile range of values and outliers; probability density: green field. (b) Average percentage of amyloidogenic regions within each domain as well as for the full-sized S1 proteins.
Percentages of the amyloidogenic regions in the separate S1 domains. The largest and smallest values are highlighted in bold.
| Number of Structural S1 Domains | FoldAmyloid | Waltz | PASTA2.0 | Aggrescan | ||||
|---|---|---|---|---|---|---|---|---|
| % Amyloidogenic Regions | Number of Sequences | % Amyloidogenic Regions | Number of Sequences | % Amyloidogenic Regions | Number of Sequences | % Amyloidogenic Regions | Number of Sequences | |
| 1-1 |
| 12 |
| 5 |
| 12 |
| 12 |
| 2-1 | 21 ± 9 | 8 | 14 ± 10 | 5 | 28 ± 17 | 8 | 47±13 | 9 |
| 2-2 | 23 ± 8 | 9 | 15 ± 8 | 5 | 31 ± 13 | 5 | 54 ± 17 | 9 |
| 3-1 | 15 ± 4 | 25 | 13 ± 4 | 23 |
| 20 | 35 ± 13 | 26 |
| 3-2 | 22 ± 8 | 26 | 15 ± 5 | 5 | 22 ± 10 | 8 |
| 26 |
| 3-3 | 27 ± 6 | 26 | 13 ± 6 | 10 | 33 ± 10 | 22 | 56 ± 7 | 26 |
| 4-1 | 13 ± 6 | 453 | 12 ± 4 | 140 | 25 ± 12 | 252 | 28 ± 16 | 459 |
| 4-2 | 19 ± 6 | 459 | 13 ± 5 | 57 | 24 ± 14 | 294 | 36 ± 9 | 459 |
| 4-3 | 15 ± 4 | 456 | 14 ± 3 | 343 | 42 ± 20 | 355 | 49 ± 6 | 459 |
| 4-4 | 17 ± 5 | 460 | 14 ± 6 | 387 | 23 ± 9 | 331 | 46 ± 7 | 459 |
| 5-1 | 18 ± 7 | 15 | 16 ± 9 | 4 | 28 ± 11 | 11 | 38 ± 15 | 15 |
| 5-2 | 18 ± 9 | 15 | 12 ± 4 | 9 | 17 ± 8 | 4 | 38 ± 10 | 15 |
| 5-3 | 23 ± 11 | 15 | 14 ± 3 | 4 | 31 ± 16 | 13 | 43 ± 14 | 15 |
| 5-4 | 17 ± 7 | 16 | 13 ± 8 | 9 | 17 ± 5 | 7 | 42 ± 10 | 15 |
| 5-5 | 18 ± 8 | 16 | 11 ± 7 | 7 | 27 ± 12 | 8 | 38 ± 12 | 15 |
| 6-1 |
| 637 |
| 581 | 32 ± 8 | 715 | 35 ± 11 | 847 |
| 6-2 | 14 ± 8 | 847 | 14 ± 6 | 346 | 20 ± 11 | 159 | 38 ± 11 | 848 |
| 6-3 | 21 ± 5 | 851 | 13 ± 7 | 166 | 27 ± 16 | 329 | 43 ± 7 | 848 |
| 6-4 | 14 ± 6 | 850 | 13 ± 7 | 166 | 20 ± 7 | 457 | 43 ± 11 | 848 |
| 6-5 | 19 ± 6 | 848 | 13 ± 6 | 239 | 15 ± 8 | 266 | 39 ± 9 | 848 |
| 6-6 | 13 ± 6 | 524 | 13 ± 7 | 176 | 19 ± 10 | 310 | 25 ± 13 | 809 |
Figure 2Distribution of amyloidogenic regions in the separate Table S1 domains according to the FoldAmyloid, Waltz, PASTA 2.0 and Aggrescan programs. The color scale corresponds to the frequency distribution of the amyloidogenic regions along the protein chain (aligned protein sequences). The size of each structural domain is about 70 amino acid residues (a.a.).
Figure 3Distribution of amyloidogenic regions (orange) and secondary structures (green for β-strands; blue for α-helix) in the separate S1 domains according to FoldAmyloid for (a) one-domain S1 proteins, (b) two-domain S1 proteins (2-1 and 2-2) and (c) three-domain S1 proteins (3-1, 3-2 and 3-3). The color scale corresponds to the frequency distribution of residues along the protein chain (aligned protein sequences). The size of each structural domain is about 70 a.a. For the prediction of the secondary structure for each sequence in our dataset, the Jpred4 program [26] was used.
Sequence logos of amyloidogenic regions in S1 domains.
| Number of S1 Structural Domains | Position of the Amyloidogenic Regions in S1 Domains (a.a.)* | |||
|---|---|---|---|---|
| 10–15 | 25–30 | 55–60 | 65–70 | |
| 1-1 | VXRY | (F/K)(G/C)(E/Y)(L/Y) | (V/E)KVL | L(L/V)(L/V)SFK |
| 2-1 | G(V/A/L)X(V/A) | (F/R)G(F/V)YP | EV(K/X)VL | R/G(L/V)(V/Y)LS |
| 2-2 | G(A/Y)EV(R/V)(L/Y) | (G/A)(L/F)(V/L)(H/P) | (V/E)(V/L)X(F/V)KV | (L/V)(S/H)X(K/R) |
| 3-1 | (E/H)(F/V)(F/L)(I/V) | (Q/M)(L/V)ILS | ||
| 3-2 | LRGFIP | (L/I)(P/T)(V/L)(K/A)FL | (K/R)LVLS | |
| 3-3 |
| GLLHIS |
| |
| 4-1 | (E/Q)(V/A)L(V/L)D | V(I/L)(P/T)(S/L)R | (V/I/L)(E/D) | LSK(K/R) |
| 4-2 | L(I/V)(V/L)X(D/E) | (M/L)V(D/E)(M/T/H)R | (N/R)(V/E) (I/V)LSRR | |
| 4-3 | (E/Q) | |||
| 4-4 |
| LVH(I/V)S | ||
| 5-1 | VXVDI | (G/A)X(V/I/L)PL | DX | GX(I/Y)(L/Y)LSR |
| 5-2 | (I/V)X (G/V)X(I/V) | G(I/V)R(G/A)F(M/L) | (N/R)(V/I)(V/I)(L/Y)LSR | |
| 5-3 |
| (E/R)( | VX | |
| 5-4 | (E/D)GL(I/V)H(I/V) | VE | ||
| 5-5 | E | (E/D)GL(I/V)H(I/V) | EXX | |
| 6-1 | (G/A)XV(I/V) | (V/A)(L/Y)(V/I)(D/N)(A/Y) | DX | |
| 6-2 | GX(I/V) | GETV(D/E) | EFK(V/I)(I/L)K | |
| 6-3 | YG(A/V)D | LH(I/V)T | ||
| 6-4 | YG(A/C)FV | EGL(V/Y)H(V/I) | (V/E) | |
| 6-5 |
|
|
| |
| 6-6 | VXGX(I/V) | VEAK(V/I/L) | RX(V/I/L)XLS(V/I)(K/R) | |
* - approximate locations of region within structural domains; X—any amino acid residue; conserved amino acid residues are highlighted in bold.
List of amyloidogenic peptides for chemical synthesis.
| Sequences, Company | Localization in S1 Sequence, Domain (D), Species | Peptides Mw * | Peptides pI ** |
|---|---|---|---|
| Peptide 1, VVEGTVVEVT (V10T)T, 1 | (211–220 a.a), D3, | 1031.2 | 3.5 |
| Peptide 2, DFGVFVNLG (D9G)T, 2 | (221–229 a.a.), D3, | 967.1 | 3.8 |
| Peptide 3, VTDFGVFVEI (V10I)T, 1 | (391–400 a.a.), D5, | 1125.3 | 3.5 |
| Peptide 4, EMEVVVLNID (E10D)T, 2 | (430–439), D5, | 1160.3 | 3.4 |
| Peptide 1, IVRGVVVAID (I10D)E, 3 | (23–32 a.a.), D1, | 1040.3 | 6.3 |
| Peptide 2, DEITVKVLKF (D10F)E, 3 | (239–248 a.a.), D3, | 1191.4 | 6.3 |
| Peptide 3, TDYGCFVEIE (T10E)E, 3 | (288–297 a.a.), D4, | 1175.3 | 3.4 |
| Peptide 4, VVNVGDVVEV (V10V)E, 3 | (321–330 a.a.), D4, | 1028.2 | 3.5 |
| Peptide 5, DFGIFIGLDG (D10G)E, 3 | (376–385 a.a.), D5, | 1053.2 | 3.5 |
| Peptide 6, VHLSDISWNV (V10NV)E,3 | (391–400 a.a.), D5, | 1169.3 | 5.5 |
| Peptide 7, EIAAVVLQVD (E10D)E, 3 | (414–423 a.a.), D5, | 1055.6 | 3.5 |
1—The Branch of the Institute of Bioorganic Chemistry, Russian Academy of Sciences, Pushchino, Russia. 2—ELABSCIENCE, China (intermediary—“Immunotex”, LLC NPO, Stavropol, Russia). 3—IQChemical, Saint-Petersburg, Russia. * and **—molecular weight and isoelectric point were calculated by the ExPASy server (https://web.expasy.org/cgi-bin/peptide_mass/peptide-mass.pl) [47]. T— peptide from the sequence of the ribosomal S1 protein from T. thermophilus. E— peptide from the sequence of the ribosomal S1 protein from E. coli.
Figure 43D structures of the S1 domains from E. coli. Experimentally studied amyloidogenic regions (position and amino acid sequence are given) are highlighted with red color. Domain 1–PDB (Protein Data Bank, https://www.rcsb.org/) code: 2MFI; domain 4–PDB code: 2KHI; domain 5–PDB code: 5XQ5. 3D structure of domain 3 was predicted using the Robetta server. Scale bar = 100 nm.
Figure 53D structures of the S1 domains from T. thermophilus. Experimentally studied amyloidogenic regions (position and sequence are given) are highlighted with red color. 3D structures of domains 3 and 5 were predicted using the Robetta server. Scale bar = 100 nm.