| Literature DB >> 35479026 |
Adewale Olamoyesan1, Dale Ang1, Alison Rodger1.
Abstract
Many proteins and peptides are increasingly being recognised to contain unfolded domains or populations that are key to their function, whether it is in ligand binding or material assembly. We report an approach to determine the secondary structure for proteins with suspected significant unfolded domains or populations using our neural network approach SOMSpec. We proceed by derandomizing spectra by removing fractions of random coil (RC) spectra prior to secondary structure fitting and then regenerating α-helical and β-sheet contents for the experimental proteins. Application to bovine serum albumin spectra as a function of temperature proved to be straightforward, whereas lysozyme and insulin have hidden challenges. The importance of being able to interrogate the SOMSpec output to understand the best matching units used in the predictions is illustrated with lysozyme and insulin whose partially melted proteins proved to have significant βII content and their CD spectrum looks the same as that for a random coil. This journal is © The Royal Society of Chemistry.Entities:
Year: 2021 PMID: 35479026 PMCID: PMC9036635 DOI: 10.1039/d1ra02898g
Source DB: PubMed Journal: RSC Adv ISSN: 2046-2069 Impact factor: 4.036
Fig. 2SOMSpec output for BSA (a) at 20 °C with 0% RC. (b) SOMSpec spectral NRMSDs for 70 °C BSA with different degrees of randomisation and output for 70 °C with (c) −20% RC, (d) −30% RC, and (e) −40% RC. The x and y axis labels indicate the self-organising map size and node positions for each run.
Fig. 3SOMSpec output for lysozyme at 20 °C (a) 0% RC, (b) −60% RC and at 100 °C (c) 0% RC, (d) −50% RC. The x and y axis labels indicate the self-organising map size.
Fig. 1CD spectra of (a) BSA and (b) lysozyme over the temperature range of 20 to 100 °C in 10 °C steps. Data were collected on 0.1 mg mL−1 samples in water. (c) BSA 100° experimental spectrum overlaid with the SELCON and SOMSpec best predicted spectra.
Spectral NRMSDs and structure predictions from BSA CD spectra as a function of temperature derived using derandomized spectra and structure predictions for the original spectra (where best fit requires >0% RC to be added). Column 2 indicates the NRMSD of the identified best fits. Columns 3 indicates the RC percentage added with the next 3 columns indicating the secondary structure of that modified spectrum (note these columns are empty for 0% RC added). The final columns indicate the secondary content of the original protein (0% RC) when the RC coil content has been added back in
| BSA (°C) | Best NRMSD | RC added | Derandomized | Regenerated original protein | ||||
|---|---|---|---|---|---|---|---|---|
| α-Helix | β-Sheet | Other | α-Helix | β-Sheet | Other | |||
| 20 | 0.009 | 0% | 0.76 | 0.00 | 0.24 | |||
| 30 | 0.006 | 0% | 0.76 | 0.00 | 0.24 | |||
| 40 | 0.016 | 0% | 0.73 | 0.01 | 0.26 | |||
| 50 | 0.015 | 0% | 0.75 | 0.00 | 0.25 | |||
| 60 | 0.017 | 20% | 0.77 | 0.00 | 0.23 | 0.62 | 0.00 | 0.38 |
| 70 | 0.024 | 30% | 0.68 | 0.01 | 0.31 | 0.48 | 0.01 | 0.52 |
| 80 | 0.022 | 50% | 0.77 | 0.00 | 0.23 | 0.39 | 0.00 | 0.62 |
| 90 | 0.023 | 50% | 0.66 | 0.02 | 0.32 | 0.33 | 0.01 | 0.66 |
| 100 | 0.026 | 60% | 0.61 | 0.04 | 0.35 | 0.24 | 0.02 | 0.74 |
Spectral NRMSDs and structure predictions from lysozyme CD spectra as a function of temperature derived using the original data and derandomized spectra (where best fit requires >0% RC to be added). Column identity is as for Table 1. Bold indicates preferred values where more than one option gave a reasonable fit as discussed in the text
| Lysozyme (°C) | Best NRMSD | RC added | Derandomized | Regenerated original protein | ||||
|---|---|---|---|---|---|---|---|---|
| α-Helix | β-Sheet | Other | α-Helix | β-Sheet | Other | |||
| 20 | 0.021 (0.019) | 0% |
|
|
| |||
| (60%) | (0.85) | (0) | (0.15) | 0.34 | 0.00 | 0.66 | ||
| 30 | 0.021 (0.019) | 0% |
|
|
| |||
| (60%) | (0.85) | (0) | (0.15) | 0.34 | 0.00 | 0.66 | ||
| 40 | 0.022 (0.019) | 0% |
|
|
| |||
| (60%) | (0.85) | (0) | (0.15) | 0.34 | 0.00 | 0.66 | ||
| 50 | 0.023 (0.020) | 0% |
|
|
| |||
| (60%) | (0.85) | (0) | (0.15) | 0.34 | 0.00 | 0.66 | ||
| 60 | 0.0234 | 0% |
|
|
| |||
| (0.0237) | (60%) | (0.86) | (0) | (0.14) | (0.34) | (0) | (0.66) | |
| 70 | 0.023 | 10% | 0.39 | 0.15 | 0.46 |
|
|
|
| 80 | 0.038 | 70% | 0.64 | 0.02 | 0.34 | 0.19 | 0.00 | 0.80 |
| 0.045 | 40% | 0.27 | 0.20 | 0.52 |
|
|
| |
| 90 | 0.036 | 0% | 0.11 | 0.34 | 0.55 | |||
| 0.044 | (50%) | 0.26 | 0.21 | 0.53 |
|
|
| |
| 100 | 0.035 | 0% | 0.11 | 0.34 | 0.55 | |||
| 0.041 | 50% | 0.26 | 0.21 | 0.53 |
|
|
| |