| Literature DB >> 31659184 |
Albert Perez-Riba1,2, Elizabeth Komives3, Ewan R G Main4, Laura S Itzhaki5.
Abstract
The simple topology and modular architecture of tandem-repeat proteins such as tetratricopeptide repeats (TPRs) and ankyrin repeats makes them straightforward to dissect and redesign. Repeat-protein stability can be manipulated in a predictable way using site-specific mutations. Here we explore a different type of modification - loop insertion - that will enable a simple route to functionalisation of this versatile scaffold. We previously showed that a single loop insertion has a dramatically different effect on stability depending on its location in the repeat array. Here we dissect this effect by a combination of multiple and alternated loop insertions to understand the origins of the context-dependent loss in stability. We find that the scaffold is remarkably robust in that its overall structure is maintained. However, adjacent repeats are now only weakly coupled, and consequently the increase in solvent protection, and thus stability, with increasing repeat number that defines the tandem-repeat protein class is lost. Our results also provide us with a rulebook with which we can apply these principles to the design of artificial repeat proteins with precisely tuned folding landscapes and functional capabilities, thereby paving the way for their exploitation as a versatile and truly modular platform in synthetic biology.Entities:
Mesh:
Substances:
Year: 2019 PMID: 31659184 PMCID: PMC6817815 DOI: 10.1038/s41598-019-49905-4
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Schematics showing sequences and topologies of the proteins used in this study. (a) The CTPRa and CTPRa-loop motifs used. The two alpha-helices and connecting loop sequence (-QGD-) of the CTPR motif are coloured orange, with the wild-type inter repeat loop sequence (-PRS-) coloured green. The large inter-repeat loop insertions are shown as blue. Three variants of the large loops were used, one with 25-residue insertions and two with 10-residue insertions (panels D,E). (b) Topology of the CTPRa series of four proteins containing only the “CTPRa” motif (CTPRa2, CTPRa3, CTPRa4 and CTPRa6). Repeats are coloured as per panel (A). (c) Ribbon representation of the atomic structures of CTPRa2 and CTPRa6 based on the crystal structure PDB: 2HYI[31]. The dots represent the fact that this series also includes CTPRa3 & CTPRa4 (not shown). Repeats are coloured as per panel (A). (d) Topology of the CTPRa proteins series with multi-loop inter-repeat insertions. Each contains the same 4 proteins as the wild-type CTPRa series (panel B), with either a 25-residue loop insertion or two different 10-residue loops insertions (one has a double mutation GSLVPRGS to GSDDPRGS denoted by a red line). Repeats are coloured as per panel (A). (e) Topology of the CTPRa proteins series with either a single or alternate inter-repeat loop insertions. Each contains the 10-residue loop insertion with sequence: GSDDPRGS. Repeats are coloured as per panel (A). (f) Schematic representation of the crystal structure of an 8-repeat protein, showing that the repeats form a super-helical array and the inter-repeat loops are offset by 45° relative to each other (PDB: 2HYZ)[31].
Figure 2Fluorescence-monitored equilibrium denaturation curves for: (a) CTPRa series, (b) CTPRm25 series, (c) CTPRm10 and (d) CTPRm10D series. Fluorescence signal was converted to fraction unfolded for ease of comparison.
Parameters obtained by fitting the equilibrium denaturations monitored by fluorescence of the CTPRa, CTPRm25, CTPRm10, CTPRm10D series to a two-state folding model.
| Protein Series | D50% (M) | ||
|---|---|---|---|
| CTPRa Series | |||
| CTPR2a | 2.97 ± 0.01 | 2.1 ± 0.04 | 6.3 ± 0.1 |
| CTPR3a | 3.76 ± 0.01 | 2.8 ± 0.1 | 10.4 ± 0.3 |
| CTPR4a | 4.04 ± 0.01 | 3.1 ± 0.1 | 12.7 ± 0.5 |
| CTPR6a | 4.35 ± 0.01 | 4.0 ± 0.1 | 17.3 ± 0.5 |
| CTPRm25 Series | |||
| CTPR2m25 | 2.41 ± 0.02 | 2.2 ± 0.1 | 5.3 ± 0.1 |
| CTPR3m25 | 2.58 ± 0.01 | 2.2 ± 0.1 | 5.6 ± 0.2 |
| CTPR4m25 | 2.62 ± 0.01 | 2.2 ± 0.1 | 5.7 ± 0.2 |
| CTPR6m25 | 2.62 ± 0.01 | 2.2 ± 0.04 | 5.7 ± 0.1 |
| CTPRm10 Series | |||
| CTPR2m10 | 2.70 ± 0.01 | 2.1 ± 0.1 | 5.6 ± 0.1 |
| CTPR3m10 | 2.90 ± 0.01 | 2.0 ± 0.1 | 5.8 ± 0.2 |
| CTPR4m10 | 3.07 ± 0.02 | 2.0 ± 0.1 | 6.2 ± 0.3 |
| CTPR6m10 | 3.17 ± 0.01 | 2.0 ± 0.1 | 6.4 ± 0.3 |
| CTPRm10D Series | |||
| CTPR2m10D | 2.79 ± 0.02 | 2.2 ± 0.1 | 5.7 ± 0.3 |
| CTPR3m10D | 3.01 ± 0.02 | 2.3 ± 0.1 | 6.8 ± 0.3 |
| CTPR4m10D | 3.08 ± 0.01 | 2.4 ± 0.1 | 7.3 ± 0.2 |
| CTPR6m10D | 3.18 ± 0.01 | 2.8 ± 0.1 | 8.8 ± 0.3 |
Measurements for the CTPRa series were performed in triplicate, and measurements for the CTPRm25 series were performed six times. Errors in were propagated from the errors obtained from the mean standard errors of the fitted variables.
Figure 3Butterfly plots of HDX MS experiments for CTPR4aX and CTPR4m25X proteins. These plots show deuterium uptake as a function of time for: (a) CTPR4aX, (c) CTPR4m25X and (d) denatured CTPR4m25X in 5 M urea. Each symbol corresponds to a different reporter peptide, as shown in (b) mapped onto a model of the CTPR4a structure as blue ribbon. Differing time points are coloured according to the legend in panel (a) and are identical in each plot. The amino-acid sequences of the reporter peptides are described in the Methods section. The x-axis of the plots represents the position of the peptide in the protein’s amino acid sequence. Above each plot, a schematic of the repeat proteins shows the secondary structure of the peptides in the protein. The deuterium uptake was corrected for back-exchange and normalised (see Methods for details of the analysis).
Figure 4Denaturation curves monitored by fluorescence and CD for CTPRalt10D, single-loop CTPR and “wild-type” parent CTPRa proteins. (a) Normalised fluorescence of CTPR2m10D, CTPR4alt10D, CTPR6alt10D & CTPR6a, (b) Normalised fluorescence of CTPR2m10D, CTPR6L1-2, CTPR6L3-4 & CTPR6a, (c) Normalised ellipticity at 222 nm of CTPR2m10D, CTPR4alt10D, CTPR6alt10D & CTPR6a and (d) Normalised ellipticity at 222 nm of CTPR2m10D, CTPR6L1-2, CTPR6L3-4 & CTPR6a. The plots are fitted to a two-state model to guide the eye.
Energetic cost of loop insertion is dependent on location and context within the repeat array.
| Protein and loop location and context | Estimated energetic cost per loop insertion (kcal mol−1) | |
|---|---|---|
| CTPR2 |
| 0.6 |
| CTPR6, L3-4 |
| 4.2 |
| CTPR4, alt loop |
| 2.0 |
| CTPR6, alt loop |
| 2.4 |
For each protein, the energetic cost per loop insertion was estimated by subtraction of the free energy of unfolding of the protein from that of the respective loop-free CTPR protein and then dividing by the total number of loops in the protein. Thus: CTPR2 = CTPR2a-CTPR2m10D.
CTPR6, L3-4 = CTPR6a- CTPR6L3-4.
CTRP4, alt loop = (CTPR4a-CTPR4alt10D)/2.
CTPR6, alt loop = (CTPR6a-CTPR6alt10D)/3.
Free energies of unfolding are taken from Tables S3 and S7.