| Literature DB >> 29168376 |
Martijn van Rosmalen1, Mike Krom1, Maarten Merkx1.
Abstract
Flexible polypeptide linkers composed of glycine and serine are important components of engineered multidomain proteins. We have previously shown that the conformational properties of Gly-Gly-Ser repeat linkers can be quantitatively understood by comparing experimentally determined Förster resonance energy transfer (FRET) efficiencies of ECFP-linker-EYFP proteins to theoretical FRET efficiencies calculated using wormlike chain and Gaussian chain models. Here we extend this analysis to include linkers with different glycine contents. We determined the FRET efficiencies of ECFP-linker-EYFP proteins with linkers ranging in length from 25 to 73 amino acids and with glycine contents of 33.3% (GSSGSS), 16.7% (GSSSSSS), and 0% (SSSSSSS). The FRET efficiency decreased with an increasing linker length and was overall lower for linkers with less glycine. Modeling the linkers using the WLC model revealed that the experimentally observed FRET efficiencies were consistent with persistence lengths of 4.5, 4.8, and 6.2 Å for the GSSGSS, GSSSSS, and SSSSSS linkers, respectively. The observed increase in linker stiffness with reduced glycine content is much less pronounced than that predicted by a classical model developed by Flory and co-workers. We discuss possible reasons for this discrepancy as well as implications for using the stiffer linkers to control the effective concentrations of connected domains in engineered multidomain proteins.Entities:
Mesh:
Substances:
Year: 2017 PMID: 29168376 PMCID: PMC6150656 DOI: 10.1021/acs.biochem.7b00902
Source DB: PubMed Journal: Biochemistry ISSN: 0006-2960 Impact factor: 3.162
Figure 1Cyan-linker-yellow (CLY) proteins for studying the behavior of flexible linkers. (A) Schematic representation of the CLY fusion protein. The region between parentheses was varied in length and glycine content. re is the distance between the ends of the linker. rc is the distance between the two chromophores. v is the vector connecting the start of the linker to the ECFP chromophore. v is the vector connecting the end of the linker to the EYFP chromophore. The two wireframe polyhedra depict the different orientations over which both ECFP and EYFP were averaged for calculation of interchromophore distances and FRET efficiencies. (B) Schematic representation of the partial digestion and religation method used to obtain multiple linker length variants. (C) Gel image of colony PCR products encompassing the linker regions of partially digested (with 0.1–1 unit of SacI) and religated (S6)9 constructs.
Experimental and Calculated Properties of CLY Proteins
| CLY | no. of amino acids | Gly (%) | ⟨ | ⟨ | anisotropy | |
|---|---|---|---|---|---|---|
| (G4S2)1 | 23 | 26.1 | 0.71 | 0.67/0.68 | 23.8/23.1 | 0.370 |
| (G4S2)2 | 29 | 34.5 | 0.65 | 0.64/0.65 | 26.6/26.0 | 0.352 |
| (G4S2)3 | 35 | 40.0 | 0.63 | 0.61/0.62 | 29.2/28.5 | 0.351 |
| (G4S2)4 | 41 | 43.9 | 0.59 | 0.58/0.59 | 31.5/30.9 | 0.350 |
| (G4S2)5 | 47 | 46.8 | 0.56 | 0.56/0.57 | 33.7/33.1 | 0.350 |
| (G4S2)6 | 53 | 49.1 | 0.52 | 0.54/0.54 | 35.8/35.1 | 0.343 |
| (G4S2)7 | 59 | 50.8 | 0.51 | 0.51/0.52 | 37.7/37.1 | 0.348 |
| (G4S2)8 | 65 | 52.3 | 0.48 | 0.49/0.50 | 39.6/38.9 | 0.341 |
| (G4S2)9 | 71 | 53.5 | 0.43 | 0.47/0.48 | 41.4/40.7 | 0.332 |
| (G2S4)1 | 25 | 16.0 | 0.64 | 0.63/0.64 | 27.4/27.1 | 0.363 |
| (G2S4)2 | 31 | 19.4 | 0.62 | 0.60/0.60 | 30.4/30.2 | 0.368 |
| (G2S4)3 | 37 | 21.6 | 0.58 | 0.57/0.57 | 33.1/33.0 | 0.365 |
| (G2S4)5 | 49 | 24.5 | 0.51 | 0.51/0.51 | 38.0/38.0 | 0.363 |
| (G2S4)6 | 55 | 25.5 | 0.49 | 0.49/0.49 | 40.2/40.2 | 0.362 |
| (G2S4)7 | 61 | 26.2 | 0.45 | 0.46/0.46 | 42.4/42.4 | 0.360 |
| (G2S4)9 | 73 | 27.4 | 0.40 | 0.42/0.42 | 46.3/46.3 | 0.361 |
| (GS5)1 | 25 | 12.0 | 0.64 | 0.62/0.62 | 28.6/28.2 | 0.362 |
| (GS5)2 | 31 | 12.9 | 0.61 | 0.58/0.69 | 31.8/31.4 | 0.363 |
| (GS5)3 | 37 | 13.5 | 0.49 | 0.55/0.55 | 34.6/34.3 | 0.363 |
| (GS5)5 | 49 | 14.3 | 0.50 | 0.49/0.49 | 39.7/39.5 | 0.358 |
| (GS5)6 | 55 | 14.5 | 0.48 | 0.47/0.47 | 42.0/41.9 | 0.354 |
| (GS5)9 | 73 | 15.1 | 0.39 | 0.40/0.40 | 48.3/48.2 | 0.352 |
| (S6)1 | 25 | 8.0 | 0.61 | 0.57/0.58 | 32.4/32.3 | 0.362 |
| (S6)2 | 31 | 6.5 | 0.60 | 0.53/0.53 | 35.9/35.9 | 0.366 |
| (S6)4 | 43 | 4.7 | 0.44 | 0.47/0.46 | 42.0/42.3 | 0.362 |
| (S6)5 | 49 | 4.1 | 0.42 | 0.44/0.43 | 44.8/45.2 | 0.360 |
| (S6)6 | 55 | 3.6 | 0.39 | 0.41/0.41 | 47.4/47.9 | 0.358 |
| (S6)8 | 67 | 3.0 | 0.37 | 0.37/0.37 | 52.2/52.8 | 0.355 |
| (S6)9 | 73 | 2.7 | 0.32 | 0.35/0.35 | 54.4/55.2 | 0.321 |
The linker length and glycine content were calculated from the entire linker, including the flexible C- and N-termini of ECFP and EYFP.
Eobs is the experimentally determined FRET efficiency.
⟨re⟩ensemble and ⟨E⟩ensemble are calculated using WLC/GC with (lp = 3.7 Å)/(C = 1.9), (lp = 4.5 Å)/(C = 2.4), (lp = 4.9 Å)/(C = 2.6), and (lp = 6.2 Å)/(C = 3.4) for G4S2, G2S4, G1S5, and S6, respectively.
Values reported in ref (14).
Figure 2Spectra of CLY proteins with linkers of varying lengths and glycine contents. Normalized fluorescence emission spectra of 200 nM CLY proteins with different lengths of (A) (G2S4) linkers, (B) (GS5) linkers, and (C) (S6) linkers.
Figure 3Predicting FRET efficiencies of CLY proteins with the wormlike chain model. (A) End-to-end distance probability distributions of WLCs with a contour length of 73 amino acids and different persistence lengths. (B) Numerical relationship between the end-to-end distance and the average interchromophore distance (red dashed line) and the average FRET efficiency (black solid line). (C) Curves obtained by multiplying the probability density distributions in panel A by the average FRET efficiency at each end-to-end distance. The area under these curves is the ensemble average FRET efficiency for a CLY protein.
Figure 4Comparison of experimental FRET efficiencies with those predicted by a wormlike chain model. Symbols show Eobs values for all CLY proteins. Solid lines represent best fit modeled ⟨E⟩ensemble vs linker length curves with indicated values of lp. Best fits were determined by minimizing the average root-mean-square deviation (RMSD) between the model and experimental data. The purple dashed line (lp = 17.1 Å) is the modeled FRET efficiency vs linker length curve for polyserine linkers according to theoretical predictions for which lp = C × b0/2.[18]
Figure 5Effect of linker stiffness on effective concentration. Effective concentrations (Ceff) were calculated using the wormlike chain model as described by Zhou[43] for distances ranging from 0 to 100 Å. Solid lines represent Ceff as a function of linker length for polyserine linkers (lp = 6.2 Å). Dashed lines represent Ceff as a function of linker length for GlyGlySer linkers (lp = 3.7 Å).