Literature DB >> 31502834

Predicting Protein-Polymer Block Copolymer Self-Assembly from Protein Properties.

Aaron Huang¹, Justin M Paloni¹, Amy Wang¹, Allie C Obermeyer¹, Hursh V Sureka¹, Helen Yao¹, Bradley D Olsen¹.

Abstract

Protein-polymer bioconjugate self-assembly has attracted a great deal of attention as a method to fabricate protein nanomaterials in solution and the solid state. To identify protein properties that affect phase behavior in protein-polymer block copolymers, a library of 15 unique protein-b-poly(N-isopropylacrylamide) (PNIPAM) copolymers comprising 11 different proteins was compiled and analyzed. Many attributes of phase behavior are found to be similar among all studied bioconjugates regardless of protein properties, such as formation of micellar phases at high temperature and low concentration, lamellar ordering with increasing temperature, and disordering at high concentration, but several key protein-dependent trends are also observed. In particular, hexagonal phases are only observed for proteins within the molar mass range 20-36 kDa, where ordering quality is also significantly enhanced. While ordering is generally found to improve with increasing molecular weight outside of this range, most large bioconjugates exhibited weaker than predicted assembly, which is attributed to chain entanglement with increasing polymer molecular weight. Additionally, order-disorder transition boundaries are found to be largely uncorrelated to protein size and quality of ordering. However, the primary finding is that bioconjugate ordering can be accurately predicted using only protein molecular weight and percentage of residues contained within β sheets. This model provides a basis for designing protein-PNIPAM bioconjugates that exhibit well-defined self-assembly and a modeling framework that can generalize to other bioconjugate chemistries.

Entities: Chemical Disease Gene Mutation Species

Mesh：

Substances：

Year: 2019 PMID： 31502834 PMCID： PMC6794641 DOI： 10.1021/acs.biomac.9b00768

Source DB: PubMed Journal: Biomacromolecules ISSN： 1525-7797 Impact factor: 6.988

Introduction

In nature, enzymes have developed as powerful and efficient biocatalysts capable of performing reactions under mild conditions with high activity, selectivity, and specificity.[1] These traits make them attractive prospects for application in industrial catalysis and sensing. For example, enzymes have proven effective in addressing challenges in a variety of fields including energy generation applications such as fuel catalysis[2,3] and light harvesting,[4] environmental applications such as waste treatment[5,6] and pollutant remediation,[7,8] or medical applications such as glucose detection[9,10] and high-throughput immunoassays.[11] Other globular proteins, such as the fluorescent proteins enhanced green fluorescent protein (EGFP) and monomeric Cherry (mCherry), have even been used in the gain media in optofluidic and solid-state lasers.[12,13] However, enzymes and other globular proteins suffer from limitations in processability and stability that complicate their use in industrial applications, where biocatalysts should be heterogeneous to allow for separation and reuse of the material.[14] The globular proteins utilized in these applications must retain good activity, selectivity, and stability in conditions that may stray quite far from physiological environments, settings in which most proteins lose activity or completely denature. One important solution to address these limitations is immobilization of the protein. Not only is immobilization necessary for the majority of practical applications such as localization of a protein on biosensor transducers,[15] but immobilization of enzymes has also been demonstrated to improve enzyme stability by restricting unfolding[16,17] or subunit dissociation[18,19] and even to potentially enhance activity by kinetically trapping the enzyme in hyperactive conformations.[20,21] Nevertheless, effective immobilization of proteins and enzymes has proven challenging. Harsh immobilization chemistries and conditions can result in protein denaturation,[22] steric crowding or nonspecific protein orientation can hinder access to active sites,[23] and encapsulation can restrict transport of substrates and products.[24,25] A variety of techniques have been developed to capture the benefits of immobilization while mitigating the drawbacks, such as using site-specific conjugation to control protein orientation[26] and using flexible linkers to relieve crowding.[27] Additionally, various methods for nanopatterning of the protein utilizing lithography[28−30] or self-assembled templates allow for control over protein loading density and transport properties within biocatalysts.[11,31,32] Direct self-assembly of enzymes in protein–polymer diblock copolymers offers a promising method for achieving control over nanostructure and protein orientation while maintaining stability and activity of the protein. Self-assembly of coil–coil diblock copolymers has been studied extensively, and a rich phase behavior encompassing a variety of nanostructured morphologies, such as micelles in dilute solution and lamellae, hexagonally packed cylinders, and gyroids in bulk, has been predicted theoretically and observed experimentally.[33,34] These principles have been applied to globular protein–polymer diblocks, which have been increasingly investigated in part due to their ease of synthesis.[35−37] Protein–polymer block copolymers have demonstrated the ability to form many ordered nanostructures, largely analogous to those observed in coil–coil diblocks, in dilute solution, concentrated solution, bulk solids, and thin films.[38−42] Globular protein–polymer diblocks have also proven effective in heterogeneous catalysis and sensing applications.[43−45] Although the phase behavior of coil–coil diblock copolymers is well understood, the monomers that comprise a globular protein are presented in a defined sequence and folded structure, resulting in topological and sequence-specific interactions between globular proteins. These spatially dependent structures and chemistries result in considerable anisotropy of ionic, hydrophobic, and hydrogen bonding interactions as well as complex sterics in the assembled domains from shape anisotropy of the globular structure. Simulations of polymer tethered nanoparticles have demonstrated shifting of phase boundaries resulting from different nanoparticle geometries,[46] and polyhedra of different symmetries have been demonstrated to self-assemble into numerous crystalline, liquid crystalline, and disordered morphologies,[47] suggesting that colloidal shape is an important factor in self-assembly. All of these effects make understanding the phase behavior of globular protein–polymer diblock copolymers a very complex problem. Several studies have been performed on mCherry–polymer conjugate systems to elucidate the effects that govern self-assembly in protein–polymer block copolymers. Changes in polymer chemistry and chain topology have demonstrated significant shifts in phase transitions and observed nanostructures,[48−50] and modifications to protein conjugation site have been shown to affect micellar stability as well as offer a handle for manipulation of protein orientation while keeping overall phase behavior relatively constant.[51] However, modifications to the surface amino acid residues of the folded protein block have resulted in minimal changes to phase behavior,[52] while large modifications of the overall charge of superfolder GFP (sfGFP) have demonstrated significantly improved ordering for neutrally charged variants and suppressed ordering for supercharged variants. This demonstrates that overall charge is one method of controlling ordering.[53] These findings also suggest that coarse-grained properties such as protein size, shape, and other colloidal properties may prove to be better predictors of self-assembly than the specific amino acid sequence of the protein. Herein, the self-assembly behavior of 15 distinct protein-b-poly(N-isopropylacrylamide) (PNIPAM) conjugates in concentrated solution is investigated using small-angle X-ray scattering (SAXS). Within this set of 15 bioconjugates, seven have been newly synthesized for this study, with the remainder taken from the existing literature. The full structural data set is analyzed for correlations between various physical and chemical properties of the proteins with various metrics of ordering, particularly focusing on quality of ordering within the observed phases, boundaries of the phase transitions, and the identity of each observed phase. It is found that a model incorporating only protein molecular weight and percent of residues contained within β sheets can account for the majority of the variation in ordering quality.

Methods

Polymer Synthesis

PNIPAM of various molecular weights was synthesized using reversible addition–fragmentation chain-transfer (RAFT) polymerization using a maleimide-functionalized chain-transfer agent, as previously reported.[42] The molar masses and dispersities of all PNIPAM samples are included in Table , and gel permeation chromatography (GPC) traces for each of the newly synthesized polymers are included in Figure S1.

Table 1

Characterization of Proteins and Polymers Used

Protein	Protein MW (kDa)	PNIPAM M_n (kDa)	PNIPAM Đ	PNIPAM Weight Fraction
rcSso7d	9.4	9.8	1.09	0.51
HMb	19.7	18.9	1.10	0.49
mCherry	28.1	29.0	1.11	0.51
EGFP	29.5	26.3	1.10	0.47
HCAII	29.4	29.0	1.11	0.50
HTPI	27.9	29.0	1.11	0.51
DFPase	36.1	40.9	1.09	0.53
PhoA	48.7	49.7	1.07	0.51
P450	54.7	55.4	1.04	0.50
BSA	66.7	66.6	1.07	0.50
IgG	149.0	68.5	1.04	0.31

Protein Biosynthesis and Purification

The genes encoding native diisopropyl fluorophosphatase (DFPase)[54] and human carbonic anhydrase II (HCAII)[55] contained in vector pET15b were purchased from GenScript, the plasmid containing the sequence for human triosephosphate isomerase (HTPI)[56] in vector pET20b was a gift from Dr. Markus Ralser (Addgene plasmid # 50723), and the gene for Alkaline Phosphatase (PhoA) in the pTrc99a plasmid was obtained as a gift from Dr. Jeff Glasgow. To introduce conjugation sites into these sequences, site-directed mutagenesis was performed using a QuikChange II XL kit (product #200521, Agilent) to produce variants with an N-terminal cysteine for DFPase (DFPaseN), a C-terminal cysteine for DFPase (DFPaseC), an S43C mutation for HCAII, an S4C mutation for HTPI, and a T93C mutation for PhoA. Each plasmid was then transformed into the Escherichia coli expression cell line Tuner(DE3) (product #70623, Novagen). The sequences for each of these genes and the expressed proteins are provided in the Supporting Information. Each protein was biosynthetically expressed in 1 L media in 2.8 L Fernbach flasks within a shaking incubator. The expression conditions for each protein are summarized in Table , along with the typical yield for each protein after purification.

Table 2

Expression Conditions for Proteins

Protein	Cell Line	Media	Induction conditions	Growth conditions	Approximate yield (mg protein/L culture)
HTPI	Tuner(DE3)	LB + Ampicillin	0.5 mM IPTG added at OD₆₀₀ = 0.5	37 °C for 6 h after induction	30
HCAII	Tuner(DE3)	LB + Ampicillin	1.0 mM IPTG added at OD₆₀₀ = 1.0	37 °C for 6 h after induction	50
DFPase	Tuner(DE3)	LB + Ampicillin	No induction	30 °C for 24 h	30
P450	Tuner(DE3)	LG + Ampicillin	1.0 mM IPTG added at OD₆₀₀ = 1.0	25 °C for 12 h after induction; low shake rate (150 rpm)	20
PhoA	Tuner(DE3)	2xYT + Ampicillin	1.0 mM IPTG, 1 mM MgSO₄, and 0.1 mM ZnSO₄ added at OD₆₀₀ = 0.4	37 °C for 12 h after induction	50

Cell pellets for expressed proteins DFPase, HCAII, and HTPI were harvested by centrifugation at 4000g, resuspended in lysis buffer (50 mM NaH2PO4, 300 mM NaCl, 10 mM imidazole, 10 mM β-mercaptoethanol (BME), adjusted to pH 8.0 with NaOH), and incubated with 1 mg/mL lysozyme at 4 °C for 30 min prior to sonication. The cell lysate was then clarified by centrifugation, and the proteins were bound to Ni-NTA resin overnight. The bound resin was washed with 10 column volumes of wash buffer (50 mM NaH2PO4, 300 mM NaCl, 20 mM imidazole, 10 mM BME, adjusted to pH 8.0 with NaOH) and then eluted using 5 column volumes of elution buffer (300 mM NaCl, 250 mM imidazole, 10 mM BME, adjusted to pH 8.0 with NaOH). Following Ni-NTA purification, the proteins were dialyzed against 20 mM Tris–Cl buffer at pH 8.0. Protein purity was assessed using sodium dodecyl sulfate polyacrylamide gel electrophoresis (SDS-PAGE) (Figures S2–4). Cell pellets for PhoA were harvested by centrifugation at 3500g and resuspended in 50 mM Tris–Cl buffer at pH 8. Pelleting and resuspension were then repeated to remove impurities. The periplasm was isolated by adding solid sucrose (0.5 M), ethylenediaminetetraacetic acid (EDTA, 2.5 mM), and lysozyme (0.6 mg mL–1) to the cells and incubating at 37 °C for 30 min. The sphaeroplasts were removed by centrifuging at 10,000g for 20 min. The resulting supernatant was the periplasmic fraction. The supernatant was dialyzed against 20 mM Tris–Cl pH 8.0, and then purified via anion exchange fast protein liquid chromatography (FPLC) using a 5 mL HiTrap Q HP column (product #17115401, GE Healthcare) on an AKTA Pure FPLC (GE Healthcare) with 20 mM Tris–Cl pH 8.0 containing 5 mM BME as the mobile phase. The protein was eluted from the column using a linear 0–0.5 M NaCl gradient. The purified fractions, determined by SDS-PAGE, were combined and dialyzed against 20 mM Tris–Cl pH 8.0 containing 1 mM MgSO4 and 0.1 mM ZnSO4. Bovine serum albumin (BSA) was purchased from Millipore-Sigma (product A2153) and purified by anion exchange FPLC using 5 mL HiTrap Q HP anion exchange columns (GE Healthcare) on an AKTA Pure FPLC (GE Healthcare) with 20 mM Tris–Cl pH 8.0 as the mobile phase. The protein was eluted from the column using a linear 0–0.3 M NaCl gradient.

Protein Bioconjugation to PNIPAM

Proteins were coupled to PNIPAM using thiol-maleimide coupling chemistry. The protein solutions were diluted to either 1 or 0.5 mg/mL in 20 mM Tris–HCl buffer. For most bioconjugations, 10 mol equiv of tris(2-caroxyethyl)phosphine (TCEP) was added, and the solutions were stirred for 30 min at 4 °C to reduce thiol groups. PNIPAM was added in stoichiometric ratios between 2:1 and 10:1 to the protein solution, and the resulting mixtures were stirred overnight at 4 °C. The reaction conditions for each of the different conjugates are summarized in Table . The DFPase, PhoA, and BSA conjugates were then purified using 5 mL HiTrap Q HP anion exchange columns (GE Healthcare) on an AKTA Pure FPLC (GE Healthcare) with 20 mM Tris–Cl pH 8.0 as the mobile phase. The protein was eluted from the column using a linear 0–0.3 M NaCl gradient. The purified fractions, determined by SDS-PAGE (Figures S5–7), were combined. HCAII and HTPI conjugates were purified by precipitation in 1.0 M ammonium sulfate three times and then bound to Ni-NTA resin for 24 h. The resin was then washed with 12 column volumes of 4°C deionized water and eluted using 5 column volumes of elution buffer (300 mM NaCl, 250 mM imidazole, adjusted to pH 8.0 with NaOH). Bioconjugate purity was confirmed using SDS-PAGE (Figures S8 and S9). Following purification, the conjugate solutions were concentrated to approximately 100–200 mg/mL using Millipore Ultra-15 centrifugal filters, and 20 μL aliquots were deposited as droplets onto poly(tetrafluoroethylene) sheets. The droplets were then dried to solid pellets by ramping the pressure within the chamber down at a rate of 100 Torr/h, and then holding at the lowest achievable pressure (∼13 Torr) for an additional 12 h. The resulting solid pellets were stored at 4 °C until use.

Table 3

Bioconjugation Conditions for Each Protein

Protein	Protein Concentration(mg/mL)	TCEP Concentration (protein molar equivalents)	PNIPAM Concentration (protein molar equivalents)
HTPI	1.0	10	5
HCAII	1.0	10	5
DFPase	1.0	10	3
PhoA	1.0	10	10
P450	1.0	10	10
BSA	0.5	5	2

Small-Angle X-ray Scattering (SAXS)

Concentrated solution-state samples for SAXS were prepared by hydrating the pellets to the desired concentration and then sealing the concentrated bioconjugate solutions into the center of 1 mm thick aluminum washers using Kapton tape. SAXS patterns were obtained at the Advanced Photon Source (APS) at Argonne National Lab on beamline 12-ID-C for DFPase and BSA, the Stanford Synchrotron Radiation Lightsource (SSRL) on beamline 1-5 for HCAII, PhoA, and P450, and the Advanced Light Source (ALS) at Lawrence Berkeley National Lab on beamline 7.3.3 for HTPI. Samples were equilibrated for 10 min before measurement at each temperature, and the collected data were corrected for empty cell scattering.

Circular Dichroism (CD) Spectroscopy

CD spectroscopy was performed to confirm proper folding of expressed proteins at all stages of sample preparation (Figure S10). Measurements were completed using a JASCO Model J-1500 CD spectrometer to measure far UV CD spectroscopy between 195 and 250 nm in a 0.1 cm path length quartz cuvette. All proteins were measured in 20 mM Tris–Cl buffer at pH 8.0, except for PhoA, for which the buffer also contained 1 mM MgSO4 and 0.1 mM ZnSO4.

Protein Physical and Structural Parameter Determination

The size, shape, and structure of proteins were quantified with parameters estimated using the PDB files for each studied protein: Sso7d (PDB: 1SSO),[57] HMb (PDB: 1A6G),[58] mCherry (PDB: 2H5Q),[59] EGFP (PDB: 2Y0G),[60] HCAII (PDB: 5JDV),[61] HTPI (PDB: 7TIM),[62] DFPase (PDB: 3O4P),[63] PhoA (PDB: 5C66),[64] P450 (PDB: 1W0E),[65] BSA (PDB: 3 V03),[66] and IgG (PDB: 1IGT).[67] Protein volume, surface area, sphericity, and effective radius (the radius of a sphere with the same surface area-to-volume ratio as the protein) were calculated on the 3V server using high grid resolution.[68] The number and percentage of residues contained within α helices, β sheets, and coil regions of each protein were determined using the VADAR server.[69] All calculated parameter values are listed in Table S1.

Development of Predictive Model for Bioconjugate Ordering Quality

Regression analysis was performed using JMP software. All variables listed in Table S1 were used in the preliminary regression except for protein volume due to the high correlation between volume and molecular weight. Volume was removed from the model instead of molecular weight to additionally remove the interdependence between volume, surface area, and sphericity. A complete methodology for model development, including training, testing, and validation, is provided in the Supporting Information.

Results and Discussion

Self-Assembly Behavior of New Bioconjugates

While in traditional coil–coil diblock copolymers it is widely accepted that a single universal phase diagram is representative of the behavior of the molecules regardless of the specific chemistry of each block, it is unclear whether the great diversity of proteins can be understood in terms of a similarly coarse-grained representation. To understand how protein variation impacts self-assembly, a relatively large sampling of structurally different protein–polymer bioconjugates has been prepared and compared, including eight bioconjugates consisting of five proteins from previously published studies and a set of seven new conjugates from six new proteins. All bioconjugates are protein–PNIPAM block copolymers. PNIPAM was selected as the polymer block, as conjugates incorporating this polymer have been shown to form ordered phases under the widest range of concentrations.[48,49] Each of the six new conjugates was synthesized from proteins of various structures, properties, and functions: HCAII, HTPI, DFPase, PhoA, Cytochrome P450, and BSA. To form bioconjugates, these proteins were expressed biosynthetically, or, in the case of BSA, purchased from a commercial source and purified. These proteins were then used to synthesize bioconjugates via site-specific bioconjugation chemistry using thiol-maleimide coupling (Scheme ). In addition to these six conjugates, structural data of conjugates of reduced charged Sso7d (rcSso7d),[45] human heart myoglobin (HMb),[44] human immunoglobulin G (IgG),[43] mCherry,[51] and EGFP[52] from other studies were compared to identify trends in how the properties of each protein affects the quality of ordering and general self-assembly behavior of protein–polymer bioconjugates. For the six bioconjugates in this study as well as the referenced bioconjugates, the weight fraction of protein and polymer was kept roughly symmetric except in the case of IgG, where the mass of the protein was too large to form a symmetric conjugate. The cartoon folded structures with marked conjugation sites for each of these 11 proteins is shown in Figure approximately to scale, and characterization details of each protein and polymer used in this study are summarized in Table .

Scheme 1

General Synthetic Scheme for Conjugation of a Cysteine-Modified Protein to Maleimide-Functionalized PNIPAM To Form Protein–PNIPAM Bioconjugates

Figure 1

Panel of 11 proteins. Images are rendered approximately to-scale. All studied conjugation sites are marked with yellow spheres.

Panel of 11 proteins. Images are rendered approximately to-scale. All studied conjugation sites are marked with yellow spheres. SAXS was used to identify order–disorder transitions (ODTs) and order–order transitions (OOTs) for each bioconjugate sample as a function of both temperature and concentration. Phase diagrams for each of the six new bioconjugates are shown in Figure . In each case, a disordered phase is observed at low concentrations. This phase is characterized either by the absence of any peak or by a single broad peak resulting from the correlation hole effect for block copolymers.[70] Characteristic SAXS traces for this phase as well as each observed phase for each bioconjugate can be found in Figure . For each conjugate at concentrations of 30 wt % and below, heating the samples to 35 °C results in a transition to a disordered micellar phase, as evidenced by a sharpening of the primary peak accompanied by at least one broad, higher order peak. This phase separation is consistent with observations in mCherry, EGFP, HMb, and rcSso7d systems[44,45,52] and is attributed to water becoming a poor solvent for PNIPAM above its lower critical solution temperature (LCST). As concentration is increased, a concentration ODT (CODT) is observed to the lamellar phase for each protein with the exceptions of rcSso7d and P450, which remain disordered. The lamellar phase is characterized by scattering peaks at q* and 2q*. This transition occurs between 40 and 50 wt % for HMb, between 40 and 45 wt % for DFPaseC, and between 45 and 50 wt % for BSA, HCAII, and HTPI, and DFPaseN. These CODT values are considerably higher than for EGFP and mCherry, which have CODT values as low as 30 wt %.[40,51,52] Additionally, 3 of the proteins exhibit ODT temperatures (TODT values) at concentrations of 40 and 45 wt %: HTPI transitions from the disordered phase to a hexagonal phase, and both DFPaseN and BSA transition from disordered to lamellar phases. These transitions are likely due to desolvation of the polymer resulting in phase separation as the water partitions to the protein domains. However, this type of thermotropic transition has not previously been observed in other protein–polymer block copolymer systems.

Figure 2

Figure 3

Representative SAXS patterns for each phase observed in (a) HTPI, (b) HCAII, (c) DFPase, (d) PhoA, (e) P450, and (f) BSA bioconjugates.

Concentration vs temperature phase diagrams for (a) HTPI, (b) HCAII, (c) PhoA, (d) P450, (e) BSA, (f) DFPaseN, (g) DFPaseC. Phases are labeled as disordered (Dis), disordered micellar (DM), lamellar (lam), and hexagonal (hex). Representative SAXS patterns for each phase observed in (a) HTPI, (b) HCAII, (c) DFPase, (d) PhoA, (e) P450, and (f) BSA bioconjugates. Moving the conjugation site of DFPase between the N-terminus and C-terminus shows very little effect on the phase behavior. Micellar stability is observed to be very similar, but a very slight change in CODT is observed, with DFPaseN transitioning from disordered to lamellar between 45 and 47 wt % at low temperatures, and DFPaseC transitioning between 40 and 45 wt %. DFPaseC also exhibits TODT values from disordered to micellar at 30 and 25 °C for 40 and 45 wt % samples, respectively, while this behavior is not observed for DFPaseN. Although there are shifts in phase boundaries between these two conjugates, the differences are small. Previous studies on conjugation-site modified mCherry-PNIPAM conjugates[51] have shown similarly minor effects on phase boundaries, suggesting that the phase transitions within protein–polymer bioconjugates are generally insensitive to changes in conjugation site.

Ordering Quality Trends

Although all of the symmetric bioconjugates that form self-assembled phases display regions of lamellar ordering, the lamellar phases of the previously studied and newly characterized proteins exhibit widely varying degrees of ordering. This is demonstrated by a comparison of the sharpness of the primary scattering peaks, which indicates the ability of the conjugates to assemble into highly correlated lamellar nanostructures. Peak sharpness was quantified using the inverse full width at half-maximum (fwhm–1) of the peak and was calculated by fitting a Lorentzian function to the primary scattering peak:where I0 is the peak height, q0 is the peak position, and 2γ is the fwhm. Though greater values of fwhm–1 do correspond to higher ordering quality, the relationship between these two variables is nonlinear. As such, quantitative comparisons between fwhm–1 values do not necessarily reflect commensurate comparisons in ordering quality. Rather, fwhm–1 provides a useful metric by which the relative ordering quality of conjugates can be compared. In a preliminary attempt to understand trends in this metric of ordering, the fwhm–1 was plotted against several coarse-grained biophysical properties of the proteins: protein volume, effective radius, sphericity, surface area, molar mass, and secondary structure content (Figures , S11–17).

Figure 4

Quality of ordering, quantified as fwhm–1, as a function of protein molecular weight illustrating a trend of increasing order with molecular weight with a small range of greatly improved ordering. Data are presented for 50 wt % bioconjugate solutions at 25 °C. Protein molar mass was found to display the most prominent trend with ordering quality (Figure ). Based on the well-known behavior of other block copolymer systems, it is natural to expect that the quality of ordering improves with the size (molar mass) of the bioconjugate. This expectation originates from block copolymer self-assembly theories,[71,72] which demonstrate that the segregation strength of the blocks scales as χN: the product of the Flory–Huggins interaction parameter and the copolymer degree of polymerization. While the theory is not applicable to protein–polymer conjugates, this trend is nonetheless observed in these protein–polymer diblocks up to a protein molar mass of approximately 30 kDa. However, beyond this point, the quality of ordering begins to decrease. While the smaller diblocks of rcSso7d and HMb may be too weakly segregated to order well, ordering in the larger diblocks of BSA, P450, and PhoA is likely hindered by low chain mobility from chain entanglement. Indeed, the molar masses of the PNIPAM samples conjugated to these proteins (66.6, 55.4, and 49.7 kDa, respectively) all fall within the expected 37.5–75 kDa range for the entanglement molar mass in a 50 wt % solution (calculation of this range is provided in the Supporting Information). Chain entanglement is expected to have a significant effect on the self-assembly of protein–PNIPAM conjugates since these conjugates cannot be thermally annealed to reach an equilibrium state without denaturing the protein block. As such, entanglements likely greatly or completely restrict diffusion of large, rigid proteins, kinetically trapping the conjugates in a nonequilibrium state, which has been demonstrated in systems of analogously rigid nanoparticles in entangled polymer networks and melts.[73] This results in a small window of molecular weights that allow for both high segregation strength of the blocks as well as sufficient chain mobility to self-assemble into well-ordered phases. Temperature is also found to significantly affect ordering quality in the studied bioconjugates. For the majority of the bioconjugates, the sharpness of the primary peak increases with temperature, indicating improved ordering within the observed lamellar phase (Figure ). This effect results from increased segregation strength between the domains as the temperature approaches the transition temperature of PNIPAM (between 30 and 35 °C for most of the conjugates), causing the solvent to become more selective for the protein block. The small magnitude of these differences in ordering quality is consistent with observations that solvent quality of water for PNIPAM only changes slightly between 10 and 25 °C.[74] Oddly, the best-ordered bioconjugates—mCherry, EGFP, and HCAII—exhibit the opposite trend, where quality of ordering decreases with increasing temperature. This behavior may suggest that the entropic penalty for forming well-ordered domains exhibited by these three bioconjugates is sufficiently high such that it dominates other temperature-dependent factors. Regardless of the physical origin, this difference strongly implies that for mCherry, EGFP, and HCAII, there is a difference in the nature of the forces governing their self-assembly that leads to greater ordering quality.

Figure 5

fwhm–1 of the primary peaks for lamellar phases observed in 50 wt % bioconjugates. This concentration was chosen as it is the condition where every bioconjugate studied here except for rcSso7 displays a lamellar phase. Although protein molecular weight displays a clear trend with quality of ordering measured by peak sharpness, when compared to CODT, another metric of segregation strength, no such trend is observed (Figures , S18–21). CODT values for each studied protein occur between 40 and 50 wt %, with the exception of the β-barrel proteins mCherry and EGFP. It is worth noting that modifications to the conjugation sites of mCherry and DFPase have been shown to shift the CODT without significant change to the overall quality of ordering of the structured phases above the CODT.[51] Similarly, studies altering the charge of superfolder GFP have demonstrated that quality of ordering can be suppressed without shifting CODT.[53] These findings imply that quality of ordering within the structured nanophases and the transition boundaries for the observed phases can be controlled independently.

Figure 6

CODT as a function of protein molecular weight. Error bars indicate the concentration resolution of the SAXS measurements used to determine phase transitions. Data are presented for 50 wt % bioconjugate solutions at 25 °C. The type of phase formed by the block copolymers at a constant coil fraction also depends on molar mass. This is possible even in the coarse-grained colloid–polymer framework because changing the molar mass can also alter the relative aspect ratio of the molecule, an effect that has been extensively observed in rod–coil systems.[75] While all ordered bioconjugates exhibit a lamellar phase, a hexagonally packed phase, characterized by scattering peaks at q*, √3q*, and 2q*, is observed in only 4 of the proteins: mCherry, EGFP, HCAII, and HTPI. This OOT only occurs above 25 °C, as the higher temperatures lead to water becoming a poor solvent for PNIPAM, resulting in deswelling of the polymer domains. The corresponding reduced coil volume fraction results in a collapse of the polymer domain into hexagonally packed cylinders. Interestingly, all 4 proteins that exhibit this phase are all bounded within the same molecular weight range of 20–36 kDa. While the transition boundaries for the hexagonal phase differ, the observation of the hexagonal phase is ubiquitous for all symmetric protein–polymer bioconjugates in this size range studied at these conditions, with the existence of this phase being robust to conjugation site,[51] protein charge,[53] and polymer chemistry.[48,49] However, no other conjugates exhibit this phase, not even well-ordered conjugates such as BSA or IgG.[43] Instead, these conjugates tend to remain lamellar up to the highest measured temperature of 40 °C. Even for IgG bioconjugates with small PNIPAM coil fraction, a condition demonstrated to promote the hexagonal phase in mCherry and EGFP systems,[52,76] the hexagonal phase is absent. The observation of a small window of hexagonally packed ordering seemingly independent of ordering quality suggests that for the bioconjugate to exhibit a hexagonal phase, the size of the protein is critical for accommodating the additional interfacial curvature of this phase. As such, there may only be a narrow region within the compositional space of solution phase protein–polymer bioconjugates where the hexagonal phase is accessible.

Predicting Bioconjugate Ordering Quality

A predictive model for the expected bioconjugate ordering quality in terms of fwhm–1 was generated using only the protein biophysical properties considered in this study. By reducing the full set of properties to a subset of those most strongly correlated with fwhm–1 using regression analysis, it was determined that an accurate model could be constructed using only four terms with six total coefficients: a constant, a term linear in molecular weight, a Lorentzian fit to the peak in the molecular weight data, and a term linear in % residues in β sheets. The corresponding model is presented below:where MW is molecular weight in kDa and β is the percentage of residues in β sheets. Error bounds on the coefficients and p-values for each parameter in the model are listed in Table .

Table 4

Best-Fit Parameter Values and Statistical Significance

Term	Coefficienta	p-Valueb
Intercept	–9 ± 6	0.1816
MW (linear)	0.38 ± 0.08	0.0009*
MW (Lorentzian) peak height	42 ± 8	0.0001*
MW (Lorentzian) peak position	28.4 ± 3.7	0.0001*
MW (Lorentzian) peak growth rate	0.97 ± 0.60	0.1055
% Beta sheets	0.3 ± 0.1	0.0001*

Ranges represent the 95% confidence interval for parameter estimates.

Values calculated using F-test of overall significance. Asterisk indicates statistical significance at the α=0.05 level.

Ranges represent the 95% confidence interval for parameter estimates. Values calculated using F-test of overall significance. Asterisk indicates statistical significance at the α=0.05 level. The ordering quality model overall shows good agreement with the experimentally measured fwhm–1 values (Figure ). The model requires very little information about a protein, only including variables that provide coarse-grained descriptions of protein size (molecular weight) and structure (% residues in β sheets). Despite its simplicity and small sample size, the model is surprisingly accurate, accounting for over 86% of the observed variation in fwhm–1 values and is estimated to explain over 75% of the variation in fwhm–1 when predicting conjugate ordering (Table ). This accuracy is particularly noteworthy since in both this and previous studies it has been observed that various factors unaccounted for in this model, including conjugation site[51] and protein surface charge,[53] affect bioconjugate ordering quality. The 95% confidence bounds on the predicted fwhm–1 value at 25 nm is ±9 nm, which is not high enough precision to distinguish between ordered and disordered phases within the approximate fwhm–1 range of 10–30 nm.[45] However, the precision of the model is sufficient to predict that conjugates with a predicted fwhm–1 below 10 nm will be disordered and those with a predicted fwhm–1 greater than 30 nm will self-assembled into ordered phases. Indeed, of the 10 conjugates from this study that fall outside the predicted fwhm–1 range of 10–30 nm, the model accurately assigns the ordering behavior of 9, only incorrectly predicting that the PhoA-PNIPAM conjugate will be disordered. All but one of the nonconstant variables in the model are statistically significant, and the positive coefficients associated with the linear molecular weight and % β sheet terms suggests that, holding other protein properties constant, larger proteins with a higher fraction of residues in β sheets exhibit stronger ordering.

Figure 7

Comparison of actual and predicted fwhm–1 values for 50 wt % bioconjugate solutions at 25 °C. Shaded region represents 95% confidence region.

Table 5

Regression Statistics

Regression Statistic	Value
R-squared	0.862
Adjusted R-squared	0.825
Predicted R-squared	0.758

Comparison of actual and predicted fwhm–1 values for 50 wt % bioconjugate solutions at 25 °C. Shaded region represents 95% confidence region. The ordering trends with molecular weight and β sheet content can largely be rationalized by considering mobility and packing entropy within the protein domain. The irregularly shaped surfaces of proteins likely contribute to protein movement becoming frustrated at high concentration, resulting in a reduced mobility that inhibits self-assembly into well-ordered structures. In terms of molecular weight, larger proteins tend to have a greater volume-to-surface area ratio, reducing the interfacial area between proteins within a given volume and thereby enhancing mobility. Protein mobility is also likely affected by entanglements within the polymer domains, however, as mobility should significantly decrease above the polymer entanglement molar mass. The combination of these two molecular weight effects may produce the Lorentzian-shaped peak in ordering quality around 28 kDa (Figure ), where proteins are large enough to avoid significant packing frustrations but are conjugated to PNIPAM of a sufficiently low molecular weight such that entanglements do not form. Additionally, proteins with a higher β sheet content are more likely to contain β sheets at the surface and therefore may be more likely to exhibit smoother, more regular surfaces, reducing packing frustrations between proteins. Indeed, mCherry, EGFP, and DFPase all have very high β sheet content and display greater than expected ordering relative to a linear fit to molecular weight, whereas HMb and BSA each contain essentially no β sheets (both are almost entirely composed of α helices) and exhibit weaker than expected ordering. Furthermore, even though both P450 and BSA have very similar low β sheet content, only BSA is observed to form ordered lamellar phases presumably due to its larger molecular weight. Thus, this model based on only two protein properties accurately identified trends between these variables and bioconjugate fwhm–1 that can be used to predict the ordering quality of protein–polymer bioconjugates. As a result, the model provides important predictive design criteria that can be used to estimate whether a specific protein–polymer bioconjugate is likely to self-assemble before the conjugate is synthesized.

Conclusions

The self-assembly of 15 protein–PNIPAM bioconjugates was compared to provide insight into the factors affecting the quality of ordering and self-assembly behavior in protein–polymer diblock copolymers. Many attributes of the phase behavior remain constant among all 15 bioconjugates. In each case, a transition from disordered to a disordered micellar phase is observed at low concentrations as temperature crosses the LCST of PNIPAM. In addition, increasing concentration in most of the conjugates leads to a disordered to ordered lyotropic transition. However, this comparison also revealed differing phase behaviors in several of the proteins, such as a thermotropic ODT from a disordered phase into lamellar or hexagonal phases. Additionally, the β-barrel proteins mCherry and EGFP were found to have uniquely lower CODTs among the set, and only proteins with molar masses in the narrow range of roughly 20–36 kDa were observed to exhibit hexagonally packed morphologies. Although the β-barrel proteins exhibited both the highest degree of ordering and the lowest CODT values, these two attributes were largely uncorrelated for the remainder of the proteins. Coupled with previous findings, these observations suggest that CODT and quality of ordering can be affected independently of one another. Outside of the optimum molar mass range of 20–36 kDa, bioconjugate ordering generally improves with increasing molecular weight, as predicted by block copolymer self-assembly theories. Even very large bioconjugates, though, struggle to match the degree of ordering exhibited by bioconjugates of proteins within the optimum window, likely due to chain entanglement at high polymer molecular weight. These trends suggest that to optimize self-assembly in protein–PNIPAM bioconjugates, a balance must be made between increasing segregation strength from bioconjugate size while avoiding polymer entanglement to maintain species mobility. A model constructed based on only protein molecular weight and % residues contained in β sheets was able to capture 86% of the variation in fwhm–1 values across the studied bioconjugates and demonstrate good predictive capabilities. The model was demonstrated to provide an accurate prediction of whether a conjugate will form ordered phases for predicted fwhm–1 values outside the range of 10–30 nm. While the model would benefit from a larger sample set of protein–polymer block copolymer SAXS data for further validation, the developed model suggests that certain aspects of protein architecture promote strong bioconjugate ordering. In particular, it suggests that likely candidates for forming well-ordered protein–PNIPAM block copolymers would contain protein blocks that are either high molecular weight and have a large fraction of β sheets or have a molecular weight within the range of 20–36 kDa. These insights should allow the selection and design of proteins with enhanced ordering quality when conjugated to PNIPAM, broadening the scope of proteins that can be used in functional self-assembled biomaterials.

57 in total

1. Steric crowding effects on target detection in an affinity biosensor.

Authors: Lisa M Bonanno; Lisa A Delouise
Journal: Langmuir Date: 2007-04-11 Impact factor: 3.882

Review 2. Enzyme-based biofuel cells.

Authors: Shelley D Minteer; Bor Yann Liaw; Michael J Cooney
Journal: Curr Opin Biotechnol Date: 2007-03-30 Impact factor: 9.740

3. Improved Ordering in Low Molecular Weight Protein-Polymer Conjugates Through Oligomerization of the Protein Block.

Authors: Justin M Paloni; Eric A Miller; Hadley D Sikes; Bradley D Olsen
Journal: Biomacromolecules Date: 2018-08-22 Impact factor: 6.988

Review 4. Control of protein immobilization: coupling immobilization and site-directed mutagenesis to improve biocatalyst or biosensor performance.

Authors: Karel Hernandez; Roberto Fernandez-Lafuente
Journal: Enzyme Microb Technol Date: 2010-10-20 Impact factor: 3.493

Review 5. The potential of optofluidic biolasers.

Authors: Xudong Fan; Seok-Hyun Yun
Journal: Nat Methods Date: 2014-02 Impact factor: 28.547

6. X-ray crystallographic studies of alanine-65 variants of carbonic anhydrase II reveal the structural basis of compromised proton transfer in catalysis.

Authors: L R Scolnick; D W Christianson
Journal: Biochemistry Date: 1996-12-24 Impact factor: 3.162

7. Structural and immunologic characterization of bovine, horse, and rabbit serum albumins.

Authors: Karolina A Majorek; Przemyslaw J Porebski; Arjun Dayal; Matthew D Zimmerman; Kamila Jablonska; Alan J Stewart; Maksymilian Chruszcz; Wladek Minor
Journal: Mol Immunol Date: 2012-06-06 Impact factor: 4.407

8. Crystal structures of myoglobin-ligand complexes at near-atomic resolution.

Authors: J Vojtechovský; K Chu; J Berendzen; R M Sweet; I Schlichting
Journal: Biophys J Date: 1999-10 Impact factor: 4.033

9. Creating nanopatterns of His-tagged proteins on surfaces by nanoimprint lithography using specific NiNTA-histidine interactions.

Authors: Pascale Maury; Maryana Escalante; Mária Péter; David N Reinhoudt; Vinod Subramaniam; Jurriaan Huskens
Journal: Small Date: 2007-09 Impact factor: 13.281

Review 10. The architecture and function of the light-harvesting apparatus of purple bacteria: from single molecules to in vivo membranes.

Authors: Richard J Cogdell; Andrew Gall; Jürgen Köhler
Journal: Q Rev Biophys Date: 2006-10-12 Impact factor: 5.318

2 in total

Review 1. Coiled-Coils: the Molecular Zippers that Self-Assemble Protein Nanostructures.

Authors: Won Min Park
Journal: Int J Mol Sci Date: 2020-05-19 Impact factor: 5.923

2. Functional enzyme-polymer complexes.

Authors: Curt Waltmann; Carolyn E Mills; Jeremy Wang; Baofu Qiao; John M Torkelson; Danielle Tullman-Ercek; Monica Olvera de la Cruz
Journal: Proc Natl Acad Sci U S A Date: 2022-03-21 Impact factor: 12.779

2 in total