Protein-polymer bioconjugate self-assembly has attracted a great deal of attention as a method to fabricate protein nanomaterials in solution and the solid state. To identify protein properties that affect phase behavior in protein-polymer block copolymers, a library of 15 unique protein-b-poly(N-isopropylacrylamide) (PNIPAM) copolymers comprising 11 different proteins was compiled and analyzed. Many attributes of phase behavior are found to be similar among all studied bioconjugates regardless of protein properties, such as formation of micellar phases at high temperature and low concentration, lamellar ordering with increasing temperature, and disordering at high concentration, but several key protein-dependent trends are also observed. In particular, hexagonal phases are only observed for proteins within the molar mass range 20-36 kDa, where ordering quality is also significantly enhanced. While ordering is generally found to improve with increasing molecular weight outside of this range, most large bioconjugates exhibited weaker than predicted assembly, which is attributed to chain entanglement with increasing polymer molecular weight. Additionally, order-disorder transition boundaries are found to be largely uncorrelated to protein size and quality of ordering. However, the primary finding is that bioconjugate ordering can be accurately predicted using only protein molecular weight and percentage of residues contained within β sheets. This model provides a basis for designing protein-PNIPAM bioconjugates that exhibit well-defined self-assembly and a modeling framework that can generalize to other bioconjugate chemistries.
Protein-polymer bioconjugate self-assembly has attracted a great deal of attention as a method to fabricate protein nanomaterials in solution and the solid state. To identify protein properties that affect phase behavior in protein-polymer block copolymers, a library of 15 unique protein-b-poly(N-isopropylacrylamide) (PNIPAM) copolymers comprising 11 different proteins was compiled and analyzed. Many attributes of phase behavior are found to be similar among all studied bioconjugates regardless of protein properties, such as formation of micellar phases at high temperature and low concentration, lamellar ordering with increasing temperature, and disordering at high concentration, but several key protein-dependent trends are also observed. In particular, hexagonal phases are only observed for proteins within the molar mass range 20-36 kDa, where ordering quality is also significantly enhanced. While ordering is generally found to improve with increasing molecular weight outside of this range, most large bioconjugates exhibited weaker than predicted assembly, which is attributed to chain entanglement with increasing polymer molecular weight. Additionally, order-disorder transition boundaries are found to be largely uncorrelated to protein size and quality of ordering. However, the primary finding is that bioconjugate ordering can be accurately predicted using only protein molecular weight and percentage of residues contained within β sheets. This model provides a basis for designing protein-PNIPAM bioconjugates that exhibit well-defined self-assembly and a modeling framework that can generalize to other bioconjugate chemistries.
In nature, enzymes
have developed as powerful and efficient biocatalysts
capable of performing reactions under mild conditions with high activity,
selectivity, and specificity.[1] These traits
make them attractive prospects for application in industrial catalysis
and sensing. For example, enzymes have proven effective in addressing
challenges in a variety of fields including energy generation applications
such as fuel catalysis[2,3] and light harvesting,[4] environmental applications such as waste treatment[5,6] and pollutant remediation,[7,8] or medical applications
such as glucose detection[9,10] and high-throughput
immunoassays.[11] Other globular proteins,
such as the fluorescent proteins enhanced green fluorescent protein
(EGFP) and monomeric Cherry (mCherry), have even been used in the
gain media in optofluidic and solid-state lasers.[12,13] However, enzymes and other globular proteins suffer from limitations
in processability and stability that complicate their use in industrial
applications, where biocatalysts should be heterogeneous to allow
for separation and reuse of the material.[14] The globular proteins utilized in these applications must retain
good activity, selectivity, and stability in conditions that may stray
quite far from physiological environments, settings in which most
proteins lose activity or completely denature.One important
solution to address these limitations is immobilization
of the protein. Not only is immobilization necessary for the majority
of practical applications such as localization of a protein on biosensor
transducers,[15] but immobilization of enzymes
has also been demonstrated to improve enzyme stability by restricting
unfolding[16,17] or subunit dissociation[18,19] and even to potentially enhance activity by kinetically trapping
the enzyme in hyperactive conformations.[20,21] Nevertheless, effective immobilization of proteins and enzymes has
proven challenging. Harsh immobilization chemistries and conditions
can result in protein denaturation,[22] steric
crowding or nonspecific protein orientation can hinder access to active
sites,[23] and encapsulation can restrict
transport of substrates and products.[24,25] A variety
of techniques have been developed to capture the benefits of immobilization
while mitigating the drawbacks, such as using site-specific conjugation
to control protein orientation[26] and using
flexible linkers to relieve crowding.[27] Additionally, various methods for nanopatterning of the protein
utilizing lithography[28−30] or self-assembled templates allow for control over
protein loading density and transport properties within biocatalysts.[11,31,32]Direct self-assembly of
enzymes in protein–polymer diblock
copolymers offers a promising method for achieving control over nanostructure
and protein orientation while maintaining stability and activity of
the protein. Self-assembly of coil–coil diblock copolymers
has been studied extensively, and a rich phase behavior encompassing
a variety of nanostructured morphologies, such as micelles in dilute
solution and lamellae, hexagonally packed cylinders, and gyroids in
bulk, has been predicted theoretically and observed experimentally.[33,34] These principles have been applied to globular protein–polymer
diblocks, which have been increasingly investigated in part due to
their ease of synthesis.[35−37] Protein–polymer block
copolymers have demonstrated the ability to form many ordered nanostructures,
largely analogous to those observed in coil–coil diblocks,
in dilute solution, concentrated solution, bulk solids, and thin films.[38−42] Globular protein–polymer diblocks have also proven effective
in heterogeneous catalysis and sensing applications.[43−45]Although the phase behavior of coil–coil diblock copolymers
is well understood, the monomers that comprise a globular protein
are presented in a defined sequence and folded structure, resulting
in topological and sequence-specific interactions between globular
proteins. These spatially dependent structures and chemistries result
in considerable anisotropy of ionic, hydrophobic, and hydrogen bonding
interactions as well as complex sterics in the assembled domains from
shape anisotropy of the globular structure. Simulations of polymer
tethered nanoparticles have demonstrated shifting of phase boundaries
resulting from different nanoparticle geometries,[46] and polyhedra of different symmetries have been demonstrated
to self-assemble into numerous crystalline, liquid crystalline, and
disordered morphologies,[47] suggesting that
colloidal shape is an important factor in self-assembly. All of these
effects make understanding the phase behavior of globular protein–polymer
diblock copolymers a very complex problem.Several studies have
been performed on mCherry–polymer conjugate
systems to elucidate the effects that govern self-assembly in protein–polymer
block copolymers. Changes in polymer chemistry and chain topology
have demonstrated significant shifts in phase transitions and observed
nanostructures,[48−50] and modifications to protein conjugation site have
been shown to affect micellar stability as well as offer a handle
for manipulation of protein orientation while keeping overall phase
behavior relatively constant.[51] However,
modifications to the surface amino acid residues of the folded protein
block have resulted in minimal changes to phase behavior,[52] while large modifications of the overall charge
of superfolder GFP (sfGFP) have demonstrated significantly improved
ordering for neutrally charged variants and suppressed ordering for
supercharged variants. This demonstrates that overall charge is one
method of controlling ordering.[53] These
findings also suggest that coarse-grained properties such as protein
size, shape, and other colloidal properties may prove to be better
predictors of self-assembly than the specific amino acid sequence
of the protein.Herein, the self-assembly behavior of 15 distinct
protein-b-poly(N-isopropylacrylamide)
(PNIPAM)
conjugates in concentrated solution is investigated using small-angle
X-ray scattering (SAXS). Within this set of 15 bioconjugates, seven
have been newly synthesized for this study, with the remainder taken
from the existing literature. The full structural data set is analyzed
for correlations between various physical and chemical properties
of the proteins with various metrics of ordering, particularly focusing
on quality of ordering within the observed phases, boundaries of the
phase transitions, and the identity of each observed phase. It is
found that a model incorporating only protein molecular weight and
percent of residues contained within β sheets can account for
the majority of the variation in ordering quality.
Methods
Polymer Synthesis
PNIPAM of various
molecular weights
was synthesized using reversible addition–fragmentation chain-transfer
(RAFT) polymerization using a maleimide-functionalized chain-transfer
agent, as previously reported.[42] The molar
masses and dispersities of all PNIPAM samples are included in Table , and gel permeation
chromatography (GPC) traces for each of the newly synthesized polymers
are included in Figure S1.
Table 1
Characterization of Proteins and Polymers
Used
Protein
Protein MW (kDa)
PNIPAM Mn (kDa)
PNIPAM Đ
PNIPAM Weight Fraction
rcSso7d
9.4
9.8
1.09
0.51
HMb
19.7
18.9
1.10
0.49
mCherry
28.1
29.0
1.11
0.51
EGFP
29.5
26.3
1.10
0.47
HCAII
29.4
29.0
1.11
0.50
HTPI
27.9
29.0
1.11
0.51
DFPase
36.1
40.9
1.09
0.53
PhoA
48.7
49.7
1.07
0.51
P450
54.7
55.4
1.04
0.50
BSA
66.7
66.6
1.07
0.50
IgG
149.0
68.5
1.04
0.31
Protein Biosynthesis and Purification
The genes encoding
native diisopropyl fluorophosphatase (DFPase)[54] and humancarbonic anhydrase II (HCAII)[55] contained in vector pET15b were purchased from GenScript, the plasmid
containing the sequence for humantriosephosphate isomerase (HTPI)[56] in vector pET20b was a gift from Dr. Markus
Ralser (Addgene plasmid # 50723), and the gene for Alkaline Phosphatase
(PhoA) in the pTrc99a plasmid was obtained as a gift from Dr. Jeff
Glasgow. To introduce conjugation sites into these sequences, site-directed
mutagenesis was performed using a QuikChange II XL kit (product #200521,
Agilent) to produce variants with an N-terminal cysteine for DFPase
(DFPaseN), a C-terminal cysteine for DFPase (DFPaseC), an S43C mutation
for HCAII, an S4C mutation for HTPI, and a T93C mutation for PhoA.
Each plasmid was then transformed into the Escherichia
coli expression cell line Tuner(DE3) (product #70623,
Novagen). The sequences for each of these genes and the expressed
proteins are provided in the Supporting Information. Each protein was biosynthetically expressed in 1 L media in 2.8
L Fernbach flasks within a shaking incubator. The expression conditions
for each protein are summarized in Table , along with the typical yield for each protein
after purification.
Table 2
Expression Conditions
for Proteins
Protein
Cell Line
Media
Induction conditions
Growth conditions
Approximate yield (mg protein/L culture)
HTPI
Tuner(DE3)
LB + Ampicillin
0.5
mM IPTG added at OD600 = 0.5
37 °C for 6 h after induction
30
HCAII
Tuner(DE3)
LB
+ Ampicillin
1.0 mM IPTG added at OD600 = 1.0
37 °C for 6 h
after induction
50
DFPase
Tuner(DE3)
LB + Ampicillin
No induction
30 °C for 24 h
30
P450
Tuner(DE3)
LG + Ampicillin
1.0 mM IPTG added at OD600 = 1.0
25 °C for
12 h after induction; low shake rate (150 rpm)
20
PhoA
Tuner(DE3)
2xYT + Ampicillin
1.0 mM IPTG, 1 mM MgSO4, and 0.1 mM ZnSO4 added
at OD600 = 0.4
37 °C for 12 h after induction
50
Cell pellets for expressed
proteins DFPase, HCAII, and HTPI were
harvested by centrifugation at 4000g, resuspended
in lysis buffer (50 mM NaH2PO4, 300 mM NaCl,
10 mM imidazole, 10 mM β-mercaptoethanol (BME), adjusted to
pH 8.0 with NaOH), and incubated with 1 mg/mL lysozyme at 4 °C
for 30 min prior to sonication. The cell lysate was then clarified
by centrifugation, and the proteins were bound to Ni-NTA resin overnight.
The bound resin was washed with 10 column volumes of wash buffer (50
mM NaH2PO4, 300 mM NaCl, 20 mM imidazole, 10
mM BME, adjusted to pH 8.0 with NaOH) and then eluted using 5 column
volumes of elution buffer (300 mM NaCl, 250 mM imidazole, 10 mM BME,
adjusted to pH 8.0 with NaOH). Following Ni-NTA purification, the
proteins were dialyzed against 20 mM Tris–Cl buffer at pH 8.0.
Protein purity was assessed using sodium dodecyl sulfate polyacrylamide
gel electrophoresis (SDS-PAGE) (Figures S2–4).Cell pellets for PhoA were harvested by centrifugation at
3500g and resuspended in 50 mM Tris–Cl buffer
at pH 8.
Pelleting and resuspension were then repeated to remove impurities.
The periplasm was isolated by adding solid sucrose (0.5 M), ethylenediaminetetraacetic
acid (EDTA, 2.5 mM), and lysozyme (0.6 mg mL–1)
to the cells and incubating at 37 °C for 30 min. The sphaeroplasts
were removed by centrifuging at 10,000g for 20 min.
The resulting supernatant was the periplasmic fraction. The supernatant
was dialyzed against 20 mM Tris–Cl pH 8.0, and then purified
via anion exchange fast protein liquid chromatography (FPLC) using
a 5 mL HiTrap Q HP column (product #17115401, GE Healthcare) on an
AKTA Pure FPLC (GE Healthcare) with 20 mM Tris–Cl pH 8.0 containing
5 mM BME as the mobile phase. The protein was eluted from the column
using a linear 0–0.5 M NaCl gradient. The purified fractions,
determined by SDS-PAGE, were combined and dialyzed against 20 mM Tris–Cl
pH 8.0 containing 1 mM MgSO4 and 0.1 mM ZnSO4.Bovine serum albumin (BSA) was purchased from Millipore-Sigma
(product
A2153) and purified by anion exchange FPLC using 5 mL HiTrap Q HP
anion exchange columns (GE Healthcare) on an AKTA Pure FPLC (GE Healthcare)
with 20 mM Tris–Cl pH 8.0 as the mobile phase. The protein
was eluted from the column using a linear 0–0.3 M NaCl gradient.
Protein Bioconjugation to PNIPAM
Proteins were coupled
to PNIPAM using thiol-maleimide coupling chemistry. The protein solutions
were diluted to either 1 or 0.5 mg/mL in 20 mM Tris–HCl buffer.
For most bioconjugations, 10 mol equiv of tris(2-caroxyethyl)phosphine
(TCEP) was added, and the solutions were stirred for 30 min at 4 °C
to reduce thiol groups. PNIPAM was added in stoichiometric ratios
between 2:1 and 10:1 to the protein solution, and the resulting mixtures
were stirred overnight at 4 °C. The reaction conditions for each
of the different conjugates are summarized in Table . The DFPase, PhoA, and BSA conjugates were
then purified using 5 mL HiTrap Q HP anion exchange columns (GE Healthcare)
on an AKTA Pure FPLC (GE Healthcare) with 20 mM Tris–Cl pH
8.0 as the mobile phase. The protein was eluted from the column using
a linear 0–0.3 M NaCl gradient. The purified fractions, determined
by SDS-PAGE (Figures S5–7), were
combined. HCAII and HTPI conjugates were purified by precipitation
in 1.0 M ammonium sulfate three times and then bound to Ni-NTA resin
for 24 h. The resin was then washed with 12 column volumes of 4°C
deionized water and eluted using 5 column volumes of elution buffer
(300 mM NaCl, 250 mM imidazole, adjusted to pH 8.0 with NaOH). Bioconjugate
purity was confirmed using SDS-PAGE (Figures S8 and S9). Following purification, the conjugate solutions were
concentrated to approximately 100–200 mg/mL using Millipore
Ultra-15 centrifugal filters, and 20 μL aliquots were deposited
as droplets onto poly(tetrafluoroethylene) sheets. The droplets were
then dried to solid pellets by ramping the pressure within the chamber
down at a rate of 100 Torr/h, and then holding at the lowest achievable
pressure (∼13 Torr) for an additional 12 h. The resulting solid
pellets were stored at 4 °C until use.
Table 3
Bioconjugation
Conditions for Each
Protein
Protein
Protein Concentration(mg/mL)
TCEP Concentration (protein molar equivalents)
PNIPAM Concentration (protein molar equivalents)
HTPI
1.0
10
5
HCAII
1.0
10
5
DFPase
1.0
10
3
PhoA
1.0
10
10
P450
1.0
10
10
BSA
0.5
5
2
Small-Angle X-ray Scattering
(SAXS)
Concentrated solution-state
samples for SAXS were prepared by hydrating the pellets to the desired
concentration and then sealing the concentrated bioconjugate solutions
into the center of 1 mm thick aluminum washers using Kapton tape.
SAXS patterns were obtained at the Advanced Photon Source (APS) at
Argonne National Lab on beamline 12-ID-C for DFPase and BSA, the Stanford
Synchrotron Radiation Lightsource (SSRL) on beamline 1-5 for HCAII,
PhoA, and P450, and the Advanced Light Source (ALS) at Lawrence Berkeley
National Lab on beamline 7.3.3 for HTPI. Samples were equilibrated
for 10 min before measurement at each temperature, and the collected
data were corrected for empty cell scattering.
Circular Dichroism (CD)
Spectroscopy
CD spectroscopy
was performed to confirm proper folding of expressed proteins at all
stages of sample preparation (Figure S10). Measurements were completed using a JASCO Model J-1500 CD spectrometer
to measure far UV CD spectroscopy between 195 and 250 nm in a 0.1
cm path length quartz cuvette. All proteins were measured in 20 mM
Tris–Cl buffer at pH 8.0, except for PhoA, for which the buffer
also contained 1 mM MgSO4 and 0.1 mM ZnSO4.
Protein Physical and Structural Parameter Determination
The size, shape, and structure of proteins were quantified with parameters
estimated using the PDB files for each studied protein: Sso7d (PDB: 1SSO),[57] HMb (PDB: 1A6G),[58] mCherry (PDB: 2H5Q),[59] EGFP (PDB: 2Y0G),[60] HCAII (PDB: 5JDV),[61] HTPI (PDB: 7TIM),[62] DFPase (PDB: 3O4P),[63] PhoA (PDB: 5C66),[64] P450 (PDB: 1W0E),[65] BSA (PDB: 3 V03),[66] and IgG
(PDB: 1IGT).[67] Protein volume, surface area, sphericity, and
effective radius (the radius of a sphere with the same surface area-to-volume
ratio as the protein) were calculated on the 3V server using high
grid resolution.[68] The number and percentage
of residues contained within α helices, β sheets, and
coil regions of each protein were determined using the VADAR server.[69] All calculated parameter values are listed in Table S1.
Development of Predictive
Model for Bioconjugate Ordering Quality
Regression analysis
was performed using JMP software. All variables
listed in Table S1 were used in the preliminary
regression except for protein volume due to the high correlation between
volume and molecular weight. Volume was removed from the model instead
of molecular weight to additionally remove the interdependence between
volume, surface area, and sphericity. A complete methodology for model
development, including training, testing, and validation, is provided
in the Supporting Information.
Results
and Discussion
Self-Assembly Behavior of New Bioconjugates
While in
traditional coil–coil diblock copolymers it is widely accepted
that a single universal phase diagram is representative of the behavior
of the molecules regardless of the specific chemistry of each block,
it is unclear whether the great diversity of proteins can be understood
in terms of a similarly coarse-grained representation. To understand
how protein variation impacts self-assembly, a relatively large sampling
of structurally different protein–polymer bioconjugates has
been prepared and compared, including eight bioconjugates consisting
of five proteins from previously published studies and a set of seven
new conjugates from six new proteins. All bioconjugates are protein–PNIPAM
block copolymers. PNIPAM was selected as the polymer block, as conjugates
incorporating this polymer have been shown to form ordered phases
under the widest range of concentrations.[48,49] Each of the six new conjugates was synthesized from proteins of
various structures, properties, and functions: HCAII, HTPI, DFPase,
PhoA, Cytochrome P450, and BSA. To form bioconjugates, these proteins
were expressed biosynthetically, or, in the case of BSA, purchased
from a commercial source and purified. These proteins were then used
to synthesize bioconjugates via site-specific bioconjugation chemistry
using thiol-maleimide coupling (Scheme ). In addition to these six conjugates, structural
data of conjugates of reduced charged Sso7d (rcSso7d),[45] human heart myoglobin (HMb),[44] human immunoglobulin G (IgG),[43] mCherry,[51] and EGFP[52] from other studies were compared to identify trends in
how the properties of each protein affects the quality of ordering
and general self-assembly behavior of protein–polymer bioconjugates.
For the six bioconjugates in this study as well as the referenced
bioconjugates, the weight fraction of protein and polymer was kept
roughly symmetric except in the case of IgG, where the mass of the
protein was too large to form a symmetric conjugate. The cartoon folded
structures with marked conjugation sites for each of these 11 proteins
is shown in Figure approximately to scale, and characterization details of each protein
and polymer used in this study are summarized in Table .
Scheme 1
General Synthetic
Scheme for Conjugation of a Cysteine-Modified Protein
to Maleimide-Functionalized PNIPAM To Form Protein–PNIPAM Bioconjugates
Figure 1
Panel of 11 proteins. Images are rendered approximately
to-scale.
All studied conjugation sites are marked with yellow spheres.
Panel of 11 proteins. Images are rendered approximately
to-scale.
All studied conjugation sites are marked with yellow spheres.SAXS was used to identify order–disorder
transitions (ODTs)
and order–order transitions (OOTs) for each bioconjugate sample
as a function of both temperature and concentration. Phase diagrams
for each of the six new bioconjugates are shown in Figure . In each case, a disordered
phase is observed at low concentrations. This phase is characterized
either by the absence of any peak or by a single broad peak resulting
from the correlation hole effect for block copolymers.[70] Characteristic SAXS traces for this phase as
well as each observed phase for each bioconjugate can be found in Figure . For each conjugate
at concentrations of 30 wt % and below, heating the samples to 35
°C results in a transition to a disordered micellar phase, as
evidenced by a sharpening of the primary peak accompanied by at least
one broad, higher order peak. This phase separation is consistent
with observations in mCherry, EGFP, HMb, and rcSso7d systems[44,45,52] and is attributed to water becoming
a poor solvent for PNIPAM above its lower critical solution temperature
(LCST). As concentration is increased, a concentration ODT (CODT) is observed to the lamellar phase for each
protein with the exceptions of rcSso7d and P450, which remain disordered.
The lamellar phase is characterized by scattering peaks at q* and
2q*. This transition occurs between 40 and 50 wt % for HMb, between
40 and 45 wt % for DFPaseC, and between 45 and 50 wt % for BSA, HCAII,
and HTPI, and DFPaseN. These CODT values
are considerably higher than for EGFP and mCherry, which have CODT values as low as 30 wt %.[40,51,52] Additionally, 3 of the proteins exhibit
ODT temperatures (TODT values) at concentrations
of 40 and 45 wt %: HTPI transitions from the disordered phase to a
hexagonal phase, and both DFPaseN and BSA transition from disordered
to lamellar phases. These transitions are likely due to desolvation
of the polymer resulting in phase separation as the water partitions
to the protein domains. However, this type of thermotropic transition
has not previously been observed in other protein–polymer block
copolymer systems.
Figure 2
Concentration vs temperature phase diagrams for (a) HTPI,
(b) HCAII,
(c) PhoA, (d) P450, (e) BSA, (f) DFPaseN, (g) DFPaseC. Phases are
labeled as disordered (Dis), disordered micellar (DM), lamellar (lam),
and hexagonal (hex).
Figure 3
Representative SAXS patterns
for each phase observed in (a) HTPI,
(b) HCAII, (c) DFPase, (d) PhoA, (e) P450, and (f) BSA bioconjugates.
Concentration vs temperature phase diagrams for (a) HTPI,
(b) HCAII,
(c) PhoA, (d) P450, (e) BSA, (f) DFPaseN, (g) DFPaseC. Phases are
labeled as disordered (Dis), disordered micellar (DM), lamellar (lam),
and hexagonal (hex).Representative SAXS patterns
for each phase observed in (a) HTPI,
(b) HCAII, (c) DFPase, (d) PhoA, (e) P450, and (f) BSA bioconjugates.Moving the conjugation site of DFPase between the
N-terminus and
C-terminus shows very little effect on the phase behavior. Micellar
stability is observed to be very similar, but a very slight change
in CODT is observed, with DFPaseN transitioning
from disordered to lamellar between 45 and 47 wt % at low temperatures,
and DFPaseC transitioning between 40 and 45 wt %. DFPaseC also exhibits TODT values from disordered to micellar at 30
and 25 °C for 40 and 45 wt % samples, respectively, while this
behavior is not observed for DFPaseN. Although there are shifts in
phase boundaries between these two conjugates, the differences are
small. Previous studies on conjugation-site modified mCherry-PNIPAM
conjugates[51] have shown similarly minor
effects on phase boundaries, suggesting that the phase transitions
within protein–polymer bioconjugates are generally insensitive
to changes in conjugation site.
Ordering Quality Trends
Although all of the symmetric
bioconjugates that form self-assembled phases display regions of lamellar
ordering, the lamellar phases of the previously studied and newly
characterized proteins exhibit widely varying degrees of ordering.
This is demonstrated by a comparison of the sharpness of the primary
scattering peaks, which indicates the ability of the conjugates to
assemble into highly correlated lamellar nanostructures. Peak sharpness
was quantified using the inverse full width at half-maximum (fwhm–1) of the peak and was calculated by fitting a Lorentzian
function to the primary scattering peak:where I0 is the
peak height, q0 is the peak position,
and 2γ is the fwhm. Though greater values of fwhm–1 do correspond to higher ordering quality, the relationship between
these two variables is nonlinear. As such, quantitative comparisons
between fwhm–1 values do not necessarily reflect
commensurate comparisons in ordering quality. Rather, fwhm–1 provides a useful metric by which the relative ordering quality
of conjugates can be compared. In a preliminary attempt to understand
trends in this metric of ordering, the fwhm–1 was
plotted against several coarse-grained biophysical properties of the
proteins: protein volume, effective radius, sphericity, surface area,
molar mass, and secondary structure content (Figures , S11–17).
Figure 4
Quality of ordering, quantified as fwhm–1, as
a function of protein molecular weight illustrating a trend of increasing
order with molecular weight with a small range of greatly improved
ordering. Data are presented for 50 wt % bioconjugate solutions at
25 °C.
Quality of ordering, quantified as fwhm–1, as
a function of protein molecular weight illustrating a trend of increasing
order with molecular weight with a small range of greatly improved
ordering. Data are presented for 50 wt % bioconjugate solutions at
25 °C.Protein molar mass was found to
display the most prominent trend
with ordering quality (Figure ). Based on the well-known behavior of other block copolymer
systems, it is natural to expect that the quality of ordering improves
with the size (molar mass) of the bioconjugate. This expectation originates
from block copolymer self-assembly theories,[71,72] which demonstrate that the segregation strength of the blocks scales
as χN: the product of the Flory–Huggins
interaction parameter and the copolymer degree of polymerization.
While the theory is not applicable to protein–polymer conjugates,
this trend is nonetheless observed in these protein–polymer
diblocks up to a protein molar mass of approximately 30 kDa. However,
beyond this point, the quality of ordering begins to decrease. While
the smaller diblocks of rcSso7d and HMb may be too weakly segregated
to order well, ordering in the larger diblocks of BSA, P450, and PhoA
is likely hindered by low chain mobility from chain entanglement.
Indeed, the molar masses of the PNIPAM samples conjugated to these
proteins (66.6, 55.4, and 49.7 kDa, respectively) all fall within
the expected 37.5–75 kDa range for the entanglement molar mass
in a 50 wt % solution (calculation of this range is provided in the Supporting Information). Chain entanglement is
expected to have a significant effect on the self-assembly of protein–PNIPAM
conjugates since these conjugates cannot be thermally annealed to
reach an equilibrium state without denaturing the protein block. As
such, entanglements likely greatly or completely restrict diffusion
of large, rigid proteins, kinetically trapping the conjugates in a
nonequilibrium state, which has been demonstrated in systems of analogously
rigid nanoparticles in entangled polymer networks and melts.[73] This results in a small window of molecular
weights that allow for both high segregation strength of the blocks
as well as sufficient chain mobility to self-assemble into well-ordered
phases.Temperature is also found to significantly affect ordering
quality
in the studied bioconjugates. For the majority of the bioconjugates,
the sharpness of the primary peak increases with temperature, indicating
improved ordering within the observed lamellar phase (Figure ). This effect results from
increased segregation strength between the domains as the temperature
approaches the transition temperature of PNIPAM (between 30 and 35
°C for most of the conjugates), causing the solvent to become
more selective for the protein block. The small magnitude of these
differences in ordering quality is consistent with observations that
solvent quality of water for PNIPAM only changes slightly between
10 and 25 °C.[74] Oddly, the best-ordered
bioconjugates—mCherry, EGFP, and HCAII—exhibit the opposite
trend, where quality of ordering decreases with increasing temperature.
This behavior may suggest that the entropic penalty for forming well-ordered
domains exhibited by these three bioconjugates is sufficiently high
such that it dominates other temperature-dependent factors. Regardless
of the physical origin, this difference strongly implies that for
mCherry, EGFP, and HCAII, there is a difference in the nature of the
forces governing their self-assembly that leads to greater ordering
quality.
Figure 5
fwhm–1 of the primary peaks for lamellar phases
observed in 50 wt % bioconjugates. This concentration was chosen as
it is the condition where every bioconjugate studied here except for
rcSso7 displays a lamellar phase.
fwhm–1 of the primary peaks for lamellar phases
observed in 50 wt % bioconjugates. This concentration was chosen as
it is the condition where every bioconjugate studied here except for
rcSso7 displays a lamellar phase.Although protein molecular weight displays a clear trend with quality
of ordering measured by peak sharpness, when compared to CODT, another metric of segregation strength, no such trend
is observed (Figures , S18–21). CODT values for each studied protein occur between 40 and 50
wt %, with the exception of the β-barrel proteins mCherry and
EGFP. It is worth noting that modifications to the conjugation sites
of mCherry and DFPase have been shown to shift the CODT without significant change to the overall quality
of ordering of the structured phases above the CODT.[51] Similarly, studies altering
the charge of superfolder GFP have demonstrated that quality of ordering
can be suppressed without shifting CODT.[53] These findings imply that quality
of ordering within the structured nanophases and the transition boundaries
for the observed phases can be controlled independently.
Figure 6
CODT as a function of protein molecular
weight. Error bars indicate the concentration resolution of the SAXS
measurements used to determine phase transitions. Data are presented
for 50 wt % bioconjugate solutions at 25 °C.
CODT as a function of protein molecular
weight. Error bars indicate the concentration resolution of the SAXS
measurements used to determine phase transitions. Data are presented
for 50 wt % bioconjugate solutions at 25 °C.The type of phase formed by the block copolymers at a constant
coil fraction also depends on molar mass. This is possible even in
the coarse-grained colloid–polymer framework because changing
the molar mass can also alter the relative aspect ratio of the molecule,
an effect that has been extensively observed in rod–coil systems.[75] While all ordered bioconjugates exhibit a lamellar
phase, a hexagonally packed phase, characterized by scattering peaks
at q*, √3q*, and 2q*, is observed in only 4 of the proteins:
mCherry, EGFP, HCAII, and HTPI. This OOT only occurs above 25 °C,
as the higher temperatures lead to water becoming a poor solvent for
PNIPAM, resulting in deswelling of the polymer domains. The corresponding
reduced coil volume fraction results in a collapse of the polymer
domain into hexagonally packed cylinders. Interestingly, all 4 proteins
that exhibit this phase are all bounded within the same molecular
weight range of 20–36 kDa. While the transition boundaries
for the hexagonal phase differ, the observation of the hexagonal phase
is ubiquitous for all symmetric protein–polymer bioconjugates
in this size range studied at these conditions, with the existence
of this phase being robust to conjugation site,[51] protein charge,[53] and polymer
chemistry.[48,49] However, no other conjugates
exhibit this phase, not even well-ordered conjugates such as BSA or
IgG.[43] Instead, these conjugates tend to
remain lamellar up to the highest measured temperature of 40 °C.
Even for IgG bioconjugates with small PNIPAM coil fraction, a condition
demonstrated to promote the hexagonal phase in mCherry and EGFP systems,[52,76] the hexagonal phase is absent. The observation of a small window
of hexagonally packed ordering seemingly independent of ordering quality
suggests that for the bioconjugate to exhibit a hexagonal phase, the
size of the protein is critical for accommodating the additional interfacial
curvature of this phase. As such, there may only be a narrow region
within the compositional space of solution phase protein–polymer
bioconjugates where the hexagonal phase is accessible.
Predicting
Bioconjugate Ordering Quality
A predictive
model for the expected bioconjugate ordering quality in terms of fwhm–1 was generated using only the protein biophysical
properties considered in this study. By reducing the full set of properties
to a subset of those most strongly correlated with fwhm–1 using regression analysis, it was determined that an accurate model
could be constructed using only four terms with six total coefficients:
a constant, a term linear in molecular weight, a Lorentzian fit to
the peak in the molecular weight data, and a term linear in % residues
in β sheets. The corresponding model is presented below:where MW is molecular
weight in kDa and β
is the percentage of residues in β sheets. Error bounds on the
coefficients and p-values for each parameter in the
model are listed in Table .
Table 4
Best-Fit Parameter Values and Statistical
Significance
Term
Coefficienta
p-Valueb
Intercept
–9 ± 6
0.1816
MW (linear)
0.38 ± 0.08
0.0009*
MW (Lorentzian) peak height
42 ± 8
0.0001*
MW (Lorentzian) peak position
28.4 ± 3.7
0.0001*
MW (Lorentzian) peak growth rate
0.97 ± 0.60
0.1055
% Beta sheets
0.3 ± 0.1
0.0001*
Ranges
represent the 95% confidence
interval for parameter estimates.
Values calculated using F-test of
overall significance. Asterisk indicates statistical significance
at the α=0.05 level.
Ranges
represent the 95% confidence
interval for parameter estimates.Values calculated using F-test of
overall significance. Asterisk indicates statistical significance
at the α=0.05 level.The ordering quality model overall shows good agreement with the
experimentally measured fwhm–1 values (Figure ). The model requires
very little information about a protein, only including variables
that provide coarse-grained descriptions of protein size (molecular
weight) and structure (% residues in β sheets). Despite its
simplicity and small sample size, the model is surprisingly accurate,
accounting for over 86% of the observed variation in fwhm–1 values and is estimated to explain over 75% of the variation in
fwhm–1 when predicting conjugate ordering (Table ). This accuracy is
particularly noteworthy since in both this and previous studies it
has been observed that various factors unaccounted for in this model,
including conjugation site[51] and protein
surface charge,[53] affect bioconjugate ordering
quality. The 95% confidence bounds on the predicted fwhm–1 value at 25 nm is ±9 nm, which is not high enough precision
to distinguish between ordered and disordered phases within the approximate
fwhm–1 range of 10–30 nm.[45] However, the precision of the model is sufficient to predict
that conjugates with a predicted fwhm–1 below 10
nm will be disordered and those with a predicted fwhm–1 greater than 30 nm will self-assembled into ordered phases. Indeed,
of the 10 conjugates from this study that fall outside the predicted
fwhm–1 range of 10–30 nm, the model accurately
assigns the ordering behavior of 9, only incorrectly predicting that
the PhoA-PNIPAM conjugate will be disordered. All but one of the nonconstant
variables in the model are statistically significant, and the positive
coefficients associated with the linear molecular weight and % β
sheet terms suggests that, holding other protein properties constant,
larger proteins with a higher fraction of residues in β sheets
exhibit stronger ordering.
Figure 7
Comparison of actual and predicted fwhm–1 values
for 50 wt % bioconjugate solutions at 25 °C. Shaded region represents
95% confidence region.
Table 5
Regression
Statistics
Regression Statistic
Value
R-squared
0.862
Adjusted R-squared
0.825
Predicted R-squared
0.758
Comparison of actual and predicted fwhm–1 values
for 50 wt % bioconjugate solutions at 25 °C. Shaded region represents
95% confidence region.The ordering trends with
molecular weight and β sheet content
can largely be rationalized by considering mobility and packing entropy
within the protein domain. The irregularly shaped surfaces of proteins
likely contribute to protein movement becoming frustrated at high
concentration, resulting in a reduced mobility that inhibits self-assembly
into well-ordered structures. In terms of molecular weight, larger
proteins tend to have a greater volume-to-surface area ratio, reducing
the interfacial area between proteins within a given volume and thereby
enhancing mobility. Protein mobility is also likely affected by entanglements
within the polymer domains, however, as mobility should significantly
decrease above the polymer entanglement molar mass. The combination
of these two molecular weight effects may produce the Lorentzian-shaped
peak in ordering quality around 28 kDa (Figure ), where proteins are large enough to avoid
significant packing frustrations but are conjugated to PNIPAM of a
sufficiently low molecular weight such that entanglements do not form.
Additionally, proteins with a higher β sheet content are more
likely to contain β sheets at the surface and therefore may
be more likely to exhibit smoother, more regular surfaces, reducing
packing frustrations between proteins. Indeed, mCherry, EGFP, and
DFPase all have very high β sheet content and display greater
than expected ordering relative to a linear fit to molecular weight,
whereas HMb and BSA each contain essentially no β sheets (both
are almost entirely composed of α helices) and exhibit weaker
than expected ordering. Furthermore, even though both P450 and BSA
have very similar low β sheet content, only BSA is observed
to form ordered lamellar phases presumably due to its larger molecular
weight. Thus, this model based on only two protein properties accurately
identified trends between these variables and bioconjugate fwhm–1 that can be used to predict the ordering quality
of protein–polymer bioconjugates. As a result, the model provides
important predictive design criteria that can be used to estimate
whether a specific protein–polymer bioconjugate is likely to
self-assemble before the conjugate is synthesized.
Conclusions
The self-assembly of 15 protein–PNIPAM bioconjugates was
compared to provide insight into the factors affecting the quality
of ordering and self-assembly behavior in protein–polymer diblock
copolymers. Many attributes of the phase behavior remain constant
among all 15 bioconjugates. In each case, a transition from disordered
to a disordered micellar phase is observed at low concentrations as
temperature crosses the LCST of PNIPAM. In addition, increasing concentration
in most of the conjugates leads to a disordered to ordered lyotropic
transition.However, this comparison also revealed differing
phase behaviors
in several of the proteins, such as a thermotropic ODT from a disordered
phase into lamellar or hexagonal phases. Additionally, the β-barrel
proteins mCherry and EGFP were found to have uniquely lower CODTs among the set, and only proteins with molar masses in
the narrow range of roughly 20–36 kDa were observed to exhibit
hexagonally packed morphologies. Although the β-barrel proteins
exhibited both the highest degree of ordering and the lowest CODT values, these two attributes were largely
uncorrelated for the remainder of the proteins. Coupled with previous
findings, these observations suggest that CODT and quality of ordering can be affected independently of one another.Outside of the optimum molar mass range of 20–36 kDa, bioconjugate
ordering generally improves with increasing molecular weight, as predicted
by block copolymer self-assembly theories. Even very large bioconjugates,
though, struggle to match the degree of ordering exhibited by bioconjugates
of proteins within the optimum window, likely due to chain entanglement
at high polymer molecular weight. These trends suggest that to optimize
self-assembly in protein–PNIPAM bioconjugates, a balance must
be made between increasing segregation strength from bioconjugate
size while avoiding polymer entanglement to maintain species mobility.A model constructed based on only protein molecular weight and
% residues contained in β sheets was able to capture 86% of
the variation in fwhm–1 values across the studied
bioconjugates and demonstrate good predictive capabilities. The model
was demonstrated to provide an accurate prediction of whether a conjugate
will form ordered phases for predicted fwhm–1 values
outside the range of 10–30 nm. While the model would benefit
from a larger sample set of protein–polymer block copolymer
SAXS data for further validation, the developed model suggests that
certain aspects of protein architecture promote strong bioconjugate
ordering. In particular, it suggests that likely candidates for forming
well-ordered protein–PNIPAM block copolymers would contain
protein blocks that are either high molecular weight and have a large
fraction of β sheets or have a molecular weight within the range
of 20–36 kDa. These insights should allow the selection and
design of proteins with enhanced ordering quality when conjugated
to PNIPAM, broadening the scope of proteins that can be used in functional
self-assembled biomaterials.
Authors: Curt Waltmann; Carolyn E Mills; Jeremy Wang; Baofu Qiao; John M Torkelson; Danielle Tullman-Ercek; Monica Olvera de la Cruz Journal: Proc Natl Acad Sci U S A Date: 2022-03-21 Impact factor: 12.779