Dominic J Glover1, Douglas S Clark1. 1. Department of Chemical and Biomolecular Engineering, University of California , Berkeley, California 94720, United States.
Abstract
The ability to assemble molecules into supramolecular architectures of controllable size and symmetry is a long sought after goal of nanotechnology and material engineering. Proteins are particularly attractive for molecular assembly due to their inherent molecular recognition and self-assembly capabilities. Advances in the computational prediction of protein folding and quaternary assembly have enabled the design of proteins that self-assemble into complex yet predictable shapes. These protein nanostructures are opening new possibilities in biomaterials, metabolic engineering, molecular delivery, tissue engineering, and a plethora of nanomaterials. Images of protein constructs assembled from simpler structures draw comparison to characters of calligraphy. In both cases, elaborate designs emerge from basic subunits, resulting in the translation of form into function with a high degree of artistry.
The ability to assemble molecules into supramolecular architectures of controllable size and symmetry is a long sought after goal of nanotechnology and material engineering. Proteins are particularly attractive for molecular assembly due to their inherent molecular recognition and self-assembly capabilities. Advances in the computational prediction of protein folding and quaternary assembly have enabled the design of proteins that self-assemble into complex yet predictable shapes. These protein nanostructures are opening new possibilities in biomaterials, metabolic engineering, molecular delivery, tissue engineering, and a plethora of nanomaterials. Images of protein constructs assembled from simpler structures draw comparison to characters of calligraphy. In both cases, elaborate designs emerge from basic subunits, resulting in the translation of form into function with a high degree of artistry.
The intricate and ordered complexes that
proteins adopt in nature is central to many biological processes,
ranging from cellular scaffolding provided by cytoskeletal proteins
to the encapsulation of nucleic acids in viral capsids. Exploiting
this remarkable fidelity and precision in self-assembly is highly
attractive for the fabrication of structurally defined materials with
nanometer dimensions. Researchers have spent considerable effort attempting
to mimic nature to sculpt proteins into structural templates and devices.
Early attempts focused on repurposing naturally occurring protein
nanostructures such as viral capsids.[1] However,
these top-down approaches are limited in terms of engineerability
and versatility. Modern nanobiotechnology aims to build structures
from the ground up, creating assemblies not found in nature, which
can be functionalized and used in a diverse range of applications
from nanoelectronics and energy, to biomedicine and the environment.
Indeed, nascent efforts to control the assembly of proteins into precise
shapes and patterns can be thought of in similar terms as the development
of letters or characters to comprise an alphabet, with the former
activity directed toward communication and the latter toward new technological
capabilities. Both, however, reflect an intrinsic artistry.As with all self-assembling systems, the final structure of a biopolymer,
such as a protein, is encoded by interactions of the material’s
components defined by their properties and order within the linear
polymer.[2] Therefore, the engineering of
biopolymers seeks to exploit sequence–structure relationships
to drive the folding and assembly of specific, well-ordered shapes.
This has been strikingly illustrated by DNA nanotechnology,[3] in which strands of DNA are designed to fold
by complementary Watson–Crick base pairing into a variety of
structures including lattices, tubes, tiles, and bricks.[4,5] Moving beyond the creation of simple shapes, the functional use
of DNA nanotechnology has been impressively demonstrated by the production
of nanoscale cages for therapeutic drug encapsulation,[6] molecular machines,[7] and biocomputing
nanorobots.[8] However, real-world application
of DNA nanostructures is in its infancy and requires expensive and
laborious chemical synthesis that limits technological applications.[9]The creation of protein nanostructures has lagged behind due to
the greater complexity of their structures. Proteins are composed
of amino acids that vary in their electrostatic charge and hydrophobicity,
which results in difficult-to-predict cooperative and long-range interactions.
While nature has had millennia of trial-and-error to evolve proteins
to fold into specific shapes, the engineering of proteins has proven
to be challenging due to incomplete understanding of how a sequence
of amino acids determines a protein’s three-dimensional shape,
and of how multiple proteins assemble into complex arrangements. The
complexity of proteins, however, has advantages in terms of chemistry
and molecular recognition, as well as architecture. Proteins are capable
of performing catalytic reactions, of interacting with both organic
and inorganic molecules, and of assembling into intricate structures,
all of which are ideal attributes for advanced materials. Furthermore,
mature technology exists for the efficient and economical production
of recombinant proteins in a range of microbial hosts.[10]Recent improvements in the prediction
of protein folding[11] and protein–protein
interactions[12,13] indicate that the design of protein
nanostructures is becoming increasingly feasible. This has occurred
in part through advances in computational power that enable increasingly
sophisticated algorithms that accurately model protein structure.
Critical to the success of this molecular modeling has been the crystallography-solved
protein structures submitted to the Protein Data Bank (PDB). This
constantly growing database of over 100,000 protein structures has
also revealed that many proteins in nature are oligomeric in structure,
and assemble either with themselves or with other proteins into complexes.
Using a combination of computational design and exploitation of natural
protein assemblies, researchers are seeking to create novel protein
shapes. In many ways, this engineering of protein assembly is reminiscent
of the masterful strokes of a calligrapher who joins simple lines
into complex patterns (Figure ). As with calligraphy, these protein materials are more than
just the sum of their parts, as the specific dimensions and topology
of the structures yield unique properties. Therefore, an ultimate
goal of protein nanotechnology is the ability to reliably create specific
protein characters without error, which can subsequently be functionalized
and combined for diverse applications. Extending the calligraphy analogy
one step further, much like combining meaningless letters into a meaningful
narrative, a set of protein characters could create the potential
for vast combinations that produce functions far beyond the properties
of the characters themselves.
Figure 1
Assembly of proteins into nanostructured shapes.
Simple filamentous structures can be assembled together into aesthetically
ordered patterns as shown by transmission electron microscopy. The
interlocking shapes produced conjure the aesthetic of “protein
calligraphy”, as shown by the comparison to characters of Chinese
calligraphy.
Assembly of proteins into nanostructured shapes.
Simple filamentous structures can be assembled together into aesthetically
ordered patterns as shown by transmission electron microscopy. The
interlocking shapes produced conjure the aesthetic of “protein
calligraphy”, as shown by the comparison to characters of Chinese
calligraphy.
Scripting Novel Protein
Nanostructures
Pioneering approaches to create novel protein
nanostructures relied on naturally occurring oligomerization domains
to guide the assembly process.[14] Many proteins
have evolved to self-associate through noncovalent oligomeric interactions
to form dimers or higher-order oligomers.[12] Domains that associate two at a time (dimers) or three at a time
(trimers) are particularly common in nature, and are a rich source
of raw building materials for creating self-assembling objects. An
elegant approach for building protein nanostructures joins two different
oligomerization domains by a semirigid linker into a single fusion
protein (Figure A).
Each domain in the fusion protein has a strong tendency to associate
with other copies of itself. As a consequence of this design, many
identical copies of the fusion protein self-assemble into filaments
or symmetrical cagelike objects, depending on the arrangement of oligomerization
domains.[14,15] A recent demonstration of this technique
involved joining dimerization and trimerization domains together (Figure A), which assembled
into a homogeneous 24-subunit porous cube with a 23 nm diameter and
large internal volume.[16] Furthermore, varying
the angle or symmetry between the oligomerization domains enables
control over the cube’s dimensions[17,15] or the creation of two-dimensional lattices over extended surface
areas.[18,19]
Figure 2
Strategies
for engineering self-assembling protein nanostructures. (a) The fusion
of two different naturally occurring oligomeric protein domains by
a semirigid linker for the creation of building blocks that self-assemble
into cagelike structures. (b) Design of de novo coiled
coils to attach multiple protein filaments together into complex geometries.
Strategies
for engineering self-assembling protein nanostructures. (a) The fusion
of two different naturally occurring oligomeric protein domains by
a semirigid linker for the creation of building blocks that self-assemble
into cagelike structures. (b) Design of de novo coiled
coils to attach multiple protein filaments together into complex geometries.The assembly of protein nanostructures
is not limited to using naturally occurring oligomerization domains
as improvements in our understanding of the rules governing protein–protein
interactions[12] are facilitating more direct
strategies for designing large protein assemblies.[13] Molecular modeling software is increasingly able to predict
protein–protein interactions,[20,21] which make
it possible to design proteins to form novel contact interfaces for
the creation of nanostructures. Using this strategy, cagelike structures
have recently been created with atomic level accuracy.[22,23] This was achieved by redesigning the interface between protein domains
to provide the energetic forces required to drive the assembly process,
as well as orient the proteins into the geometry of the desired structure.
Importantly, the final structures of the 24-subunit protein assemblies—either
tetrahedrons or octahedrons—were in close agreement with the
computational design. The engineering of new protein–protein
interfaces is not without challenges, however, as current methods
have a low success rate, which requires many individual designs to
be experimentally evaluated in order to find a functional interaction.[24] Therefore, the long-term goal is to have reliable
computational methods for designing protein assemblies without experimental
trial-and-error, but this will require deeper insights into the nature
of protein–protein interactions.[12]The next level of sophistication in protein nanostructure
engineering is increasingly focused on creating small modular domains
composed of de novo designed secondary structure
that assemble with specific partners into larger assemblies. This
methodology avoids having to redesign the entire contact interface
of a protein, and instead imparts modularity to control assembly of
individual proteins. One approach that has had notable success seeks
to use the simple but well-understood structural elements of coiled
coils to create complex shapes. Coiled-coil domains are intertwined
helical sequences that associate together by inter- and intramolecular
protein–protein interactions into elongated bundles.[25] These protein domains are attractive building
blocks as the rules governing coiled-coil assembly have been thoroughly
characterized,[26,27] which has facilitated the rational
and computational design of artificial coiled coils.[28,29]Engineering coiled-coil interaction specificity has enabled
a variety of protein nanostructures to be created, including fibers,[30,31] tubes,[32] cages,[33,34] and responsive hydrogels.[35] Our recent
paper demonstrated that the natural protein–protein interface
of a filamentous protein can be redesigned using coiled coils to impart
specificity and drive filament assembly into multifaceted structures.[36] In this approach, modular connector proteins
were created by replacing one of the two contact interfaces in the
filament subunit with opposing helical domains that associate together
as tight heterodimeric coiled coils (Figure B). The resulting pair of connector proteins
bound to each other with high specificity while also incorporating
into nascent filaments. Furthermore, the addition of a trimerization
domain into one of the engineered subunits enabled the creation of
a three-way connector that assembles with filaments containing the
opposing helical sequence into geometrically ordered shapes such as
pinwheels (Figure ). These protein shapes were ideal templates for building nanomaterials
that included highly conductive nanowires,[36] and demonstrate that the calligraphy of proteins is maturing beyond
the proof of principle stage and moving toward application in solving
a host of challenging problems.
Spatial Organization of
Enzymes
The controlled assembly of multiple proteins has
found application in the construction of novel enzymatic pathways
for metabolic engineering, which are promising alternatives to synthetic
chemistry for cheaply and renewably producing molecules of value.
Biopolymer scaffolds such as the hypothetical protein example in Figure B could prove ideal
for the alignment of multiple enzymes to facilitate substrate channelling,
a process whereby the reaction product of one enzyme is transferred
directly from the enzyme’s active site into the active site
of the next enzyme of a multistep reaction sequence. This can accelerate
the rate of metabolic reactions[37,38] and reduce diversion
of substrate/intermediates into competing pathways that lead to undesirable
side reactions.[39,40] An emerging strategy has been
to use protein scaffolds composed of a series of protein–protein
interaction domains that bind to specific peptide sequences (Figure A). Using this approach,
enzymes fused to cognate peptide sequences are attached along a protein
or nucleic acid scaffold in ordered arrangements for sequential enzymatic
reactions.[37,40,41] The natural modularity of this system enables stoichiometric control
over the enzymes and reaction fluxes, resulting in improved efficiency
of a variety of enzymatic pathways.[42] These
scaffold systems lack physical barriers and work by concentrating
intermediates before diffusion into the environment can occur.[38] The actual mechanism for the enhanced metabolic
flux observed by positioning enzymes in close proximity is at present
unclear. The distance between the active sites of adjacent enzymes
is most likely too large for an intermediate produced by one enzyme
to be processed by an adjacent enzyme before diffusing in solution.
Instead, recent molecular modeling suggests that the clustering of
enzymes into large agglomerates improves the probability that the
intermediate will be processed by a downstream enzyme.[38]
Figure 3
Spatial organization of enzymes for substrate channelling. (a) Synthetic protein scaffold to position enzymes for modular control over metabolic pathways. Adapted from reference ([40]). (b) Proposed scheme for creating a protein pipeline to integrate multiple-enzyme pathways for sequential catalytic reactions. Enzymes are encapsulated within tubular proteins that are assembled together, which enables direct diffusion of intermediates between active sites.
Spatial organization of enzymes for substrate channelling. (a) Synthetic protein scaffold to position enzymes for modular control over metabolic pathways. Adapted from reference ([40]). (b) Proposed scheme for creating a protein pipeline to integrate multiple-enzyme pathways for sequential catalytic reactions. Enzymes are encapsulated within tubular proteins that are assembled together, which enables direct diffusion of intermediates between active sites.A more direct strategy for spatial isolation of enzymes would
be to encapsulate them within an engineered protein cage, which has
the inherent advantages of isolating reactive intermediates from competing
or incompatible processes.[43] Initial attempts
to encapsulate enzymes exploited naturally occurring protein containers
such as virus-like particles[44] and bacterial
microcompartments.[45,46] Engineered protein cages (Figure A) are also promising
for enzyme encapsulation; however, practical application of enzyme
encapsulation will require methods to be developed for the regulation
of metabolite entry and exit, and ways in which to localize enzymes
into the cage. If these problems can be solved, then elegant enzyme
cascades can be envisaged whereby individual enzymes are positioned
in a series of protein containers that prevent metabolic intermediates
from diffusing away before reacting with sequential enzymes (Figure B). These containers
could ultimately be assembled into networks, allowing multiple inputs
into branched metabolic pathways, with intermediates reacted upon
in a precise and ordered manner, a long sought after goal in metabolic
engineering.
Nanomaterial Scaffolds
Beyond generating
2D structures to serve as templates, assembling organized protein
nanostructures has advantages over traditional processing methods
for the fabrication of nanoscale devices. In particular, mature technology
exists to evolve peptide sequences for the recognition and binding
of nearly any conceivable material.[47] Judicious
selection of binding peptides combined with an appropriate protein
nanostructure enables the incorporation of functional materials into
regular patterns with nanometer precision. The almost encyclopedic
design of binding peptides has facilitated the creation of a diverse
range of materials including conductive metallic nanowires,[1] lithium ion batteries,[48] and carbon nanotube solar cells.[49] Many
of these materials, particular in the area of nanoelectronics, are
beginning to move out of the laboratory and into real-world application.
Furthermore, the natural ability of many protein domains to bind to
specific nucleic acid sequences, for example, zinc fingers, has enabled
the creation of protein–nucleic acid coassembling nanomaterials.[41,50] These hybrid materials have the potential to meld the highly ordered
assembly of DNA nanotechnology with the functionality of proteins.In addition to the templation of functional materials upon the
surface of protein nanostructures, the internal cavity of protein
cages or containers is ideal for the compartmentalization of functional
molecules (e.g., Figure A). Protein nanostructures with larger internal openings could be
useful as reaction vessels or as vehicles for delivering cargoes,
including as agents for nanomedicine. Indeed, functionalization of
protein delivery vectors with subcellular targeting signals can facilitate
the active transport and delivery of therapeutic drugs to specific
locations within mammalian cells.[51] These
multicomponent delivery vectors have virus-like functionality, and
mimic the ability of viruses to target and enter specific cells, with
the release of their drug/DNA cargo triggered by a decrease in the
pH of the endosome/lysozyme.[52] The assembly
of multiple cages (Figure A) would also facilitate the creation of structures with high
porosity.[16] Highly porous materials, as
exemplified by metal–organic frameworks (MOFs),[53] have been examined for applications in catalysis,
gas storage, and biomimetic mineralization;[54] presumably protein nanostructures could find similar roles, especially
as scaffolds for three-dimensional tissue culture.
Figure 4
Future directions for
the creation of functional protein nanostructures and materials. (a)
Embedding of functional molecules within large-scale protein frameworks.
(b) An example of a nanorobot capable of encapsulating a cargo within
a compartment and transportation using an F1-ATPase molecular motor
attached to a filament nanopropeller.
Future directions for
the creation of functional protein nanostructures and materials. (a)
Embedding of functional molecules within large-scale protein frameworks.
(b) An example of a nanorobot capable of encapsulating a cargo within
a compartment and transportation using an F1-ATPase molecular motor
attached to a filament nanopropeller.It is important to note, however, that many proteins have
specific environmental limitations, such as a narrow range of structural
stability, which may render them impractical for harsh material fabrication
conditions. Therefore, of particular interest is the use of ultrastable
proteins that retain their native structure at extremes of pH or temperature
or in the presence of chemical denaturants.[55] The continual discovery of extremophilic organisms that thrive under
adverse environmental conditions is providing new source material
as well as insights into protein stability and will enable the creation
of ultrastable protein materials.[56,57]Combining
elements of structural biology, mechanical engineering, and materials
science, the design and fabrication of biological nanomachines ranging
from levers and rotors to motor-driven assemblies is advancing rapidly
in scope and complexity. Proposed applications of such systems include
sorting and transporting nanoscale cargo;[58] driving and accelerating self-assembly processes of nanostructures;[59] mixing and pumping fluids;[60] and various other mechanical maneuvers.[61] Many of these engineered protein machines exploit naturally
occurring molecular motors; however, de novo design
of proteins is progressing to a point whereby novel functions can
be created such as a recently described metal ion transporter that
actively moves Zn2+ and Co2+ across lipid membranes.[62]An ultimate goal, which bridges science fiction
with emerging reality, is the construction of biobased or bioinspired
“nanobots” that are capable of performing myriad mechanical
and analytical tasks in a multitude of environments. Recently, DNA
nanotechnology has made large strides in the creation of nanobots
for the transport and delivery of therapeutic agents;[58] however, despite numerous proof-of-concept examples of
biological moving parts and simple devices, protein nanobots have
yet to advance to the same level of sophistication. Well-known examples
of biological rotary and linear-motion motors in nature include F1-ATPase[63] and kinesin,[64] which
have been incorporated into a variety of nanoscale mechanical constructs.
A surface-immobilized F1-ATPase motor, for example, has been used
to rotate an attached filamentous protein through the efficient conversion
of chemical energy.[60] Improvements in the
assembly of proteins into ordered shapes should enable the next level
of nanobots to be created, whereby a motor such as an F1-ATPase is
able to drive a nanoscale protein propeller and transport a cargo
encapsulated within a protein compartment (Figure B). Combined with targeting peptides and
environmental responsiveness, these nanobots could be programmed to
seek out and interact with or destroy specific targets, e.g., cancer
cells.
Protein Calligraphy: Form and Function
Overall, this is an exciting time in the development of protein
nanostructures. The building of protein complexes has improved to
the point where highly ordered structures can be designed and built
in a customizable manner; next-generation genome sequencing and crystallography
are rapidly expanding the database of known protein structures; the
dramatic reduction in the cost and time required to synthesize genes[65] and build protein expression constructs[66] greatly accelerates prototyping of new protein
designs; and near-atomic resolution imaging by cryo-electron microscopy
enables the structure of large protein assemblies to be resolved.[67] Taken together, this confluence of scientific
advances will enable “protein calligraphy” to move from
“writing” single characters to crafting the equivalent
of sentences, i.e., rational assemblages of shapes that can translate
abstract matter into powerful devices, even convey information, for
creating function and solving problems.
Authors: Sebastian E Ahnert; Joseph A Marsh; Helena Hernández; Carol V Robinson; Sarah A Teichmann Journal: Science Date: 2015-12-11 Impact factor: 47.728
Authors: Christopher W Wood; Marc Bruning; Amaurys Á Ibarra; Gail J Bartlett; Andrew R Thomson; Richard B Sessions; R Leo Brady; Derek N Woolfson Journal: Bioinformatics Date: 2014-07-26 Impact factor: 6.937
Authors: Adam J M Wollman; Carlos Sanchez-Cano; Helen M J Carstairs; Robert A Cross; Andrew J Turberfield Journal: Nat Nanotechnol Date: 2013-11-10 Impact factor: 39.213
Authors: Yun X Chen; Nicole L Ing; Fengbin Wang; Dawei Xu; Nancy B Sloan; Nga T Lam; Daniel L Winter; Edward H Egelman; Allon I Hochbaum; Douglas S Clark; Dominic J Glover Journal: ACS Nano Date: 2020-05-07 Impact factor: 15.881
Authors: Eric J Young; Rodney Burton; Jyoti P Mahalik; Bobby G Sumpter; Miguel Fuentes-Cabrera; Cheryl A Kerfeld; Daniel C Ducat Journal: Front Microbiol Date: 2017-07-31 Impact factor: 5.640