Evan Janzen1,2, Celia Blanco1, Huan Peng1, Josh Kenchel1,2, Irene A Chen1,2,3. 1. Department of Chemistry and Biochemistry, University of California, Santa Barbara, Santa Barbara, California 93109, United States. 2. Biomolecular Sciences and Engineering Program, University of California, Santa Barbara, Santa Barbara, California 93109, United States. 3. Department of Chemical and Biomolecular Engineering, University of California, Los Angeles, Los Angeles, California 90095, United States.
Abstract
The ability of enzymes, including ribozymes, to catalyze side reactions is believed to be essential to the evolution of novel biochemical activities. It has been speculated that the earliest ribozymes, whose emergence marked the origin of life, were low in activity but high in promiscuity, and that these early ribozymes gave rise to specialized descendants with higher activity and specificity. Here, we review the concepts related to promiscuity and examine several cases of highly promiscuous ribozymes. We consider the evidence bearing on the question of whether de novo ribozymes would be quantitatively more promiscuous than later evolved ribozymes or protein enzymes. We suggest that while de novo ribozymes appear to be promiscuous in general, they are not obviously more promiscuous than more highly evolved or active sequences. Promiscuity is a trait whose value would depend on selective pressures, even during prebiotic evolution.
The ability of enzymes, including ribozymes, to catalyze side reactions is believed to be essential to the evolution of novel biochemical activities. It has been speculated that the earliest ribozymes, whose emergence marked the origin of life, were low in activity but high in promiscuity, and that these early ribozymes gave rise to specialized descendants with higher activity and specificity. Here, we review the concepts related to promiscuity and examine several cases of highly promiscuous ribozymes. We consider the evidence bearing on the question of whether de novo ribozymes would be quantitatively more promiscuous than later evolved ribozymes or protein enzymes. We suggest that while de novo ribozymes appear to be promiscuous in general, they are not obviously more promiscuous than more highly evolved or active sequences. Promiscuity is a trait whose value would depend on selective pressures, even during prebiotic evolution.
Catalytic
RNA sequences, or ribozymes, are widely accepted to have
been central to the origin of life.[1,2] Their dual
capacity for information storage and catalytic activity is the basis
for the RNA world theory,[3−5] that an RNA-based metabolism could
have preceded the more complex DNA-RNA-protein system that we observe
in biology today. Regardless of whether an RNA world existed on early
Earth, ribozymes represent an excellent laboratory model system for
molecular evolution. Beginning with a pool of random sequences, strategies
can be devised to select for particular activities. Cycles of selection
and amplification by PCR allow enrichment and eventually isolation
of active sequences. A prerequisite of successful in vitro evolution is the presence of one or more molecules with some activity,
however slight, in the initial pool or early rounds. Once this kernel
of activity exists, the active sequences can be selected and activity
possibly improved by mutation during the evolutionary process. In
addition to developing new ribozymes, in vitro evolution
of RNA allows well-controlled experiments to observe and analyze the de novo emergence of biochemical functions.[6−8]Promiscuous catalytic activities have been invoked as being
particularly
significant for the origin of enzymes,[9,10] as one might
intuit that early, simple ribozymes or enzymes would have little specificity
and therefore might catalyze many reactions, albeit with slow rates.
These sequences might possess kernels of activity for many different
substrates or reactions. One landmark study of such a ribozyme is
a sequence which was engineered to adopt two possible folds, one of
which acted as a ligase and one of which acted as a self-cleaving
ribozyme.[11] This sequence had very low
activity for each function, but a relatively small number of mutations
would increase function to near wild-type in both directions. Such
promiscuity would promote evolutionary innovation by poising sequences
at a nonzero fitness for multiple activities, each of which could
be potentially optimized by natural selection. This idea also raises
the interesting question of whether ribozymes are intrinsically more
promiscuous than protein enzymes. From extensive work on the directed
evolution of enzymes, it has become clear that much of the success
of the field as a whole is due to the presence of low levels of apparently
promiscuous activity in extant protein enzymes. This surprising degree
of promiscuity in highly evolved enzymes suggests that promiscuity
is actually the rule rather than an exception for protein enzymes.Here, we review what is known about the specificity and promiscuity
of ribozymes. We first introduce major concepts and definitions in
specificity and promiscuity, including how to quantify them, which
were originally developed in the enzyme literature. An interesting
concept is the relationship between activity and specificity, which
underlies the intuition that early, relatively low-activity ribozymes
would be more promiscuous. We then review several cases of ribozymes
in which studies have demonstrated promiscuity in some way. When possible,
we calculate a promiscuity index from what is known about these ribozymes,
a first step toward rigorous comparisons of the promiscuity of ribozymes
and protein enzymes. We end with a discussion of the implications
of these comparisons for the hypothesis that early ribozymes were
particularly promiscuous.
Promiscuity and Specificity:
Concepts and Definitions
Defining Specificity
Specificity
is the ability of an enzyme to discriminate between two different
substrates assuming both are present. The question of how to measure
enzyme specificity has been a matter of debate in the past (see refs (12 and 13) and references therein), but
it is generally agreed that specificity in the presence of two different
substrates should be compared based on the discrimination factor,[14] defined as the ratio of the catalytic efficiencies
(kcat/KM)
for the corresponding reactions. According to transition-state theory,
the logarithm of the catalytic efficiency (kcat/KM) is proportional to the
free energy difference between the free enzyme and substrate vs the
transition-state complex (ΔG⧧).[15] When comparing the cognate with an
alternate substrate, the discrimination factor is also called the
accuracy A. Thus, A is exponentially
dependent on the difference ΔΔG⧧ between the cognate and alternate substrates.In some cases
(e.g., polymerases), the use of an error ratio (the rate of incorrect
product formation divided by the rate of correct product formation)
is more appropriate. To gain an intuition about the possible scale
of this ratio, one may note that the theoretical maximum discrimination
between alternative substrates undergoing analogous reactions occurs
when the formation of the enzyme–substrate complex is much
faster than product conversion and release (as assumed in Michaelis–Menten
kinetics). In this case, the theoretical minimum error ratio is equal
to the ratio of KM values.[16,17]
General Mechanisms for Specificity and the
Possible Trade-off with Rate
Discrimination among substrates
can arise from different affinities in the initial enzyme–substrate
complexes (ground-state discrimination) or in the transition-state
complexes (catalytic or transition-state discrimination).[18] The accuracy (A) for a cognate
vs alternative substrate can be increased by three scenarios: (a)
higher rate of substrate association (ground-state discrimination
with koncog > konalt), (b) lower rate of substrate dissociation
(ground-state discrimination with koffcog < koffalt), or (c) higher rate of conversion of the enzyme–substrate
complex into the transition state (transition-state discrimination
with kcatcog > kcatalt).In ground-state discrimination,
lowering the energy of the enzyme–substrate complex has two
effects, namely decreasing KM as well
as decreasing kcat. In other words, although
selectivity may be improved via increased substrate affinity, the
reaction rate suffers. Examples of enzymes exhibiting ground state
discrimination include DNA methyltransferases and the ribosome.[18] The trade-off between accuracy and rate might
impose an evolutionary constraint limiting selectivity.[18] Indeed, selection for activity on one substrate
does not seem to induce high selectivity by itself,[19] and therefore, negative selection against undesired substrates
is used when engineering new enzymes.[20,21] Interestingly,
a trade-off between rate and accuracy created by ground-state discrimination
would contradict the idea that early, less optimized ribozymes or
enzymes were more promiscuous.On the other hand, in transition-state
discrimination, which tends
to apply to relatively small substrates (e.g., DNA polymerases[18,22−25]), lowering the activation barrier increases kcat without necessarily affecting KM. Thus, in principle, transition-state discrimination might achieve
higher selectivity at high activity because there is not necessarily
a trade-off between accuracy and rate. In addition, nonequilibrium
mechanisms driven by release of chemical energy may improve selectivity
with or without a trade-off between accuracy and rate.[26] Furthermore, such mechanisms can allow accuracy
to surpass the theoretical thermodynamic limit based on binding energies.
For example, in kinetic proofreading,[16,27] discrimination
between two possible substrates is achieved by the presence of one
or more irreversible steps in the reaction pathway, whose rate(s)
are biased by the identity of the substrate. These steps are made
irreversible by consumption of chemical energy, and concatenation
of such steps could be used to achieve arbitrarily small error ratios,
in principle. Some biological processes can afford high specificity
by using this mechanism.[16] For example,
although the valine concentration in vivo is ∼5-fold
higher than that of isoleucine, and isoleucyl-tRNA synthetase favors
the reaction with isoleucine over valine by only ∼100-fold,
the rapid hydrolysis of misincorporated valine-tRNA decreases the
error ratio to 1 in 3000. While kinetic proofreading can increase
reaction specificity substantially, this comes with a relatively high
energetic cost.[18,28]However, in the absence
of proofreading mechanisms, substrate specificity
is inherently limited due to physicochemical reasons. Indeed, a recent
survey of the BRENDA database (The Comprehensive Enzyme Information
System[29]) suggests discrimination is usually
much lower than the theoretical maximum.[14] In the case of substrates differing by a single methyl group, discrimination
was found to be lower than the theoretical maximum, for 23 out of
the 25 enzymes surveyed, by typically 1–2 orders of magnitude.
Interestingly, a similar discrepancy is found in nonenzymatic, template-directed
polymerization of activated nucleotides,[17] suggesting that this phenomenon is not specific to enzymes. A discrimination
level lower than the expected theoretical maximum might reflect prioritization
of increased rate during evolution, if the enzyme is subject to an
accuracy-rate trade-off; in other words, the marginal fitness benefit
of increased specificity may come with a larger fitness decrement
due to slower rate. Thus, in general, specificity tends to be lower
than the theoretical maximum, possibly because of the costs associated
with accuracy.[18]Specificity may
appear to be quite suboptimal even for presumably
highly evolved enzymes. For example, the carboxylase enzyme Rubisco
plays an essential role in fixing atmospheric carbon dioxide into
sugars during photosynthesis. However, considering its biomass and
critical role, it is surprisingly slow and nonspecific, as oxygenation
constitutes a major side reaction. Trade-off models have been proposed
to explain the observed correlations between specificity and other
kinetic parameters,[30,31] which were recently revisited
using an extended data set.[32] A strong
correlation was found between the catalytic efficiencies for carboxylation
and oxygenation, indicating that lowering the effective CO2 addition energy barrier (i.e., faster carboxylation) entails a similar
reduction in the effective O2 addition energy barrier (i.e.,
faster oxygenation). Therefore, the accuracy of Rubisco appears to
be highly constrained.
Promiscuous vs Multispecific
Enzymes
The term “catalytic promiscuity” was
originally used
to refer to enzymes known to catalyze more than one type of reaction.[9,33] However, in practice, “promiscuity” has not been well-defined
and thus has been used to refer to fundamentally different phenomena.[14,34] Generally, catalytic promiscuity refers to the
capability of enzymes to catalyze reactions mechanistically different
from the primary biological reaction,[33] and substrate promiscuity refers to the capability
of enzymes to transform different substrates.[35] These terms warrant additional consideration here, as their usage
varies and can depend on incomplete knowledge.The native function
of an enzyme refers to the physiologically relevant chemical transformation
and substrate for which an enzyme has evolved. Native function is
selected for and contributes to organismal fitness. In this context,
any physiological functions for which an enzyme has evolved are considered
native, even if they are not the enzyme’s primary function.
For example, while the primary function of aminoacyl-tRNA synthetases
is to catalyze the attachment of tRNAs to their respective amino acids,
some also catalyze generation of 5′,5′-diadenosine tetraphosphate
in a reaction that appears to be physiologically relevant,[36] and thus this additional function would be considered
native. In practice, whether a particular function contributes to
organismal fitness may be difficult to assess.It is nowadays
well-accepted that many, if not most, enzymes have
multiple side activities.[10,37,38] However, such side activities may or may not be a product of evolution.
In the evolutionary biochemistry literature, promiscuity refers to side activities that are non-native (i.e., not evolved),
so the alternative transformation or substrate is fortuitous. By definition,
there is no evolutionary pressure on non-native activities, as they
do not impact organismal fitness (e.g., alternative substrates are
not available in the cell).[37,39,40] For biologically evolved enzymes, promiscuity (as defined by evolutionary
biochemists) is nearly impossible to ascertain in practice because
we do not know what past environments and selective pressures might
have applied to the protein. If promiscuity of a naturally evolved
enzyme is suspected, we suggest use of the term apparent promiscuity, in contrast to true promiscuity, to acknowledge
this uncertainty.Interestingly, there are two special scenarios
in which true promiscuity
can indeed be characterized unequivocally. First, if the enzyme transforms
a man-made compound not present in nature, the enzyme could not have
evolved this activity, and the activity must be non-native. Examples
are the atrazine chlorohydrolase and melamine deaminase enzymes, which
degrade the man-made compounds atrazine and melamine, respectively.
Despite very high similarity (98% identity), both enzymes show little
activity on the alternative substrate.[12] While it is likely that their host strains evolved the atrazine
or melamine degradation function in response to environmental exposure
(both strains were isolated from areas contaminated by the substrate[12,41]), it may be presumed that neither strain experienced both contaminants
simultaneously. If so, these enzymes can be considered as lacking
in true promiscuity.[42,43] The second scenario in which
true promiscuity might be determined is in the case of in
vitro evolved enzymes and ribozymes, in which the different
environments and selective pressures applied to the sequences are
known.An important contrast to promiscuous enzymes, whose side
reactions
are non-native, is multispecific enzymes (or broad-specificity
enzymes), which evolved to perform many native transformations, such
as on a broad range of available substrates. These enzymes are characterized
by small accuracy values, with different substrates having similar kcat/KM. For example,
theta class glutathione transferases from various species can catalyze
the conjugation of the tripeptideglutathione to a variety of electrophilic
substrates.[44] The enzyme family of cytochromes
P450 metabolizes a variety of different substrates, with activities
including biosynthesis of steroids, fatty acids, or fat-soluble vitamins
as well as the degradation of herbicides and insecticides. In particular,
cytochrome P450 3A4 contributes to the metabolism of approximately
50% of marketed drugs.[45−47] Most of the terpene cyclase enzymes (a.k.a. terpene
synthases) are also multispecific. For example, the class I sesquiterpene
cyclase γ-humulene synthase generates 52 different products,
of which γ-humulene constitutes less than 30% in abundance.[48,49] Methane monooxygenase oxidizes more than 150 different substrates.[50] The RecBCD nuclease, originally named Exonuclease
V, accepts both linear double-stranded DNA and single-stranded DNA
with very low specificity.[51,52] The distinction drawn
between promiscuous and multispecific enzymes hinges on whether the
additional substrates represented selective pressures on the enzyme.
While this is an important conceptual distinction, assessing whether
an enzyme is promiscuous vs multispecific may be difficult in practice
due to lack of knowledge of the evolutionary and environmental history
of the enzyme.It should be noted that additional usages of
the term “promiscuous”
also exist. Promiscuity is sometimes used to refer to the capacity
of an enzyme to transform different physiologically relevant substrates
(see ref (14) and references
therein), to be contrasted with “multifunctional enzymes”
whose side activities may be either physiologically useful or detrimental.[53] Unfortunately, this definition of promiscuity
can be contradictory to the one given earlier, in which promiscuity
refers to the capacity to perform non-native reactions.
In addition, the determination of physiological relevance is difficult
to make and again raises questions of evolutionary history. A third
usage of the term promiscuity refers to enzymes whose catalytic domain
executes multiple functions.[53−56] In this review, we have favored the definition from
the evolutionary biochemistry literature, because ribozymes are often
evolved under known conditions in vitro, allowing true promiscuity to be characterized, while physiological
relevance is unspecified and multiple domains are relatively uncommon.
Promiscuity and Evolutionary Innovation
Fortuitous side reactions of a promiscuous enzyme are believed
to be central to evolutionary innovation, as an initial kernel of
activity for a side reaction is a starting point for optimization
of the new activity by evolution.[9] In addition,
an enzyme might exhibit new side activities under new environmental
conditions (e.g., temperature, pH), and such enzymes are called condition-promiscuous enzymes. Conditional promiscuity is
a possible path for adaptation in new environments.[57−59] However, in
the absence of selective pressure, side activities would be subject
to neutral drift and may be lost if they are uncorrelated to native
functions of the enzyme.Interestingly, contrary to native functions,
which are usually tolerant of mutations, directed evolution of enzymes
has shown that the non-native functions can be greatly optimized by
just a small number of mutations.[19,55] The flip side
of this so-called plasticity is that newly evolved non-native functions
are typically not tolerant to mutations. One might therefore suspect
that evolutionary robustness, if it is observed for native functions,
likely evolved as a trait or correlate of a selected trait. In addition,
the duality of plasticity for non-native functions and robustness
for native functions implies that evolutionary optimization of non-native
functions might not always lead to a significant decrease of the original
native function. However, in the absence of continued selection pressure
on the original function, specialization has been shown to occur due
to trade-offs during selection of the secondary function, even without
negative selection against the original function.[60]The idea that enzyme evolution and promiscuity are
connected goes
back to the mid-1970s, when Yčas and Jensen proposed, independently,
the first model for enzyme evolution.[9,61] This general
model hypothesizes that primitive life had minimal gene content and
the number of available enzymes was limited. It posits that primordial
enzymes may have been less specific, being able to catalyze broad
classes of reactions on a variety of substrates. Gene duplication
and mutation would have then increased genetic diversity, leading
to the divergence of new enzymes whose secondary activities might
give an evolutionary advantage under newly encountered selective pressures.[62,63] This hypothesis is widely accepted, although direct evidence is
scant.[14,55] The molecular processes and evolutionary
forces involved in the biological evolution of enzymes are very difficult
to reconstruct, and hence the mechanisms under which duplication and
specialization events shape enzyme evolution have been the subject
of much debate.[64] Nevertheless, understanding
how specificity and promiscuity arise during in vitro selection and evolution of ribozymes, recapitulating an origin of
life, can address this problem experimentally.
Promiscuity
and the Fitness Landscape
The fitness landscape is a well-studied
conceptualization of evolution
through the space of all possible sequences (sequence space).[65,66] Each point of sequence space is specified by a sequence and its
associated fitness (e.g., activity on a given substrate), giving the
fitness “landscape” in sequence space. At each point
in sequence space, one might also imagine the large chemical space
of possible substrates, and an activity profile for that sequence
over substrate space, which reflects the promiscuity of the sequence.Properties of the fitness landscape are not necessarily expected
to correlate with properties of the promiscuity profile. Different
fitness landscapes over sequence space can give rise to the same promiscuity
profile in substrate space (Figure ). In general, optimization for higher activity need
not correspond to increased specificity. However, specific mechanisms,
such as a trade-off between rate and specificity, could produce correlations
between the fitness landscape and the associated promiscuity profiles.
The idea that early, nonoptimized ribozymes were particularly promiscuous
would translate into a correlation in which highly active sequences
on the fitness landscape have lower promiscuity compared to less active
sequences. Whether the specificity of a ribozyme can be improved through
mutation or evolution would depend not only on the specificity of
that individual ribozyme, but also on the fitness landscapes for the
cognate and alternative substrates (Figure ).
Figure 1
Fitness landscapes and promiscuity profiles.
Different fitness
landscapes (A–D) can correspond to the same promiscuity profile
(E–H). Vertical dashed lines (a–g) correspond to different
ribozyme sequences. Ribozyme fitness landscapes (A–D) for two
substrates may differ (blue and red) with or without overlap. The
promiscuity profile (E–H), depicted here for two substrates
(1: blue and 2: red) depends on the sequence tested, as seen in the
comparison among sequences a–c in panel E. In addition, similar
promiscuity profiles can be derived from qualitatively different fitness
landscapes. Compare sequence a from (A, E) with sequence d from (B,
F), sequence b from (A, E) with sequence e from (C, G), and sequence
c from (A, E) with sequence g from (D, H). While ribozymes a and d
have similar promiscuity profiles, their evolutionary potential is
strikingly different. Ribozyme a could evolve through mutations to
specialized activity, but ribozyme d is already at a local maximum
and has no evolutionary potential for increasing activity. Similarly,
ribozymes b and e have the same promiscuity profile, but only ribozyme
b has the possibility to evolve into a sequence of higher activity
and selectivity. Ribozymes c, f, and g are highly specific, but unlike
ribozymes f and g, ribozyme c has increased potential to evolve into
a promiscuous ribozyme.
Fitness landscapes and promiscuity profiles.
Different fitness
landscapes (A–D) can correspond to the same promiscuity profile
(E–H). Vertical dashed lines (a–g) correspond to different
ribozyme sequences. Ribozyme fitness landscapes (A–D) for two
substrates may differ (blue and red) with or without overlap. The
promiscuity profile (E–H), depicted here for two substrates
(1: blue and 2: red) depends on the sequence tested, as seen in the
comparison among sequences a–c in panel E. In addition, similar
promiscuity profiles can be derived from qualitatively different fitness
landscapes. Compare sequence a from (A, E) with sequence d from (B,
F), sequence b from (A, E) with sequence e from (C, G), and sequence
c from (A, E) with sequence g from (D, H). While ribozymes a and d
have similar promiscuity profiles, their evolutionary potential is
strikingly different. Ribozyme a could evolve through mutations to
specialized activity, but ribozyme d is already at a local maximum
and has no evolutionary potential for increasing activity. Similarly,
ribozymes b and e have the same promiscuity profile, but only ribozyme
b has the possibility to evolve into a sequence of higher activity
and selectivity. Ribozymes c, f, and g are highly specific, but unlike
ribozymes f and g, ribozyme c has increased potential to evolve into
a promiscuous ribozyme.
Quantifying
Promiscuity: The Promiscuity Index
Several possible methods
exist to quantify substrate specificity.
Here, we describe the promiscuity index (I) proposed
by Nath and Atkins, which is a metric similar to a normalized information
entropy[67] (eq ):where N is the number of
substrates that can be transformed and e corresponds to their individual associated catalytic
efficiencies. Due to the normalization, this metric goes from 0 (only
uses one substrate) to 1 (equally efficient on all N substrates).While this promiscuity index is simple and intuitive,
it might be strongly influenced by the experimenter’s choice
of substrates to test. In particular, when comparing promiscuity indices
for different ribozymes or enzymes, one sequence might appear more
promiscuous only because many chemically similar substrates were assayed.
To account for this problem, a weighted promiscuity index (J) factoring in substrate similarity can be calculated (eq ).[67] Chemical similarity can be calculated using a bitwise dissimilarity
metric between a pair of substrates (δ), which is based on the
presence or absence of a number of different functional groups.Any method for quantifying promiscuity from experimental data is
likely to be biased in at least two ways. First, there is an experimental
bias in the selection of substrates (e.g., synthetically accessible,
similar to known substrates). For comparisons among enzymes, differences
in these biases might affect the promiscuity index calculated, even
when using the weighted value. Second, these metrics do not consider
the chemical context in which an enzyme functions. If the environment
never provides a certain substrate, it may not be justifiable to include
such a substrate in the calculation even if the enzyme has nonzero
activity on it in vitro. Additionally, the relationship
between chemical similarity and promiscuity has not been well-established,
and often little difference between unweighted (I) and weighted (J) values has been observed.[67,68] Other metrics for promiscuity also exist, such as a measure based
on structural information on the catalytic residues.[69,70]Despite these limitations, the promiscuity index serves as
a starting
point for characterization and comparison of substrate specificities.
In this review, we calculate promiscuity indices for ribozymes for
which sufficient data is available in the literature.[71,72] However, when necessary, catalytic rates were used in place of catalytic
efficiency when the catalytic efficiency was inappropriate or unknown.
Quantitative metrics like the promiscuity index provide the opportunity
to compare the specificity of different molecules and potentially
study the relationship between promiscuity and other measurable characteristics,
such as activity.
Ribozymes Illustrating Promiscuity
In this section, we first describe substrate promiscuity using
aminoacylation ribozymes, for which different substrates have been
studied in some depth. Then, to gain mechanistic insight into a specific
case, we turn to the hammerhead ribozyme, where specificity can be
understood in terms of RNA annealing. An important consequence of
this mechanism is that promiscuity is dependent on environmental conditions,
such as temperature. The expression of promiscuity under new conditions
(conditional promiscuity) is a possible mechanism for uncovering latent
side activities. We then follow a series of in vitro evolution experiments seeking an RNA replicase, in which the presumption
of promiscuous activity was essential to the design and success of
the experiments. We next describe a different kind of promiscuity,
catalytic promiscuity, in a case of a nucleotide synthase ribozyme
that unexpectedly possesses two distinct catalytic mechanisms. Then,
we end this section with a brief discussion of the ribosome, a proteinaceous
ribozyme whose promiscuity appears to be unparalleled.
Substrate Promiscuity: Aminoacylation Ribozymes
Aminoacylation
of tRNA is a key step in protein synthesis, and
high selectivity for tRNA-amino acid pairs is crucial for the stability
of the genetic code.[73] It is presumed that
ribozymes carried out aminoacylation reactions in the earliest stages
of the evolution of the translation apparatus. Indeed, several aminoacyl-RNA
synthase ribozymes have been identified through in vitro selection, which use a variety of activated amino acid substrates[74−77] (Figure ). These
aminoacylating ribozymes show a range of specificities for the substrate
side chain. For example, selection using a phenylalanine adenylate
substrate 1 produced ribozymes that showed little discrimination
(i.e., promiscuous ribozymes) as well as ribozymes showing a strong
preference for aromatic amino acids.[78] Although
they are derived from the same selection, these ribozymes have quite
different promiscuity profiles and indices (Figure , Table ). Selection for aminoacylation with coenzyme A (CoA)
thioester 2 produced ribozymes that could function with
other CoA thioesters, but required the presence of a free α-amino
group.[76] None of these ribozymes match
the specificity of the aminoacyl-tRNA synthetase enzymes found in
modern biochemistry. This discrepancy cannot be the result of a trade-off
between activity and specificity, because the ribozymes are generally
much less efficient (∼1000-fold) than the corresponding enzymes.[79,80] Instead, the general finding of promiscuity and the variation of
specificities found among these ribozymes are consistent with the
understanding that newly evolved sequences are not necessarily specific
if they have not been selected for specificity.
Figure 2
Substrates for aminoacylation
ribozymes. Phenylalanyl-adenosine
monophosphate[74,78] (1), biocytinCoA[76] (2), biotinyl-Tyr(Me)-oxazolone[77] (3), amino acid cyanomethyl ester,[90] (4) and amino acid 3,5-dinitrobenzyl
ester[90] (5). Substrates 4 and 5 are flexizyme substrates. The amino acid
backbone is depicted in green; side chains are depicted in blue, and
leaving groups are depicted in red. R indicates possible chemical
variation in the side chain.
Figure 3
Promiscuity
profiles for two aminoacylation ribozymes. Promiscuity
profiles for ribozyme 77 (blue squares) and ribozyme 29 (pink circles)
show catalytic rates for each tested amino acid substrate,[78] ordered by hydrophilicity as defined by Hopp
and Woods[91,92] (Phe = −2.5, Tyr = −2.3, Ile
= −1.8, Ala = −0.5, Gln = 0.2, and Ser = 0.3). Also
see Table .
Table 1
Promiscuity Indices Calculated for
Two Aminoacylation Ribozymes (77 and 29)
substrate
side chaina
CIDa
k (M–1 min–1)b
Ic
Jc
ribozyme 77
phenylalanine
6140
60 000
0.376
0.439
tyrosine
6057
40 000
isoleucine
6306
1.5
alanine
5950
3.4
glutamine
5961
5.3
serine
5951
0.9
ribozyme 29
phenylalanine
6140
140
0.810
0.807
tyrosine
6057
220
isoleucine
6306
20
alanine
5950
1000
glutamine
5961
650
serine
5951
600
Amino acids (CID:
PubChem Compound
Identifier) were used to determine similarities for calculation of J.[67,71,71] The substrates used are aminoacyl adenylates with the side chain
indicated.
Rate constants
are from Illangasekare
et al.[78]
Both the unweighted (I) and weighted
(J) promiscuity indices were calculated
from the rate constants (k) shown.
Substrates for aminoacylation
ribozymes. Phenylalanyl-adenosine
monophosphate[74,78] (1), biocytinCoA[76] (2), biotinyl-Tyr(Me)-oxazolone[77] (3), amino acid cyanomethyl ester,[90] (4) and amino acid 3,5-dinitrobenzyl
ester[90] (5). Substrates 4 and 5 are flexizyme substrates. The amino acid
backbone is depicted in green; side chains are depicted in blue, and
leaving groups are depicted in red. R indicates possible chemical
variation in the side chain.Promiscuity
profiles for two aminoacylation ribozymes. Promiscuity
profiles for ribozyme 77 (blue squares) and ribozyme 29 (pink circles)
show catalytic rates for each tested amino acid substrate,[78] ordered by hydrophilicity as defined by Hopp
and Woods[91,92] (Phe = −2.5, Tyr = −2.3, Ile
= −1.8, Ala = −0.5, Gln = 0.2, and Ser = 0.3). Also
see Table .Amino acids (CID:
PubChem Compound
Identifier) were used to determine similarities for calculation of J.[67,71,71] The substrates used are aminoacyl adenylates with the side chain
indicated.Rate constants
are from Illangasekare
et al.[78]Both the unweighted (I) and weighted
(J) promiscuity indices were calculated
from the rate constants (k) shown.The aminoacylating ribozymes discussed
above have also been observed
to catalyze reactions using alternative nucleophilic substrates to
generate amide bonds in addition to esterification. In particular,
a minimized, 29 nucleotide version of an aminoacylating ribozyme that
utilizes Phe-AMP was found to catalyze successive reactions: aminoacylation
of the RNA and the subsequent amide bond formation to generate a conjugated
peptide.[81,82] The rate of peptide formation was approximately
13-fold less than that for aminoacylation, but this difference could
be tuned. Extending the 3′ tail of the RNA by three nucleotides
resulted in a 3-fold reduction in the rate of aminoacylation and a
2-fold increase in the rate of peptide formation, presumably by increasing
the flexibility around the active site.The potential promiscuity
of aminoacylating ribozymes is highlighted
by the “flexizymes” developed by Suga and colleagues,
so named for their flexibility in accommodating a variety of substrates.
These ribozymes were generated over a series of selections with the
ultimate goal of producing catalysts capable of charging tRNAs with
a wide variety of both natural and non-natural substrates. The starting
point of the selection was a library containing a 5′ random
region and a 3′ tRNA. Ribozymes were selected for their ability
to aminoacylate the 3′ terminus of the conjugated tRNA. This
first selection produced ribozymes with a high level of specificity
to both the tRNA and phenylalanine substrates.[75,83,84] To broaden the tRNA substrate range, further
design and selection was performed with an alternative tRNA sequence,
which resulted in ribozymes capable of accepting a variety of tRNAs.[85] These early flexizymes exhibited high affinity
for the aromatic side chains. Using the ribozyme’s affinity
to the aromatic group to broaden the side chain specificity, the initial
substrate 4 was redesigned to substrate 5, which contains a 3,5-dinitrobenzyl ester as the leaving group in
the aminoacylation reaction. The idea was that this leaving group
could be kept constant, ensuring affinity to the ribozyme, while the
side chain itself was varied. This substrate necessitated an altered
reaction mechanism, but nevertheless, the strategy was successful,
with further selection resulting in ribozymes capable of charging
tRNAs without regard to amino acid side chain.[85] More recently, flexizymes have been used to charge tRNAs
with various non-natural amino acids, including d-amino acids,
β-amino acids, and α-hydroxy acids, and 3′-aminoacyl-NH-tRNA
can also be charged.[86−89] Although the flexizyme does exhibit a minor degree of side chain
specificity, yields with the non-natural analogues often rival those
for the l-amino acids used in the initial selections. The
additional substrates represent both promiscuous (non-native) as well
as native activities. Overall, the flexizyme demonstrates the surprisingly
broad substrate specificity that can be evolved and designed when
substrate generality is a desired goal.
Conditional
Promiscuity: The Hammerhead Ribozyme
Due to their historical
importance in the discovery of ribozymes,
much is known about the self-cleaving ribozymes, which function through
general acid–base catalysis.[175] The
ribozyme fold brings the reactant nucleotides to the vicinity of the
cleavage site, with the catalytic strand acting as the general base
or acid to activate the nucleophile or stabilize the leaving group,
respectively (Figure ). Many of these ribozymes can also catalyze the same transesterification
reaction in reverse, using nucleophilic attack from a 5′-hydroxyl
to ligate two substrate strands,[93] which
represents a possible case of catalytic promiscuity, a phenomenon
discussed in Section in the context of a different ribozyme. In this section,
we focus on the substrate promiscuity of a self-cleaving ribozyme
and how it arises. Although these ribozymes are cis-acting in vivo, they can be engineered to accept
oligonucleotide substrates in trans with multiple
turnover. While there are numerous self-cleaving ribozymes, here we
confine our discussion to the case of the hammerhead ribozyme, a naturally
occurring ribozyme found in plant viroid transcripts,[94] for which the specificity of trans-acting
variants has been extensively investigated.
Figure 4
Proposed mechanism of
RNA self-scission by general acid–base
catalysis. A general base promotes deprotonation of the 2′-hydroxyl
of the nucleophile, initiating formation of the cyclic intermediate.
A general acid stabilizes the 5′-hydroxyl leaving group, allowing
resolution of the intermediate to generate the cleavage products.
Proposed mechanism of
RNA self-scission by general acid–base
catalysis. A general base promotes deprotonation of the 2′-hydroxyl
of the nucleophile, initiating formation of the cyclic intermediate.
A general acid stabilizes the 5′-hydroxyl leaving group, allowing
resolution of the intermediate to generate the cleavage products.The trans-acting hammerhead ribozyme
can be engineered
from the cis-acting ancestor by removing a nucleotide
loop of one helical arm, thereby creating a cleavable substrate strand
and a catalytic strand.[95−99] In such constructs, the catalytic strand can bind and cleave substrate
strands with multiple turnover. In particular, separation of stem
I from stem III of the ribozyme (the I/III construct) is most widely
studied[100−102] because this construction places most of
the conserved nucleotides in the catalytic strand (Figure ). This allows substrate specificity
in the substrate strand to be probed.
Figure 5
Structure of a trans-acting
I/III hammerhead ribozyme
HH16. The catalytic strand is shown in black, and substrate is shown
in green.[103] The red arrow indicates the
cleavage site.
Structure of a trans-acting
I/III hammerhead ribozyme
HH16. The catalytic strand is shown in black, and substrate is shown
in green.[103] The red arrow indicates the
cleavage site.There are two main expectations
for sequence specificity of the
substrate in a trans-acting construct. First, residues
critical for the catalytic mechanism are expected to be relatively
intolerant to mutations, which would primarily affect kcat.[104] Second, aside from
critical residues, promiscuity for the substrate is expected to be
determined by binding interactions (Km) between enzyme and substrate, namely base-pairing, which can lead
to large variation in dissociation rates among different substrates.
In the HH16 ribozyme (Figure ), substrate affinity, which was dominated by stem III, was
very high, implying a low dissociation rate, such that truncation
from the 3′ end, down to a 2-nucleotide version of stem I,
had little effect on the overall rate of cleavage.[103] Specificity in either stem I or III of the substrate was
therefore only observed when stem III was destabilized to give a dissociation
rate that was on par with or slower than the overall cleavage rate.
Conversely, extending the recognition sequence reduced specificity,[105] in keeping with the idea that, if substrate
dissociation is slow relative to cleavage, mutations in the substrate
are tolerated because the bound complex is sufficiently populated.
In terms of the active site itself, the hammerhead ribozyme has limited
substrate promiscuity; substitution of the reactive phosphate with
thiophosphate greatly reduces kcat.[106]The example of the hammerhead ribozyme,
particularly the sequence
dependence of the substrate, illustrates, at a molecular level, the
property of conditional substrate promiscuity, in
which the apparent promiscuity depends on the environmental condition.
Variants having longer binding regions or higher substrate affinity
can tolerate weakening (or strengthening) of binding without much
change in population of the bound state, and therefore are relatively
insensitive to mutations and have high apparent promiscuity. On the
other hand, variants that exist on the threshold of binding can display
high specificity as they are sensitive to small changes in dissociation
rate. Thus, exhibition of promiscuity depends on conditions such as
substrate concentration, pH, ionic strength, and temperature. Conditional
promiscuity can be the basis for cryptic genetic variation, in which
an altered phenotype is uncovered in new environments. Thus, it is
likely to be underappreciated in the literature due to observational
bias, because most experimental studies tend to focus on a small set
of reaction or environmental conditions. This is an area ripe for
future research given the likely importance of conditional promiscuity
for evolutionary innovation.
Convergent Mechanism, Convergent
Promiscuity:
A Tale of Two “Capping” Ribozymes
The influence
of mechanism on promiscuity, illustrated by the hammerhead ribozyme,
is exemplified in a comparison of two independently derived ribozymes
that share a common mechanism. These ribozymes, isolated under different
selection conditions, promote the formation of a phosphate-phosphate
anhydride bond between the terminal phosphate of a nucleotide and
the 5′-α-phosphate of RNA. The final product is similar
to the 5′ cap found on eukaryotic mRNAs.These two RNA
capping ribozymes, the Iso6 and 6.17 ribozymes, were discovered in
the Yarus and Unrau groups, respectively.[107,108] Interestingly, both ribozymes were isolated from different selections
for which this capping reaction was not the desired function. Iso6
was recovered from a selection originally designed to identify ribozymes
that could produce aminoacyl adenylates through reaction between amino
acids and triphosphorylated RNA. Instead, pyrophosphate release was
observed in the absence of amino acids, and the selection pool even
developed labeling with PPi. Selection for capping activity
using UTP instead of PPi quickly resulted in high activity
in the pool and the identification of Iso6.[107] On the other hand,the 6.17 ribozyme derived from a selection initially
designed to identify polymerase activity by incorporating labeled
UMP into a primer annealed to a poly(A) template. The resultant ribozyme
with the fastest kinetics, 6.17, instead was found to act on the 5′
end of the RNA, forming a 5′–5′ cap.[108]Iso6 and 6.17 display no apparent sequence
similarities and are
expected to adopt different secondary structures, consistent with
their unique origins. Despite these differences, the molecular mechanisms
for these two ribozymes appear to be surprisingly similar. Both ribozymes
are predicted to have helices that terminate at the site of capping,
with the terminal 5′ nucleotide retaining some flexibility;
this position is unpaired in Iso6 and requires wobble pairing in 6.17.
Both ribozymes also display increased activity at lower pH and require
divalent cations for activity, although Iso6 prefers Ca2+ while 6.17 tolerates Mg2+, Mn2+, and Ca2+. The ribozymes even possess similar substrate binding affinities.[108,109] Finally, both ribozymes appear to have minimal substrate requirements.
The identities of the sugar and base have little impact on activity,
despite possible hydrogen bonding interactions with these moieties.
However, decreasing the length of the phosphate chain results in a
large decrease in substrate binding,[108,110,111] indicating that the phosphate itself is responsible
for most substrate interactions.Thus, these ribozymes suggest
a common molecular mechanism for
RNA capping that permits a high degree of substrate promiscuity, provided
a small number of key features is present. The fact that two independently
evolved, structurally dissimilar ribozymes have the same requirements
supports the idea that substrate promiscuity is determined by mechanism.
In this case, evolutionary convergence on the same mechanism resulted
in convergence to similar promiscuity as well.[112]
Relying on Promiscuity:
Searching for an RNA
Replicase
For those interested in the origin of life, one
of the most sought-after de novo ribozyme functions
is catalysis of template-directed RNA polymerization (an “RNA
replicase”), which is thought to be important, if not essential,
to a self-replicating RNA system. One of the major avenues for this
search has relied heavily on the promiscuity of newly evolved ribozymes.
The first ribozymes developed by use of in vitro selection
from a large pool of random sequences were RNA ligase ribozymes, including
the “class I ligase” (Figure ). The class I ligase was selected to catalyze
the ligation between a 5′-triphosphate on the ribozyme and
a 3′-hydroxyl on an RNA oligonucleotide substrate, which caused
the ribozyme to tag itself with a sequence on the substrate that was
necessary for purification and amplification.[113,114] The 3′,5′-phosphodiester bond formed during ligation
is identical to that formed during template-directed polymerization.
Modification of the original class I ligase to bind a primer-template
complex generated a ribozyme that possessed some polymerization activity,
being able to extend a primer through the incorporation of mononucleotide
triphosphates.[115,116] This reaction occurred with
92% fidelity, though activity decreased with successive nucleotide
additions, topping out at six nucleotides added after a six-day incubation.
Nevertheless, this initial finding signaled that catalytic promiscuity
of the class I ligase could potentially lead to an RNA replicase.
Figure 6
The class
I ligase ribozyme and its descendants. Structures for
the class I ligase;[133,134] the round-18 polymerase introducing
the new 3′ accessory domain (blue), which is known to interact
with the loop depicted on the lower right;[117,122] the tC19Z polymerase, introducing a new 5′ accessory domain;[123] and the 24–3 polymerase.[124] Blue regions denote new additions to the ribozyme
with point mutations marked by blue diamonds. Primer and template
oligonucleotides are shown in orange and red.
The class
I ligase ribozyme and its descendants. Structures for
the class I ligase;[133,134] the round-18 polymerase introducing
the new 3′ accessory domain (blue), which is known to interact
with the loop depicted on the lower right;[117,122] the tC19Z polymerase, introducing a new 5′ accessory domain;[123] and the 24–3 polymerase.[124] Blue regions denote new additions to the ribozyme
with point mutations marked by blue diamonds. Primer and template
oligonucleotides are shown in orange and red.Subsequent work with the class I ligase and its derivatives aimed
to increase its processivity, fidelity, and template generality. Important
progress was made through attachment of an accessory domain to the
3′ end of the ribozyme, which was selected from a 76-nt random
sequence with the idea that this domain could facilitate interaction
of the ribozyme with the primer-template complex.[117] Polymerization activity was selected for through incorporation
of tagged nucleotides opposite an attached primer. To increase the
sequence generality of polymerase activity, shorter templates were
used to reduce hybridization with the ribozyme, and different primer-template
sequences and lengths were used in different rounds of selection.
One ribozyme, isolate 10.2, was found to function without attachment
to the primer, and without recognition of a specific sequence. This
feature was conferred by the new accessory domain, which increased
binding of the primer-template complex, but polymerization activity
itself occurred with minimal change to the ligase domain. Mutagenized
versions of the 10.2 ribozyme were further selected for function on
longer single-stranded templates and in the presence of higher concentrations
of untagged nucleotides to improve fidelity. After eight more rounds
of selection, the best resultant ribozyme, termed the round-18 polymerase
(Figure ), functioned
much better with longer templates, and allowed for the extension of
up to 14 nucleotides.While catalytic promiscuity was key to
the discovery of an RNA
polymerase ribozyme, substrate promiscuity at a given template base
is highly undesirable. That is, fidelity is important for an RNA replicase,
because error rates represent a serious limit in the transmission
of information.[118] The round-18 polymerase
copied templates with a per-base fidelity of 96.7%, which corresponds
to relatively low promiscuity values (0.01–0.22; Table ). One of the major determinants
of this fidelity is misincorporation resulting from G:U wobble pairs,
which is reflected by their higher promiscuity values compared to
A and C (Table ).
While extension across a template A or C resulted in the correct addition
(U or G, respectively) in over 99% of cases, G templated with an incorporation
fidelity of 95.7%, and U templated with a fidelity of 92.1%, with
the vast majority of mismatches resulting in a G:U mispair. This type
of mispairing also appears to be the major limitation on the fidelity
of nonenzymatic replication, and may be an echo of the thermodynamic
limit on specificity.[17,119]
Table 2
Promiscuity
Index (I) and Weighted Promiscuity Index (J) for the Round-18
Polymerase
substrate
CID
kA (M–1min–1)a
kC (M–1min–1)a
kG (M–1min–1)a
kU (M–1min–1)a
ATP
5957
0.30
0.057
0.023
5.3
CTP
6176
0.02
0.008
5.4
0.0002
GTP
135 398 633
0.02
41
0.003
0.23
UTP
6133
87
0.004
0.46
0.001
fidelity
0.991
0.9996
0.957
0.921
I
0.020
0.010
0.219
0.126
J
0.020
0.010
0.220
0.125
Rate constants
(kN, for N = A, C, G,
U) are from Johnston
et al.[117]
Rate constants
(kN, for N = A, C, G,
U) are from Johnston
et al.[117]Despite the 3′ accessory domain, a major limitation
of the
round-18 polymerase continued to be low binding affinity for the primer-template
complex, which was the primary contributor to the low processivity
of the class I polymerase. The affinity also had a high degree of
variability with regard to primer-template sequence, suggesting that
further reduction of sequence specificity was still needed.[120] Further progress was achieved by selecting
directly for activity in trans using a water-in-oil
emulsion. The first product of this method was the B6.61 polymerase,
which was capable of generating sequences 20 nucleotides long.[121,122] B6.61 showed a much faster polymerization rate than its predecessor,
with an extension rate over 75 times faster for longer sequences.
While there was no significant improvement in binding to the primer-template,
this rate increase was accompanied by increased fidelity, including
a minimization of G:U wobble insertions. As with the aminoacylation
ribozymes (Table ),
this trend is a counterexample to the idea of a general trade-off
between activity and specificity.A substantial improvement
to processivity came using a similar
compartmentalization technique with the addition of a 5′ random
region with the aim of improving interactions between the ribozyme
and the primer-template complex. This yielded a 5′ accessory
domain that forms stabilizing interactions with downstream portions
of the template, thus increasing binding of the ribozyme to the template.
Randomization and selection of the template sequence strengthened
these interactions. With the optimized template, the new ribozyme,
named tC19, ultimately yielded up to 95 nucleotide extensions with
a per-base fidelity of 97.3%. However, the new interactions were largely
intermolecular base-pairing, such that activity was strongly dependent
on sequence. Selection on different templates identified four point
mutations which, when introduced into tC19 to make tC19Z (Figure ), improved the sequence
generality. These new mutations further increased the measured per-base
fidelity to 99.1%, largely due to a decrease in G insertion across
template U. The tC19Z polymerase was shown to be capable of transcribing
a functional 24 nt variant of the hammerhead ribozyme.[123]Consideration of the promiscuity of the
RNA polymerase ribozyme
raises an interesting irony: while substrate promiscuity of the incoming
monomer across a given template base is undesirable because it leads
to copying errors, substrate promiscuity with respect to the template
itself is highly desirable to obtain a ribozyme capable of copying
many different, and ideally any, sequences. Sequences of particular
concern are those with a high degree of structure that would need
to be locally melted for ribozyme access, including sequences that
comprise the ribozyme itself. Recent selections based on the RNA polymerase
ribozyme focus on improving its sequence generality. One such study
selected for the polymerase’s ability to synthesize complex
folded RNA molecules, with selection tied to the creation of two functional
aptamers, imposing pressure for sequence generality and high fidelity.[124] The most active selected ribozyme, 24–3
(Figure ), showed
a ∼100-fold increased incorporation rate through structured
sequences compared to the parent ribozyme. Likely as a result of selection
for functional molecules instead of sequence fidelity, the 24–3
ribozyme displayed a higher error rate than its predecessors, in particular
an increased tolerance for G-U wobble pairing. Despite this limitation,
24–3 was able to synthesize functional RNAs up to 76 nt long
and could perform exponential amplification of an RNA template.A different approach to overcoming the substrate generality problem
takes advantage of plasticity, which occurs when non-native functions
can be found through a relatively small number of mutations. In this
case, it was hypothesized that copying via ligation of oligonucleotides
could improve copying through structured sequences, because base-pairing
to the oligonucleotide would mitigate some of the free energy cost
of melting the template. Knowing that the RNA polymerase ribozyme
was originally derived from the promiscuous activity of an RNA ligase,
Attwater et al. engineered and evolved an ancestor of the tC19Z ribozyme
to copy templates using trinucleotide triphosphates instead of NTPs.[125] The triplet oligonucleotides use strand invasion
to unfold structured RNA sequences for improved copying. The atavistic
ribozyme t5+1 displayed reduced fidelity compared to its
NTP-using counterpart, but selection for fidelity yielded an improved
variant able to synthesize its own catalytic subunit. Interestingly,
the t5+1 ribozyme consists of a heterodimer of the catalytic
subunit and an RNA “cofactor” that assists interaction
with the primer-template complex. Both subunits are descended from
the same ancestral pool, illustrating how specialized descendants
originated from distinct domains of the ancestor.An ingenious
orthogonal strategy to overcome the problem of sequence
generality was developed by Joyce and coworkers, who reasoned that
base-pairing between ribozyme and template was the major contributor
to the energetics determining template specificity. Base-pairing is
essentially absent between d-RNA and l-RNA sequences,[126,127] and thus, a d-ribozyme is expected to have little base-pairing
interaction with l-substrates. Selection for ligase activity
indeed discovered d-ribozymes that could ligate l-RNA oligonucleotides.[128] As expected,
the non-natural, mirror-image l-ribozyme could perform the
complementary reaction using d-RNA substrates and template.
Furthermore, as with the class I ligase, these cross-chiral ligases
also possessed polymerization activity. Unlike nonenzymatic templated
RNA replication,[129] these ribozymes displayed
very little chiral inhibition, showing a high specificity for substrates
of the desired chirality. A cross-chiral ligase was efficient enough
to produce its mirror image enantiomer, which could then produce the
original enantiomer. The cross-chiral ribozymes were not entirely
sequence-general, as some substrates, such as those with 3′-terminal
C or G residues, were more efficient than others. Nevertheless, given
the precedent of the evolutionary strides demonstrated by the promiscuous
class I ligase, the cross-chiral ligases represent an intriguing starting
point for further development.Polymerase (and ligase) ribozymes
present a unique challenge in
simultaneously requiring broad template accommodation and strict fidelity.
Although this work was not undertaken for the purpose of studying
promiscuity in ribozymes, the advances made with the class I ligase,
spanning more than two decades of work by multiple groups, rely heavily
on the promiscuity and plasticity of the ribozyme. Because this lineage
of RNA polymerase ribozymes has only been selected on RNA substrates,
true promiscuity can be clearly identified if the ribozymes accept
different nucleic acids. One ribozyme displays some activity for incorporation
of non-natural sugars and nucleobases, although it often stalls if
modified nucleotides are present at specific positions.[130] Unlike other ribozymes in its lineage, the
24–3 ribozyme, perhaps as a consequence of its selection for
tolerance of different RNA aptamer templates, was observed to polymerize
DNA on an RNA template (i.e., reverse transcription), permitting extension
by up to 32 deoxyribonucleotides.[131] A
later generation of this ribozyme, 38–6, shows remarkable promiscuity,
with activity on templates or nucleotides composed of multiple combinations
of RNA, DNA, threose nucleic acid (TNA), or arabinose nucleic acids
(ANA), though with reduced activity compared to an RNA-only system.[132] 38–6 also performs DNA-templated RNA
synthesis and RNA-templated DNA synthesis more effectively than synthesis
involving TNA or ANA, likely due to the lower structural similarities.
These results further demonstrate the promiscuity of this ribozyme
lineage. While the class I ligase and its descendants constitute a
fascinating case study, it is unknown whether other ribozymes could
exhibit similar versatility.
Catalytic Promiscuity:
The Nucleotide Synthase
Ribozyme
While substrate promiscuity appears to be commonly
found among ribozymes,[108,135,136] one may ask whether true catalytic promiscuity is also observed.
Indeed, an interesting case was found in the pR1 nucleotide synthase.[137] Selected to catalyze a reaction between ribose
5-phosphate (PR) and 6-thioguanosine (6SGua), this ribozyme
was found to also be capable of catalyzing the reaction between 6SGua and 5-phosphoribosyl 1-pyrophospate (PRPP), an intermediate
in the biological synthesis of nucleotides. These two reactions appear
to have distinct reaction mechanisms and resultant products, depending
on the substrate provided. Reaction with PRPP generates a glycosidic
bond, resulting in the corresponding nucleotide, 6SGMP.
However, the reaction with PR appears to require acyclization of ribose,
allowing 6SGua to react with the corresponding aldehyde
and generate a Schiff base (Figure ). Each reaction has a unique dependence on magnesium
ion concentration, supporting the existence of two different mechanisms.
Interestingly, ribozymes selected for reactivity with PRPP instead
of PR did not exhibit analogous activity on PR. Thus, not all ribozymes
with the same function possess catalytic promiscuity. Despite the
catalytic promiscuity of pR1, the ribozyme still displays a high degree
of substrate specificity toward 6SGua, as analogous sulfur-containing
purines are not recognized, in contrast to purine synthase ribozymes
selected independently.[138] The pR1 ribozyme
demonstrates that catalytic promiscuity may differ in important ways
from substrate promiscuity. While substrate promiscuity might be readily
evolved through a relaxed binding mode, catalytic promiscuity requires
a new reaction mechanism whose spontaneous emergence might be relatively
unusual.
Figure 7
Reactions catalyzed by the pR1 nucleotide synthase ribozyme. (A)
Given a ribose 5-phosphate substrate, the acyclic form of ribose is
stabilized and 6-thioguanosine (6SGua) reacts to form a
Schiff base, which can then undergo an Amadori rearrangement. (B)
Reaction with 5-phosphoribosyl 1-pyrophospate produces the desired
nucleotide, 6SGMP. Adapted with permission from ref (137). Copyright 2009 Elsevier
Ltd. Cell Press.
Reactions catalyzed by the pR1 nucleotide synthase ribozyme. (A)
Given a ribose 5-phosphate substrate, the acyclic form of ribose is
stabilized and 6-thioguanosine (6SGua) reacts to form a
Schiff base, which can then undergo an Amadori rearrangement. (B)
Reaction with 5-phosphoribosyl 1-pyrophospate produces the desired
nucleotide, 6SGMP. Adapted with permission from ref (137). Copyright 2009 Elsevier
Ltd. Cell Press.
A Highly
Promiscuous Ribozyme: The Ribosome
The ribosome is a ribozyme
that translates genetic information
on mRNA into protein sequences and is conserved across all the domains
of life. The ribosome core consists of catalytic RNA, but farther
from the catalytic center, both proteins and RNA are found.[139] In eukaryotes, four rRNAs (rRNAs) associate
with about 70 proteins, while in E. coli, the ribosome
consists of 3 rRNAs and 52 proteins.[140,141] Because the
ribosome is a ribozyme and conserved across all domains, it is presumed
to have existed in the last universal common ancestor (LUCA), and
is also taken as circumstantial evidence of the RNA World.[142] Protein translation necessitates high fidelity,
with error rates of the overall process on the order of 10–4 per codon. Fidelity is primarily maintained through factors other
than the ribosome, such as aminoacyl-tRNA synthetase editing and EF-Tu
binding.[143,144] While the peptidyl transferase
center of the ribosome provides some steric selectivity (e.g., preferring l- rather than d-amino acids[145]), the ribosome itself is a surprisingly promiscuous molecule overall,
permitting a wide assortment of substrates so long as there is a correct
codon-anticodon match.The ribosome accepts two aminoacyl-tRNAs
at a time and catalyzes the formation of a peptide bond between the
amino acids, releasing an uncharged tRNA and retaining a peptidyl-tRNA.[141] The ribosome must accommodate a large variety
of substrates: there are 20 canonical amino acids that can be associated
with 50 or more different tRNAs, depending on the species.[146] Even if one only considers two canonical amino
acids coming together to be joined by a peptide bond, there are 400
possible substrate permutations that the ribosome must accommodate
and catalyze. This level of multispecificity is essential for the
production of all extant proteins in the organism. In addition to
accommodating different amino acids and peptides in the active site,
the ribosome must also accommodate different peptides in the exit
tunnel. Interestingly, the exit tunnel is lined primarily by RNA and
lacks significant patches of hydrophobicity, creating a “nonstick”
character that allows peptides through regardless of sequence.[139]The ribosome is similarly multispecific
with respect to the mRNA
templates, on which there are minimal sequence restrictions. Following
initiation, which does involve sequence-specific interactions in some
organisms, ribosome binding to mRNA is primarily facilitated through
interactions with the mRNA backbone.[147] However, the ribosome does display some slight substrate preferences.
Early research on the ribosome, for example, discovered roughly 2-fold
higher reactivity with leucine than phenylalanine.[148] Additionally, ribosomes display codon preferences that
can alter the elongation rate,[149] a property
which is used for regulation of gene expression. Still, the degree
to which ribosomes are capable of utilizing a wide variety of substrates,
including many noncanonical amino acids,[150] representing promiscuous activity, is truly striking.As with
the RNA polymerase ribozymes, the substrate promiscuity
of the ribosome must coexist with a requirement for high fidelity
of information transfer. The promiscuity of the ribosome is tolerated
by the cell in part because translation fidelity is handled during
aminoacylation of tRNAs, including proofreading processes.[151] In the ribosome, cognate and noncognate tRNAs
can be distinguished through minor differences in base-pairing to
mRNA. Recognition of the cognate tRNA leads to a structural change
that is identified by elongation factor proteins which permit translation
to proceed.[152] Another restriction imposed
on the incoming tRNA is the 3′ terminal CCA sequence, which
forms specific interactions with the ribosome.[139,147] This CCA sequence is required, and occasionally sufficient, for
peptidyl transfer to occur.[148,153] A small number of
important interactions between tRNAs and the ribosome provide high
fidelity of translation while permitting minimal restrictions on the
mRNA or protein sequences.The innate promiscuity of the ribosome
is occasionally exploited
by nature. One such example is puromycin, an antibiotic produced by
the bacteria Streptomyces alboniger. Puromycin is
an aminonucleoside, containing nucleoside and amino acid analogues,
linked through an amide bond instead of the conventional ester. This
structure mimics the 3′ terminus of a charged tRNA, which allows
it to enter the ribosome and be irreversibly incorporated into the
nascent polypeptide, terminating translation.[154,155] The efficacy of this molecule suggests that evolutionary escape
from this promiscuous activity has been difficult despite the selective
pressure engendered by the antibiotic.Synthetic biologists
have also taken advantage of the substrate
promiscuity of the ribosome, fundamentally altering the genetic code
itself. tRNAs recognizing the amber stop codon can be charged with
noncanonical amino acids. Because the ribosome is, to a first approximation,
essentially agnostic with respect to the side chain of the incoming
monomer, the amber codon is translated into the new amino acid.[156,157] Amino acids with a remarkably diverse set of unnatural functional
groups have been successfully incorporated by this method, including
alkanes, polybenzenes, sugars and phosphate-containing species.[158,159] The ribosome can even catalyze the formation of ester bonds, yielding
mRNA-encoded polyesters, without mutation in the ribosome itself.[86] Further evolution can push this versatility
further, as seen with the ribosome variants ribo-Q1 and ribo-X, which
translate quadruplet codons and thus introduce many “blank”
codons to the genetic code.[160−162] Although it was postulated early
on that the genetic code might be a “frozen accident”,[4] it now seems clear that the code itself has been
the subject of evolution, as evidenced by the different version of
the code found in mitochondria[163] as well
as statistical analyses suggesting that the code has evolved to minimize
the biophysical impact of mutations.[164] The evolvability and malleability of the genetic code attests to
the remarkable combination of substrate promiscuity and informational
fidelity in the ribosome.
Primordial
Ribozymes: More Promiscuous?
We now return to a question
posed at the beginning of this review:
would primordial ribozymes be particularly promiscuous? There are
two reasons why one might hypothesize this. First, given the importance
of promiscuity for evolutionary innovation, one may suppose that primordial
ribozymes might have been more promiscuous than highly evolved enzymes
due to evolutionary pressure for greater specificity. Second, given
the wider chemical diversity of functional groups available to proteins,
one may suppose that proteins will have both superior specificity
and activity, in general, compared to ribozymes, due to their ability
to engage in more types of interactions. Although there is insufficient
data in the literature to answer these questions definitively, here
we consider two comparisons that bear on these issues. First, we consider
whether newly evolved ribozymes are more promiscuous than highly evolved
ribozymes. Second, we ask whether ribozymes are more promiscuous than
proteins by examining a head-to-head comparison of a ribozyme and
a protein enzyme, both of which were evolved de novo.
Newly Evolved vs Highly Evolved Ribozymes
An interesting comparison can be made between the promiscuity of
ribozymes from in vitro selection, which have very
short evolutionary histories, to highly evolved ribozymes, in particular,
the ribosome. The first ribozymes to catalyze amide bond formation
were initially selected for a different activity, to catalyze the
transfer of an aminoacyl group from the 3′-hydroxyl of a short
tRNA mimic to the 5′-hydroxyl of the ribozyme.[165] However, like the ribosome, one of the selected
ribozymes was able to use an alternative nucleophilic substrate. When
the 5′-hydroxyl was substituted with an amino group, amide
bond formation was observed at a similar rate. In both cases, the
ribozyme accelerated the respective noncatalyzed reaction by over
1000-fold. Later, ribozymes were selected to perform peptide bond
formation by linking a phenylalanine to the 5′ end of the RNA
and selecting for the ability to attach a biotinylated methionine
from a 3′ acylated AMP substrate.[166] The best ribozymes from this selection displayed a rate enhancement
of ∼106. This reaction was inhibited by the presence
of AMP, but not other nucleotides or methionine, suggesting that the
ribozyme functions primarily through specific interactions with AMP.
Consistent with this, activity was also observed with leucine, phenylalanine,
and lysine substrates, with methionine and leucine being the best
peptidyl donors.These peptide synthase ribozymes, much like
the flexizyme, illustrate that if there are sufficient interactions
with other parts of the small molecule substrate, the amino acid side
chain may be quite variable. In addition, the fact that one of the
ribozymes is capable of both ester and amide bond formation, much
like the ribosome, further corroborates its promiscuity of function.
(In this case, these non-native activities represent true promiscuity
because we know the complete environmental history of the ribozyme.)
However, while it may be possible to evolve increased promiscuity
in these ribozymes, it seems a hard task indeed to match or exceed
the promiscuity of the ribosome. This comparison at least suggests
that newly evolved ribozymes are not necessarily more promiscuous
than highly evolved ones. Instead, specificity or promiscuity itself
may be a selectable trait, and natural selection may favor either
greater or lesser promiscuity.
De Novo Ribozyme vs de Novo Protein Enzyme:
The Diels–Alderases
Are protein enzymes superior to
ribozymes, such that ribozymes emerging
in the RNA world would be worse than their protein counterparts? While
it seems clear that proteins have greater activity in general, nearly
all protein enzymes have much longer evolutionary histories compared
to ribozymes, most of which have been evolved in vitro. To avoid this confounding factor, one may compare a de
novo ribozyme with a de novo protein enzyme.
Such a comparison can be made with the Diels–Alderase ribozyme
and protein enzymes, which both catalyze a reaction (Figure ) previously not known to occur
in biology.
Figure 8
Diels–Alder cycloaddition. A concerted reaction between
a conjugated diene and an alkene (dienophile) results in the formation
of a cyclized product.[172] The dienophile
substituents shown (R3 and R4) are added to
the same face of the cyclohexane ring.
Diels–Alder cycloaddition. A concerted reaction between
a conjugated diene and an alkene (dienophile) results in the formation
of a cyclized product.[172] The dienophile
substituents shown (R3 and R4) are added to
the same face of the cyclohexane ring.Interestingly, the first biochemical catalyst discovered for the
Diels–Alder reaction was a ribozyme, not a protein. While most
ribozyme reactions involve RNA or amino acid modifications and often
involve base-pairing interactions, an early discovery that demonstrated
the catalytic versatility of ribozymes was carbon–carbon bond
formation by Diels–Alderase ribozymes.[167,168] The specificity of one such ribozyme was extensively characterized
by the Jäschke laboratory through testing a series of potential
substrates.[169,170] The initial experiments selected
for cycloaddition of a biotin-maleimide to anthracene, which was conjugated
to the RNA via a polyethylene glycol linker. The ribozyme produced
from this selection could catalyze this reaction on free substrate
with a high degree of enantioselectivity. Additionally, a synthesized
mirror image of this ribozyme composed of l-nucleotides produced
the opposite enantiomer. This enantioselectivity was the result of
a “tail” group on the anthracene substrate (e.g., the
PEG linker), which restricted the molecule’s orientation in
the binding pocket. Important structural features of both substrates
include: the diene must contain three linearly annellated rings, the
dienophile must be a five-membered maleimide ring with a hydrophobic
tail, and both substrates must be arranged in a stacked, coplanar
manner.[170] These results present one of
the most rigorous characterizations of ribozyme specificity on a non-nucleotide
substrate.One decade later, a Diels–Alderase protein
enzyme was developed
by the Baker lab.[171] This enzyme was created de novo using computational design and site-directed mutagenesis
to catalyze the reaction between 4-carboxybenzyl trans-1,3-butadiene-1-carbamate
and N,N-dimethylacrylamide. Like
the ribozyme, this protein enzyme demonstrated a high level of product
stereoselectivity (>97%). The best Diels–Alderase enzymes
possessed
higher catalytic activity than the Diels–Alderase ribozymes,
but were still markedly slower than natural enzymes.Although
these catalysts were discovered through different means
(in vitro selection vs computational design), both
were created in a laboratory setting independent of natural evolutionary
influences, and therefore they are an interesting test comparison
to understand the promiscuity of de novo functions.
While the reactions catalyzed by these molecules use different substrates,
the promiscuity indices can be compared between them (Table vs Table ). Note that different values are calculated
for the diene and dienophile when possible, and that I and J varies depending on whether they are calculated
from kcat or from kcat/kuncat (catalytic power). Despite
these differences, in general, the promiscuity indices are not very
different; all values for the dienophiles lie in the range of 0.66–0.83,
with the values for the protein enzyme lying in the middle of this
range. Therefore, comparing these two de novo catalysts,
it does not appear that the ribozyme is more promiscuous than the
protein enzyme.
Table 3
Unweighted (I) and
Weighted (J) Promiscuity Indices for Diels–Alderase
Ribozymes, Calculated Separately for the Diene and Dienophile Substrates
HEG indicates hexa(ethylene
glycol).
kcat and kuncat values from
Stuhlmann and
Jäschke.[170]
Dash indicates kuncat not reported; these values could not be included
in the calculation of I or J.
Table 4
Unweighted (I) and
Weighted (J) Promiscuity Indices for the Protein
Diels–Alderase, Calculated for Dienophile Substrates
Rate values were
estimated from
Siegel et al.[171]
HEG indicates hexa(ethylene
glycol).kcat and kuncat values from
Stuhlmann and
Jäschke.[170]Dash indicates kuncat not reported; these values could not be included
in the calculation of I or J.Rate values were
estimated from
Siegel et al.[171]It is of practical interest to note that the weighted
and unweighted
promiscuity indices (Tables –4) are often not very different
from one another, as the difference between these values ranges from
0 to 0.1. This may reflect ruggedness in the promiscuity profile over
the chemical space of the substrates. The motivation for creating
a weighted index was to account for the expectation that chemically
similar substrates would have similar activity. While this must be
true to some extent, over the substrates that were tested and reported
in these examples, the additional accounting did not alter the overall
calculation by much.
Concluding Remarks
Ribozymes identified by in vitro selection or
evolution represent an ideal model system for studying true promiscuity
because the selective pressures on these ribozymes are controlled
by the experimenter and their entire evolutionary history is available
for study. In addition, the promiscuity of ribozymes in particular
is a fascinating question relating to the origin of living systems.
An attractive but untested hypothesis is that the earliest ribozymes
emerging from the prebiotic milieu of random polymers would be highly
promiscuous, presenting a kernel of activity across many functions
that could be optimized by evolution individually (e.g., after duplication
events). Although a rigorous test of this hypothesis is currently
lacking, we may consider how current knowledge informs this hypothesis
of promiscuous ribozymes.What are the likely properties of
a ribozyme selected de
novo, i.e., a primordial ribozyme? It is clear that the activity
is likely to be low initially, simply because there are more sequences
of low activity compared to high activity (i.e., the frequency of
sequences is a decreasing function of activity),[173] leaving room for optimization of activity by natural selection.
What about promiscuity? While it might seem intuitive that simple,
low activity ribozymes would have high promiscuity, we do not see
solid evidence for this so far in the literature. As discussed above,
a de novo peptide synthase ribozyme is less promiscuous than its highly evolved counterpart (the ribosome).
While there are some examples of ribozymes where in vitro evolution resulted in both improved activity and specificity (discussed
above), it is not clear that there would be a positive correlation
between activity and specificity in general. Indeed at least one mechanism
(discussed in Section ) has the opposite effect, causing a negative correlation,
i.e., a trade-off, between activity and specificity.The intuition
that there should be a positive correlation between
activity and specificity is based on the general idea that increased
molecular interactions give both increased activity and increased
specificity. This seems to be reasonable, but in one rigorous study
of RNA aptamers, activity was found to be uncorrelated to specificity.[174] It is even less clear that increased interactions
should increase specificity in ribozymes, particularly cis-acting ribozymes, because the entire reaction pathway would be stabilized.
How the ground state, transition state, and product would be affected
in relative terms is not clear. Therefore, it should not be assumed
that the earliest emerging ribozymes were particularly promiscuous.
Empirical data is required to resolve the relationship between activity
and specificity of ribozymes.The reason why the hypothesis
of promiscuous primordial ribozymes
is attractive, despite the current lack of evidence to support it,
is that it solves an important problem in prebiotic evolution. If
the first ribozyme to emerge by chance possesses the ability to catalyze
many reactions, albeit at low activity, this ribozyme could serve
as the ancestral catalyst to a suite of different reactions, rapidly
forming a metabolic network of ribozymes. In evolutionary terms, a
network of ribozymes might then arise from exaptation (or preadaptation)
of a small number of ancestral ribozymes. However, it may be that
promiscuity, rather than being an automatic property of a low-activity
primordial ribozyme, should be considered as an evolvable or fortuitous
trait itself, possibly uncorrelated to activity. In this case, the
selective pressures on the RNA world would play an important role
in shaping ribozyme evolvability.
Authors: Jorge Azpurua; Zhonghe Ke; Iris X Chen; Quanwei Zhang; Dmitri N Ermolenko; Zhengdong D Zhang; Vera Gorbunova; Andrei Seluanov Journal: Proc Natl Acad Sci U S A Date: 2013-09-30 Impact factor: 11.205