Literature DB >> 32011135

Promiscuous Ribozymes and Their Proposed Role in Prebiotic Evolution.

Evan Janzen^1,2, Celia Blanco¹, Huan Peng¹, Josh Kenchel^1,2, Irene A Chen^1,2,3.

Abstract

The ability of enzymes, including ribozymes, to catalyze side reactions is believed to be essential to the evolution of novel biochemical activities. It has been speculated that the earliest ribozymes, whose emergence marked the origin of life, were low in activity but high in promiscuity, and that these early ribozymes gave rise to specialized descendants with higher activity and specificity. Here, we review the concepts related to promiscuity and examine several cases of highly promiscuous ribozymes. We consider the evidence bearing on the question of whether de novo ribozymes would be quantitatively more promiscuous than later evolved ribozymes or protein enzymes. We suggest that while de novo ribozymes appear to be promiscuous in general, they are not obviously more promiscuous than more highly evolved or active sequences. Promiscuity is a trait whose value would depend on selective pressures, even during prebiotic evolution.

Entities: Chemical Disease Species

Mesh：

Substances：
RNA, Catalytic

Year: 2020 PMID： 32011135 PMCID： PMC7291351 DOI： 10.1021/acs.chemrev.9b00620

Source DB: PubMed Journal: Chem Rev ISSN： 0009-2665 Impact factor: 60.622

Introduction

Catalytic RNA sequences, or ribozymes, are widely accepted to have been central to the origin of life.[1,2] Their dual capacity for information storage and catalytic activity is the basis for the RNA world theory,[3−5] that an RNA-based metabolism could have preceded the more complex DNA-RNA-protein system that we observe in biology today. Regardless of whether an RNA world existed on early Earth, ribozymes represent an excellent laboratory model system for molecular evolution. Beginning with a pool of random sequences, strategies can be devised to select for particular activities. Cycles of selection and amplification by PCR allow enrichment and eventually isolation of active sequences. A prerequisite of successful in vitro evolution is the presence of one or more molecules with some activity, however slight, in the initial pool or early rounds. Once this kernel of activity exists, the active sequences can be selected and activity possibly improved by mutation during the evolutionary process. In addition to developing new ribozymes, in vitro evolution of RNA allows well-controlled experiments to observe and analyze the de novo emergence of biochemical functions.[6−8] Promiscuous catalytic activities have been invoked as being particularly significant for the origin of enzymes,[9,10] as one might intuit that early, simple ribozymes or enzymes would have little specificity and therefore might catalyze many reactions, albeit with slow rates. These sequences might possess kernels of activity for many different substrates or reactions. One landmark study of such a ribozyme is a sequence which was engineered to adopt two possible folds, one of which acted as a ligase and one of which acted as a self-cleaving ribozyme.[11] This sequence had very low activity for each function, but a relatively small number of mutations would increase function to near wild-type in both directions. Such promiscuity would promote evolutionary innovation by poising sequences at a nonzero fitness for multiple activities, each of which could be potentially optimized by natural selection. This idea also raises the interesting question of whether ribozymes are intrinsically more promiscuous than protein enzymes. From extensive work on the directed evolution of enzymes, it has become clear that much of the success of the field as a whole is due to the presence of low levels of apparently promiscuous activity in extant protein enzymes. This surprising degree of promiscuity in highly evolved enzymes suggests that promiscuity is actually the rule rather than an exception for protein enzymes. Here, we review what is known about the specificity and promiscuity of ribozymes. We first introduce major concepts and definitions in specificity and promiscuity, including how to quantify them, which were originally developed in the enzyme literature. An interesting concept is the relationship between activity and specificity, which underlies the intuition that early, relatively low-activity ribozymes would be more promiscuous. We then review several cases of ribozymes in which studies have demonstrated promiscuity in some way. When possible, we calculate a promiscuity index from what is known about these ribozymes, a first step toward rigorous comparisons of the promiscuity of ribozymes and protein enzymes. We end with a discussion of the implications of these comparisons for the hypothesis that early ribozymes were particularly promiscuous.

Promiscuity and Specificity: Concepts and Definitions

Defining Specificity

Specificity is the ability of an enzyme to discriminate between two different substrates assuming both are present. The question of how to measure enzyme specificity has been a matter of debate in the past (see refs (12 and 13) and references therein), but it is generally agreed that specificity in the presence of two different substrates should be compared based on the discrimination factor,[14] defined as the ratio of the catalytic efficiencies (kcat/KM) for the corresponding reactions. According to transition-state theory, the logarithm of the catalytic efficiency (kcat/KM) is proportional to the free energy difference between the free enzyme and substrate vs the transition-state complex (ΔG⧧).[15] When comparing the cognate with an alternate substrate, the discrimination factor is also called the accuracy A. Thus, A is exponentially dependent on the difference ΔΔG⧧ between the cognate and alternate substrates. In some cases (e.g., polymerases), the use of an error ratio (the rate of incorrect product formation divided by the rate of correct product formation) is more appropriate. To gain an intuition about the possible scale of this ratio, one may note that the theoretical maximum discrimination between alternative substrates undergoing analogous reactions occurs when the formation of the enzyme–substrate complex is much faster than product conversion and release (as assumed in Michaelis–Menten kinetics). In this case, the theoretical minimum error ratio is equal to the ratio of KM values.[16,17]

General Mechanisms for Specificity and the Possible Trade-off with Rate

Discrimination among substrates can arise from different affinities in the initial enzyme–substrate complexes (ground-state discrimination) or in the transition-state complexes (catalytic or transition-state discrimination).[18] The accuracy (A) for a cognate vs alternative substrate can be increased by three scenarios: (a) higher rate of substrate association (ground-state discrimination with koncog > konalt), (b) lower rate of substrate dissociation (ground-state discrimination with koffcog < koffalt), or (c) higher rate of conversion of the enzyme–substrate complex into the transition state (transition-state discrimination with kcatcog > kcatalt). In ground-state discrimination, lowering the energy of the enzyme–substrate complex has two effects, namely decreasing KM as well as decreasing kcat. In other words, although selectivity may be improved via increased substrate affinity, the reaction rate suffers. Examples of enzymes exhibiting ground state discrimination include DNA methyltransferases and the ribosome.[18] The trade-off between accuracy and rate might impose an evolutionary constraint limiting selectivity.[18] Indeed, selection for activity on one substrate does not seem to induce high selectivity by itself,[19] and therefore, negative selection against undesired substrates is used when engineering new enzymes.[20,21] Interestingly, a trade-off between rate and accuracy created by ground-state discrimination would contradict the idea that early, less optimized ribozymes or enzymes were more promiscuous. On the other hand, in transition-state discrimination, which tends to apply to relatively small substrates (e.g., DNA polymerases[18,22−25]), lowering the activation barrier increases kcat without necessarily affecting KM. Thus, in principle, transition-state discrimination might achieve higher selectivity at high activity because there is not necessarily a trade-off between accuracy and rate. In addition, nonequilibrium mechanisms driven by release of chemical energy may improve selectivity with or without a trade-off between accuracy and rate.[26] Furthermore, such mechanisms can allow accuracy to surpass the theoretical thermodynamic limit based on binding energies. For example, in kinetic proofreading,[16,27] discrimination between two possible substrates is achieved by the presence of one or more irreversible steps in the reaction pathway, whose rate(s) are biased by the identity of the substrate. These steps are made irreversible by consumption of chemical energy, and concatenation of such steps could be used to achieve arbitrarily small error ratios, in principle. Some biological processes can afford high specificity by using this mechanism.[16] For example, although the valine concentration in vivo is ∼5-fold higher than that of isoleucine, and isoleucyl-tRNA synthetase favors the reaction with isoleucine over valine by only ∼100-fold, the rapid hydrolysis of misincorporated valine-tRNA decreases the error ratio to 1 in 3000. While kinetic proofreading can increase reaction specificity substantially, this comes with a relatively high energetic cost.[18,28] However, in the absence of proofreading mechanisms, substrate specificity is inherently limited due to physicochemical reasons. Indeed, a recent survey of the BRENDA database (The Comprehensive Enzyme Information System[29]) suggests discrimination is usually much lower than the theoretical maximum.[14] In the case of substrates differing by a single methyl group, discrimination was found to be lower than the theoretical maximum, for 23 out of the 25 enzymes surveyed, by typically 1–2 orders of magnitude. Interestingly, a similar discrepancy is found in nonenzymatic, template-directed polymerization of activated nucleotides,[17] suggesting that this phenomenon is not specific to enzymes. A discrimination level lower than the expected theoretical maximum might reflect prioritization of increased rate during evolution, if the enzyme is subject to an accuracy-rate trade-off; in other words, the marginal fitness benefit of increased specificity may come with a larger fitness decrement due to slower rate. Thus, in general, specificity tends to be lower than the theoretical maximum, possibly because of the costs associated with accuracy.[18] Specificity may appear to be quite suboptimal even for presumably highly evolved enzymes. For example, the carboxylase enzyme Rubisco plays an essential role in fixing atmospheric carbon dioxide into sugars during photosynthesis. However, considering its biomass and critical role, it is surprisingly slow and nonspecific, as oxygenation constitutes a major side reaction. Trade-off models have been proposed to explain the observed correlations between specificity and other kinetic parameters,[30,31] which were recently revisited using an extended data set.[32] A strong correlation was found between the catalytic efficiencies for carboxylation and oxygenation, indicating that lowering the effective CO2 addition energy barrier (i.e., faster carboxylation) entails a similar reduction in the effective O2 addition energy barrier (i.e., faster oxygenation). Therefore, the accuracy of Rubisco appears to be highly constrained.

Promiscuous vs Multispecific Enzymes

The term “catalytic promiscuity” was originally used to refer to enzymes known to catalyze more than one type of reaction.[9,33] However, in practice, “promiscuity” has not been well-defined and thus has been used to refer to fundamentally different phenomena.[14,34] Generally, catalytic promiscuity refers to the capability of enzymes to catalyze reactions mechanistically different from the primary biological reaction,[33] and substrate promiscuity refers to the capability of enzymes to transform different substrates.[35] These terms warrant additional consideration here, as their usage varies and can depend on incomplete knowledge. The native function of an enzyme refers to the physiologically relevant chemical transformation and substrate for which an enzyme has evolved. Native function is selected for and contributes to organismal fitness. In this context, any physiological functions for which an enzyme has evolved are considered native, even if they are not the enzyme’s primary function. For example, while the primary function of aminoacyl-tRNA synthetases is to catalyze the attachment of tRNAs to their respective amino acids, some also catalyze generation of 5′,5′-diadenosine tetraphosphate in a reaction that appears to be physiologically relevant,[36] and thus this additional function would be considered native. In practice, whether a particular function contributes to organismal fitness may be difficult to assess. It is nowadays well-accepted that many, if not most, enzymes have multiple side activities.[10,37,38] However, such side activities may or may not be a product of evolution. In the evolutionary biochemistry literature, promiscuity refers to side activities that are non-native (i.e., not evolved), so the alternative transformation or substrate is fortuitous. By definition, there is no evolutionary pressure on non-native activities, as they do not impact organismal fitness (e.g., alternative substrates are not available in the cell).[37,39,40] For biologically evolved enzymes, promiscuity (as defined by evolutionary biochemists) is nearly impossible to ascertain in practice because we do not know what past environments and selective pressures might have applied to the protein. If promiscuity of a naturally evolved enzyme is suspected, we suggest use of the term apparent promiscuity, in contrast to true promiscuity, to acknowledge this uncertainty. Interestingly, there are two special scenarios in which true promiscuity can indeed be characterized unequivocally. First, if the enzyme transforms a man-made compound not present in nature, the enzyme could not have evolved this activity, and the activity must be non-native. Examples are the atrazine chlorohydrolase and melamine deaminase enzymes, which degrade the man-made compounds atrazine and melamine, respectively. Despite very high similarity (98% identity), both enzymes show little activity on the alternative substrate.[12] While it is likely that their host strains evolved the atrazine or melamine degradation function in response to environmental exposure (both strains were isolated from areas contaminated by the substrate[12,41]), it may be presumed that neither strain experienced both contaminants simultaneously. If so, these enzymes can be considered as lacking in true promiscuity.[42,43] The second scenario in which true promiscuity might be determined is in the case of in vitro evolved enzymes and ribozymes, in which the different environments and selective pressures applied to the sequences are known. An important contrast to promiscuous enzymes, whose side reactions are non-native, is multispecific enzymes (or broad-specificity enzymes), which evolved to perform many native transformations, such as on a broad range of available substrates. These enzymes are characterized by small accuracy values, with different substrates having similar kcat/KM. For example, theta class glutathione transferases from various species can catalyze the conjugation of the tripeptide glutathione to a variety of electrophilic substrates.[44] The enzyme family of cytochromes P450 metabolizes a variety of different substrates, with activities including biosynthesis of steroids, fatty acids, or fat-soluble vitamins as well as the degradation of herbicides and insecticides. In particular, cytochrome P450 3A4 contributes to the metabolism of approximately 50% of marketed drugs.[45−47] Most of the terpene cyclase enzymes (a.k.a. terpene synthases) are also multispecific. For example, the class I sesquiterpene cyclase γ-humulene synthase generates 52 different products, of which γ-humulene constitutes less than 30% in abundance.[48,49] Methane monooxygenase oxidizes more than 150 different substrates.[50] The RecBCD nuclease, originally named Exonuclease V, accepts both linear double-stranded DNA and single-stranded DNA with very low specificity.[51,52] The distinction drawn between promiscuous and multispecific enzymes hinges on whether the additional substrates represented selective pressures on the enzyme. While this is an important conceptual distinction, assessing whether an enzyme is promiscuous vs multispecific may be difficult in practice due to lack of knowledge of the evolutionary and environmental history of the enzyme. It should be noted that additional usages of the term “promiscuous” also exist. Promiscuity is sometimes used to refer to the capacity of an enzyme to transform different physiologically relevant substrates (see ref (14) and references therein), to be contrasted with “multifunctional enzymes” whose side activities may be either physiologically useful or detrimental.[53] Unfortunately, this definition of promiscuity can be contradictory to the one given earlier, in which promiscuity refers to the capacity to perform non-native reactions. In addition, the determination of physiological relevance is difficult to make and again raises questions of evolutionary history. A third usage of the term promiscuity refers to enzymes whose catalytic domain executes multiple functions.[53−56] In this review, we have favored the definition from the evolutionary biochemistry literature, because ribozymes are often evolved under known conditions in vitro, allowing true promiscuity to be characterized, while physiological relevance is unspecified and multiple domains are relatively uncommon.

Promiscuity and Evolutionary Innovation

Fortuitous side reactions of a promiscuous enzyme are believed to be central to evolutionary innovation, as an initial kernel of activity for a side reaction is a starting point for optimization of the new activity by evolution.[9] In addition, an enzyme might exhibit new side activities under new environmental conditions (e.g., temperature, pH), and such enzymes are called condition-promiscuous enzymes. Conditional promiscuity is a possible path for adaptation in new environments.[57−59] However, in the absence of selective pressure, side activities would be subject to neutral drift and may be lost if they are uncorrelated to native functions of the enzyme. Interestingly, contrary to native functions, which are usually tolerant of mutations, directed evolution of enzymes has shown that the non-native functions can be greatly optimized by just a small number of mutations.[19,55] The flip side of this so-called plasticity is that newly evolved non-native functions are typically not tolerant to mutations. One might therefore suspect that evolutionary robustness, if it is observed for native functions, likely evolved as a trait or correlate of a selected trait. In addition, the duality of plasticity for non-native functions and robustness for native functions implies that evolutionary optimization of non-native functions might not always lead to a significant decrease of the original native function. However, in the absence of continued selection pressure on the original function, specialization has been shown to occur due to trade-offs during selection of the secondary function, even without negative selection against the original function.[60] The idea that enzyme evolution and promiscuity are connected goes back to the mid-1970s, when Yčas and Jensen proposed, independently, the first model for enzyme evolution.[9,61] This general model hypothesizes that primitive life had minimal gene content and the number of available enzymes was limited. It posits that primordial enzymes may have been less specific, being able to catalyze broad classes of reactions on a variety of substrates. Gene duplication and mutation would have then increased genetic diversity, leading to the divergence of new enzymes whose secondary activities might give an evolutionary advantage under newly encountered selective pressures.[62,63] This hypothesis is widely accepted, although direct evidence is scant.[14,55] The molecular processes and evolutionary forces involved in the biological evolution of enzymes are very difficult to reconstruct, and hence the mechanisms under which duplication and specialization events shape enzyme evolution have been the subject of much debate.[64] Nevertheless, understanding how specificity and promiscuity arise during in vitro selection and evolution of ribozymes, recapitulating an origin of life, can address this problem experimentally.

Promiscuity and the Fitness Landscape

The fitness landscape is a well-studied conceptualization of evolution through the space of all possible sequences (sequence space).[65,66] Each point of sequence space is specified by a sequence and its associated fitness (e.g., activity on a given substrate), giving the fitness “landscape” in sequence space. At each point in sequence space, one might also imagine the large chemical space of possible substrates, and an activity profile for that sequence over substrate space, which reflects the promiscuity of the sequence. Properties of the fitness landscape are not necessarily expected to correlate with properties of the promiscuity profile. Different fitness landscapes over sequence space can give rise to the same promiscuity profile in substrate space (Figure ). In general, optimization for higher activity need not correspond to increased specificity. However, specific mechanisms, such as a trade-off between rate and specificity, could produce correlations between the fitness landscape and the associated promiscuity profiles. The idea that early, nonoptimized ribozymes were particularly promiscuous would translate into a correlation in which highly active sequences on the fitness landscape have lower promiscuity compared to less active sequences. Whether the specificity of a ribozyme can be improved through mutation or evolution would depend not only on the specificity of that individual ribozyme, but also on the fitness landscapes for the cognate and alternative substrates (Figure ).

Figure 1

Fitness landscapes and promiscuity profiles. Different fitness landscapes (A–D) can correspond to the same promiscuity profile (E–H). Vertical dashed lines (a–g) correspond to different ribozyme sequences. Ribozyme fitness landscapes (A–D) for two substrates may differ (blue and red) with or without overlap. The promiscuity profile (E–H), depicted here for two substrates (1: blue and 2: red) depends on the sequence tested, as seen in the comparison among sequences a–c in panel E. In addition, similar promiscuity profiles can be derived from qualitatively different fitness landscapes. Compare sequence a from (A, E) with sequence d from (B, F), sequence b from (A, E) with sequence e from (C, G), and sequence c from (A, E) with sequence g from (D, H). While ribozymes a and d have similar promiscuity profiles, their evolutionary potential is strikingly different. Ribozyme a could evolve through mutations to specialized activity, but ribozyme d is already at a local maximum and has no evolutionary potential for increasing activity. Similarly, ribozymes b and e have the same promiscuity profile, but only ribozyme b has the possibility to evolve into a sequence of higher activity and selectivity. Ribozymes c, f, and g are highly specific, but unlike ribozymes f and g, ribozyme c has increased potential to evolve into a promiscuous ribozyme.

Quantifying Promiscuity: The Promiscuity Index

Several possible methods exist to quantify substrate specificity. Here, we describe the promiscuity index (I) proposed by Nath and Atkins, which is a metric similar to a normalized information entropy[67] (eq ):where N is the number of substrates that can be transformed and e corresponds to their individual associated catalytic efficiencies. Due to the normalization, this metric goes from 0 (only uses one substrate) to 1 (equally efficient on all N substrates). While this promiscuity index is simple and intuitive, it might be strongly influenced by the experimenter’s choice of substrates to test. In particular, when comparing promiscuity indices for different ribozymes or enzymes, one sequence might appear more promiscuous only because many chemically similar substrates were assayed. To account for this problem, a weighted promiscuity index (J) factoring in substrate similarity can be calculated (eq ).[67] Chemical similarity can be calculated using a bitwise dissimilarity metric between a pair of substrates (δ), which is based on the presence or absence of a number of different functional groups. Any method for quantifying promiscuity from experimental data is likely to be biased in at least two ways. First, there is an experimental bias in the selection of substrates (e.g., synthetically accessible, similar to known substrates). For comparisons among enzymes, differences in these biases might affect the promiscuity index calculated, even when using the weighted value. Second, these metrics do not consider the chemical context in which an enzyme functions. If the environment never provides a certain substrate, it may not be justifiable to include such a substrate in the calculation even if the enzyme has nonzero activity on it in vitro. Additionally, the relationship between chemical similarity and promiscuity has not been well-established, and often little difference between unweighted (I) and weighted (J) values has been observed.[67,68] Other metrics for promiscuity also exist, such as a measure based on structural information on the catalytic residues.[69,70] Despite these limitations, the promiscuity index serves as a starting point for characterization and comparison of substrate specificities. In this review, we calculate promiscuity indices for ribozymes for which sufficient data is available in the literature.[71,72] However, when necessary, catalytic rates were used in place of catalytic efficiency when the catalytic efficiency was inappropriate or unknown. Quantitative metrics like the promiscuity index provide the opportunity to compare the specificity of different molecules and potentially study the relationship between promiscuity and other measurable characteristics, such as activity.

Ribozymes Illustrating Promiscuity

In this section, we first describe substrate promiscuity using aminoacylation ribozymes, for which different substrates have been studied in some depth. Then, to gain mechanistic insight into a specific case, we turn to the hammerhead ribozyme, where specificity can be understood in terms of RNA annealing. An important consequence of this mechanism is that promiscuity is dependent on environmental conditions, such as temperature. The expression of promiscuity under new conditions (conditional promiscuity) is a possible mechanism for uncovering latent side activities. We then follow a series of in vitro evolution experiments seeking an RNA replicase, in which the presumption of promiscuous activity was essential to the design and success of the experiments. We next describe a different kind of promiscuity, catalytic promiscuity, in a case of a nucleotide synthase ribozyme that unexpectedly possesses two distinct catalytic mechanisms. Then, we end this section with a brief discussion of the ribosome, a proteinaceous ribozyme whose promiscuity appears to be unparalleled.

Substrate Promiscuity: Aminoacylation Ribozymes

Aminoacylation of tRNA is a key step in protein synthesis, and high selectivity for tRNA-amino acid pairs is crucial for the stability of the genetic code.[73] It is presumed that ribozymes carried out aminoacylation reactions in the earliest stages of the evolution of the translation apparatus. Indeed, several aminoacyl-RNA synthase ribozymes have been identified through in vitro selection, which use a variety of activated amino acid substrates[74−77] (Figure ). These aminoacylating ribozymes show a range of specificities for the substrate side chain. For example, selection using a phenylalanine adenylate substrate 1 produced ribozymes that showed little discrimination (i.e., promiscuous ribozymes) as well as ribozymes showing a strong preference for aromatic amino acids.[78] Although they are derived from the same selection, these ribozymes have quite different promiscuity profiles and indices (Figure , Table ). Selection for aminoacylation with coenzyme A (CoA) thioester 2 produced ribozymes that could function with other CoA thioesters, but required the presence of a free α-amino group.[76] None of these ribozymes match the specificity of the aminoacyl-tRNA synthetase enzymes found in modern biochemistry. This discrepancy cannot be the result of a trade-off between activity and specificity, because the ribozymes are generally much less efficient (∼1000-fold) than the corresponding enzymes.[79,80] Instead, the general finding of promiscuity and the variation of specificities found among these ribozymes are consistent with the understanding that newly evolved sequences are not necessarily specific if they have not been selected for specificity.

Figure 2

Figure 3

Promiscuity profiles for two aminoacylation ribozymes. Promiscuity profiles for ribozyme 77 (blue squares) and ribozyme 29 (pink circles) show catalytic rates for each tested amino acid substrate,[78] ordered by hydrophilicity as defined by Hopp and Woods[91,92] (Phe = −2.5, Tyr = −2.3, Ile = −1.8, Ala = −0.5, Gln = 0.2, and Ser = 0.3). Also see Table .

Table 1

Promiscuity Indices Calculated for Two Aminoacylation Ribozymes (77 and 29)

substrate side chaina	CIDa	k (M^–1 min^–1)b	Ic	Jc
ribozyme 77
phenylalanine	6140	60 000	0.376	0.439
tyrosine	6057	40 000
isoleucine	6306	1.5
alanine	5950	3.4
glutamine	5961	5.3
serine	5951	0.9
ribozyme 29
phenylalanine	6140	140	0.810	0.807
tyrosine	6057	220
isoleucine	6306	20
alanine	5950	1000
glutamine	5961	650
serine	5951	600

Amino acids (CID: PubChem Compound Identifier) were used to determine similarities for calculation of J.[67,71,71] The substrates used are aminoacyl adenylates with the side chain indicated.

Rate constants are from Illangasekare et al.[78]

Both the unweighted (I) and weighted (J) promiscuity indices were calculated from the rate constants (k) shown.

Substrates for aminoacylation ribozymes. Phenylalanyl-adenosine monophosphate[74,78] (1), biocytinCoA[76] (2), biotinyl-Tyr(Me)-oxazolone[77] (3), amino acid cyanomethyl ester,[90] (4) and amino acid 3,5-dinitrobenzyl ester[90] (5). Substrates 4 and 5 are flexizyme substrates. The amino acid backbone is depicted in green; side chains are depicted in blue, and leaving groups are depicted in red. R indicates possible chemical variation in the side chain. Promiscuity profiles for two aminoacylation ribozymes. Promiscuity profiles for ribozyme 77 (blue squares) and ribozyme 29 (pink circles) show catalytic rates for each tested amino acid substrate,[78] ordered by hydrophilicity as defined by Hopp and Woods[91,92] (Phe = −2.5, Tyr = −2.3, Ile = −1.8, Ala = −0.5, Gln = 0.2, and Ser = 0.3). Also see Table . Amino acids (CID: PubChem Compound Identifier) were used to determine similarities for calculation of J.[67,71,71] The substrates used are aminoacyl adenylates with the side chain indicated. Rate constants are from Illangasekare et al.[78] Both the unweighted (I) and weighted (J) promiscuity indices were calculated from the rate constants (k) shown. The aminoacylating ribozymes discussed above have also been observed to catalyze reactions using alternative nucleophilic substrates to generate amide bonds in addition to esterification. In particular, a minimized, 29 nucleotide version of an aminoacylating ribozyme that utilizes Phe-AMP was found to catalyze successive reactions: aminoacylation of the RNA and the subsequent amide bond formation to generate a conjugated peptide.[81,82] The rate of peptide formation was approximately 13-fold less than that for aminoacylation, but this difference could be tuned. Extending the 3′ tail of the RNA by three nucleotides resulted in a 3-fold reduction in the rate of aminoacylation and a 2-fold increase in the rate of peptide formation, presumably by increasing the flexibility around the active site. The potential promiscuity of aminoacylating ribozymes is highlighted by the “flexizymes” developed by Suga and colleagues, so named for their flexibility in accommodating a variety of substrates. These ribozymes were generated over a series of selections with the ultimate goal of producing catalysts capable of charging tRNAs with a wide variety of both natural and non-natural substrates. The starting point of the selection was a library containing a 5′ random region and a 3′ tRNA. Ribozymes were selected for their ability to aminoacylate the 3′ terminus of the conjugated tRNA. This first selection produced ribozymes with a high level of specificity to both the tRNA and phenylalanine substrates.[75,83,84] To broaden the tRNA substrate range, further design and selection was performed with an alternative tRNA sequence, which resulted in ribozymes capable of accepting a variety of tRNAs.[85] These early flexizymes exhibited high affinity for the aromatic side chains. Using the ribozyme’s affinity to the aromatic group to broaden the side chain specificity, the initial substrate 4 was redesigned to substrate 5, which contains a 3,5-dinitrobenzyl ester as the leaving group in the aminoacylation reaction. The idea was that this leaving group could be kept constant, ensuring affinity to the ribozyme, while the side chain itself was varied. This substrate necessitated an altered reaction mechanism, but nevertheless, the strategy was successful, with further selection resulting in ribozymes capable of charging tRNAs without regard to amino acid side chain.[85] More recently, flexizymes have been used to charge tRNAs with various non-natural amino acids, including d-amino acids, β-amino acids, and α-hydroxy acids, and 3′-aminoacyl-NH-tRNA can also be charged.[86−89] Although the flexizyme does exhibit a minor degree of side chain specificity, yields with the non-natural analogues often rival those for the l-amino acids used in the initial selections. The additional substrates represent both promiscuous (non-native) as well as native activities. Overall, the flexizyme demonstrates the surprisingly broad substrate specificity that can be evolved and designed when substrate generality is a desired goal.

Conditional Promiscuity: The Hammerhead Ribozyme

Due to their historical importance in the discovery of ribozymes, much is known about the self-cleaving ribozymes, which function through general acid–base catalysis.[175] The ribozyme fold brings the reactant nucleotides to the vicinity of the cleavage site, with the catalytic strand acting as the general base or acid to activate the nucleophile or stabilize the leaving group, respectively (Figure ). Many of these ribozymes can also catalyze the same transesterification reaction in reverse, using nucleophilic attack from a 5′-hydroxyl to ligate two substrate strands,[93] which represents a possible case of catalytic promiscuity, a phenomenon discussed in Section in the context of a different ribozyme. In this section, we focus on the substrate promiscuity of a self-cleaving ribozyme and how it arises. Although these ribozymes are cis-acting in vivo, they can be engineered to accept oligonucleotide substrates in trans with multiple turnover. While there are numerous self-cleaving ribozymes, here we confine our discussion to the case of the hammerhead ribozyme, a naturally occurring ribozyme found in plant viroid transcripts,[94] for which the specificity of trans-acting variants has been extensively investigated.

Figure 4

Proposed mechanism of RNA self-scission by general acid–base catalysis. A general base promotes deprotonation of the 2′-hydroxyl of the nucleophile, initiating formation of the cyclic intermediate. A general acid stabilizes the 5′-hydroxyl leaving group, allowing resolution of the intermediate to generate the cleavage products. The trans-acting hammerhead ribozyme can be engineered from the cis-acting ancestor by removing a nucleotide loop of one helical arm, thereby creating a cleavable substrate strand and a catalytic strand.[95−99] In such constructs, the catalytic strand can bind and cleave substrate strands with multiple turnover. In particular, separation of stem I from stem III of the ribozyme (the I/III construct) is most widely studied[100−102] because this construction places most of the conserved nucleotides in the catalytic strand (Figure ). This allows substrate specificity in the substrate strand to be probed.

Figure 5

Structure of a trans-acting I/III hammerhead ribozyme HH16. The catalytic strand is shown in black, and substrate is shown in green.[103] The red arrow indicates the cleavage site.

Structure of a trans-acting I/III hammerhead ribozyme HH16. The catalytic strand is shown in black, and substrate is shown in green.[103] The red arrow indicates the cleavage site. There are two main expectations for sequence specificity of the substrate in a trans-acting construct. First, residues critical for the catalytic mechanism are expected to be relatively intolerant to mutations, which would primarily affect kcat.[104] Second, aside from critical residues, promiscuity for the substrate is expected to be determined by binding interactions (Km) between enzyme and substrate, namely base-pairing, which can lead to large variation in dissociation rates among different substrates. In the HH16 ribozyme (Figure ), substrate affinity, which was dominated by stem III, was very high, implying a low dissociation rate, such that truncation from the 3′ end, down to a 2-nucleotide version of stem I, had little effect on the overall rate of cleavage.[103] Specificity in either stem I or III of the substrate was therefore only observed when stem III was destabilized to give a dissociation rate that was on par with or slower than the overall cleavage rate. Conversely, extending the recognition sequence reduced specificity,[105] in keeping with the idea that, if substrate dissociation is slow relative to cleavage, mutations in the substrate are tolerated because the bound complex is sufficiently populated. In terms of the active site itself, the hammerhead ribozyme has limited substrate promiscuity; substitution of the reactive phosphate with thiophosphate greatly reduces kcat.[106] The example of the hammerhead ribozyme, particularly the sequence dependence of the substrate, illustrates, at a molecular level, the property of conditional substrate promiscuity, in which the apparent promiscuity depends on the environmental condition. Variants having longer binding regions or higher substrate affinity can tolerate weakening (or strengthening) of binding without much change in population of the bound state, and therefore are relatively insensitive to mutations and have high apparent promiscuity. On the other hand, variants that exist on the threshold of binding can display high specificity as they are sensitive to small changes in dissociation rate. Thus, exhibition of promiscuity depends on conditions such as substrate concentration, pH, ionic strength, and temperature. Conditional promiscuity can be the basis for cryptic genetic variation, in which an altered phenotype is uncovered in new environments. Thus, it is likely to be underappreciated in the literature due to observational bias, because most experimental studies tend to focus on a small set of reaction or environmental conditions. This is an area ripe for future research given the likely importance of conditional promiscuity for evolutionary innovation.

Convergent Mechanism, Convergent Promiscuity: A Tale of Two “Capping” Ribozymes

The influence of mechanism on promiscuity, illustrated by the hammerhead ribozyme, is exemplified in a comparison of two independently derived ribozymes that share a common mechanism. These ribozymes, isolated under different selection conditions, promote the formation of a phosphate-phosphate anhydride bond between the terminal phosphate of a nucleotide and the 5′-α-phosphate of RNA. The final product is similar to the 5′ cap found on eukaryotic mRNAs. These two RNA capping ribozymes, the Iso6 and 6.17 ribozymes, were discovered in the Yarus and Unrau groups, respectively.[107,108] Interestingly, both ribozymes were isolated from different selections for which this capping reaction was not the desired function. Iso6 was recovered from a selection originally designed to identify ribozymes that could produce aminoacyl adenylates through reaction between amino acids and triphosphorylated RNA. Instead, pyrophosphate release was observed in the absence of amino acids, and the selection pool even developed labeling with PPi. Selection for capping activity using UTP instead of PPi quickly resulted in high activity in the pool and the identification of Iso6.[107] On the other hand,the 6.17 ribozyme derived from a selection initially designed to identify polymerase activity by incorporating labeled UMP into a primer annealed to a poly(A) template. The resultant ribozyme with the fastest kinetics, 6.17, instead was found to act on the 5′ end of the RNA, forming a 5′–5′ cap.[108] Iso6 and 6.17 display no apparent sequence similarities and are expected to adopt different secondary structures, consistent with their unique origins. Despite these differences, the molecular mechanisms for these two ribozymes appear to be surprisingly similar. Both ribozymes are predicted to have helices that terminate at the site of capping, with the terminal 5′ nucleotide retaining some flexibility; this position is unpaired in Iso6 and requires wobble pairing in 6.17. Both ribozymes also display increased activity at lower pH and require divalent cations for activity, although Iso6 prefers Ca2+ while 6.17 tolerates Mg2+, Mn2+, and Ca2+. The ribozymes even possess similar substrate binding affinities.[108,109] Finally, both ribozymes appear to have minimal substrate requirements. The identities of the sugar and base have little impact on activity, despite possible hydrogen bonding interactions with these moieties. However, decreasing the length of the phosphate chain results in a large decrease in substrate binding,[108,110,111] indicating that the phosphate itself is responsible for most substrate interactions. Thus, these ribozymes suggest a common molecular mechanism for RNA capping that permits a high degree of substrate promiscuity, provided a small number of key features is present. The fact that two independently evolved, structurally dissimilar ribozymes have the same requirements supports the idea that substrate promiscuity is determined by mechanism. In this case, evolutionary convergence on the same mechanism resulted in convergence to similar promiscuity as well.[112]

Relying on Promiscuity: Searching for an RNA Replicase

For those interested in the origin of life, one of the most sought-after de novo ribozyme functions is catalysis of template-directed RNA polymerization (an “RNA replicase”), which is thought to be important, if not essential, to a self-replicating RNA system. One of the major avenues for this search has relied heavily on the promiscuity of newly evolved ribozymes. The first ribozymes developed by use of in vitro selection from a large pool of random sequences were RNA ligase ribozymes, including the “class I ligase” (Figure ). The class I ligase was selected to catalyze the ligation between a 5′-triphosphate on the ribozyme and a 3′-hydroxyl on an RNA oligonucleotide substrate, which caused the ribozyme to tag itself with a sequence on the substrate that was necessary for purification and amplification.[113,114] The 3′,5′-phosphodiester bond formed during ligation is identical to that formed during template-directed polymerization. Modification of the original class I ligase to bind a primer-template complex generated a ribozyme that possessed some polymerization activity, being able to extend a primer through the incorporation of mononucleotide triphosphates.[115,116] This reaction occurred with 92% fidelity, though activity decreased with successive nucleotide additions, topping out at six nucleotides added after a six-day incubation. Nevertheless, this initial finding signaled that catalytic promiscuity of the class I ligase could potentially lead to an RNA replicase.

Figure 6

The class I ligase ribozyme and its descendants. Structures for the class I ligase;[133,134] the round-18 polymerase introducing the new 3′ accessory domain (blue), which is known to interact with the loop depicted on the lower right;[117,122] the tC19Z polymerase, introducing a new 5′ accessory domain;[123] and the 24–3 polymerase.[124] Blue regions denote new additions to the ribozyme with point mutations marked by blue diamonds. Primer and template oligonucleotides are shown in orange and red. Subsequent work with the class I ligase and its derivatives aimed to increase its processivity, fidelity, and template generality. Important progress was made through attachment of an accessory domain to the 3′ end of the ribozyme, which was selected from a 76-nt random sequence with the idea that this domain could facilitate interaction of the ribozyme with the primer-template complex.[117] Polymerization activity was selected for through incorporation of tagged nucleotides opposite an attached primer. To increase the sequence generality of polymerase activity, shorter templates were used to reduce hybridization with the ribozyme, and different primer-template sequences and lengths were used in different rounds of selection. One ribozyme, isolate 10.2, was found to function without attachment to the primer, and without recognition of a specific sequence. This feature was conferred by the new accessory domain, which increased binding of the primer-template complex, but polymerization activity itself occurred with minimal change to the ligase domain. Mutagenized versions of the 10.2 ribozyme were further selected for function on longer single-stranded templates and in the presence of higher concentrations of untagged nucleotides to improve fidelity. After eight more rounds of selection, the best resultant ribozyme, termed the round-18 polymerase (Figure ), functioned much better with longer templates, and allowed for the extension of up to 14 nucleotides. While catalytic promiscuity was key to the discovery of an RNA polymerase ribozyme, substrate promiscuity at a given template base is highly undesirable. That is, fidelity is important for an RNA replicase, because error rates represent a serious limit in the transmission of information.[118] The round-18 polymerase copied templates with a per-base fidelity of 96.7%, which corresponds to relatively low promiscuity values (0.01–0.22; Table ). One of the major determinants of this fidelity is misincorporation resulting from G:U wobble pairs, which is reflected by their higher promiscuity values compared to A and C (Table ). While extension across a template A or C resulted in the correct addition (U or G, respectively) in over 99% of cases, G templated with an incorporation fidelity of 95.7%, and U templated with a fidelity of 92.1%, with the vast majority of mismatches resulting in a G:U mispair. This type of mispairing also appears to be the major limitation on the fidelity of nonenzymatic replication, and may be an echo of the thermodynamic limit on specificity.[17,119]

Table 2

Promiscuity Index (I) and Weighted Promiscuity Index (J) for the Round-18 Polymerase

substrate	CID	k_A (M^–1min^–1)a	k_C (M^–1min^–1)a	k_G (M^–1min^–1)a	k_U (M^–1min^–1)a
ATP	5957	0.30	0.057	0.023	5.3
CTP	6176	0.02	0.008	5.4	0.0002
GTP	135 398 633	0.02	41	0.003	0.23
UTP	6133	87	0.004	0.46	0.001
	fidelity	0.991	0.9996	0.957	0.921
	I	0.020	0.010	0.219	0.126
	J	0.020	0.010	0.220	0.125

Rate constants (kN, for N = A, C, G, U) are from Johnston et al.[117]

Rate constants (kN, for N = A, C, G, U) are from Johnston et al.[117] Despite the 3′ accessory domain, a major limitation of the round-18 polymerase continued to be low binding affinity for the primer-template complex, which was the primary contributor to the low processivity of the class I polymerase. The affinity also had a high degree of variability with regard to primer-template sequence, suggesting that further reduction of sequence specificity was still needed.[120] Further progress was achieved by selecting directly for activity in trans using a water-in-oil emulsion. The first product of this method was the B6.61 polymerase, which was capable of generating sequences 20 nucleotides long.[121,122] B6.61 showed a much faster polymerization rate than its predecessor, with an extension rate over 75 times faster for longer sequences. While there was no significant improvement in binding to the primer-template, this rate increase was accompanied by increased fidelity, including a minimization of G:U wobble insertions. As with the aminoacylation ribozymes (Table ), this trend is a counterexample to the idea of a general trade-off between activity and specificity. A substantial improvement to processivity came using a similar compartmentalization technique with the addition of a 5′ random region with the aim of improving interactions between the ribozyme and the primer-template complex. This yielded a 5′ accessory domain that forms stabilizing interactions with downstream portions of the template, thus increasing binding of the ribozyme to the template. Randomization and selection of the template sequence strengthened these interactions. With the optimized template, the new ribozyme, named tC19, ultimately yielded up to 95 nucleotide extensions with a per-base fidelity of 97.3%. However, the new interactions were largely intermolecular base-pairing, such that activity was strongly dependent on sequence. Selection on different templates identified four point mutations which, when introduced into tC19 to make tC19Z (Figure ), improved the sequence generality. These new mutations further increased the measured per-base fidelity to 99.1%, largely due to a decrease in G insertion across template U. The tC19Z polymerase was shown to be capable of transcribing a functional 24 nt variant of the hammerhead ribozyme.[123] Consideration of the promiscuity of the RNA polymerase ribozyme raises an interesting irony: while substrate promiscuity of the incoming monomer across a given template base is undesirable because it leads to copying errors, substrate promiscuity with respect to the template itself is highly desirable to obtain a ribozyme capable of copying many different, and ideally any, sequences. Sequences of particular concern are those with a high degree of structure that would need to be locally melted for ribozyme access, including sequences that comprise the ribozyme itself. Recent selections based on the RNA polymerase ribozyme focus on improving its sequence generality. One such study selected for the polymerase’s ability to synthesize complex folded RNA molecules, with selection tied to the creation of two functional aptamers, imposing pressure for sequence generality and high fidelity.[124] The most active selected ribozyme, 24–3 (Figure ), showed a ∼100-fold increased incorporation rate through structured sequences compared to the parent ribozyme. Likely as a result of selection for functional molecules instead of sequence fidelity, the 24–3 ribozyme displayed a higher error rate than its predecessors, in particular an increased tolerance for G-U wobble pairing. Despite this limitation, 24–3 was able to synthesize functional RNAs up to 76 nt long and could perform exponential amplification of an RNA template. A different approach to overcoming the substrate generality problem takes advantage of plasticity, which occurs when non-native functions can be found through a relatively small number of mutations. In this case, it was hypothesized that copying via ligation of oligonucleotides could improve copying through structured sequences, because base-pairing to the oligonucleotide would mitigate some of the free energy cost of melting the template. Knowing that the RNA polymerase ribozyme was originally derived from the promiscuous activity of an RNA ligase, Attwater et al. engineered and evolved an ancestor of the tC19Z ribozyme to copy templates using trinucleotide triphosphates instead of NTPs.[125] The triplet oligonucleotides use strand invasion to unfold structured RNA sequences for improved copying. The atavistic ribozyme t5+1 displayed reduced fidelity compared to its NTP-using counterpart, but selection for fidelity yielded an improved variant able to synthesize its own catalytic subunit. Interestingly, the t5+1 ribozyme consists of a heterodimer of the catalytic subunit and an RNA “cofactor” that assists interaction with the primer-template complex. Both subunits are descended from the same ancestral pool, illustrating how specialized descendants originated from distinct domains of the ancestor. An ingenious orthogonal strategy to overcome the problem of sequence generality was developed by Joyce and coworkers, who reasoned that base-pairing between ribozyme and template was the major contributor to the energetics determining template specificity. Base-pairing is essentially absent between d-RNA and l-RNA sequences,[126,127] and thus, a d-ribozyme is expected to have little base-pairing interaction with l-substrates. Selection for ligase activity indeed discovered d-ribozymes that could ligate l-RNA oligonucleotides.[128] As expected, the non-natural, mirror-image l-ribozyme could perform the complementary reaction using d-RNA substrates and template. Furthermore, as with the class I ligase, these cross-chiral ligases also possessed polymerization activity. Unlike nonenzymatic templated RNA replication,[129] these ribozymes displayed very little chiral inhibition, showing a high specificity for substrates of the desired chirality. A cross-chiral ligase was efficient enough to produce its mirror image enantiomer, which could then produce the original enantiomer. The cross-chiral ribozymes were not entirely sequence-general, as some substrates, such as those with 3′-terminal C or G residues, were more efficient than others. Nevertheless, given the precedent of the evolutionary strides demonstrated by the promiscuous class I ligase, the cross-chiral ligases represent an intriguing starting point for further development. Polymerase (and ligase) ribozymes present a unique challenge in simultaneously requiring broad template accommodation and strict fidelity. Although this work was not undertaken for the purpose of studying promiscuity in ribozymes, the advances made with the class I ligase, spanning more than two decades of work by multiple groups, rely heavily on the promiscuity and plasticity of the ribozyme. Because this lineage of RNA polymerase ribozymes has only been selected on RNA substrates, true promiscuity can be clearly identified if the ribozymes accept different nucleic acids. One ribozyme displays some activity for incorporation of non-natural sugars and nucleobases, although it often stalls if modified nucleotides are present at specific positions.[130] Unlike other ribozymes in its lineage, the 24–3 ribozyme, perhaps as a consequence of its selection for tolerance of different RNA aptamer templates, was observed to polymerize DNA on an RNA template (i.e., reverse transcription), permitting extension by up to 32 deoxyribonucleotides.[131] A later generation of this ribozyme, 38–6, shows remarkable promiscuity, with activity on templates or nucleotides composed of multiple combinations of RNA, DNA, threose nucleic acid (TNA), or arabinose nucleic acids (ANA), though with reduced activity compared to an RNA-only system.[132] 38–6 also performs DNA-templated RNA synthesis and RNA-templated DNA synthesis more effectively than synthesis involving TNA or ANA, likely due to the lower structural similarities. These results further demonstrate the promiscuity of this ribozyme lineage. While the class I ligase and its descendants constitute a fascinating case study, it is unknown whether other ribozymes could exhibit similar versatility.

Catalytic Promiscuity: The Nucleotide Synthase Ribozyme

While substrate promiscuity appears to be commonly found among ribozymes,[108,135,136] one may ask whether true catalytic promiscuity is also observed. Indeed, an interesting case was found in the pR1 nucleotide synthase.[137] Selected to catalyze a reaction between ribose 5-phosphate (PR) and 6-thioguanosine (6SGua), this ribozyme was found to also be capable of catalyzing the reaction between 6SGua and 5-phosphoribosyl 1-pyrophospate (PRPP), an intermediate in the biological synthesis of nucleotides. These two reactions appear to have distinct reaction mechanisms and resultant products, depending on the substrate provided. Reaction with PRPP generates a glycosidic bond, resulting in the corresponding nucleotide, 6SGMP. However, the reaction with PR appears to require acyclization of ribose, allowing 6SGua to react with the corresponding aldehyde and generate a Schiff base (Figure ). Each reaction has a unique dependence on magnesium ion concentration, supporting the existence of two different mechanisms. Interestingly, ribozymes selected for reactivity with PRPP instead of PR did not exhibit analogous activity on PR. Thus, not all ribozymes with the same function possess catalytic promiscuity. Despite the catalytic promiscuity of pR1, the ribozyme still displays a high degree of substrate specificity toward 6SGua, as analogous sulfur-containing purines are not recognized, in contrast to purine synthase ribozymes selected independently.[138] The pR1 ribozyme demonstrates that catalytic promiscuity may differ in important ways from substrate promiscuity. While substrate promiscuity might be readily evolved through a relaxed binding mode, catalytic promiscuity requires a new reaction mechanism whose spontaneous emergence might be relatively unusual.

Figure 7

Reactions catalyzed by the pR1 nucleotide synthase ribozyme. (A) Given a ribose 5-phosphate substrate, the acyclic form of ribose is stabilized and 6-thioguanosine (6SGua) reacts to form a Schiff base, which can then undergo an Amadori rearrangement. (B) Reaction with 5-phosphoribosyl 1-pyrophospate produces the desired nucleotide, 6SGMP. Adapted with permission from ref (137). Copyright 2009 Elsevier Ltd. Cell Press.

A Highly Promiscuous Ribozyme: The Ribosome

The ribosome is a ribozyme that translates genetic information on mRNA into protein sequences and is conserved across all the domains of life. The ribosome core consists of catalytic RNA, but farther from the catalytic center, both proteins and RNA are found.[139] In eukaryotes, four rRNAs (rRNAs) associate with about 70 proteins, while in E. coli, the ribosome consists of 3 rRNAs and 52 proteins.[140,141] Because the ribosome is a ribozyme and conserved across all domains, it is presumed to have existed in the last universal common ancestor (LUCA), and is also taken as circumstantial evidence of the RNA World.[142] Protein translation necessitates high fidelity, with error rates of the overall process on the order of 10–4 per codon. Fidelity is primarily maintained through factors other than the ribosome, such as aminoacyl-tRNA synthetase editing and EF-Tu binding.[143,144] While the peptidyl transferase center of the ribosome provides some steric selectivity (e.g., preferring l- rather than d-amino acids[145]), the ribosome itself is a surprisingly promiscuous molecule overall, permitting a wide assortment of substrates so long as there is a correct codon-anticodon match. The ribosome accepts two aminoacyl-tRNAs at a time and catalyzes the formation of a peptide bond between the amino acids, releasing an uncharged tRNA and retaining a peptidyl-tRNA.[141] The ribosome must accommodate a large variety of substrates: there are 20 canonical amino acids that can be associated with 50 or more different tRNAs, depending on the species.[146] Even if one only considers two canonical amino acids coming together to be joined by a peptide bond, there are 400 possible substrate permutations that the ribosome must accommodate and catalyze. This level of multispecificity is essential for the production of all extant proteins in the organism. In addition to accommodating different amino acids and peptides in the active site, the ribosome must also accommodate different peptides in the exit tunnel. Interestingly, the exit tunnel is lined primarily by RNA and lacks significant patches of hydrophobicity, creating a “nonstick” character that allows peptides through regardless of sequence.[139] The ribosome is similarly multispecific with respect to the mRNA templates, on which there are minimal sequence restrictions. Following initiation, which does involve sequence-specific interactions in some organisms, ribosome binding to mRNA is primarily facilitated through interactions with the mRNA backbone.[147] However, the ribosome does display some slight substrate preferences. Early research on the ribosome, for example, discovered roughly 2-fold higher reactivity with leucine than phenylalanine.[148] Additionally, ribosomes display codon preferences that can alter the elongation rate,[149] a property which is used for regulation of gene expression. Still, the degree to which ribosomes are capable of utilizing a wide variety of substrates, including many noncanonical amino acids,[150] representing promiscuous activity, is truly striking. As with the RNA polymerase ribozymes, the substrate promiscuity of the ribosome must coexist with a requirement for high fidelity of information transfer. The promiscuity of the ribosome is tolerated by the cell in part because translation fidelity is handled during aminoacylation of tRNAs, including proofreading processes.[151] In the ribosome, cognate and noncognate tRNAs can be distinguished through minor differences in base-pairing to mRNA. Recognition of the cognate tRNA leads to a structural change that is identified by elongation factor proteins which permit translation to proceed.[152] Another restriction imposed on the incoming tRNA is the 3′ terminal CCA sequence, which forms specific interactions with the ribosome.[139,147] This CCA sequence is required, and occasionally sufficient, for peptidyl transfer to occur.[148,153] A small number of important interactions between tRNAs and the ribosome provide high fidelity of translation while permitting minimal restrictions on the mRNA or protein sequences. The innate promiscuity of the ribosome is occasionally exploited by nature. One such example is puromycin, an antibiotic produced by the bacteria Streptomyces alboniger. Puromycin is an aminonucleoside, containing nucleoside and amino acid analogues, linked through an amide bond instead of the conventional ester. This structure mimics the 3′ terminus of a charged tRNA, which allows it to enter the ribosome and be irreversibly incorporated into the nascent polypeptide, terminating translation.[154,155] The efficacy of this molecule suggests that evolutionary escape from this promiscuous activity has been difficult despite the selective pressure engendered by the antibiotic. Synthetic biologists have also taken advantage of the substrate promiscuity of the ribosome, fundamentally altering the genetic code itself. tRNAs recognizing the amber stop codon can be charged with noncanonical amino acids. Because the ribosome is, to a first approximation, essentially agnostic with respect to the side chain of the incoming monomer, the amber codon is translated into the new amino acid.[156,157] Amino acids with a remarkably diverse set of unnatural functional groups have been successfully incorporated by this method, including alkanes, polybenzenes, sugars and phosphate-containing species.[158,159] The ribosome can even catalyze the formation of ester bonds, yielding mRNA-encoded polyesters, without mutation in the ribosome itself.[86] Further evolution can push this versatility further, as seen with the ribosome variants ribo-Q1 and ribo-X, which translate quadruplet codons and thus introduce many “blank” codons to the genetic code.[160−162] Although it was postulated early on that the genetic code might be a “frozen accident”,[4] it now seems clear that the code itself has been the subject of evolution, as evidenced by the different version of the code found in mitochondria[163] as well as statistical analyses suggesting that the code has evolved to minimize the biophysical impact of mutations.[164] The evolvability and malleability of the genetic code attests to the remarkable combination of substrate promiscuity and informational fidelity in the ribosome.

Primordial Ribozymes: More Promiscuous?

We now return to a question posed at the beginning of this review: would primordial ribozymes be particularly promiscuous? There are two reasons why one might hypothesize this. First, given the importance of promiscuity for evolutionary innovation, one may suppose that primordial ribozymes might have been more promiscuous than highly evolved enzymes due to evolutionary pressure for greater specificity. Second, given the wider chemical diversity of functional groups available to proteins, one may suppose that proteins will have both superior specificity and activity, in general, compared to ribozymes, due to their ability to engage in more types of interactions. Although there is insufficient data in the literature to answer these questions definitively, here we consider two comparisons that bear on these issues. First, we consider whether newly evolved ribozymes are more promiscuous than highly evolved ribozymes. Second, we ask whether ribozymes are more promiscuous than proteins by examining a head-to-head comparison of a ribozyme and a protein enzyme, both of which were evolved de novo.

Newly Evolved vs Highly Evolved Ribozymes

An interesting comparison can be made between the promiscuity of ribozymes from in vitro selection, which have very short evolutionary histories, to highly evolved ribozymes, in particular, the ribosome. The first ribozymes to catalyze amide bond formation were initially selected for a different activity, to catalyze the transfer of an aminoacyl group from the 3′-hydroxyl of a short tRNA mimic to the 5′-hydroxyl of the ribozyme.[165] However, like the ribosome, one of the selected ribozymes was able to use an alternative nucleophilic substrate. When the 5′-hydroxyl was substituted with an amino group, amide bond formation was observed at a similar rate. In both cases, the ribozyme accelerated the respective noncatalyzed reaction by over 1000-fold. Later, ribozymes were selected to perform peptide bond formation by linking a phenylalanine to the 5′ end of the RNA and selecting for the ability to attach a biotinylated methionine from a 3′ acylated AMP substrate.[166] The best ribozymes from this selection displayed a rate enhancement of ∼106. This reaction was inhibited by the presence of AMP, but not other nucleotides or methionine, suggesting that the ribozyme functions primarily through specific interactions with AMP. Consistent with this, activity was also observed with leucine, phenylalanine, and lysine substrates, with methionine and leucine being the best peptidyl donors. These peptide synthase ribozymes, much like the flexizyme, illustrate that if there are sufficient interactions with other parts of the small molecule substrate, the amino acid side chain may be quite variable. In addition, the fact that one of the ribozymes is capable of both ester and amide bond formation, much like the ribosome, further corroborates its promiscuity of function. (In this case, these non-native activities represent true promiscuity because we know the complete environmental history of the ribozyme.) However, while it may be possible to evolve increased promiscuity in these ribozymes, it seems a hard task indeed to match or exceed the promiscuity of the ribosome. This comparison at least suggests that newly evolved ribozymes are not necessarily more promiscuous than highly evolved ones. Instead, specificity or promiscuity itself may be a selectable trait, and natural selection may favor either greater or lesser promiscuity.

De Novo Ribozyme vs de Novo Protein Enzyme: The Diels–Alderases

Are protein enzymes superior to ribozymes, such that ribozymes emerging in the RNA world would be worse than their protein counterparts? While it seems clear that proteins have greater activity in general, nearly all protein enzymes have much longer evolutionary histories compared to ribozymes, most of which have been evolved in vitro. To avoid this confounding factor, one may compare a de novo ribozyme with a de novo protein enzyme. Such a comparison can be made with the Diels–Alderase ribozyme and protein enzymes, which both catalyze a reaction (Figure ) previously not known to occur in biology.

Figure 8

Diels–Alder cycloaddition. A concerted reaction between a conjugated diene and an alkene (dienophile) results in the formation of a cyclized product.[172] The dienophile substituents shown (R3 and R4) are added to the same face of the cyclohexane ring. Interestingly, the first biochemical catalyst discovered for the Diels–Alder reaction was a ribozyme, not a protein. While most ribozyme reactions involve RNA or amino acid modifications and often involve base-pairing interactions, an early discovery that demonstrated the catalytic versatility of ribozymes was carbon–carbon bond formation by Diels–Alderase ribozymes.[167,168] The specificity of one such ribozyme was extensively characterized by the Jäschke laboratory through testing a series of potential substrates.[169,170] The initial experiments selected for cycloaddition of a biotin-maleimide to anthracene, which was conjugated to the RNA via a polyethylene glycol linker. The ribozyme produced from this selection could catalyze this reaction on free substrate with a high degree of enantioselectivity. Additionally, a synthesized mirror image of this ribozyme composed of l-nucleotides produced the opposite enantiomer. This enantioselectivity was the result of a “tail” group on the anthracene substrate (e.g., the PEG linker), which restricted the molecule’s orientation in the binding pocket. Important structural features of both substrates include: the diene must contain three linearly annellated rings, the dienophile must be a five-membered maleimide ring with a hydrophobic tail, and both substrates must be arranged in a stacked, coplanar manner.[170] These results present one of the most rigorous characterizations of ribozyme specificity on a non-nucleotide substrate. One decade later, a Diels–Alderase protein enzyme was developed by the Baker lab.[171] This enzyme was created de novo using computational design and site-directed mutagenesis to catalyze the reaction between 4-carboxybenzyl trans-1,3-butadiene-1-carbamate and N,N-dimethylacrylamide. Like the ribozyme, this protein enzyme demonstrated a high level of product stereoselectivity (>97%). The best Diels–Alderase enzymes possessed higher catalytic activity than the Diels–Alderase ribozymes, but were still markedly slower than natural enzymes. Although these catalysts were discovered through different means (in vitro selection vs computational design), both were created in a laboratory setting independent of natural evolutionary influences, and therefore they are an interesting test comparison to understand the promiscuity of de novo functions. While the reactions catalyzed by these molecules use different substrates, the promiscuity indices can be compared between them (Table vs Table ). Note that different values are calculated for the diene and dienophile when possible, and that I and J varies depending on whether they are calculated from kcat or from kcat/kuncat (catalytic power). Despite these differences, in general, the promiscuity indices are not very different; all values for the dienophiles lie in the range of 0.66–0.83, with the values for the protein enzyme lying in the middle of this range. Therefore, comparing these two de novo catalysts, it does not appear that the ribozyme is more promiscuous than the protein enzyme.

Table 3

Unweighted (I) and Weighted (J) Promiscuity Indices for Diels–Alderase Ribozymes, Calculated Separately for the Diene and Dienophile Substrates

HEG indicates hexa(ethylene glycol).

kcat and kuncat values from Stuhlmann and Jäschke.[170]

Dash indicates kuncat not reported; these values could not be included in the calculation of I or J.

Table 4

Unweighted (I) and Weighted (J) Promiscuity Indices for the Protein Diels–Alderase, Calculated for Dienophile Substrates

Rate values were estimated from Siegel et al.[171]

HEG indicates hexa(ethylene glycol). kcat and kuncat values from Stuhlmann and Jäschke.[170] Dash indicates kuncat not reported; these values could not be included in the calculation of I or J. Rate values were estimated from Siegel et al.[171] It is of practical interest to note that the weighted and unweighted promiscuity indices (Tables –4) are often not very different from one another, as the difference between these values ranges from 0 to 0.1. This may reflect ruggedness in the promiscuity profile over the chemical space of the substrates. The motivation for creating a weighted index was to account for the expectation that chemically similar substrates would have similar activity. While this must be true to some extent, over the substrates that were tested and reported in these examples, the additional accounting did not alter the overall calculation by much.

Concluding Remarks

Ribozymes identified by in vitro selection or evolution represent an ideal model system for studying true promiscuity because the selective pressures on these ribozymes are controlled by the experimenter and their entire evolutionary history is available for study. In addition, the promiscuity of ribozymes in particular is a fascinating question relating to the origin of living systems. An attractive but untested hypothesis is that the earliest ribozymes emerging from the prebiotic milieu of random polymers would be highly promiscuous, presenting a kernel of activity across many functions that could be optimized by evolution individually (e.g., after duplication events). Although a rigorous test of this hypothesis is currently lacking, we may consider how current knowledge informs this hypothesis of promiscuous ribozymes. What are the likely properties of a ribozyme selected de novo, i.e., a primordial ribozyme? It is clear that the activity is likely to be low initially, simply because there are more sequences of low activity compared to high activity (i.e., the frequency of sequences is a decreasing function of activity),[173] leaving room for optimization of activity by natural selection. What about promiscuity? While it might seem intuitive that simple, low activity ribozymes would have high promiscuity, we do not see solid evidence for this so far in the literature. As discussed above, a de novo peptide synthase ribozyme is less promiscuous than its highly evolved counterpart (the ribosome). While there are some examples of ribozymes where in vitro evolution resulted in both improved activity and specificity (discussed above), it is not clear that there would be a positive correlation between activity and specificity in general. Indeed at least one mechanism (discussed in Section ) has the opposite effect, causing a negative correlation, i.e., a trade-off, between activity and specificity. The intuition that there should be a positive correlation between activity and specificity is based on the general idea that increased molecular interactions give both increased activity and increased specificity. This seems to be reasonable, but in one rigorous study of RNA aptamers, activity was found to be uncorrelated to specificity.[174] It is even less clear that increased interactions should increase specificity in ribozymes, particularly cis-acting ribozymes, because the entire reaction pathway would be stabilized. How the ground state, transition state, and product would be affected in relative terms is not clear. Therefore, it should not be assumed that the earliest emerging ribozymes were particularly promiscuous. Empirical data is required to resolve the relationship between activity and specificity of ribozymes. The reason why the hypothesis of promiscuous primordial ribozymes is attractive, despite the current lack of evidence to support it, is that it solves an important problem in prebiotic evolution. If the first ribozyme to emerge by chance possesses the ability to catalyze many reactions, albeit at low activity, this ribozyme could serve as the ancestral catalyst to a suite of different reactions, rapidly forming a metabolic network of ribozymes. In evolutionary terms, a network of ribozymes might then arise from exaptation (or preadaptation) of a small number of ancestral ribozymes. However, it may be that promiscuity, rather than being an automatic property of a low-activity primordial ribozyme, should be considered as an evolvable or fortuitous trait itself, possibly uncorrelated to activity. In this case, the selective pressures on the RNA world would play an important role in shaping ribozyme evolvability.

166 in total

Review 1. The origins of the RNA world.

Authors: Michael P Robertson; Gerald F Joyce
Journal: Cold Spring Harb Perspect Biol Date: 2012-05-01 Impact factor: 10.005

2. INHIBITION BY PUROMYCIN OF AMINO ACID INCORPORATION INTO PROTEIN.

Authors: M B Yarmolinsky; G L Haba
Journal: Proc Natl Acad Sci U S A Date: 1959-12 Impact factor: 11.205

Review 3. The flexizyme system: a highly flexible tRNA aminoacylation tool for the translation apparatus.

Authors: Masaki Ohuchi; Hiroshi Murakami; Hiroaki Suga
Journal: Curr Opin Chem Biol Date: 2007-09-19 Impact factor: 8.822

4. A structural model of latent evolutionary potentials underlying neutral networks in proteins.

Authors: Richard Wroe; Hue Sun Chan; Erich Bornberg-Bauer
Journal: HFSP J Date: 2007-05-21

Review 5. Squalene hopene cyclases: highly promiscuous and evolvable catalysts for stereoselective CC and CX bond formation.

Authors: Stephan C Hammer; Per-Olof Syrén; Miriam Seitz; Bettina M Nestl; Bernhard Hauer
Journal: Curr Opin Chem Biol Date: 2013-02-26 Impact factor: 8.822

6. On the evolution of the genetic code.

Authors: C R Woese
Journal: Proc Natl Acad Sci U S A Date: 1965-12 Impact factor: 11.205

7. Enzyme kinetics based on free-energy profiles.

Authors: S Yagisawa
Journal: Biochem J Date: 1995-05-15 Impact factor: 3.857

8. On earlier states of the biochemical system.

Authors: M Ycas
Journal: J Theor Biol Date: 1974-03 Impact factor: 2.691

9. Naked mole-rat has increased translational fidelity compared with the mouse, as well as a unique 28S ribosomal RNA cleavage.

Authors: Jorge Azpurua; Zhonghe Ke; Iris X Chen; Quanwei Zhang; Dmitri N Ermolenko; Zhengdong D Zhang; Vera Gorbunova; Andrei Seluanov
Journal: Proc Natl Acad Sci U S A Date: 2013-09-30 Impact factor: 11.205

10. Aptamers selected for higher-affinity binding are not more specific for the target ligand.

Authors: James M Carothers; Stephanie C Oestreich; Jack W Szostak
Journal: J Am Chem Soc Date: 2006-06-21 Impact factor: 15.419

5 in total

1. Driving forces in the origins of life.

Authors: K A Dill; L Agozzino
Journal: Open Biol Date: 2021-02-03 Impact factor: 6.411

2. Using Steady-State Kinetics to Quantitate Substrate Selectivity and Specificity: A Case Study with Two Human Transaminases.

Authors: Alessio Peracchi; Eugenia Polverini
Journal: Molecules Date: 2022-02-18 Impact factor: 4.411