Daniel Wilhelm1, Jehoshua Bruck1,2, Lulu Qian3,4. 1. Computation & Neural Systems, California Institute of Technology, Pasadena, CA 91125. 2. Electrical Engineering, California Institute of Technology, Pasadena, CA 91125. 3. Bioengineering, California Institute of Technology, Pasadena, CA 91125; luluqian@caltech.edu. 4. Computer Science, California Institute of Technology, Pasadena, CA 91125.
Abstract
A natural feature of molecular systems is their inherent stochastic behavior. A fundamental challenge related to the programming of molecular information processing systems is to develop a circuit architecture that controls the stochastic states of individual molecular events. Here we present a systematic implementation of probabilistic switching circuits, using DNA strand displacement reactions. Exploiting the intrinsic stochasticity of molecular interactions, we developed a simple, unbiased DNA switch: An input signal strand binds to the switch and releases an output signal strand with probability one-half. Using this unbiased switch as a molecular building block, we designed DNA circuits that convert an input signal to an output signal with any desired probability. Further, this probability can be switched between 2 n different values by simply varying the presence or absence of n distinct DNA molecules. We demonstrated several DNA circuits that have multiple layers and feedback, including a circuit that converts an input strand to an output strand with eight different probabilities, controlled by the combination of three DNA molecules. These circuits combine the advantages of digital and analog computation: They allow a small number of distinct input molecules to control a diverse signal range of output molecules, while keeping the inputs robust to noise and the outputs at precise values. Moreover, arbitrarily complex circuit behaviors can be implemented with just a single type of molecular building block.
A natural feature of molecular systems is their inherent stochastic behavior. A fundamental challenge related to the programming of molecular information processing systems is to develop a circuit architecture that controls the stochastic states of individual molecular events. Here we present a systematic implementation of probabilistic switching circuits, using DNA strand displacement reactions. Exploiting the intrinsic stochasticity of molecular interactions, we developed a simple, unbiased DNA switch: An input signal strand binds to the switch and releases an output signal strand with probability one-half. Using this unbiased switch as a molecular building block, we designed DNA circuits that convert an input signal to an output signal with any desired probability. Further, this probability can be switched between 2 n different values by simply varying the presence or absence of n distinct DNA molecules. We demonstrated several DNA circuits that have multiple layers and feedback, including a circuit that converts an input strand to an output strand with eight different probabilities, controlled by the combination of three DNA molecules. These circuits combine the advantages of digital and analog computation: They allow a small number of distinct input molecules to control a diverse signal range of output molecules, while keeping the inputs robust to noise and the outputs at precise values. Moreover, arbitrarily complex circuit behaviors can be implemented with just a single type of molecular building block.
A simple but fundamental principle underlying the sophistication of life is that individual cells with the same genome can exhibit different types of behaviors in response to stochastic molecular events, and the fraction of cells at a given state can be precisely regulated, giving rise to complex system behaviors for a collection of cells (1, 2). In many cases, the stochastic events are not unbiased choices with equal probabilities. Instead, specific fractions of cell states are maintained in various circumstances (3). Similar to biological systems, stochastic information processing could also give rise to sophisticated behaviors in engineered molecular systems. Here we aim to understand the engineering principles for controlling the output of molecular circuits with arbitrary probabilities. Because it is simple to generate an equal probability between two choices, like flipping a coin, we ask the following question: Does there exist a molecular circuit architecture that generates an output with any desired probability, and thus any desired fraction of a molecular species, from unbiased molecular events controlled by simple building blocks?Circuits that are capable of processing molecular information have been developed to control complex behaviors in biological (4–6) and biochemical (7–9) systems. In particular, chemical reaction networks implemented using DNA strand displacement reactions (10) have been proposed as a framework for creating arbitrary chemical kinetics and universal computation (11). However, experimental demonstrations of the theoretical proposal have so far been limited to simple systems with specific functions involving no more than three formal reactions (12, 13). Moreover, in many cases, powerful circuit architectures do not necessarily require the full expressibility of chemical reaction networks. Thus, studies on using simpler and more straightforward DNA strand displacement implementations for certain types of information processing, including for both digital (9, 14) and analog (15, 16) computation, continue to play an important role in the development of biochemical circuits.Digital signals are discrete high or low concentrations of molecular species, which correspond to binary inputs and outputs that are ON or OFF, respectively. Analog signals are continuous concentrations of molecular species, which correspond to real-value inputs and outputs. Here, instead of idealized analog signals with infinite precision, which can be used to compute beyond the Turing limit (17), we are interested in real-world analog signals with finite precision. It has been articulated that digital computation and analog computation each have distinct advantages and thus should be combined in biological and biochemical circuits (18, 19). For example, digital computation is more robust to noise and analog computation is more efficient under certain circumstances. However, an unresolved challenge is to develop a DNA circuit architecture for generating arbitrary analog outputs controlled by digital inputs. This would allow a small number of distinct input molecules to control a diverse signal range of output molecules, while keeping the inputs robust to noise and the outputs at precise values. More importantly, the systematic constructions of DNA strand displacement circuits have so far been focused only on deterministic computation. For example, in DNA-based logic circuits (7, 9), the same input signals will yield the same output signals, whether in the low–copy-number regime or in bulk.Here we show a molecular circuit architecture that controls the stochastic states of individual molecular events with any desired probability: With the same input signals and at the single-molecule level, a circuit will yield either no output or a desired output. Alternatively, a circuit will yield one specific output among a few possible outputs. At the bulk level (i.e., for a collection of molecules), the circuit architecture controls an arbitrary fraction of the input molecules to yield a desired output. Functionally, these circuits enable output signals with analog concentration generated from a single input with analog concentration , for arbitrary binary and rational fraction , which can be controlled by a set of digital signals that are either present or absent.Our approach is a systematic implementation of probabilistic switching circuits, using DNA strand displacement reactions. Unlike the theory of the original switching circuits proposed by Shannon (20), wherein signals arriving at the input terminal of a deterministic switch always reach the output terminal if the switch is ON and always stop flowing if the switch is OFF, the theory of probabilistic switching circuits allows the signals to flow through a switch with a specified probability (21, 22). Exploiting the intrinsic stochasticity of molecular interactions, in our implementation, each input DNA signal is designed to bind to a DNA switch and release an output signal with a one-half probability. Composing these switches together, arbitrary probabilities for any input signal to yield a circuit output can be implemented with just a single type of DNA building block.
Results
Circuit Design.
In a probabilistic switching circuit, any circuit components can be combined in series or parallel (Fig. 1). Each probabilistic switch, or pswitch, is associated with a Bernoulli variable defining the probability that the switch is closed. When it is closed, the input and output terminals are connected and a signal can be propagated; otherwise it is open and the signal cannot flow through. When two pswitches with probabilities and are combined in series, a signal can flow through only if both pswitches are closed, and thus the probability is . When they are combined in parallel, a signal can flow through if either pswitch is closed, and thus the probability is . An extension of the pswitch is a probabilistic splitter wherein an input terminal is connected to one of two output terminals with probabilities and . Equivalently, the fan-out wires in the parallel construction can be replaced by a splitter and the pswitch by a wire, which enables a simple molecular implementation for splitting signal flow at a junction.
Fig. 1.
DNA implementation of probabilistic switching circuits. (A) Series and parallel circuit constructions. To the right of each circuit is the probability its terminals are connected, given the probability for each pswitch and splitter. (B) An example circuit closed with probability . Here, for all pswitches and splitters. (C) A universal probability generator. Binary fractions to are realized with to . (D) DNA implementation for each circuit component. Squiggled lines indicate short toehold domains and straight lines indicate long branch migration domains in DNA strands, with arrowheads marking their 3´ ends. Asterisks indicate domain complementarity. F indicates a fluorophore and Q indicates a quencher. The term is a standard concentration of, for example, 50 nM. The default probability is , which is implemented with . (E) Reaction mechanism for a signal species interacting with a gate species.
DNA implementation of probabilistic switching circuits. (A) Series and parallel circuit constructions. To the right of each circuit is the probability its terminals are connected, given the probability for each pswitch and splitter. (B) An example circuit closed with probability . Here, for all pswitches and splitters. (C) A universal probability generator. Binary fractions to are realized with to . (D) DNA implementation for each circuit component. Squiggled lines indicate short toehold domains and straight lines indicate long branch migration domains in DNA strands, with arrowheads marking their 3´ ends. Asterisks indicate domain complementarity. F indicates a fluorophore and Q indicates a quencher. The term is a standard concentration of, for example, 50 nM. The default probability is , which is implemented with . (E) Reaction mechanism for a signal species interacting with a gate species.In theory, it has been shown that using pswitches with just probability , arbitrary -bit binary fractions can be realized with pswitches (21). The construction is quite simple: Read from the least to the most significant bit, add a pswitch in parallel if the bit is 1, and add it in series if the bit is 0 (Fig. 1). Allowing feedback in the circuits, arbitrary rational fraction with can be realized with splitters (22).Using switches, including deterministic switches, pswitches, and splitters, a circuit that maps digital inputs to all
-bit binary fractions can be systematically constructed (21). The circuit is referred to as a universal probability generator (UPG). A one-bit UPG consists of a deterministic switch controlled by signal and a pswitch (Fig. 1). When is OFF, the circuit is open and the output is 0. When is ON, the circuit is closed with probability. An -bit UPG is recursively constructed by adding a splitter and a deterministic switch to an -bit UPG. A UPG is functionally equivalent to a digital-to-analog converter, except that the output value is controlled not only by a set of digital signals – but also by an analog circuit input .The three types of switches can be implemented with a single type of DNA molecule (Fig. 1). To test the circuit function, an analog signal and a reporter will be placed at the input and the output terminal, respectively. An arbitrary signal is implemented with a single-stranded DNA species that has a 15-nt history domain (Sh) and two 6-nt toehold domains (T) flanking a 15-nt branch migration domain (Sx). The concentration of corresponds to its analog value.A deterministic switch is implemented with a partially double-stranded gate species that has a signal strand with its end half bound to a complementary strand, which we refer to as a gate bottom strand. It has an uncovered toehold domain at the end. The gate species will be present or absent, depending on whether the switching signal is ON or OFF. If the switch is ON, input signal will be converted to output signal through an irreversible strand displacement reaction (23) (Fig. 1): The input strand first binds to the gate via the uncovered toehold domain. Branch migration occurs when the two Sx domains in the input and output strands compete for binding to the complementary domain on the gate bottom strand. When branch migration proceeds to the end of the input, the output strand will be released from the gate and become an active signal.A probabilistic switch is implemented with two gate species: One is the same as the deterministic switch and the other is that consumes the input signal without generating any active output signals (Fig. 1). The two gate species will compete with each other for interacting with the input strand, and the outcome of the competition depends on the rates of the two reactions, which in turn depend on the concentrations of the gate species and the rate constants. The rate constant of a strand displacement reaction is primarily determined by the standard free energy of the toehold (24), and thus using the same toehold for both reactions will result in roughly the same rate constant, allowing the competition to be simply controlled by the concentrations of the gates. Similarly, a probabilistic splitter is also implemented with two gate species at equal concentration, each generating a distinct output signal with probability (Fig. 1).A reporter converts an output strand to a fluorescent signal, which can then be measured by a spectrofluorometer. The reporter molecule has two DNA strands, one modified with a fluorophore and the other with a quencher (Fig. 1). It interacts with a signal strand just like the gate does, but upon completion of the irreversible strand displacement reaction the fluorophore will be separated from the quencher, resulting in increased fluorescence.
Simple Circuits.
We start the experimental demonstration with a single pswitch and splitter (). At first, we designed the second gate species in the pswitch to have no tail. The tail in other gate species contains a branch migration domain and a toehold domain—once the signal is released from the gate, these domains can then participate in reactions with downstream gates. Since no active signal should be produced by the second gate species in the pswitch, having no tail is the simplest way to satisfy that. Unlike the pswitch, the second gate species in the splitter must have a tail to generate another active signal, which was the only difference between the two circuits that we tested. For simplicity, we left the second output of the splitter unconnected to any downstream gate or reporter. With this setup, we expected the two circuits to produce the same amount of output signal, given the same amount of input. However, experimental data showed that the output signal of the pswitch was noticeably less than that of the splitter (). We hypothesized that the uncovered toehold in the tail of the gate species might have reversibly bound to the complementary toehold in the gate bottom strand, forming a loop structure (). In that case, at any given time, only a fraction of the gate species will be able to interact with the input signal as designed, resulting in a slower reaction rate compared with that of the gate species without a tail. Thus, the actual probability for the pswitch was smaller than intended.To solve this toehold problem, we made two design changes: First, a tail is added to the second gate species in all pswitches, but with a poly-A domain (S0) instead of an active branch migration domain. This way, the two competing gate species are now structurally the same. Second, three distinct toeholds are used instead of one universal toehold (). With a simple rule for toehold assignment, the toehold in the tail of any gate species will be different from that in the bottom strand: Choose the longest pathway from the circuit input to the output and assign two toeholds to each signal species along the pathway, following the order shown in . Trace all other pathways from the circuit output back to the input and assign the remaining toeholds based on the existing ones.With these changes, we constructed a one-bit UPG (Fig. 2 ). To convert raw fluorescence signal to concentration of output signal, we introduced a postexperiment triggering step that directly generates a reference output signal change which was then used to normalize the data (). To compare data with expected circuit behavior, we simulated the set of strand displacement reactions using mass-action kinetics. Since we designed the three toeholds to have similar binding energy, to simplify the model, we used a single rate constant for all reactions. In agreement with simulations, the circuit produced approximately no output and output when the deterministic switch was OFF and ON, respectively (Fig. 2).
Fig. 2.
Simple circuits. (A–C) Circuit diagram (A), DNA species (B), and simulations and fluorescence kinetics experiments (C) of a one-bit universal probability generator. The error of the DNA circuit is , comparing the last data point with the expected circuit output. ROX is the name of the fluorophore and RQ is the name of the quencher used in . (D–F) Circuit diagram (D), additional DNA species (E), and simulations and fluorescence kinetics experiments (F) of a two-bit universal probability generator. The error of the DNA circuit is , comparing the last data point with the expected circuit output. Dotted lines are experimental data and solid lines are simulations. Truth tables show expected values of the analog output based on the digital switching signals, and the values are the same as the reaction completion levels shown in simulations at roughly 25 min. Simulations were performed by solving a set of ordinary differential equations derived from the listed reactions, using mass-action kinetics. was used in all simulations.
Simple circuits. (A–C) Circuit diagram (A), DNA species (B), and simulations and fluorescence kinetics experiments (C) of a one-bit universal probability generator. The error of the DNA circuit is , comparing the last data point with the expected circuit output. ROX is the name of the fluorophore and RQ is the name of the quencher used in . (D–F) Circuit diagram (D), additional DNA species (E), and simulations and fluorescence kinetics experiments (F) of a two-bit universal probability generator. The error of the DNA circuit is , comparing the last data point with the expected circuit output. Dotted lines are experimental data and solid lines are simulations. Truth tables show expected values of the analog output based on the digital switching signals, and the values are the same as the reaction completion levels shown in simulations at roughly 25 min. Simulations were performed by solving a set of ordinary differential equations derived from the listed reactions, using mass-action kinetics. was used in all simulations.Adding a splitter and a second deterministic switch, we then constructed a two-bit UPG (Fig. 2 ). The circuit correctly produced the expected output for all four combinations of the two digital switching signals (Fig. 2), suggesting that the three types of switches compose well in a multilayer circuit with branches.To evaluate the predictive power of the simple model, we used the same rate constant estimated from the one-bit UPG experiment to simulate the two-bit UPG: The data and simulations semiquantitatively agreed with each other (Fig. 2). Adjusting the rate constant in the simulations led to a better fit to the data (), which is unsurprising given that an additional toehold sequence (T1) was used in the two-bit UPG. Leak reactions between an upstream and a downstream gate species (or between an upstream gate and a downstream reporter) can be included in simulations to better explain the gradual signal increase in the output trajectories (). It is also reasonable to assume that an up to 10% inaccuracy of the input concentration could occur in the experiments, because of pipetting errors as well as the signal loss due to synthesis errors in the DNA strands. With these two modifications in the model, the simulations quantitatively agreed with the data ().
More Complex Circuits.
Next, we wanted to understand whether the circuit architecture is robust enough for an increasing circuit size. To explore that, we constructed a three-bit UPG (Fig. 3). This was when we encountered a problem with one of the splitters: It yielded roughly 0.4 instead of the desired 0.5 output (). We hypothesized that either the effective concentration of one gate species in this splitter was 50% higher than that of the other or the rate constant for the input signal interacting with one gate was larger than that with the other, both resulting in one reaction pathway being faster. In either case, the desired circuit behavior should be restored by reducing the concentration of the gate involved in the faster pathway. Indeed, with a concentration of the gate in the faster pathway, the circuit produced the desired output for all possible three-bit switching signals (Fig. 3). Because synthesis, concentration, and pipetting errors all could affect the desired molecular behavior (14), it is important that the circuit architecture allows for a simple method to tune individual components and restore the overall circuit function.
Fig. 3.
More complex circuits. (A and B) Circuit diagram (A) and simulations and fluorescence kinetics experiments (B) of a three-bit universal probability generator. The error of the DNA circuit is , comparing the last data point with the expected circuit output. (C–E) Circuit diagram (C) and simulations and fluorescence kinetics experiments of a circuit without feedback (D) and with feedback (E) for generating rational fractions. The error of the DNA circuit without and with feedback is and , respectively, comparing the last data point with the expected circuit output. Dotted lines are experimental data and solid lines are simulations. To compensate for the observed difference between the two reaction pathways in a splitter (), instead of was used in the experiments shown in B. Similarly, was used in the experiments shown in D and E.
More complex circuits. (A and B) Circuit diagram (A) and simulations and fluorescence kinetics experiments (B) of a three-bit universal probability generator. The error of the DNA circuit is , comparing the last data point with the expected circuit output. (C–E) Circuit diagram (C) and simulations and fluorescence kinetics experiments of a circuit without feedback (D) and with feedback (E) for generating rational fractions. The error of the DNA circuit without and with feedback is and , respectively, comparing the last data point with the expected circuit output. Dotted lines are experimental data and solid lines are simulations. To compensate for the observed difference between the two reaction pathways in a splitter (), instead of was used in the experiments shown in B. Similarly, was used in the experiments shown in D and E.Similar to the two-bit UPG, simulations using the simpler model semiquantitatively agreed with the experimental data (Fig. 3), and simulations using the more complex model including leak reactions resulted in a better agreement (). Allowing different rate constants for reactions involving different toeholds and branch migration domains provided an even more ideal fit to the data ().Finally, we demonstrate the full power of probabilistic switching circuits by constructing a feedback circuit that realizes two rational fractions: and (Fig. 3 and ). The circuit consists of two splitters in a cascade, with one output of the downstream splitter connected to the input of the upstream splitter. Without feedback, the probability for an input molecule to reach each of the two output terminals is and , respectively. With feedback, an input molecule always has probability to follow the loop, and thus the overall probability for it to reach one of the two output terminals is simply adding the probabilities together: and , respectively. Similar to the three-bit UPG, we had to tune down the concentration of one gate species. But after this simple tuning, the circuit produced the expected output both without and with feedback. Again, simulations using the simple model (Fig. 3 ) and the complex model () semiquantitatively and quantitatively reproduced the experimental data, respectively.In theory, many classes of probabilistic switching circuits—including all UPGs and some feedback circuits—are surprisingly robust with imperfect building blocks: If the error of each pswitch is bounded by , the total error of a circuit is bounded by a constant multiple of , regardless of the circuit size (25). In our experiments, the error of the smallest circuit was 3.1% and that of the largest circuit was 1.7%, and the errors of all other circuits were between these two values (Figs. 2 and 3 legends).
Discussion
We used exactly one type of gate species to construct all three types of switches required for arbitrary probabilistic switching circuits. The structure of the gates is similar to that of seesaw gates (9, 26), except that an additional toehold is included in the signals to make the reactions irreversible. Unlike the seesaw circuit architecture wherein thresholding was required for the overall circuit function and was implemented by competition between a fast and a slower pathway, all reactions in the pswitch circuits require just one rate, which can be close to the maximum rate of DNA strand displacement reactions. As a result, all DNA circuits that we demonstrated yielded the desired output signals in just a few minutes, which were one to two orders of magnitude faster than the seesaw circuits.The unbiased DNA switch that we developed simply exploits the inherent stochasticity in molecular interactions: If one molecule can react with two different molecules, it will react with the one that it bumps into first through random diffusion. One might ask, Since the equal probability can be realized by equal concentration of the two reactive molecules, why not use different concentrations of different reactive molecules to create a biased probability? Yes, that is possible, but that would not be robust—if any environmental changes resulted in fluctuation of the concentrations, the circuit function would break down. However, if only unbiased choices are involved, the exact concentrations of the reactive molecules may vary, as long as their concentrations remain equal to each other. For example, it should be possible to make the pswitch or splitter in the form of a dimer: Two gate species could be linked together in multiple ways (), and complexes that include both gates could then be gel purified. This way, the concentration of the gates would have to be equal. The fact that arbitrary binary and rational fractions can be realized using just the pswitch and splitter is critical for the possibility of perfecting the building block and enabling even more robust construction of increasingly complex circuits. However, to truly demonstrate a perfectly unbiased switch, further study will be required to explore the different designs and understand their trade-offs.There are at least two other aspects of the DNA-based probabilistic switching circuits that merit further study: First, we used bulk fluorescence kinetics experiments to observe the circuit behavior, but in principle, the stochastic states of individual molecular events could be observed in droplets (27, 28), on microparticle surfaces (29), or on DNA origami surfaces (30, 31). Second, we used a simple clamp design to reduce undesired leak reactions between circuit components (), which was not very effective. Scaling up the complexity of these circuits will require a more advanced design to eliminate undesired reactions, for example by using leakless mechanisms (32).DNA-based probabilistic switching circuits could be directly composed together with previously developed DNA-based analog circuits (11, 13). With a revised design of deterministic switches (), the circuits could also be composed together with previously developed DNA-based logic circuits (9) and neural networks (26). Integrating multiple circuit architectures would enable more powerful molecular information processing in complex biochemical environments, while each part of the circuit could be optimized for a specific task, combining robustness and efficiency. Moreover, what we have shown here could have implications for natural molecular systems in biology and chemistry, as well as for engineered molecular systems in material science and medicine: Any desired probability of an individual molecular event, leading to any desired fraction of molecular species at a specific state, could be generated from just one type of molecular building block that generates an equal probability between two choices, as simple as flipping a coin. Finally, it is now conceivable to create engineered molecular systems with programmable stochastic behavior in simple and compartmentalized environments, and the communication among these simple systems could give rise to complex global behavior.
Materials and Methods
All DNA sequences are listed in . A 1-nt clamp was used in all gate bottom strands to reduce undesired gate–gate interactions. DNA oligonucleotides were purchased from Integrated DNA Technologies. All gate and reporter species were annealed at 20 M in 1 TE buffer with 12.5 mM Mg2+. After annealing, the gate species were purified using 15% PAGE. Fluorescence kinetics experiments were performed at 25 °C.
Authors: Anupama J Thubagere; Chris Thachuk; Joseph Berleant; Robert F Johnson; Diana A Ardelean; Kevin M Cherry; Lulu Qian Journal: Nat Commun Date: 2017-02-23 Impact factor: 14.919
Authors: Boya Wang; Chris Thachuk; Andrew D Ellington; Erik Winfree; David Soloveichik Journal: Proc Natl Acad Sci U S A Date: 2018-12-13 Impact factor: 11.205
Authors: Joseph Berleant; Christopher Berlind; Stefan Badelt; Frits Dannenberg; Joseph Schaeffer; Erik Winfree Journal: J R Soc Interface Date: 2018-12-21 Impact factor: 4.118