Anne Condon1. 1. The Department of Computer Science, 2366 Main Mall, University of British Columbia, Vancouver, British Columbia V6T 1Z4, Canada. condon@cs.ubc.ca
Abstract
Long admired for its informational role in the cell, DNA is now emerging as an ideal molecule for molecular nanotechnology. Biologists and biochemists have discovered DNA sequences and structures with new functional properties, which are able to prevent the expression of harmful genes or detect macromolecules at low concentrations. Physical and computational scientists can design rigid DNA structures that serve as scaffolds for the organization of matter at the molecular scale, and can build simple DNA-computing devices, diagnostic machines and DNA motors. The integration of biological and engineering advances offers great potential for therapeutic and diagnostic applications, and for nanoscale electronic engineering.
Long admired for its informational role in the cell, DNA is now emerging as an ideal molecule for molecular nanotechnology. Biologists and biochemists have discovered DNA sequences and structures with new functional properties, which are able to prevent the expression of harmful genes or detect macromolecules at low concentrations. Physical and computational scientists can design rigid DNA structures that serve as scaffolds for the organization of matter at the molecular scale, and can build simple DNA-computing devices, diagnostic machines and DNA motors. The integration of biological and engineering advances offers great potential for therapeutic and diagnostic applications, and for nanoscale electronic engineering.
The cellular role of DNA is relatively limited, perhaps because of the restrictions imposed by bonding between complementary strands. Outside the cell, however, nanoengineers are uncovering the many hidden talents of DNA. The DNA sequence is able to process information in biochemical assays, its structure is an ideal building material, and its folding pathway allows DNA to move and respond to its environment. Here, I review several innovative applications of designed DNA molecules, some underlying design principles and the prospects for future developments. The applications of new DNA-based technologies range from molecular diagnostics to protein purification and from therapeutics to the assembly of tiny electronic circuits. These uses exploit the properties of DNA at three levels — sequence only, structure (which depends on sequence) and folding pathways (which depend on both sequence and structure) — as the following examples show.By taking their cue from the genetic code found in nature, nanoengineers are now using DNA sequences in vitro to direct the synthesis and evolution of molecules with new functions and reactive properties. When attached to other molecules, detection methods that exploit DNA sequences allow complementary molecules to be identified at extremely low concentrations. DNA sequences can also be used, in a non-biological setting, to control the layout of electronic components in a nanocircuit.DNA strands can fold into stable structures that have valuable functional and material properties. Sensitive methods for detecting DNA have been extended to detect proteins and other macromolecular targets of DNA aptamers — that is, nucleic-acid molecules that have high binding specificity for their targets. Lattices and even three-dimensional (3D) objects have been self-assembled from rigid DNA structures. DNA lattices can organize proteins and nanoelectronic components on a surface with unprecedented precision, making it possible to gain a better understanding of the properties and interactions of proteins.The folding pathway of a DNA molecule to its stable structure allows it to move and perform mechanical functions. The energy released in DNA-folding pathways has been used in vitro to drive motors, providing the ability to release, grab or cleave target molecules and even to control the release of drugs, depending on the outcome of certain diagnostic tests.This review focuses on the recent developments in generating rationally designed DNA molecules and their applications. The past decade has seen huge advances in the development of design principles, particularly by exploiting DNA structure and folding pathways, but also in applications of designed DNA sequences. Together, techniques for the or molecular evolution of DNA molecules from random pools have given spectacular yields, including DNA aptamers and enzymes, which can inhibit the expression of harmful genes or disrupt protein function. Excellent reviews of selected DNA molecules can be found elsewhere[1,2,3]; this paper includes only examples of designed DNA molecules, or molecules that combine both selected and designed features. The examples are organized by the primary property of DNA used: sequence, structure or folding pathway (although the distinctions are sometimes blurred). Overall, I hope that the reader will gain a renewed appreciation for the extraordinary abilities of DNA, and that the range of examples will stimulate ideas for further practical applications.Innovative uses of DNA sequenceMany uses of DNA exploit its sequence properties, along with the ability of a DNA sequence to hybridize with its complement. Following a short description of DNA-sequence properties and the techniques for manipulating these molecules, I describe several applications, ranging from sensitive molecular detection to DNA computing and the assembly of nanoelectronic components. Finally, I describe some challenges and prospects for this field.DNA sequences have three major assets: a four-base digital make-up, an ability to bind a complementary sequence with high affinity, and stability in a wide range of environmental conditions. DNA has an incredible information density — roughly 1 bit per nm3 — that can be retrieved from an ssDNA sequence by its ability to recognize and hybridize with its complement. Hybridization is reversible and, in part, thermodynamically driven: when temperature decreases complementary ssDNA in solution tends to hybridize, and when temperature increases dsDNA tends to denature into single strands. The applications that exploit the features of DNA sequences also depend crucially on efficient technologies for the sequencing, synthesis, amplification (by PCR) and detection (by fluorescence spectroscopy and DNA microarray analysis) of DNA strands.DNA tags, which are sequences that are chemically attached to other molecules, represent the most successful and varied application of DNA-sequence properties. DNA tags are used in the highly sensitive detection of disease markers, the parallel synthesis of new compounds and the discovery of new reagents — as such, they have been applied in disease diagnosis and drug development. Two powerful principles unify the applications of DNA tags. First, new DNA codes[4] can direct the cheap and efficient parallel synthesis of large sets of polymers. In this case, each DNA tag is composed of ordered segments that specify the sequence of polymeric units of one molecule to be synthesized, just as a sequence of natural codons specifies the sequence of amino acids that make up a protein. Second, efficient methods for amplifying, sequencing and detecting DNA can be extended to new types of molecule using their attached DNA tags. Several applications of DNA tags are described next.Molecular-detection methods can help to identify disease markers, such as cytokines in human serum samples, and, in so doing, aid the diagnosis of some cancers and immunodeficiency-related diseases[5]. Detection methods can also identify allergens and pollutants[6]. Highly sensitive detection, at the attomolar (10−18 mole) level or better, is of great practical importance: it allows more efficient screening at blood banks through the pooling of samples, and it facilitates early disease detection when treatment might be more promising and paediatric research when sample sizes are necessarily small[6].Early DNA-based detection methods, such as immuno-PCR[6,7], involved many cumbersome steps, including PCR. Bio-barcode amplification (BCA), which has been developed by Nam and colleagues[8], is a new streamlined method that achieves highly sensitive detection at attomolar concentrations without the need for PCR amplification. In BCA, the target protein to be detected is 'sandwiched' between two probes: a tagging probe and a capture probe (Fig. 1). Both probes contain antibodies, which bind to the target protein such that the protein is sandwiched between the probes. Two key innovations are that many copies of a DNA tag are attached to the tagging probe, and that tags can be removed for detection purposes using a simple denaturing process. BCA has been successfully applied to the detection of prostate-specific antigen (an indicator of prostate cancer and breast cancer[8,9]), amyloid-β-derived diffusible ligands (ADDLs; indicators of Alzheimer disease[10]) and interleukin 2 (a cytokine protein that is involved in inflammation and immune responses in humans[11]). Georganopoulou and colleagues[10] quantified differences in ADDL levels in clinically realistic samples of cerebrospinal fluid that were taken from human tissue. Overall, they found a significantly higher (low femtomolar) concentration of ADDLs in diseased patients, compared with samples from control patients who had not been diagnosed with Alzheimer disease.
Figure 1
Exploiting DNA sequences: sensitive molecular detection.
Adleman[21] used select-and-prune parallel computation to determine whether there is a path (that is, a sequence of linked nodes) through a network, such that the path visits each node exactly once. In the illustrated example (simplified from the Adleman experiment), the input set contains a sample of five paths in a simple four-node network (shown on the left). Nodes are represented by short DNA strands, and paths are represented as concatenations of the corresponding node strands. The first round, which is accomplished by gel electrophoresis and shown on the right, selects paths that have exactly four nodes, therefore pruning two paths from the input sample. The second, third, fourth and fifth rounds prune out paths that do not visit node 1, 2, 3 or 4, in corresponding order (accomplished using DNA-sequence complementarity and magnetic-bead separation). Only one path remains at the end of the computation, revealing the solution: 1→3→2→4.
Exploiting DNA sequences: DNA computation.
Adleman[21] used select-and-prune parallel computation to determine whether there is a path (that is, a sequence of linked nodes) through a network, such that the path visits each node exactly once. In the illustrated example (simplified from the Adleman experiment), the input set contains a sample of five paths in a simple four-node network (shown on the left). Nodes are represented by short DNA strands, and paths are represented as concatenations of the corresponding node strands. The first round, which is accomplished by gel electrophoresis and shown on the right, selects paths that have exactly four nodes, therefore pruning two paths from the input sample. The second, third, fourth and fifth rounds prune out paths that do not visit node 1, 2, 3 or 4, in corresponding order (accomplished using DNA-sequence complementarity and magnetic-bead separation). Only one path remains at the end of the computation, revealing the solution: 1→3→2→4.Single-walled carbon nanotubes (CNTs) have been used to realize tiny electronic transistors, which have potential for assembly into nanocircuits. There are several challenges to building a circuit from CNTs, however, including controlling the layout of the component CNTs and placing the wires to connect the components according to a specific pattern. DNA tags (and also tags made of PNA) offer great promise in meeting these challenges. DNA tags that are attached to CNTs can direct their assembly by binding a tag with its complement[23]. DNA strands have been used to spatially position CNTs, making it possible to assemble them into circuits[24]. In addition, ssDNA can be stretched across a surface to connect two electrodes by binding the ends of the ssDNA with complementary strands that are attached to the electrodes. Silver can then be deposited along the stretched DNA strand, to create a nanowire that connects the electrodes[25,26].Varying the DNA strands that are wrapped around CNTs alters the electrostatic properties of the CNT–DNA hybrid in a way that depends on the diameter of the CNT[27]. DNA-wrapped CNTs also enhance the sensitivity of nanotube devices in the detection of gases[28]. Storhoff and Mirkin[29], and Niemeyer[30] review further uses of DNA for materials synthesis — for example, in the assembly of networks of gold nanoparticles or the organization of enzymes so as to speed up chemical reactions[31].DTS and the involved reactions have excellent potential for use in the discovery of drugs or other new compounds. For practical applications, it will be necessary to broaden the scale and range of these methods. An adaptive approach involving repeated rounds of synthesis, in which templates of already-synthesized molecules that show some useful function could be mutated or recombined, might diversify the synthesis pool[17]. Another possibility is to integrate DNA-computing methods, which select molecules on the basis of the information content of their labels, with methods that select or organize molecules on the basis of their function.Innovative uses of DNA structureThe diverse structures that DNA can have vastly expand its potential uses in nanotechnology and biotechnology applications. Structure is the key feature that allows the specific binding of DNA to proteins. It is now also possible to engineer rigid scaffolds from DNA. The integration of the scaffolding and binding structural properties of DNA yields two-dimensional (2D) ordered arrays of proteins, which might be useful for the detailed study of protein structure. DNA structure even supports computation, making it possible, in principle, to program the layout of arbitrarily complex 2D or 3D patterns from DNA.DNA structure, which might involve one or more DNA molecules, is largely shaped by the hybridization of complementary intermolecular or intramolecular base pairs. The most stable structure formed by DNA molecules in solution depends on environmental factors, such as temperature, pH level and salt concentration. The double helix is just one possible conformation; structures might have branching helices that stem from junctions or loops (Fig. 4a). The stacking of adjacent base pairs causes short helices to be rigid, unlike ssDNA. Some DNA molecules enjoy further structural complexity, such as G quartets (Fig. 4b), which can fit snugly in crevices of proteins, and therefore can have high binding specificity for important protein targets. One DNA aptamer exploits a G-quartet structure to bind to thrombin and can inhibit thrombin-catalysed fibrin-clot formation in vitro in samples of human plasma[32].
Figure 4
Exploiting DNA structure: branched structures.
a | The nanobarcode, which is a branched structure with fluorescent labels at the tips of its branches[33], is constructed from Y-shaped components, each formed from three DNA strands (indicated by colours). Each Y-shaped structure binds to another by complementary sticky ends. Ligation is then used to seal the nicks (at positions where distinct strands meet) in the overall structure. Nanobarcodes are distinguished by the ratio of red and green fluorophores (represented by circles) on their branches, and have been used for multiplexed detection. For each type of protein to be detected, a probe is selected that has high binding specificity for the target. A tagging probe is constructed by attaching nanobarcodes of common type (that is, ratio of red to green fluorophores) to copies of the probe. The tagging probe and an additional large capture probe (not shown) are introduced into a solution; if the target is present in the solution, it is sandwiched between the tagging and capture probes, and detected by fluorescence spectroscopy. b | A G-quartet structure in a nucleic-acid aptamer, which binds to thrombin (not shown) in the presence of potassium ions[81].
Exploiting DNA structure: branched structures.
a | The nanobarcode, which is a branched structure with fluorescent labels at the tips of its branches[33], is constructed from Y-shaped components, each formed from three DNA strands (indicated by colours). Each Y-shaped structure binds to another by complementary sticky ends. Ligation is then used to seal the nicks (at positions where distinct strands meet) in the overall structure. Nanobarcodes are distinguished by the ratio of red and green fluorophores (represented by circles) on their branches, and have been used for multiplexed detection. For each type of protein to be detected, a probe is selected that has high binding specificity for the target. A tagging probe is constructed by attaching nanobarcodes of common type (that is, ratio of red to green fluorophores) to copies of the probe. The tagging probe and an additional large capture probe (not shown) are introduced into a solution; if the target is present in the solution, it is sandwiched between the tagging and capture probes, and detected by fluorescence spectroscopy. b | A G-quartet structure in a nucleic-acid aptamer, which binds to thrombin (not shown) in the presence of potassium ions[81].Substantial progress has also been made recently in the design of branched and rigid shapes from DNA. These new structures support the creation of 2D scaffolds and complex patterns, and even 3D structures. Single molecules can now be arrayed with nanoscale precision on DNA scaffolds. Because many exciting developments in engineered DNA structures have been reported in the past year or so, they are the primary focus of this section.Sensitive detection of molecules can take advantage of structure as well as sequence. Li et al.[33]and Stavis et al.[34] considered the case of multiplexed detection, where the goal is to determine which of many DNA fragments, derived from pathogens, are present in solution. Instead of sequence tags, Li et al.[33] used branched DNA structures called nanobarcodes, with red or green fluorophores on the branch tips (Fig. 4a). Nanobarcodes are distinguishable, using fluorescence spectroscopy, by their ratios of red to green fluorophores. As proof of concept, nanobarcodes were used to detect the presence or absence of DNA fragments from four pathogens (Bacillus anthracis, Francisella tularensis, Ebola virus and the severe acute respiratory syndrome (SARS) coronavirus) in a prepared solution with a detection limit of 620 attomoles. Stavis et al.[34] showed that, by flowing nanobarcode tags through microfluidic channels, more rapid detection is possible.A long-term goal of Seeman and co-workers[35,36] has been to build lattices and other rigid structures from DNA in a controlled scalable fashion. DNA lattices can form a 2D scaffold on which biological macromolecules can be arrayed. DNA lattices could also bring enzymes into proximity, thereby catalysing numerous reactions on a large scale, and serve as guides for the placement of nanoelectronic components on a surface, thereby forming a nanocircuit. In a 3D DNA lattice, the cavities could serve as cages in which proteins are arrayed, making it possible to uncover new properties of the macromolecules with X-ray crystallography.Many DNA scaffolds are constructed from rectangular structures, called tiles, which are composed of numerous short DNA strands. Emanating from these tiles are one or more single strands, called sticky ends, which can bind to complementary sticky ends on other tiles. Self-assembly of large structures from small tiles ensues when many tiles bind to each other. Important challenges in tile design are ensuring that the tiles are rigid (a necessary property for scaffolding) and that adjacent assembled tiles are co-planar.The double-crossover (DX) molecule of Fu and Seeman[37] (Fig. 5a) is an early, elegant and widely used design, from which the first 2D DNA lattices were built by Winfree et al.[38]. Other designs and applications have followed[39,40,41], which additionally incorporate strategies to array protein molecules in regular arrangements on the lattice. Such arrays could be used to study protein structure using electron microscopy or to construct spatially well-defined complexes involving several enzymes. Specific examples are the lattices of Yan et al.[39] and Park et al.[40], in which streptavidin binds to biotinylated DNA molecules at lattice junctions, thereby precisely controlling the arrangement of the streptavidin molecules, for example, with a distance of approximately 20 nm between adjacent molecules (Fig. 5b). Liu et al.[41] incorporated the DNA aptamer for thrombin (Fig. 4b) into a hairpin structure at regular points of a one-dimensional (1D) lattice. This approach should, in principle, allow organization of any molecule for which there is a DNA aptamer.
Figure 5
Exploiting DNA structure: double-crossover structures and self assembly.
Construction of a two-dimensional aperiodic lattice.
Four types of double-crossover (DX) molecule, distinguished by their sticky ends (a), represent four types of tile from which a Sierpinski triangle can self-assemble (b). Tiles assemble upwards from the blue frame, starting from the position marked with the red asterisk, forming patterned rows (c). Each tile is marked with either 0 or 1 (corresponding to the sticky ends of the DX molecule) on each of its four corners, and is added to the assembly in such a way that the mark on its lower-left corner matches the mark on the adjoining tile immediately to its left in the row below it, and similarly the mark on its lower-right corner matches the mark on the adjoining tile immediately to its right in the row below it. Crucial to the success of the construction is that the two sticky ends on the lower end of a DX molecule (tile) bind cooperatively to the upper sticky ends of the two matching DX molecules (tiles) below it; a tile with just one matching sticky end is considered to be too weak to persist. Analogously, if working on a puzzle, by moving inward from one corner of the frame and ensuring that each inserted puzzle piece properly connects with two pieces already in the puzzle, mistakes can be reduced. The proof proposed by Winfree to show that DNA self-assembly is able to perform general-purpose computation builds on the classical work of Wang[82], and relies on the cooperative binding of DNA tiles. An atomic-force microscopy image (d; courtesy of Erik Winfree, California Institute of Technology) shows a recognizable Sierpinski triangle fragment, with each error marked by a red cross. Red asterisks again represent starting points of self-assembly. Scale bar represents 100 nm. Reproduced with permission from Ref. 54.
Construction of a two-dimensional aperiodic lattice.
Four types of double-crossover (DX) molecule, distinguished by their sticky ends (a), represent four types of tile from which a Sierpinski triangle can self-assemble (b). Tiles assemble upwards from the blue frame, starting from the position marked with the red asterisk, forming patterned rows (c). Each tile is marked with either 0 or 1 (corresponding to the sticky ends of the DX molecule) on each of its four corners, and is added to the assembly in such a way that the mark on its lower-left corner matches the mark on the adjoining tile immediately to its left in the row below it, and similarly the mark on its lower-right corner matches the mark on the adjoining tile immediately to its right in the row below it. Crucial to the success of the construction is that the two sticky ends on the lower end of a DX molecule (tile) bind cooperatively to the upper sticky ends of the two matching DX molecules (tiles) below it; a tile with just one matching sticky end is considered to be too weak to persist. Analogously, if working on a puzzle, by moving inward from one corner of the frame and ensuring that each inserted puzzle piece properly connects with two pieces already in the puzzle, mistakes can be reduced. The proof proposed by Winfree to show that DNA self-assembly is able to perform general-purpose computation builds on the classical work of Wang[82], and relies on the cooperative binding of DNA tiles. An atomic-force microscopy image (d; courtesy of Erik Winfree, California Institute of Technology) shows a recognizable Sierpinski triangle fragment, with each error marked by a red cross. Red asterisks again represent starting points of self-assembly. Scale bar represents 100 nm. Reproduced with permission from Ref. 54.Techniques for the scalable assembly of rigid 2D and 3D DNA structures suffer from errors that are due to inherent limitations in the specificity of sticky ends and the control of hybridization kinetics. Error-correction techniques have the potential to rectify mistakes in the self-assembly process[55,56]. DNA self-assembly of electronic components raises new research challenges, and opportunities for the design and modelling of circuit architectures[57].Because, at present, there is limited understanding of the rules of formation of DNA tertiary structure, the development of DNA aptamers and enzymes has employed in vitro-selection methods rather than rational design. However, to develop larger DNA molecules with new binding properties, it is likely that a combination of rational design and selection will be effective. Combined approaches have already been used for the design of new RNA molecules[58].Innovative uses of DNA-folding pathwaysWhen a DNA molecule is not in its stable structure, perhaps because of a change in environmental conditions, it switches to a form that is stable in the new environment. In doing so, the molecule typically passes through a sequence of intermediate structures, called the folding pathway. The folding pathway of DNA can be put to good use in molecular detection to diagnose and respond to environmental conditions, to release energy (which can be used to drive nanoscale motors) and to execute computations.Figure 7 illustrates several folding-pathway (or structure-switching) scenarios. In a typical case, DNA molecules are initially bound to each other, but have some single-stranded regions. When a new DNA molecule is introduced that can bind with a single-stranded region, it can displace base pairs by a process called branch migration. This displacement can cause the potential energy trapped in loops of the initial structure to be released, thereby providing fuel for DNA devices, such as DNA tweezers[59], and can also cause the movement of the DNA molecules. Simmel and Dittmer[60] provide a comprehensive overview of several feats of motion by DNA molecules, including unidirectional walking and movement along a track. Such devices might aid in the transport of tiny molecules in a nanoscale system. Alternatively, changes in buffer conditions, which are achieved by introducing positively charged ions, can cause a DNA helix to come apart and switch from its usual B form (in which the helix twists to the right) to the less standard Z form (in which the helix twists to the left)[61,62]. The addition of enzymes that cleave DNA also causes a change in the environment and a corresponding change in the stable structure of the collection of DNA molecules, which could be used to detect the presence of the enzymes. The following examples highlight work that directly explores the potential for applications of nanodevices that exploit folding pathways.
Authors: Ravinderjit S Braich; Nickolas Chelyapov; Cliff Johnson; Paul W K Rothemund; Leonard Adleman Journal: Science Date: 2002-03-14 Impact factor: 47.728
Authors: Daniel A Heller; Esther S Jeng; Tsun-Kwan Yeung; Brittany M Martinez; Anthonie E Moll; Joseph B Gastala; Michael S Strano Journal: Science Date: 2006-01-27 Impact factor: 47.728
Authors: Samuel M Stavis; Joshua B Edel; Yougen Li; Kevan T Samiee; Dan Luo; Harold G Craighead Journal: Nanotechnology Date: 2005-04-15 Impact factor: 3.874
Authors: Amy C Flor; Jimmy H Williams; Kelly M Blaine; Ryan C Duggan; Anne I Sperling; David A Schwartz; Stephen J Kron Journal: Chembiochem Date: 2013-12-27 Impact factor: 3.164