| Literature DB >> 25340050 |
Nathan Gould1, Oliver Hendy2, Dimitris Papamichail1.
Abstract
Advances in DNA synthesis have enabled the construction of artificial genes, gene circuits, and genomes of bacterial scale. Freedom in de novo design of synthetic constructs provides significant power in studying the impact of mutations in sequence features, and verifying hypotheses on the functional information that is encoded in nucleic and amino acids. To aid this goal, a large number of software tools of variable sophistication have been implemented, enabling the design of synthetic genes for sequence optimization based on rationally defined properties. The first generation of tools dealt predominantly with singular objectives such as codon usage optimization and unique restriction site incorporation. Recent years have seen the emergence of sequence design tools that aim to evolve sequences toward combinations of objectives. The design of optimal protein-coding sequences adhering to multiple objectives is computationally hard, and most tools rely on heuristics to sample the vast sequence design space. In this review, we study some of the algorithmic issues behind gene optimization and the approaches that different tools have adopted to redesign genes and optimize desired coding features. We utilize test cases to demonstrate the efficiency of each approach, as well as identify their strengths and limitations.Entities:
Keywords: codon bias; codon context; computational biology; gene design; synthetic biology
Year: 2014 PMID: 25340050 PMCID: PMC4186344 DOI: 10.3389/fbioe.2014.00041
Source DB: PubMed Journal: Front Bioeng Biotechnol ISSN: 2296-4185
Gene design tools.
| Gene design tool | Web URL | Reference |
|---|---|---|
| DNAWorks | Hoover and Lubkowski ( | |
| Jcat | Grote et al. ( | |
| Synthetic gene designer | Wu et al. ( | |
| GeneDesign | Richardson et al. ( | |
| Gene Designer 2.0 | Villalobos et al. ( | |
| OPTIMIZER | Puigbò et al. ( | |
| Visual gene developer | Jung and McDonald ( | |
| Eugene | Gaspar et al. ( | |
| mRNA Optimizer | Gaspar et al. ( | |
| COOL | Chin et al. ( | |
| D-Tailor | Guimaraes et al. ( |
Overview of gene design tools’ features.
| Gene design tool | Oligo generation | Rho-independent transcription terminator elimination | Prokaryotic ribosome binding site elimination | Codon usage | Codon context | mRNA secondary structure | GC/AT content | Restriction site manipulation | Motif avoidance | Repetitious base removal | Codon auto- correlation adjustment | Hidden stop codons | Hydropathy index optimization |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| DNAWorks | X | X | X | ||||||||||
| Jcat | X | X | X | X | |||||||||
| Synthetic gene designer | X | X | X | X | |||||||||
| GeneDesign | X | X | X | ||||||||||
| Gene Designer 2.0 | X | X | |||||||||||
| OPTIMIZER | X | X | X | X | X | ||||||||
| Visual gene developer | X | X | X | X | X | X | X | ||||||
| Eugene | X | X | X | X | X | X | X | X | |||||
| mRNA Optimizer | X | ||||||||||||
| COOL | X | X | X | X | X | X | X | ||||||
| D-Tailor | X | X | X | X | X |
Availability and ease of installation/use of computational gene design tools.
| Gene design tools | Availability | Operating system | Implementation language | Source code availability | Ease of installation | Ease of use |
|---|---|---|---|---|---|---|
| DNAWorks | Web based | Any | Fortran90 | N/A | N/A | Easy |
| Jcat | Web based | Any | Java | N/A | N/A | Easy |
| Synthetic gene designer | Web based | Any | PHP, Javascript, and Perl | N/A | N/A | Medium |
| GeneDesign | Web based | Any | Perl and C | N/A (github repository unavailable) | N/A | Easy |
| Gene Designer 2.0 | Stand-alone | Mac OS X, Windows, Linux | Unknown | N/A | Easy | Medium |
| OPTIMIZER | Web based | Any | PHP | N/A | N/A | Medium |
| Visual gene developer | Stand-alone | Windows | .Net Framework | Partially available | Easy | Medium |
| Eugene | Stand-alone | Mac OS X, Windows, Linux | Java | N/A | Easy | Easy |
| mRNA Optimizer | Stand-alone, command line | Mac OS X, Windows, Linux | Java | N/A | Easy | Medium |
| COOL | Web based | Any | Perl (functional back-end) | Partially available | N/A | Medium |
| D-Tailor | Stand-alone | Mac OS X, Windows, Linux | Python | Available | Hard | Hard |
.
.
The terms “easy,” “medium,” and “hard” are subjective and refer to the ease of installation and use of the tools by a user with limited to moderate ICT and programing skills.
Figure 1Gene design software data flow.
Figure 2Codon usage distributions of gene designs generated by optimization tools.
MFE optimization experiments of GFP mRNA.
| Tool | Minimized MFE | Maximized MFE | Running time (s) | |
|---|---|---|---|---|
| Maximization | Minimization | |||
| Eugene | −202.20 | −74.40 | 8.4 | 8.4 |
| mRNA Optimizer | −133.80 | −76.67 | 0.0 | 10.2 |
| D-Tailor | −190.40 | −99.04 | 144.5 | 142.0 |