| Literature DB >> 18445293 |
Damien Marsic1, Ronny C Hughes, Miranda L Byrne-Steele, Joseph D Ng.
Abstract
BACKGROUND: Gene synthesis technologies are an important tool for structural biology projects, allowing increased protein expression through codon optimization and facilitating sequence alterations. Existing methods, however, can be complex and not always reproducible, prompting researchers to use commercial suppliers rather than synthesize genes themselves.Entities:
Mesh:
Substances:
Year: 2008 PMID: 18445293 PMCID: PMC2408586 DOI: 10.1186/1472-6750-8-44
Source DB: PubMed Journal: BMC Biotechnol ISSN: 1472-6750 Impact factor: 2.563
Oligonucleotides for the synthesis of polA
| f22 | TTTGTTTAACTTTAAGAAGGAGATATACATATGCTCCGCAAGG |
| f21 | ATATACATATGCTCCGCAAGGTCGAGGTACAAACCGTAGAGCCGGAGGACCGTGAAAGCT |
| f20 | GGAGGACCGTGAAAGCTGGAACGACTTCCTGGAAAATCCACTCCTCAGCCTGTGGCTCGA |
| f19 | TCAGCCTGTGGCTCGAGATGGACGGCGAAAACTATCATCGCGCAGAAATCATCGGTCTCG |
| f18 | GCAGAAATCATCGGTCTCGCACTGTCTGATGGTGAGACCCATCTGTACGTCCCTTGGCGC |
| f17 | GTACGTCCCTTGGCGCACTGCGCGTGACTGGGAGAACCTCCACCGTCTGCTCGCGGACGA |
| f16 | TCTGCTCGCGGACGAGGAACGCAAGAAGATTGTTTATGATGGTAAACGTCTGCAAGTGGT |
| f15 | GTAAACGTCTGCAAGTGGTGCTGAAGCGTCGTGGTCTGGAGGCTGGCGGTCTGGCTTTCG |
| f14 | GGCGGTCTGGCTTTCGACGCGCTCCTGGCTTCTTATCTGCTGGACCCGTCCGAATCTGGC |
| f13 | CCCGTCCGAATCTGGCCACTCTCTGTCCGACCTGGTACAGCGTAAGATGGATGGCTCTCT |
| f12 | CGTAAGATGGATGGCTCTCTCCCACCTGATGAGGAGGTTTACGGTAAAGGTGCCAAACGC |
| f11 | GGTAAAGGTGCCAAACGCCGCCTGCCAGGTGAGCGCGAACTCGCTGAACATCTCGCGCGT |
| f10 | CTGAACATCTCGCGCGTAAAGCGGAAGCGCTCAAGCGCCTCTACCCACTGCTGAGCGAGG |
| f9 | CCACTGCTGAGCGAGGAGATCCGTGAGGCCGGTATGGAGAGCCTGCTCTTTGAAATGGAA |
| f8 | GCCTGCTCTTTGAAATGGAACTGCCACTGAGCCGTGTTCTGGCAGAGATGGAGCTGCATG |
| f7 | CAGAGATGGAGCTGCATGGCGTACGTGTCGACCGCGACCGTCTCCTGGACCTGGGTGAAG |
| f6 | CCTGGACCTGGGTGAAGAACTCAAGGAACAGGCAGAATCCCTCACGCGTCAAATCTACGA |
| f5 | TCACGCGTCAAATCTACGAACTGGCGGGTACCGAATTCAACATTAACTCTCCGAAACAGC |
| f4 | ACATTAACTCTCCGAAACAGCTCGCCGAGATTCTCTACGACAAACTGGGTCTGCCGGTTC |
| f3 | CTGGGTCTGCCGGTTCTCAAAAAGACTAAAACGGGCTATTCTACCTCTGCGGACGTTCTG |
| f2 | CCTCTGCGGACGTTCTGGAGAAACTGGCACCGCAACATGAAATCGTTGAAAAAATCCTGC |
| f1 | GAAATCGTTGAAAAAATCCTGCACTACCGCCAAATCATGAAACTGATCTCCACCTATGTC |
| r1 | TTTACCGGATTCCGGGTCGATTTCCTTCAGGAGGCCCTCGACATAGGTGGAGATCAGTTT |
| r2 | GCGGCCAGTTGCAGTGATAGTCTGGTTGAAACGCGTGTGAATTTTACCGGATTCCGGGTC |
| r3 | CAGGCGAATCGGGATATTTTGCAGGTTCGGCTCGGTAGAGCTGAGGCGGCCAGTTGCAGT |
| r4 | CAGACGGAACGAATACCTGACGGATGCGACGACCTTCCTCCAGGCGAATCGGGATATTTT |
| r5 | AATCTGAGAGTAATCTGCGCTCAGGATCTGCCAACCCGGTTCAGACGGAACGAATACCTG |
| r6 | GAGACTCGTCGCCAGACAGATGAGCCAGTACGCGGAGCTCAATCTGAGAGTAATCTGCGC |
| r7 | GTCTTAGTATGAATGTCCATGTCTTCGCTAAAAGCCTGTTTGAGAGACTCGTCGCCAGAC |
| r8 | AACTTCATCTTCAGGGACACCGAACACATCCATAGCGGTCTTAGTATGAATGTCCATGTC |
| r9 | CGAAATTAACCGCCTTTGCTTGACGACGCATGAGAGAGGTAACTTCATCTTCAGGGACAC |
| r10 | GTTTTGGGACAGGCCGTAATCAGAAATGCCGTAGATGATACCGAAATTAACCGCCTTTGC |
| r11 | AACGTTCAATGAATTGCGCCGCTACTTTACGTGGGATGTTGAGGTTTTGGGACAGGCCGT |
| r12 | ACGATCCATGTATTCTTTAACACCAGGGTAAGATTGGAAGTAACGTTCAATGAATTGCGC |
| r13 | TGGTAACGTAACCGTCTTTACGGGCTTGCTCTACAACACGATCCATGTATTCTTTAACAC |
| r14 | AGCGAATTTCCGGGAGGTAGCGACGGCGGTTGAGCATGGTGGTAACGTAACCGTCTTTAC |
| r15 | ATCGCCGTACGTTCCGCGAAAGTGCGGCGATTGTAGTTACGAGAGCGAATTTCCGGGAGG |
| r16 | CGGTTTTGATAATATCTGCCGCAGAACCCTGGATCGGGGTGTTCATCGCCGTACGTTCCG |
| r17 | CGCGGCGACGTTTGATCTCACGATGCAGGCGTACCATGGCGGTTTTGATAATATCTGCCG |
| r18 | CAAAAATGAGTTCGTCGTGAACTTGGAGCAGCATACGAGACTTAACGCGGCGACGTTTGA |
| r19 | CCAGATTTTTCATTTCTTCGAGTTCCTCTTCTGGAACTTCAAAAATGAGTTCGTCGTGAA |
| r20 | ACGGAGAGCGGGACCGCTTGTTCCATAACCGTACGAACCAGATTTTTCATTTCTTCGAGT |
| r21 | GCTTCATACCAAGTCTGACCGGTGTGGATATCAACTTTGAGCGGCACGGAGAGCGGGACC |
| r22 | CTTCCTTTCGGGCTTTGTTAGCAGCCGGATCCTTATTTGGCTTCATACCAAGTCTGACC |
Oligonucleotides for the synthesis of paz
| f4 | TTTGTTTAACTTTAAGAAGGAGATATACATATGAGCCACCAGATCCGTTCTAAGAAAACCCTGTGGGAAC |
| f3 | CTAAGAAAACCCTGTGGGAACTCGTTGGTCGTAATAAAGACGCGCTGCGTGATTTCCTGAAAGAACACCGTGGCA |
| f2 | TGAAAGAACACCGTGGCACCATCCTGCTCCGTGACATCGCGTCTGAACACAAAGTTGTTTACAAACCGATCTTCA |
| f1 | GTTGTTTACAAACCGATCTTCAAACGCTACAACGGTGACCCGGACCTGATCGAAGACAACTCTAACGACGTTGAA |
| r1 | TTTTTTCAGTTCCGGGGTGTTCCAGTAACGTTCCAGGTGGTAGTCGTACCAGTGTTCAACGTCGTTAGAGTTGTC |
| r2 | TTCGCCAGGATGATTGGTTGGTTCAGGTCAACCGGACCGAACTTTTTGTAGAACTCTTTTTTCAGTTCCGGGGTG |
| r3 | ACAACGAACTGCGGCAGGAGGTGAACCAGGTCACCACGGTTGTGCTGACGCAGTGGTTTCGCCAGGATGATTGGT |
| r4 | CTTCCTTTCGGGCTTTGTTAGCAGCCGGATCCTTAGTTATAAACCGGAACAACGAACTGCGGCA |
Figure 1SeqTBIO method of PCR-based gene synthesis for . The nucleotide sequences paz and polA were assembled through a sequence of 4 and 22 reactions respectively and analyzed by agarose gel electrophoresis. Fragment sizes were compared against molecular markers ranging from 100 to 2000 bp. Two oligonucleotides were used for each amplification reaction starting with the central pair (f1 and r1), followed by the next pairs (f2 and r2, then f3 and r3, etc.) in the subsequent reactions, in an inside-out manner. The oligonucleotides overlapped by 15 to 23 nt. The entire paz sequence was synthesized in 4 reactions showing single products in each reaction with traces of unincorporated primers (A, lanes 1–4). The robustness of sequential step DNA assembly is shown for the synthesis of polA where single band fragments are clearly seen for each assembly step (B, lanes 1–22). Lane F shows the final synthesis product of polA. The assembled gene products for both proteins contained 30 nt of sequence at the 5'and 3' termini homologous to the terminal ends of a linearized plasmid vector of pET-3a for subsequent in vivo homologous recombination. These homologous regions (HR) were designed within the last outside primers sets used in the assembly process.
Figure 2Scheme of error correction using coupled methods of site directed mutagenesis and homologous recombination. Assembled synthetic gene fragments subcloned into a plasmid vector are subjected initially to sequence analysis of the synthesized product to detect nucleotide errors. Error correction can be conducted in two ways using oligonucleotide primers (30–35 nucleotides long) that are designed to include the correcting nucleotide (when needed) at the midpoint. First, mutagenic primers are targeted exclusively against the assembled synthetic gene (route A). Two, three and four pairs of primers are required to correct one, two and three point mutation (ptm) sites respectively. DNA fragments are amplified by two or all primer pairs F1-R1, F2-R2, F3-R3 and F1-R4 in separate reactions. The terminal primers have overlapping homologous regions with that of the targeted plasmid vector. Reaction products are mixed for transformation into an appropriate cell host. The second approach involves the amplification of the plasmid vector (routes B-D). To remove 3 point mutations, two correcting primers, reverse-complement of each other, are designed at each mutation site, with the correcting nucleotide being at the midpoint of each primer. DNA fragments are amplified by PCR using primer pairs F1-R1, F2-R2 and F3-R3 respectively in 3 separate reactions (D). Two pairs of primers are similarly used for 2 point mutations involving only 2 separate reactions (C). Single site error correction requires a non-mutagenic primer pair corresponding to a sequence in the vector backbone in addition to the correcting primer set such that 2 fragments are generated (as if 2 corrections were being made). Products of the correcting reactions are retransformed into competent cells for plasmid isolation and sequencing. Upon verification of error free clones, the plasmids are then transformed into an appropriate host cell for protein expression.
Figure 3SDS-PAGE analysis and crystallization of PAZ and PolA. Purified recombinant proteins PAZ and PolA are shown in the upper panels (lanes 2) against standard molecular markers (lanes 1) containing myosin, phosphorylase, BSA, glutamic dehydrogenase, alcohol dehydrogenase, carbonic anhydrase, myoglobin red, lysozyme, aprotinin and insulin B chain on a 11% polyacrylamide gel. Approximately 15 μg of purified protein was loaded in each gel (lanes 2) and visualized with Coomassie brilliant blue. The arrows indicate recombinant proteins at about 15 kDa and 70 kDa for PAZ and PolA respectively. The proteins were estimated to be more than 90% homogeneous. Crystals of PAZ and PolA are shown in the bottom panels along with their corresponding space group and unit cell parameters as determined by preliminary X-ray analysis.
Figure 4Comparative gene synthesis of polA coding region by methods of TBIO and SeqTBIO. The oligonucleotide pairs f1-r1, f2-r2, f3-r3 and f4-r4 were used to attempt the synthesis of a 351 bp central fragment of the polA coding region. The initial TBIO gene synthesis included the mixture of 8 primers and the reaction was subjected to 30 cycles of PCR amplification (lane 2). An early version of the SeqTBIO approach was performed in parallel (lanes 3–6) using the first 3 oligonucleotide pairs only. The central pair (f1-r1) was initially assembled (lane 3) followed by next pairs f2-r2 (lane 4, 5) and f3-r3 (lane 6). The TBIO reaction produced a smear and did not seem to generate a unique fragment but instead a range of product sizes ranging from 190 bp – 600 bp. In contrast, each SeqTBIO reaction generated a single-band product of the expected size. A PCR negative control is shown in lane 1. The reaction products were analyzed by electrophoresis on a 1.2% agarose gel stained with ethidium bromide and visualized by UV illumination.
Figure 5Gene to crystal flowchart. Gene synthesis can be integrated into an experimental pipeline where proteins can be designed for crystallization screening. It is possible to design variations of protein constructs by exploiting the SeqTBIO method of sequential DNA assembly (path 1 to 6). Different constructs can be screened for expression in 96-well format array and evaluated for the protein solubility and stability (path 7). In our case, we used heat stability as a biochemical selection criterion. However, recombinant tags can be easily incorporated at the DNA synthesis level whereby the resulting expressed proteins can be selected with affinity binding matrices (e.g. His or GST tags). Large scale production can then be performed for crystallization trials (path 8–9). Initial crystals obtained would undergo optimization and preliminary X-ray analysis (path 10). Further protein sequence changes can be made if necessary for further processing (path 11).