| Literature DB >> 31584076 |
Miguel Angel Martínez1, Ana Jordan-Paiz1, Sandra Franco1, Maria Nevot1.
Abstract
Synthetic genome recoding is a new means of generating designed organisms with altered phenotypes. Synonymous mutations introduced into the protein coding region tolerate modifications in DNA or mRNA without modifying the encoded proteins. Synonymous genome-wide recoding has allowed the synthetic generation of different small-genome viruses with modified phenotypes and biological properties. Recently, a decreased cost of chemically synthesizing DNA and improved methods for assembling DNA fragments (e.g. lambda red recombination and CRISPR-based editing) have enabled the construction of an Escherichia coli variant with a 4-Mb synthetic synonymously recoded genome with a reduced number of sense codons (n = 59) encoding the 20 canonical amino acids. Synonymous genome recoding is increasing our knowledge of microbial interactions with innate immune responses, identifying functional genome structures, and strategically ameliorating cis-inhibitory signaling sequences related to splicing, replication (in eukaryotes), and complex microbe functions, unraveling the relevance of codon usage for the temporal regulation of gene expression and the microbe mutant spectrum and adaptability. New biotechnological and therapeutic applications of this methodology can easily be envisaged. In this review, we discuss how synonymous genome recoding may impact our knowledge of microbial biology and the development of new and better therapeutic methodologies.Entities:
Year: 2019 PMID: 31584076 PMCID: PMC6846928 DOI: 10.1093/nar/gkz831
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.The standard genetic code and codon bias. Each amino acid is encoded by multiple codons, except methionine and tryptophan. On the right are the human frequencies per thousand for each codon. The Kazusa database (http://www.kazusa.or.jp/codon/) was used to compile human codon usage. Codons carrying CpG and UpA dinucleotides are underrepresented.
Methods for large-scale genome synonymous recoding
| Method | References |
|---|---|
| Codon bias usage | ( |
| Codon-pair usage | ( |
| Dinucleotide frequency (e.g. CpG/UpA) | ( |
| Codon-triplet bias usage | ( |
| Codons that could generate stop mutations after a single nucleotide substitution | ( |
Figure 2.Synthetic DNA oligonucleotide assembly and cloning. Overlapping oligonucleotides encoding a DNA duplex are successively assembled via polymerase chain reaction (PCR) or isothermal amplification. Designed DNA duplex are now cloned in yeast or bacterial plasmid that after E. coli transformation can be profusely amplified, sequenced and linearized for further cloning.
Figure 3.Synthetic DNA generation of a full-length bacterial genome. (A) Oligonucleotides (100–200 nucleotides long) are recombined by PCR, cloned in a bacterial plasmid, and sequenced. Synthetic DNA fragments from plasmids of ∼10 kb (pink, green, and blue) are assembled by homologous recombination in Saccharomyces cerevisiae using bacterial artificial chromosomes (BACs) to generate a length of ∼100 kb. BAC fragments contain Cas9 cleavage sites (black arrows) that allow them to be liberated and to replace genomic bacterial DNA by homologous recombination. The double-selection cassette (−1, +1) (e.g. rpsL-kanR) ensures the integration of the synthetic DNA, and the double-selection cassette (−2, +2) on the genome ensures the removal of the corresponding wild-type DNA. (B) Different BAC fragments have homologous overlapping sequences and corresponding selection cassettes for sequentially and iteratively replace the bacterial genome in the precise genome location.
Synonymous recoding as a new tool for exploring microbial biology
| Explored features | Microbe | Targeted genome region | References |
|---|---|---|---|
| Reduced number of synonymous sense codons |
| Full-genome | ( |
| Specific mRNA structures required for replication | Poliovirus | P2 and P3 | ( |
| Virus genome cis-inhibitory signal sequences relevant to complex viral functions | Respiratory Syncytial Virus | NS1 and NS2 | ( |
| Non-coding control features embedded within protein-coding sequences |
| Full-genome | ( |
| Immune innate response | Influenza Virus | Segment 5 | ( |
| HIV-1 | Env and Gag | ||
| SIV | Gag and Pol | ||
| Echovirus 7 | VP3, 3C, and 3D | ||
| Codon usage and the temporal regulation of viral gene expression | SIV Herpesvirus | Env ORF57 | ( |
| Genome position in sequence space, evolutionary trajectory, and pathogenicity | Poliovirus | Capsid | ( |
| Coxsackie virus | Capsid | ||
| Influenza Virus | Polymerase and Haemagglutinin | ||
| HIV-1 | Protease |
Examples of microbe attenuation by synonymous genome recoding
| Microbe | Targeted genome region | Method | References |
|---|---|---|---|
|
| Pneumolysin | Codon-pair usage | ( |
| Methylobacterium extorquens | Formaldehyde-activating enzyme | Codon bias usage | ( |
|
| Ribosomal protein S20, L-γ-glutamyl phosphate reductase | Codon bias usage | ( |
|
| Lac-Z, green florescence protein, Beta-lactamase | Codon bias usage | ( |
| Poliovirus | Capsid | Codon bias usage, Codon-pair usage and Dinucleotide frequency | ( |
| Echovirus 7 | VP1, VP2, 3C, 3D | Dinucleotide frequency | ( |
| Influenza virus | Polymerase subunit PB1, PB2, PA, nucleoprotein, matrix, nonstructural protein, neuraminidase, hemagglutinin | Codon-pair usage | ( |
| Human immunodeficiency virus type 1 | Protease | Codon-pair usage | ( |
| Simian immunodeficiency virus | Gag, Pol | Codon bias usage | ( |
| Chikungunya virus | Non-structural proteins: nsP1, nsP4 Structural proteins: E2, E1 | Codon bias usage | ( |
| Dengue virus | E, NS3, NS5 | Codon-pair usage | ( |
| Zika virus | E, NS1 | Codon-pair usage | ( |
| Tick-borne encephalitis virus | NS5 | Codon bias usage | ( |
| Respiratory syncytial virus | NS1, NS2, N, P, M, SH, G, F, L | Codon-pair usage | ( |
| Porcine reproductive and respiratory syndrome virus | Envelope GP5 | Codon-pair usage | ( |
| Vesicular stomatitis virus | L1 | Codon-pair usage | ( |
| Papillomavirus | Oncogenes E6, E7 | Codon bias usage | ( |
| Marek's disease virus | UL54/ICP27, UL49/VP22, UL30 | Codon-pair usage | ( |
| Adenovirus | Fiber | Codon bias usage | ( |
| Cucumber mosaic virus | Coat protein | Codon bias usage | ( |