| Literature DB >> 16924266 |
Anthony C Forster1, George M Church.
Abstract
Construction of a chemical system capable of replication and evolution, fed only by small molecule nutrients, is now conceivable. This could be achieved by stepwise integration of decades of work on the reconstitution of DNA, RNA and protein syntheses from pure components. Such a minimal cell project would initially define the components sufficient for each subsystem, allow detailed kinetic analyses and lead to improved in vitro methods for synthesis of biopolymers, therapeutics and biosensors. Completion would yield a functionally and structurally understood self-replicating biosystem. Safety concerns for synthetic life will be alleviated by extreme dependence on elaborate laboratory reagents and conditions for viability. Our proposed minimal genome is 113 kbp long and contains 151 genes. We detail building blocks already in place and major hurdles to overcome for completion.Entities:
Mesh:
Year: 2006 PMID: 16924266 PMCID: PMC1681520 DOI: 10.1038/msb4100090
Source DB: PubMed Journal: Mol Syst Biol ISSN: 1744-4292 Impact factor: 11.429
Figure 1A minimal cell containing biological macromolecules and pathways proposed to be necessary and sufficient for replication from small molecule nutrients. The macromolecules are all nucleic acid and protein polymers and are encapsulated within a bilayer lipid vesicle. The small molecules (brown) diffuse across the bilayer. The macromolecules are ordered according to the pathways in which they are synthesized and act. They are colored by biochemical subsystem as follows: blue=DNA synthesis, red=RNA synthesis and cleavage, green=RNA modification, purple=ribosome assembly, orange=post-translational modification and black=protein synthesis. MFT=methionyl-tRNAfMeti formyltransferase. The system could be bootstrapped with DNA, RNA polymerase, ribosome, translation factors, tRNAs, MTF, synthetases, chaperones and small molecules.
Biochemically derived list of genes that may encode a useful, near-minimal, self-replicating system dependent only on small molecule nutrients
| Mycoplasma | 3D structure | |
|---|---|---|
| Coliphage f29 DNA polymerase | + | + |
| Coliphage P1 Cre recombinase | − | + |
| >Coliphage Lox/Cre recombinase site | − | + |
| Coliphage T7 RNA polymerase | Analog | + |
| >Coliphage T7 RNA polymerase initiation site | Analog | + |
| >Coliphage T7 RNA polymerase class II termination site | Analog | + |
| Lucerne viral hammerhead RNA | − | + |
| RNase P RNA | + | + |
| RNase P protein | + | + |
| >RNase P site/RNA primer for DNA polymerase | + | + |
| Small subunit 16S ribosomal RNA | + | + |
| All 21 small subunit ribosomal proteins (1–21) | + except 1, 21 | + |
| Large subunit 5S ribosomal RNA | + | + |
| Large subunit 23S ribosomal RNA | + | + |
| − | ||
| − | ||
| Large subunit 23S rRNA U2457>pseudoU synthetase | − | |
| − | ||
| − | ||
| Large subunit 23S rRNA U2504>pseudoU synthetase | − | |
| All 33 large subunit ribosomal proteins (1–7, 9–11, 13–25, 27–36) | + except 25, 30 | + |
| Translational initiation factor 1 | + | + |
| Translational initiation factor 2 | + | + |
| Translational initiation factor 3 | + | + |
| Translational elongation factor Tu | + | + |
| Translational elongation factor Ts | + | + |
| Translational elongation factor G | + | + |
| Translational release factor 1 | + | + |
| Translational release factor 2 | − | + |
| Translational release factor Gln methylase | + | + |
| Translational release factor 3 | − | + |
| Ribosome recycling factor | + | + |
| Set of 29 | + | |
| tRNA C34>lysidine synthetase | + | |
| tRNA A34>I deaminase | + | |
| − | − | |
| tRNA U34>2sU Cys desulfurase | − | + |
| tRNA U34>2sU synthetase | + | |
| tRNA U34>cmnm5U GTPase | + | |
| tRNA U34>cmnm5U synthetase | + | |
| tRNA cmnm5U34>nm5U>mnm5U synthetase | − | |
| tRNA G37 N1-methylase | + | + |
| − | ||
| tRNA A37>i6A synthetase | − | + |
| tRNA i6A37>s2i6A>ms2i6A synthetase | − | + |
| All 22 aminoacyl-tRNA synthetase subunits (20 enzymes) | + except Gly sub., Gln | + except Gly sub., Ala |
| Met-tRNA formyltransferase | + | + |
| Chaperonin GroEL | + | + |
| Chaperonin GroES | + | + |
| 151 genes=38 RNAs+113 proteins |
Gaps in knowledge are in bold. Left column: chosen gene products and DNA sites. Middle column: relationship to the minimal genome of M. genitalium; clear sequence homolog=‘+'; known enzyme product without an evident sequence homolog=‘unidentified'; no functional homolog=‘−'. Right column: high-resolution, three-dimensional, structural information; >25% of the structure solved=‘+', <25%=‘−'. The small molecules known to be required are four dNTPs, four NTPs, 20 amino acids, N-5,10-methenyltetrahydrofolate, S-adenosylmethionine and isopentenyl pyrophosphate). Note: a full version listing the nomenclature, database link, length and sequence of each individual product is available in Supplementary Tables S1 and S2.
Figure 2A generalizable, physiologically compatible, theoretical scheme for accurate DNA replication and RNA synthesis in vitro. Polymerase movements are illustrated by colored arrowheads. DNA synthesis: a nicked double-stranded DNA circle (middle) undergoes rolling-circle DNA synthesis by coliphage φ29 DNA polymerase (Dahl ) to give an oligomeric single-stranded DNA (bottom, blue). RNA primers (red) then hybridize at two sites to prime lagging strand DNA synthesis (bottom, green). When two Lox sites (bottom, L) are completed, recombination occurs between them catalyzed by coliphage P1 Cre recombinase (black cross) to form a duplicate of the original circular template. RNA synthesis: the circular genetic operon (middle) contains a promoter for T7 RNA polymerase (P), a ribosomal RNA (rRNA) gene, two transfer RNA (tRNA) sequences, a self-cleaving hammerhead sequence (H) and a T7 terminator (T). RNA synthesis from P generates a precursor RNA (top, red) containing three cleavage sites (thin black arrows). The second tRNA sequence merely serves as a recognition site for RNase P cleavage. Cleavages yield the mature rRNA and tRNA1. Any cleavage product containing a 3′ hydroxyl group or primary RNA transcript can serve as a primer for DNA synthesis (bottom, red).
Figure 3All nucleoside modifications of all 33 synthetic tRNAs that may be sufficient for accurate translation. Outside (shaded): mRNA codons of the genetic code are illustrated in the standard format, except that the 3′ U and C are switched to simplify depiction of decoding. Inside: tRNA nucleotides 34–37 (from 5′ to 3′) and their cognate amino acids. Nucleotides 34–36 are the anticodons, and the 37th nucleotides are represented by black superscripts. Codon and anticodon positions that base pair with each other are colored similarly. Stop codon specificities of release factor (RF) proteins are included. The portions of the tRNA sequences not shown in the figure are unmodified. Expected modifications of in vitro transcripts by the enzymes in Table I, and expected amino-acid and codon specificities are given. *=unspecified modification, _=unknown modification status, ms2i6A=2-methylthio-N6-isopentenyladenosine, m1G=1-methylguanosine, t6A=N6-threonylcarbamoyladenosine, cmnm5U=5-carboxymethylaminomethyluridine, V=cmo5U=uridine 5-oxyacetic acid, I=inosine, cmnm5s2U=5-carboxymethylaminomethyl-2-thiouridine, k2C=lysidine, S=mnm5s2U=5-methylaminomethyl-2-thiouridine, mnm5U=5-methylaminomethyluridine.