| Literature DB >> 32019919 |
Bryan A Bartley1, Jacob Beal2, Jonathan R Karr3, Elizabeth A Strychalski4.
Abstract
Genome-scale engineering holds great potential to impact science, industry, medicine, and society, and recent improvements in DNA synthesis have enabled the manipulation of megabase genomes. However, coordinating and integrating the workflows and large teams necessary for gigabase genome engineering remains a considerable challenge. We examine this issue and recommend a path forward by: 1) adopting and extending existing representations for designs, assembly plans, samples, data, and workflows; 2) developing new technologies for data curation and quality control; 3) conducting fundamental research on genome-scale modeling and design; and 4) developing new legal and contractual infrastructure to facilitate collaboration.Entities:
Mesh:
Substances:
Year: 2020 PMID: 32019919 PMCID: PMC7000699 DOI: 10.1038/s41467-020-14314-z
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Fig. 1As capabilities for genome engineering have advanced rapidly, the size of teams involved in each pioneering genome engineering project has also increased.
a From 1980 to present, the size of the largest engineered genomes has grown exponentially, doubling approximately every 3 years. This trend suggests that gigabase engineering could become feasible by 2050. b The number of authors credited with producing these genomes has also grown exponentially. This trend suggests that engineering gigabase genomes will require the effort of ~500 individuals—either directly as part of a team or indirectly through an ecosystem of tools, services, automation, and other resources. The data for this figure are provided in Table 1.
Year, genome size (bp), and the number of authors involved in pioneering genome engineering projects of the last 30 years.
| Year | DNA size (bp) | Collaboration size (# authors) | Reference | Notes |
|---|---|---|---|---|
| 1979 | 207 | 1 | Khorana[ | First synthetic gene |
| 1990 | 2050 | 4 | Mandecki et al.[ | First synthetic plasmid |
| 1995 | 2700 | 5 | Stemmer et al.[ | Synthetic plasmid |
| 2002 | 7.5000E+03 | 3 | Cello et al.[ | Polio virus cDNA |
| 2004 | 1.4600E+04 | 7 | Tian et al.[ | rRNA genes |
| 2004 | 3.1656E+04 | 6 | Kodumal et al.[ | Gene cluster |
| 2008 | 5.8297E+05 | 17 | Gibson et al.[ | Mycoplasma genitalium |
| 2010 | 5.3100E+05 | 24 | Gibson et al.[ | Mycoplasma mycoide, JCVI synthetic cell |
| 2011 | 9.1010E+04 | 15 | Dymond et al.[ | Sc 2.0 synIXR |
| 2014 | 2.7287E+05 | 80 | Annaluru et al.[ | Yeast chromosome synIII |
| 2016 | 3.9700E+06 | 21 | Ostrov et al.[ | Partially recoded |
| 2017 | 2.0000E+05 | 13 | Lau et al.[ | Salmonella typhimurium partial genome |
| 2019 | 4.0000E+06 | 14 | Fredens et al.[ | Recoded |
| 2020 | 1.14E+07 | 172 | Richardson et al.[ | Sc 2.0 estimated completion date; Genome size from Table 3 in reference; Collaboration size estimated from Sc 2.0 website |
These data are plotted in Fig. 1
Fig. 2The emerging design–build–test–learn workflow for genome engineering is shown schematically with current (solid arrows) and likely future (dashed arrows) tasks, interfaces (circles), and repositories (cylinders), either digital (light) or physical (dark).
Potential approaches for integrating the emerging gigabase engineering workflow, labeled for reference.
For each interface in the emerging workflow, our recommendations fall into one of three categories: adopt or extend relatively mature existing methods (green), develop new solutions or expand nascent methods (yellow), and conduct additional fundamental research (red)