| Literature DB >> 36120530 |
Nicole LeBlanc1, Trevor C Charles1,2.
Abstract
Bacterial cells are widely used to produce value-added products due to their versatility, ease of manipulation, and the abundance of genome engineering tools. However, the efficiency of producing these desired biomolecules is often hindered by the cells' own metabolism, genetic instability, and the toxicity of the product. To overcome these challenges, genome reductions have been performed, making strains with the potential of serving as chassis for downstream applications. Here we review the current technologies that enable the design and construction of such reduced-genome bacteria as well as the challenges that limit their assembly and applicability. While genomic reductions have shown improvement of many cellular characteristics, a major challenge still exists in constructing these cells efficiently and rapidly. Computational tools have been created in attempts at minimizing the time needed to design these organisms, but gaps still exist in modelling these reductions in silico. Genomic reductions are a promising avenue for improving the production of value-added products, constructing chassis cells, and for uncovering cellular function but are currently limited by their time-consuming construction methods. With improvements to and the creation of novel genome editing tools and in silico models, these approaches could be combined to expedite this process and create more streamlined and efficient cell factories.Entities:
Keywords: bacteria; genome engineering; genome reduction; minimal genome; synthetic biology
Year: 2022 PMID: 36120530 PMCID: PMC9473318 DOI: 10.3389/fgeed.2022.957289
Source DB: PubMed Journal: Front Genome Ed ISSN: 2673-3439
FIGURE 1Schematic diagram of the construction of a reduced genome cell. Identification of essential genes using experimental and computational methods, genomic reduction by a top-down gene deletion or bottom-up synthesis approach, and evaluation of modifications of this strain.
Comparison between TnSeq and CRISPRiSeq methods to determine gene essentiality.
| Characteristic | TnSeq | CRISPRiSeq |
|---|---|---|
|
|
|
|
|
| ||
| Plasmid extraction from sgRNA library and electroporation into target | ||
|
| Growth of mutant library in desired conditions/media over multiple generations | Induction of dCas9 expression and growth of mutant library in desired conditions/media over multiple generations |
|
| Varies depending on the transposon used but for Mariner transposons; genome digestion to isolate transposon, blunting, and PCR with sequencing adaptors | PCR to amplify gRNAs |
|
| Only 1 transposon insertion per cell so cannot identify and screen those with duplicated genes | sgRNA can bind to multiple copies of the same region so can more effectively determine essentiality of those regions |
|
| No control | Can modify the repression level by altering the sgRNA homology |
|
| Random integration | Targeted integration by sgRNA sequence |
Depending on the transposon used (ie. mariner based transposons integrate into TA, dinucleotides but randomly across the genome).
Essential gene databases and computational programs to predict essential genes and design genomic deletions.
| Program | Function | URL | References |
|---|---|---|---|
|
| Database of essential genomic regions based on experimental data with embedded BLAST tools for homology searches |
|
|
|
| Database of predicted essential genes within |
|
|
|
| Database of predicted essential genes for more than 2,700 organisms based on protein-protein interaction network features |
|
|
|
| Database of predicted essential genes for more than 4,000 organisms based on previous experimental data and functional KEGG orthologs |
|
|
|
| Database of essential genes from experimental data for 91 bacterial strains with additional gene features like expression profiles, conservation, evolutionary origins etc. |
|
|
|
| Database of essential genes from experimental data for 11 different species with visualization and analysis on a subsystem diagram |
|
|
|
| |||
|
| Database of essential genes based on data from DEG that are clustered based on function |
|
|
|
| |||
|
| Within the CEG database, bases the prediction of essential genes on function |
|
|
|
| |||
|
| Predicts genes from an unannotated genome and will also predict essential genes from that |
|
|
|
| Predicts gene essentiality based on sequence conservation and orthology with comparing a fully sequenced organism to 37 other species |
|
|
|
| Machine learning-based method applying sequence composition features to predict essential genes with the input of nucleotide sequence |
|
|
|
| Deep neural network to identify essential genes in bacteria based on sequence features only |
|
|
|
| Prediction of essential genes based on 6 features not dependent on experimental or functional data with design of large genomic deletions to minimize the organism’s genome |
|
|
|
| Minimal genome design with large deletion predictions using whole cell models using biological knowledge on gene locations and essentiality |
|
|
|
| Comprehensive and computationally intensive simulation of gene essentiality with gene deletions to construct a true minimal cell that cannot be run on a single computer |
|
|
|
| Simultaneously assesses genomic regions that can be deleted, starting with larger deletion regions moving to individual genes |
|
|
FIGURE 2Experimental methods of determining gene essentiality. TnSeq inactivates a random gene by randomly inserting a transposon from a vector. LoxTnSeq deletes a random genomic region by inserting two transposons containing LoxP sites and activating recombination by Cre. CRISPRi-Seq inactivates a random gene by expressing a random gRNA and dCas9 that will bind and repress expression. For all three, the mutants will be pooled and subjected to various conditions followed by sequencing to identify remaining transposon location or guide RNA.
FIGURE 3Deletion methods used to make large genomic reductions. (A) Site-specific recombination deletion method using two recombination sites (RS) which are acted on by a recombinase to result in recombination and deletion of the target. (B) λ red recombineering with I-SceI uses a linear DNA fragment with I-SceI cut sites (S) to cause a double-stranded break with recombination between the homologous regions (HR) to repair, resulting in a deletion. (C) Homologous recombination mediated deletions using a counterselectable marker (CS) to select for the second recombination event resulting in the deletion or return of wild-type. (D) CRISPR-based deletions employing Cas9 to cause a double-stranded break, forcing the cell to repair using homologous recombination, resulting in the deletion.
Previous genome reduced bacterial strains.
| Strain | Deletion | Deletion size | Deletion method | Characteristics (relative to parental strain) | References |
|---|---|---|---|---|---|
|
| |||||
|
| Genomic islands, extracellular polysaccharide biosynthesis genes, prophages | 167 Kb (4.18%) | HR with | Faster growth, higher transformation efficiency, increased heterologous gene expression |
|
|
| |||||
|
| Prophages, | 323 Kb (7.7%) | HR with no CS | Comparable growth rate |
|
|
| Prophages, antibiotic production genes | 991 Kb (24%) | HR | Reduced growth rate, unstable recombinant protein production |
|
|
| 74 regions including prophages, secondary metabolite producing genes, etc | 873.5 Kb (20.7%) | HR with | Increase in cellulase (1.7-fold) and protease (2.5-fold) production |
|
| | Prophages, antibiotic production operons and other nonessential regions | 814 Kb (20%) | HR with | Decreased growth characteristics but 4.4-fold higher guanosine production |
|
| | Prophages, antibiotic production operons and other nonessential regions | 756 Kb (18.6%) | HR with | Decreased growth characteristics but 5.2-fold higher thymidine production |
|
|
| Many genes including those for sporulation, motility, secondary metabolism, prophages, secreted proteases, etc. | 1.46 Mb (36%) | HR with | Decreased growth rate, lower resource utilization for information processing, improved production of ‘difficult proteins’ that cannot be produced in other |
|
|
| |||||
|
| |||||
|
| 3 Prophages | 204.7 Kb (6%) | HR with | Improved growth under stress conditions, increased transformation efficiency, 30% increase in heterologous protein production |
|
|
| Non-essential genes including prophages, unknown genesetc. | 440 Kb (13.4%) | HR with | Robust against stresses, improved growth stability, similar growth rates |
|
|
| All prophages and IS elements | 249.4 Kb (7.6%) | HR with | Similar growth rate and transformation efficiency to MB001 |
|
|
| |||||
|
| Insertion sequences | 663.3 Kb (14.3%) | λ-Red HR with I-SceI + P1 transduction | Improved electroporation efficiency, similar growth rates |
|
|
| Various deletions across the | 1.38 Mb (29.7%) | λ-Red HR with | Slower growth and abnormal cell morphology |
|
|
| Various nonessential gene regions | 1.03 Mb (22%) | λ-Red HR + P1 transduction | 1.5-fold higher cell density and 2x threonine production from an introduced gene cassette |
|
|
| IS Elements, K-islands, flagella genes, LPS synthesis genes | 1.1 Mb (23%) | λ-Red HR with I-SceI + | 1.6-fold faster growth and improved genomic stability |
|
|
| |||||
|
| Prophages, integrases, and transposases | 71 Kb (2.83%) | Cre-LoxP | Faster growth rate, increased biomass yield, improved heterologous gene expression 3-4-fold |
|
|
| Prophages and genomic islands | 176 Kb (6.86%) | Cre-LoxP | Shortened generation time by 17%, similar nisin yield |
|
|
| |||||
|
| Prophages, transposases, nitrogen fixation genes, | 227 Kb (5.5%) | HR with | Comparable growth rate and magnetosome biosynthesis with improved genomic stability |
|
|
| |||||
|
| All nonessential or quasi essential genes | 669 Kb (55.2%) | Chemical synthesis | Improved growth rates compared to JCVI-syn3.0 |
|
|
| |||||
|
| Genomic islands | 254.5 Kb (4.1%) | HR with | 45-fold increase in transformation efficiency, 9.4-fold increase in heterologous protein expression, 39% increase in PHA production |
|
|
| Flagellar biosynthesis genes, prophages, transposases, recombinases | 265.8 Kb (4.3%) | HR with ISce-I | Improved growth rate, heterologous protein expression, plasmid stability, stress resistance, and more |
|
|
| |||||
|
| |||||
|
| Genomic island, prophages, hypothetical protein clusters | 418 Kb (7.7%) | HR with | Increased ATP/ADP ratio by 11x, Improved mcl-PHA and alginate oligosaccharide production by 114.8 and 27.8% respectively |
|
|
| |||||
|
| Megaplasmid, prophages, flagellar biosynthesis genes, and biofilm genes | 640 Kb (10.7%) | I-SceI HR with CS | Increased growth rates and biomass yield, improved production of chemicals including phenol |
|
|
| |||||
|
| 2 megaplasmids containing nonessential genes including toxin/antitoxin systems | 3.1 Mb (46%) | Flp/FRT | Identification of 4 toxin/antitoxin pairs that are essential |
|
|
| |||||
| J1074 (Del14) | 15 biosynthetic secondary metabolite gene clusters | 500 Kb (7.3%) | HR of mutant BAC library and | Comparable growth rates and improved heterologous gene expression of 7 products by 2–2.4 fold |
|
|
| |||||
|
| Biosynthetic genes, prophages, transposases | 1.67 Mb (18.5%) | Cre/LoxP | Increased streptomycin (4-fold) and cephamycin C (2-fold) production |
|
|
| |||||
|
| Biosynthetic clusters including the natamycin biosynthetic cluster | 700 Kb (7.7%) | Cre/LoxP | Increased ATP and NADPH availability, higher transformation efficiency, improved heterologous gene expression, and increased genetic stability |
|