| Literature DB >> 27713529 |
Yuan-Nong Ye1,2, Bin-Guang Ma3, Chuan Dong1,4, Hong Zhang3, Ling-Ling Chen3, Feng-Biao Guo1,4.
Abstract
A minimal gene set (MGS) is critical for the assembly of a minimal artificial cell. We have developed a proposal of simplifying bacterial gene set to approximate a bacterial MGS by the following procedure. First, we base our simplified bacterial gene set (SBGS) on experimentally determined essential genes to ensure that the genes included in the SBGS are critical. Second, we introduced a half-retaining strategy to extract persistent essential genes to ensure stability. Third, we constructed a viable metabolic network to supplement SBGS. The proposed SBGS includes 327 genes and required 431 reactions. This report describes an SBGS that preserves both self-replication and self-maintenance systems. In the minimized metabolic network, we identified five novel hub metabolites and confirmed 20 known hubs. Highly essential genes were found to distribute the connecting metabolites into more reactions. Based on our SBGS, we expanded the pool of targets for designing broad-spectrum antibacterial drugs to reduce pathogen resistance. We also suggested a rough semi-de novo strategy to synthesize an artificial cell, with potential applications in industry.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27713529 PMCID: PMC5054358 DOI: 10.1038/srep35082
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Comparison of the genes in our MGS and others.
(a). The distribution of genes involved in the metabolism of each species and in the minimal gene set. (b) A Venn diagram for the three MGSs showing that our MGS contains 91% of the genes (128 of 141) existing in both Koonin et al.’s and Gil et al.’s MGSs, as well as our 107 newly identified genes (underlined in Supplementary Table S2).
The number of genes in each COG categories of different gene sets.
| COG category | Number of gene in gene sets | ||||
|---|---|---|---|---|---|
| HPEGS | MMN | SBGS | Koonin | Gil | |
| Energy production and conversion (C) | 12 | 13 | 16 | 17 | 9 |
| Cell division and chromosome partitioning (D) | 5 | 0 | 5 | 2 | 1 |
| Amino acid transport and metabolism (E) | 5 | 23 | 24 | 12 | 5 |
| Nucleotide transport and metabolism (F) | 13 | 15 | 14 | 17 | 15 |
| Carbohydrate transport and metabolism (G) | 11 | 17 | 18 | 15 | 17 |
| Coenzyme metabolism (H) | 14 | 19 | 19 | 8 | 12 |
| Lipid metabolism (I) | 25 | 28 | 29 | 4 | 7 |
| Translation, ribosomal structure and biogenesis (J) | 98 | 8 | 98 | 93 | 98 |
| Transcription (K) | 6 | 1 | 7 | 11 | 8 |
| DNA replication, recombination and repair (L) | 19 | 1 | 19 | 24 | 16 |
| Cell envelope biogenesis, outer membrane (M) | 20 | 18 | 25 | 8 | 2 |
| Posttranslational modification, protein turnover, chaperones (O) | 8 | 8 | 10 | 14 | 14 |
| Inorganic ion transport and metabolism (P) | 1 | 7 | 7 | 9 | 2 |
| Secondary metabolites biosynthesis, transport and catabolism (Q) | 3 | 3 | 3 | 1 | 0 |
| General function prediction only (R) | 13 | 1 | 13 | 11 | 0 |
| Function unknown (S) | 1 | 1 | 1 | 3 | 0 |
| Signal transduction mechanisms (T) | 0 | 2 | 2 | 1 | 2 |
| Intracellular trafficking, secretion, and vesicular transport (U) | 9 | 0 | 9 | 5 | 0 |
| Defense mechanisms (V) | 0 | 5 | 5 | 2 | 0 |
Key metabolites in the network (out-degrees and in-degrees)a.
| Metabolite name | Node connectivity | Metabolite description | Remarks |
|---|---|---|---|
| cpd00067 | 215 | H | Current metabolite |
| cpd00001 | 122 | H2O | Current metabolite |
| cpd11493 | 79 | ACP | Current metabolite |
| cpd00002 | 73 | ATP | Current metabolite |
| cpd00011 | 56 | CO2 | Current metabolite |
| cpd00008 | 54 | ADP | Current metabolite |
| cpd00006 | 52 | NADP | Current metabolite |
| cpd00005 | 49 | NADPH | Current metabolite |
| cpd00012 | 45 | PPi | Current metabolite |
| cpd00003 | 42 | NAD | Current metabolite |
| cpd11492 | 40 | Malonyl-acyl-carrierprotein | Ma |
| cpd00010 | 39 | CoA | Ma |
| ++ | |||
| cpd00004 | 37 | NADH | Current metabolite |
| cpd00080 | 33 | Glycerol-3-phosphate | Ma |
| cpd00046 | 30 | CMP | Current metabolite |
| cpd00052 | 19 | CTP | Current metabolite |
| cpd00018 | 19 | AMP | Current metabolite |
| ++ | |||
| ++ | |||
| ++ | |||
| ++ | |||
| cpd00023 | 10 | L-Glutamate | Ma |
| cpd00020 | 9 | Pyruvate | Ma |
| cpd00061 | 9 | Phosphoenolpyruvate | Ma |
aThe key metabolites in bold and italics marked with “++” are newly found in this work. The metabolites marked as “Current metabolite” are consistent with the work of Ma et al. (Ma & Zeng, 2003) and of Jeong et al. (Jeong et al.31). The metabolites marked as “Ma” are consistent with the work of Ma (Ma & Zeng,32).
Spearman correlation between MGS essentiality and node connectivitya.
| Object | rho | p-value |
|---|---|---|
| Essentiality ~ node connectivity | 0.211 | 0.008198 |
| Essentiality ~ average in-degree | −0.274 | 0.000512 |
| Essentiality ~ average out-degree | −0.281 | 0.000369 |
| Essentiality ~ average degree | −0.278 | 0.000424 |
aMGS essentiality is represented by cluster size; node connectivity is represented by the number of reactions associated with a gene; average in-degree is represented by the number of reactants divided by that of reactions associated with a gene; average out-degree is represented by the number of products divided by that of reactions associated with a gene; average degree is represented by the number of metabolites divided by that of reactions associated with a gene.
Figure 2Our design for the semi-de novo synthesis of an artificial cell based on our MGS.
(A) involves transferring the genes in the MGS but not in M. genitalium to the genome. (B) is knocking out the genes of M. genitalium that are absent from our MGS in the genome obtained in (A). (C) involves supplementing with genes required for specific applications.
Figure 3The procedure of this work.