| Literature DB >> 23873957 |
Olga Tsoy1, Marina Yurieva, Andrey Kucharavy, Mary O'Reilly, Arcady Mushegian.
Abstract
Minimal bacterial gene set comprises the genetic elements needed for survival of engineered bacterium on a rich medium. This set is estimated to include 300-350 protein-coding genes. One way of simplifying an organism with such a minimal genome even further is to constrain the amino acid content of its proteins. In this study, comparative genomics approaches and the results of gene knockout experiments were used to extrapolate the minimal gene set of mollicutes, and bioinformatics combined with the knowledge-based analysis of the structure-function relationships in these proteins and their orthologs, paralogs and analogs was applied to examine the challenges of completely replacing the rarest residue, cysteine. Among several known functions of cysteine residues, their roles in the active centers of the enzymes responsible for deoxyribonucleoside synthesis and transfer RNA modification appear to be crucial, as no alternative chemistry is known for these reactions. Thus, drastic reduction of the content of the rarest amino acid in a minimal proteome appears to be possible, but its complete elimination is challenging.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23873957 PMCID: PMC3794579 DOI: 10.1093/nar/gkt610
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Computational design of minimal genome with reduced content of the rarest amino acid, cysteine.
The percentages of proteins lacking each amino acid
| Amino acid | In all COGs | In minimal genome | Amino acid | In all COGs | In minimal genome | Amino acid | In all COGs | In minimal genome |
|---|---|---|---|---|---|---|---|---|
| A | 0.38 | 0.13 | I | 0.59 | 0.29 | Q | 2.26 | 1.06 |
| C | 21.00 | 22.30 | K | 1.60 | 0.39 | R | 0.84 | 0.30 |
| D | 1.19 | 0.63 | L | 0.14 | 0.14 | S | 0.34 | 0.21 |
| E | 0.96 | 0.47 | initiatory M | 0.39 | 0.17 | T | 0.67 | 0.31 |
| F | 1.70 | 1.17 | internal M | 6.66 | 2.40 | V | 0.39 | 0.09 |
| G | 0.58 | 0.10 | N | 2.15 | 0.81 | W | 16.71 | 22.50 |
| H | 6.35 | 4.39 | P | 1.69 | 0.81 | Y | 3.50 | 3.04 |
Reducing cysteine content of proteins with different functions within minimal genome
| Function | All | List 1 | List 2 | List 3 | No Cys | Orthologs without Cys | Other ways of Cys removal | Indispensable | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| C: Energy production | 22 | 20 | 18 | 54 | 17 | 15 | 42 | 21 | 19 | 55 | 2 | 15 | 39 | 4 | 16 | 0 | 0 |
| D: Cell division, chromosome partitioning | 5 | 3 | 3 | 13 | 4 | 3 | 12 | 5 | 4 | 16 | 1 | 4 | 16 | 0 | 0 | 0 | 0 |
| E: Amino acid transport and metabolism | 13 | 10 | 10 | 41 | 11 | 11 | 41 | 12 | 12 | 45 | 0 | 11 | 42 | 1 | 3 | 0 | 0 |
| F: Nucleotide transport and metabolism | 24 | 23 | 21 | 83 | 21 | 19 | 76 | 24 | 22 | 85 | 2 | 15 | 55 | 3 | 11 | 4 | 19 |
| G: Carbohydrate transport and metabolism | 27 | 21 | 20 | 65 | 20 | 20 | 61 | 23 | 22 | 69 | 1 | 18 | 57 | 4 | 12 | 0 | 0 |
| H: Coenzyme transport and metabolism | 12 | 10 | 10 | 38 | 10 | 10 | 40 | 12 | 12 | 48 | 0 | 11 | 45 | 1 | 3 | 0 | 0 |
| I: Lipid transport and metabolism | 9 | 6 | 5 | 13 | 6 | 4 | 10 | 8 | 6 | 15 | 2 | 5 | 13 | 1 | 2 | 0 | 0 |
| J: Translation, ribosome biogenesis | 109 | 98 | 67 | 281 | 99 | 67 | 274 | 104 | 72 | 298 | 32 | 66 | 273 | 6 | 25 | 0 | 0 |
| K: Transcription | 15 | 13 | 11 | 46 | 12 | 9 | 43 | 14 | 11 | 46 | 3 | 11 | 46 | 0 | 0 | 0 | 0 |
| L: Replication, recombination and repair | 40 | 35 | 32 | 173 | 29 | 27 | 141 | 38 | 35 | 178 | 3 | 32 | 154 | 3 | 24 | 0 | 0 |
| M: Cell wall/membrane/envelope biogenesis | 12 | 6 | 6 | 25 | 9 | 9 | 40 | 12 | 12 | 54 | 0 | 11 | 49 | 1 | 5 | 0 | 0 |
| O: Protein modification/turnover, chaperones | 20 | 14 | 12 | 37 | 14 | 13 | 44 | 18 | 16 | 50 | 2 | 11 | 29 | 5 | 21 | 0 | 0 |
| P: Inorganic ion transport and metabolism | 19 | 17 | 15 | 54 | 14 | 12 | 45 | 18 | 16 | 56 | 2 | 16 | 54 | 1 | 2 | 0 | 0 |
| R: General (molecular) function | 37 | 29 | 26 | 109 | 27 | 25 | 101 | 34 | 31 | 123 | 3 | 30 | 119 | 1 | 5 | 0 | 0 |
| S: Conserved protein, unknown function | 16 | 9 | 6 | 19 | 13 | 11 | 32 | 15 | 12 | 35 | 3 | 12 | 35 | 0 | 0 | 0 | 0 |
| T: Signal transduction | 2 | 2 | 2 | 6 | 2 | 2 | 6 | 2 | 2 | 6 | 0 | 2 | 6 | 0 | 0 | 0 | 0 |
| U: Intracellular trafficking, secretion | 7 | 4 | 3 | 9 | 7 | 5 | 11 | 7 | 5 | 12 | 2 | 5 | 12 | 0 | 0 | 0 | 0 |
| V: Defense mechanisms | 8 | 5 | 5 | 22 | 6 | 6 | 19 | 7 | 7 | 28 | 0 | 6 | 19 | 1 | 9 | 0 | 0 |
| Unknown non-conserved | 90 | 3 | 2 | 11 | 65 | 47 | 127 | 65 | 47 | 127 | 18 | 23 | 59 | 24 | 68 | 0 | 0 |
| Total | 487 | 328 | 274 | 1099 | 386 | 315 | 1165 | 439 | 363 | 1346 | 76 | 304 | 1122 | 56 | 206 | 4 | 19 |
aThe number indicates the counts of proteins in each functional category.
bThree columns represent the total of all proteins, only Cys-containing proteins and the count of Cys residues in these proteins within each functional category.
cTwo numbers indicate Cys-containing proteins and the count of Cys residues in these proteins within each functional category.