| Literature DB >> 21050470 |
Conrad von Mandach1, Rainer Merkl.
Abstract
BACKGROUND: In many microbial genomes, a strong preference for a small number of codons can be observed in genes whose products are needed by the cell in large quantities. This codon usage bias (CUB) improves translational accuracy and speed and is one of several factors optimizing cell growth. Whereas CUB and the overrepresentation of individual proteins have been studied in detail, it is still unclear which high-level metabolic categories are subject to translational optimization in different habitats.Entities:
Mesh:
Substances:
Year: 2010 PMID: 21050470 PMCID: PMC3091758 DOI: 10.1186/1471-2164-11-617
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1A plot of . Panel A: For 388 microbial species constituting the set MG_CUB the number of tRNA genes and the -score was determined as described and plotted. A Spearman rank correlation confirms the statistically significant correlation of these two values (r= 0.71, p < 0.001). Panel B: The minimum generation time for 113 species of MG_CUB (as listed in [13]) was plotted versus the -score. A Spearman rank correlation confirms the statistically significant correlation of these two values (r= -0.75, p < 0.001).
Figure 2Abundance of metabolic functions in archaeal and bacterial effectomes. For all FunCat categories of level 1, the Abund-values were deduced from the datasets MG_CUB(Archaea) and MG_CUB(Bacteria). Positive scores indicate categories overrepresented in the effectomes. Underrepresented categories have negative values. The categories are numbered according to the FunCat scheme: "metabolism" (1), "energy" (2), "cell cycle and DNA processing" (10), "transcription" (11), "protein synthesis" (12), "protein fate (folding, modification, destination) " (14), "regulation of metabolism and protein function" (18), "cellular transport, transport facilitation and transport routes" (20), "cellular communication/signal transduction mechanism" (30), "cell rescue, defense and virulence" (32), "interaction with the environment" (34), "transposable elements, viral and plasmid proteins" (38), "cell fate" (40), "development (systemic) " (41), "biogenesis of cellular components" (42), "subcellular location" (70). Abund-values were plotted if the number #(Cat) was at least 100 (compare Table 1).
Abundance of functional categories (FunCat level 1) in archaeal and bacterial effectomes
| Archaea | Bacteria | ||||||
|---|---|---|---|---|---|---|---|
| 1 | Metabolism | 594 | 15495 | -0.21 | 15948 | 461317 | -0.24 |
| 2 | Energy | 110 | 1508 | 0.07 | 5129 | 47263 | 0.25 |
| 10 | Cell cycle and DNA processing | 30 | 2557 | -0.73 | 1775 | 116118 | -0.60 |
| 11 | Transcription | 32 | 2746 | -0.73 | 2587 | 105731 | -0.39 |
| 12 | Protein synthesis | 1006 | 3576 | 0.65 | 29446 | 67653 | 0.86 |
| 14 | Protein fate (folding, modification, destination) | 91 | 2106 | -0.16 | 2502 | 71643 | -0.24 |
| 18 | Regulation of metabolism and protein function | 8 | 395 | -0.48 | 61 | 8494 | -0.95 |
| 20 | Cellular transport, transport facilitation and transport routes | 160 | 3798 | -0.17 | 5551 | 155592 | -0.23 |
| 30 | Cellular communication/signal transduction mechanism | 0 | 931 | -2.23 | 138 | 37418 | -1.22 |
| 32 | Cell rescue, defense and virulence | 27 | 590 | -0.13 | 1026 | 24273 | -0.16 |
| 34 | Interaction with the environment | 21 | 548 | -0.21 | 1059 | 26204 | -0.17 |
| 38 | Transposable elements, viral and plasmid proteins | 0 | 68 | - | 23 | 5107 | -1.13 |
| 40 | Cell fate | 0 | 9 | - | 76 | 6919 | -0.72 |
| 41 | Development (systemic) | 0 | 2 | - | 3 | 385 | -0.30 |
| 42 | Biogenesis of cellular components | 13 | 1235 | -0.77 | 1271 | 83665 | -0.60 |
| 70 | Subcellular localization | 191 | 3511 | -0.06 | 8601 | 175358 | -0.09 |
| Sum: | 2283 | 39075 | 75196 | 1393140 | |||
For archaeal and bacterial genomes, the number of gene products contributing to FunCat categories was determined. The column labeled #(Cat) gives the number of genes deduced from the whole dataset. The column labeled #(Cat) gives the number of genes belonging to the respective effectomes. The column Abundlists the ratio log(f(Cat)/f(Cat)) for the category, if #(Cat) was at least 100. In all other cases a trend is given indicated by a "-" for underrepresentation. The line labeled Sum lists the number of genes being analyzed.
Functional categories (FunCat level 2) and their abundance in archaeal and bacterial effectomes
| FunCat Category | Description | Archaea | Bacteria | ||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| HT | MS | AQU | ANE | AB | TH | MS | PS | AQU | TER | HOA | AER | ANE | NHAL | MHAL | HITR | ||
| 2.01 | Glycolysis and gluconeogenesis | + | -0.32 | -0.01 | -0.17 | 0.49 | 0.34 | 0.52 | 0.42 | 0.44 | 0.51 | 0.46 | 0.35 | 0.55 | 0.56 | 0.36 | 0.60 |
| 2.04 | Glyoxylate cycle | x | x | x | x | 0.20 | + | 0.20 | + | 0.20 | 0.45 | -0.12 | 0.30 | + | 0.34 | + | -0.78 |
| 2.07 | Pentose-phosphate pathway | - | - | - | - | 0.00 | -0.32 | 0.00 | 0.03 | -0.03 | 0.12 | -0.12 | -0.26 | 0.09 | 0.04 | 0.23 | |
| 2.08 | Pyruvate dehydrogenase complex | x | x | x | x | 0.35 | - | 0.40 | + | + | + | 0.40 | 0.08 | x | 0.60 | + | 0.73 |
| 2.09 | Anaplerotic reactions | x | x | x | x | 0.48 | + | 0.48 | + | + | + | + | 0.60 | + | + | + | 0.00 |
| 2.10 | Tricarboxylic-acid pathway (citrate cycle, Krebs cycle, TCA cycle) | x | + | + | x | 0.49 | + | 0.50 | 0.40 | 0.51 | 0.58 | 0.37 | 0.58 | 0.43 | 0.49 | 0.46 | 0.39 |
| 2.11 | Electron transport and membrane-associated energy conservation | 0.10 | 0.30 | 0.10 | 0.25 | 0.28 | 0.18 | 0.28 | 0.21 | 0.32 | 0.37 | 0.22 | 0.31 | 0.21 | 0.26 | 0.28 | 0.25 |
| 2.30 | Photosynthesis | x | x | x | x | -0.12 | - | -0.12 | x | -0.12 | 0.33 | -0.48 | -0.60 | + | -0.48 | x | -0.30 |
| 2.45 | Energy conversion and regeneration | + | 0.29 | + | + | 0.32 | 0.19 | 0.31 | 0.30 | 0.39 | 0.44 | 0.23 | 0.39 | 0.17 | 0.29 | 0.27 | 0.33 |
| 12.01 | Ribosome biogenesis | 0.79 | 0.85 | 0.79 | 0.82 | 1.03 | 0.92 | 1.04 | 1.00 | 1.07 | 1.06 | 0.94 | 1.05 | 1.00 | 1.07 | 1.08 | 1.10 |
| 12.04 | Translation | 0.63 | 0.66 | 0.62 | 0.65 | 0.84 | 0.73 | 0.85 | 0.82 | 0.88 | 0.87 | 0.75 | 0.86 | 0.81 | 0.87 | 0.90 | 0.91 |
| 12.07 | Translational control | + | + | + | + | 0.68 | + | 0.68 | + | 0.64 | + | 0.65 | 0.75 | + | 0.74 | + | 0.75 |
| 12.10 | Aminoacyl-tRNA synthetases | - | - | - | - | -0.32 | - | -0.32 | - | - | - | - | - | - | - | - | 0.21 |
| 14.01 | Protein folding and stabilization | + | 0.46 | + | + | 0.45 | 0.53 | 0.47 | 0.35 | 0.46 | 0.55 | 0.36 | 0.48 | 0.46 | 0.46 | 0.50 | 0.52 |
| 32.07 | Detoxification | + | - | + | + | 0.09 | 0.20 | 0.09 | 0.10 | 0.11 | 0.20 | -0.01 | 0.09 | 0.00 | 0.12 | 0.15 | 0.26 |
| 34.01 | Homeostasis | + | - | + | + | 0.09 | -0.02 | 0.11 | 0.02 | 0.13 | 0.12 | 0.03 | 0.07 | 0.10 | 0.07 | 0.02 | 0.22 |
| 34.05 | Cell motility | - | + | 0.08 | 0.01 | -0.04 | -0.08 | -0.02 | -0.22 | -0.25 | 0.16 | 0.17 | 0.12 | 0.06 | -0.09 | -0.38 | -0.50 |
| 42.33 | Pilus/fimbria | x | x | x | x | 0.00 | 0 | 0.00 | + | + | 0 | -0.40 | 0.18 | - | 0.00 | + | 0.00 |
| 70.03 | Cytoplasm | x | x | x | x | 0.02 | -0.07 | 0.03 | 0.03 | 0.04 | 0.08 | -0.02 | 0.05 | -0.01 | 0.04 | 0.05 | 0.10 |
| 70.27 | Extracellular/secretion proteins | + | - | + | + | 0.34 | + | 0.36 | + | 0.18 | 0.22 | 0.39 | 0.28 | 0.40 | 0.44 | + | 0.26 |
| 70.34 | Prokaryotic cell envelope component | x | x | x | x | 0.05 | x | 0.07 | 0.02 | 0.01 | 0.04 | 0.09 | -0.04 | -0.14 | 0.17 | 0.07 | 0.12 |
For archaeal and bacterial genomes, the number of gene products contributing to FunCat categories of level 2 was determined (see Methods). The columns lists the ratio log(f(Cat)/f(Cat)) for the categories, if #(Cat) was at least 100 (compare Table 1). In all other cases a trend is given indicated by a "+" for overrepresentation, a "-" for underrepresentation and a 0 for a score value of 0.0. An "x" indicates categories not occurring in the respective dataset. Abbreviations of subsets: AB all Bacteria, HT hyperthermophilic, TH thermophilic, MS mesophilic, PS psychrophilic, AQU aquatic, TER terrestrial, HOA host-associated, AER aerobic, ANE anaerobic, NHAL non halophilic, MHAL moderately halophilic species, and HITR the subset of bacterial species possessing an extreme number of tRNA genes, i.e. MG_CUB(Bacteria_HITR).
Figure 3Habitat specific abundance of metabolic functions in bacterial effectomes. For FunCat categories of level 2, the Abund-values were deduced for specific bacterial subsets. Scores larger than zero indicate categories overrepresented in the effectomes. Underrepresented categories have negative values. The categories are numbered according to the FunCat scheme: "amino acid metabolism" (1.01), "secondary metabolism" (1.20), "extracellular metabolism" (1.25), "glycolysis and gluconeogenesis" (2.01), "glyoxylate cycle" (2.04), "pentose-phosphate pathway" (2.07), "pyruvate dehydrogenase complex" (2.08), "anaplerotic reactions" (2.09), "tricarboxylic-acid pathway (citrate cycle, Krebs cycle, TCA cycle)" (2.10), "electron transport and membrane-associated energy conservation" (2.11), "metabolism of energy reserves (e.g. glycogen, trehalose)" (2.19), "oxidation of fatty acids" (2.25), "photosynthesis" (2.30), "energy conversion and regeneration" (2.45), "DNA processing " (10.01), "RNA synthesis" (11.02), "RNA processing" (11.04), "RNA modification" (11.06), "ribosome biogenesis" (12.01), "translation" (12.04), "translational control" (12.07), "aminoacyl-tRNA synthetases" (12.10), "protein folding and stabilization" (14.01), "protein targeting, sorting and translocation" (14.04), "protein modification" (14.07), "protein/peptide degradation" (14.13), "transmembrane signal transduction" (30.05), "stress response" (32.01), "disease, virulence and defense" (32.05), "cellular sensing and response to external stimulus" (34.11), "prokaryotic cell envelope structures" (42.34), "fungal/microorganismic cell type differentiation" (43.01), "cell wall" (70.01). Abund-values were plotted if the number #All(Cat) was at least 100. Abbreviations of subsets: AB all, TH thermophilic, MS mesophilic, PS psychrophilic, AER aerobic, ANE anaerobic, AQU aquatic, TER terrestrial, MHAL moderately halophilic bacteria, and HITR the subset of Bacteria possessing an extreme number of tRNA genes represented by MG_CUB(Bacteria_HITR).