| Literature DB >> 22691113 |
Anindya Roy Chowdhury1, Chitra Dutta.
Abstract
BACKGROUND: Archaea evoke interest among researchers for two enigmatic characteristics -a combination of bacterial and eukaryotic components in their molecular architectures and an enormous diversity in their life-style and metabolic capabilities. Despite considerable research efforts, lineage- specific/niche-specific molecular features of the whole archaeal world are yet to be fully unveiled. The study offers the first large-scale in silico proteome analysis of all archaeal species of known genome sequences with a special emphasis on methanogenic and sulphur-metabolising archaea.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22691113 PMCID: PMC3416665 DOI: 10.1186/1471-2164-13-236
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1 Grouping of archaea according to their standardised amino acid usage. The left panel is a pictorial representation of relative amino acid usage in the respective archaea. The over- representation and underrepresentation of amino acid residues in the organisms are shown in green and red colored blocks, respectively. The right panel depicts the complete linkage clustering on the Euclidean distances between the relative abundances of different amino acid residues in the encoded proteins of organisms. Organism abbreviations are listed in Additional file 1. The panel in between shows the short names of the archaea, their genomic GC content, temperature adaptation and oxygen requirements respectively.
Figure 2 Isoelectric point distribution patterns in a) euryarchaeal classes b) crenarchaeal orders and c) the rest three phyla of archaea.
Figure 3 Distribution of COGs categories of the entire archaeal group under study and their comparison with E.coli (member from bacterial kingdom) and yeast (member from eukaryotic kingdom).
Figure 4 heat map and clustering on the unbiased amino acid usage profile in the methanogenic and the sulphur metabolising group. In the heat map, the column ‘other’ represents the sum of the frequencies of the rest of the amino acids i.e. those which are biased at their codon levels. The colour scheme and the clustering method is the same as described in Figure 1.
Highest and lowest values of amino acids in the methanogenic and sulphur metabolising orders
| Thermococcales | 4.40 | 2.88 | Methanopyrales | |
| Methanococcales | 9.92 | 4.81 | Methanopyrales | |
| Methanobacteriales | 5.88 | 1.92 | Methanopyrales | |
| Methanococcales | 9.49 | 3.99 | Methanopyrales | |
| Sulfolobales | 4.74 | 2.82 | Methanopyrales | |
| Methanocellales | 3.00 | 1.65 | Methanopyrales | |
| Desulfurococcales | 10.97 | 8.65 | Methanocellales | |
| Methanopyrales | 10.55 | 6.81 | Methanococcales | |
| Methanosarcinales | 6.83 | 4.64 | Methanopyrales | |
| Methanomicrobiales | 6.07 | 4.48 | Desulfurococcales | |
| Methanomicrobiales | 2.01 | 1.32 | Sulfolobales | |
| Methanomicrobiales | 2.87 | 1.41 | Methanopyrales | |
| Methanobacteriales | 6.18 | 4.39 | Thermoproteales | |
| Methanopyrales | 9.99 | 6.18 | Unclassified methanogen RC-I | |
| Methanomicrobiales | 1.34 | 0.56 | Thermococcales | |
| Thermoproteales | 1.35 | 0.66 | Methanococcales | |
| Methanopyrales | 8.33 | 3.35 | Methanococcales | |
| Methanopyrales | 5.48 | 3.39 | Methanococcales | |
| Thermoproteales | 9.40 | 5.55 | Methanococcales | |
| Methanopyrales | 8.05 | 6.60 | Sulfolobales |
Figure 5 Plot of all the methanogens and the sulphur metabolisers in the plane of the first two axes generated by the correspondence analysis on a) all amino acid usage and b) unbiased amino acid usage.
Figure 6 Isoelectric point and aromaticity distribution patterns in the two studied groups.
Figure 7 Surface charge distribution pattern of the protein glyceraldehydes-3-phosphate- dehydrogenase in M.jan (methanogen) and S.sol (sulphur metaboliser).
Top 20 amino acid pairs displaying highest bias in terms of differences and ratios in number of forward (methanogens → sulphur metaboliser) and reverse (sulphur metaboliser → methanogen) replacements in 213 orthologous proteins from M.del to T. onn
| R → K | 1.97## | 871 | 443 | 428 | Q → W | 8.00* | 8 | 1 | 7 |
| E → K | 2.03## | 615 | 303 | 312 | C → M | 7.50** | 15 | 2 | 13 |
| D → E | 1.58## | 723 | 458 | 265 | H → W | 6.00** | 18 | 3 | 15 |
| S → K | 3.91## | 274 | 70 | 204 | C → I | 6.00## | 30 | 5 | 25 |
| D → K | 2.84## | 281 | 99 | 182 | C → S | 4.75## | 38 | 8 | 30 |
| S → A | 1.65## | 442 | 268 | 174 | C → N | 4.67** | 14 | 3 | 11 |
| S → E | 2.41## | 296 | 123 | 173 | D → W | 4.33* | 13 | 3 | 10 |
| I → L | 1.20# | 838 | 701 | 137 | R → W | 4.25## | 34 | 8 | 26 |
| R → E | 1.40## | 436 | 312 | 124 | S → K | 3.91## | 274 | 70 | 204 |
| M → L | 1.45## | 379 | 262 | 117 | C → A | 3.83## | 92 | 24 | 68 |
| G → K | 2.22## | 182 | 82 | 100 | M → W | 3.60** | 18 | 5 | 13 |
| T → V | 1.48## | 275 | 186 | 89 | C → V | 3.55## | 71 | 20 | 51 |
| A → K | 1.77## | 188 | 106 | 82 | C → G | 3.50** | 21 | 6 | 15 |
| L → F | 1.30** | 328 | 252 | 76 | S → W | 3.50* | 14 | 4 | 10 |
| M → I | 1.51## | 204 | 135 | 69 | D → Y | 3.24## | 55 | 17 | 38 |
| C → A | 3.83## | 92 | 24 | 68 | C → T | 3.20# | 32 | 10 | 22 |
| S → T | 1.33** | 275 | 207 | 68 | C → F | 3.17** | 19 | 6 | 13 |
| D → N | 1.43# | 216 | 151 | 65 | D → K | 2.84## | 281 | 99 | 182 |
| T → L | 1.62## | 165 | 102 | 63 | V → W | 2.70** | 27 | 10 | 17 |
| H → Y | 2.21## | 106 | 48 | 58 | I → W | 2.44* | 22 | 9 | 13 |
* significant at p < 0.05, ** significant at p < 0.01, # significant at p <0.001, ## significant at p < 0.0001.
Figure 8 Plot of percentage occurrence of COGs categories in different groups of methanogens and sulphur metabolisers.
Methanogen-specific COGs ID and their descriptions
| COG4058 | Methyl coenzyme M reductase, alpha subunit | met-coenzyme M reductase | met-coenzyme M reductase |
| COG4054 | Methyl coenzyme M reductase, beta subunit | ||
| COG4056 | Methyl coenzyme M reductase, subunit C | ||
| COG4055 | Methyl coenzyme M reductase, subunit D | ||
| COG2710 | Nitrogenase molybdenum-iron protein, alpha and beta chains | methanogenesis marker protein 13 | oxidored-nitr |
| COG1348 | Nitrogenase subunit NifH (ATPase) | NifH | Fer4-NifH |
| COG4008 | Predicted metal-binding transcription factor | methanogenesis marker protein 9 | Not found |
| COG4070 | Predicted peptidyl-prolyl cis-trans isomerase (rotamase), cyclophilin family | methanogenesis marker protein 3 | |
| COG4002 | Predicted phosphotransacetylase | methanogenesis marker protein 4 | |
| COG4032 | Predicted thiamine-pyrophosphate-binding protein | sulfopyruvate decarboxylase | TPP enzyme |
| COG2144 | Selenophosphate synthetase-related proteins | methanogenesis marker protein 2 | AIRS |
| COG4063 | Tetrahydromethanopterin S-methyltransferase, subunit A | Methyl transferase (Mtr) | Methyl transferase (Mtr) |
| COG4062 | Tetrahydromethanopterin S-methyltransferase, subunit B | ||
| COG4061 | Tetrahydromethanopterin S-methyltransferase, subunit C | ||
| COG4060 | Tetrahydromethanopterin S-methyltransferase, subunit D | ||
| COG4059 | Tetrahydromethanopterin S-methyltransferase, subunit E | ||
| COG4050 | Uncharacterized protein conserved in archaea | methanogenesis marker protein 5 | Domain of unknown function (DUF) |
| COG4051 | Uncharacterized protein conserved in archaea | methanogenesis marker protein 17 | |
| COG4065 | Uncharacterized protein conserved in archaea | methanogenesis marker protein 14 | |
| COG4014 | Uncharacterized protein conserved in archaea | *** | |
| COG4029 | Uncharacterized protein conserved in archaea | methanogenesis marker protein 6 | |
| COG4052 | Uncharacterized protein related to methyl coenzyme M reductase subunit C | methanogenesis marker protein 7 |
Figure 9 Four-variable venn diagram of the COGs possession in a) four methanogenic classes and b) four sulphur metabolising orders. MM- methanomicrobia, MB- Methanobacteria, MP- Methanopyri, MC- methanococci, SL-Sulfolobales, AG-Archaealglobales, TP-Thermoproteales, DC- Desulfurococcales.