| Literature DB >> 24874434 |
Neetika Nath1, John B O Mitchell1, Gustavo Caetano-Anollés2.
Abstract
Phylogenomic analysis of the occurrence and abundance of protein domains in proteomes has recently showed that the α/β architecture is probably the oldest fold design. This holds important implications for the origins of biochemistry. Here we explore structure-function relationships addressing the use of chemical mechanisms by ancestral enzymes. We test the hypothesis that the oldest folds used the most mechanisms. We start by tracing biocatalytic mechanisms operating in metabolic enzymes along a phylogenetic timeline of the first appearance of homologous superfamilies of protein domain structures from CATH. A total of 335 enzyme reactions were retrieved from MACiE and were mapped over fold age. We define a mechanistic step type as one of the 51 mechanistic annotations given in MACiE, and each step of each of the 335 mechanisms was described using one or more of these annotations. We find that the first two folds, the P-loop containing nucleotide triphosphate hydrolase and the NAD(P)-binding Rossmann-like homologous superfamilies, were α/β architectures responsible for introducing 35% (18/51) of the known mechanistic step types. We find that these two oldest structures in the phylogenomic analysis of protein domains introduced many mechanistic step types that were later combinatorially spread in catalytic history. The most common mechanistic step types included fundamental building blocks of enzyme chemistry: "Proton transfer," "Bimolecular nucleophilic addition," "Bimolecular nucleophilic substitution," and "Unimolecular elimination by the conjugate base." They were associated with the most ancestral fold structure typical of P-loop containing nucleotide triphosphate hydrolases. Over half of the mechanistic step types were introduced in the evolutionary timeline before the appearance of structures specific to diversified organisms, during a period of architectural diversification. The other half unfolded gradually after organismal diversification and during a period that spanned ∼2 billion years of evolutionary history.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24874434 PMCID: PMC4038463 DOI: 10.1371/journal.pcbi.1003642
Source DB: PubMed Journal: PLoS Comput Biol ISSN: 1553-734X Impact factor: 4.475
Figure 1The history of biocatalytic mechanisms.
The heat map describes the distribution of presence (red) and absence (yellow) of mechanism step types (y-axis) over fold age (x-axis). Rows of the heat map (mechanisms) are ordered vertically according to the first appearance of the step type in time, with the oldest at the top. The row sidebars at the top of the heat map are used to describe the number of MACiE entries and CATH H-level domain structures (annotated as number of folds) appearing at each fold age, and presence of top-level EC classes that are associated with these H-level structures (see color key). The x-axis scale reflects the different nd values found in our dataset, arranged from the oldest on the left to the youngest on the right. Every unique nd value forms a separate column. The non-linear scale is defined by the number of unique nd values falling in each interval of nd. There are many distinct nd values between 0.0 and 0.3 found in our dataset, so the scale is expanded in this region. There are few distinct nd values between 0.7 and 1.0, so the scale is very condensed in that region. Geological time is taken to be approximately linear with nd, where nd = 0 represents the origin of the protein world approximately 3.8 billion years ago and nd = 1 corresponds to the present [4].
Figure 2Definition of the most ancient mechanistic step types, which include fundamental building blocks of enzyme chemistry: “Proton transfer”, “Bimolecular nucleophilic addition”, “Bimolecular nucleophilic substitution”, and “Unimolecular elimination by the conjugate base”.
We follow MACiE's terminology, though the latter could perhaps be better described as “Unimolecular elimination from the conjugate base”, being the second and last step of the E1cB “Unimolecular elimination via the conjugate base” mechanism.
Figure 3Cumulative plot describing the appearance of mechanistic step types in protein domain evolution.
The graph shows the proportion of mechanistic step types that are present at a particular time.
Figure 4Heat map representing the number of mechanistic step types (y-axis) used by H-level structures of each different fold age (x-axis).
Different colors indicate distinct structures which happen to share both the same number of mechanistic step types and an identical fold age. For example, in column 2 the black coloring of rows 4, 15 and 16 shows that four structures respectively accommodate 4, 15 and 16 different mechanistic step types to effect their reactions. The color code for the row sidebar is similar to that in Figure 1; the x-axis scale is also similar to that in Figure 1.
Discovery of MACiE's mechanistic step types according to the evolutionary timeline of domain structure innovation.
| Fold age | CATH | Description | Mechanisms discovered |
|
| 3.40.50.300 | P-loop containing nucleotide triphosphate hydrolases | Bimolecular nucleophilic addition |
| Bimolecular nucleophilic substitution | |||
| Intramolecular nucleophilic addition | |||
| Proton transfer | |||
| Unimolecular elimination by the conjugate base | |||
| Electron transfer | |||
|
| 3.40.50.150 | Vaccinia Virus protein VP39 | Bimolecular elimination |
|
| 3.40.50.720 | NAD(P)-binding Rossmann-like Domain | Bimolecular elimination |
| Aromatic bimolecular nucleophilic addition | |||
| Aromatic unimolecular elimination by the conjugate base | |||
| Assisted keto-enol tautomerisation | |||
| Aromatic intramolecular elimination | |||
| Bimolecular homolytic addition | |||
| Radical formation | |||
| Radical termination | |||
| Redox | |||
| Bimolecular electrophilic addition | |||
|
| 3.50.50.60 | FAD/NAD(P)-binding domain | Bimolecular elimination |
| Aromatic bimolecular nucleophilic addition | |||
| Aromatic unimolecular elimination by the conjugate base | |||
| Assisted keto-enol tautomerisation | |||
| Aromatic intramolecular elimination | |||
| Bimolecular homolytic addition | |||
| Radical formation | |||
| Radical termination | |||
| Colligation | |||
| Redox | |||
|
| 3.40.50.620 | HUPs | Intramolecular elimination |
|
| 3.20.20.70 | Aldolase class I | Heterolysis |
| Aldol addition | |||
| Assisted other tautomerisation | |||
| Aromatic bimolecular elimination | |||
| Other tautomerisation | |||
|
| 3.40.50.970 | Not Assigned (1-deoxy-D-xylulose-5-phosphate synthase -like domain 1/2/3) | Homolysis |
| Elimination reaction | |||
|
| 3.40.190.10 | Periplasmic binding protein-like II | Aromatic bimolecular nucleophilic substitution |
|
| 3.90.226.10 | 2-enoyl-CoA Hydratase; Chain A domain 1 | Keto-Enol tautomerisation |
| Intramolecular electrophilic addition | |||
|
| 3.40.47.10 | Peroxisomal Thiolase; Chain A, domain 1 | Claisen condensation |
|
| 3.40.30.10 | Glutaredoxin | Intramolecular nucleophilic substitution |
|
| 3.60.21.10 | Purple Acid Phosphatase; chain A, domain 2 | Coordination |
|
| 2.60.120.10 | Jelly Rolls | Radical propagation |
|
| 3.40.50.1820 | Not Assigned 4,9-DSHA hydrolase activity, (Carboxyesterase-related protein -like domain 1) | Substitution reaction |
|
| 3.20.70.20 | Anaerobic Ribonucleotide-triphosphate Reductase Large Chain | Bimolecular homolytic substitution |
| Hydrogen transfer | |||
| Unimolecular homolytic elimination | |||
|
| 1.10.600.10 | Farnesyl Diphosphate Synthase | Intramolecular electrophilic substitution |
| Intramolecular rearrangement | |||
|
| 2.40.100.10 | Cyclophilin | Isomerisation |
|
| 3.40.50.10090 | Not Assigned (Uroporphyrinogen-III synthase -like domain 1/2) | Aromatic intramolecular electrophilic substitution |
|
| 3.30.1130.10 | GTP Cyclohydrolase I, domain 2 | Amadori rearrangement |
|
| 1.10.520.10 | Not Assigned (Catalase-peroxidase -like domain 1/2) | Bond order change |
|
| 3.40.50.10230 | Precorrin-8X methylmutase CbiC/CobH | Sigmatropic rearrangement |
| Pericyclic reaction | |||
|
| 1.10.606.10 | Vanadium-containing Chloroperoxidase domain 2 | Acidic bimolecular nucleophilic substitution |
|
| 1.10.590.10 | Chorismate Mutase subunit A | Claisen rearrangement |
|
| 3.20.20.240 | TIM Barrel | Intramolecular homolytic addition |
| Bimolecular homolytic elimination | |||
|
| 1.25.40.80 | Serine Threonine Protein Phosphatase 5, Tetratricopeptide repeat | Photochemical activation |
|
| 1.10.800.10 | Phenylalanine Hydroxylase | Aromatic bimolecular electrophilic addition |
First column represents nd values, second CATH code, third CATH H-level structure names (in cases where the names were not assigned, we have given the FunFams description) and the last column represents mechanistic step types as described in MACiE.
Figure 5Heat map representing the similarity of mechanistic step types utilised by the H-level structures.
For this we have calculated the Jaccard similarity scores. Here the x and y axes in the plot are ordered using a hierarchical clustering algorithm in which the two most similar data points are linked together at each iteration. The colors of the heatmap represent the similarity scores where yellow suggests low or no (when 0) similarity and white (1) means that identical combinations of mechanistic steps are shared between two H-level structures. The top left corner represents the color key for the similarity scores and the distribution of the similarity scores.
Pattern 133, the mechanistic step types associated with CATH 3.20.20.70, Aldolase class I.
| Mechanistic step types with CATH 3.20.20.70, Aldolase class I |
| Unimolecular elimination by the conjugate base |
| Redox |
| Radical termination |
| Radical formation |
| Proton transfer |
| Other tautomerisation |
| Intramolecular nucleophilic addition |
| Intramolecular elimination |
| Hydride transfer |
| Heterolysis |
| Electron transfer |
| Bimolecular nucleophilic substitution |
| Bimolecular nucleophilic addition |
| Bimolecular elimination |
| Assisted other tautomerisation |
| Assisted keto-enol tautomerisation |
| Aromatic unimolecular elimination by the conjugate base |
| Aromatic bimolecular nucleophilic addition |
| Aromatic bimolecular elimination |
| Aldol addition |
MACiE enzymes for purine metabolism.
| MACiE | Enzyme Name | EC | Subnetwork | PDB | CATH H level Structure |
| Combinatorial patterns | Mechanistic step types |
|
| adenylate kinase | 2.7.4.3 | INT | 1zio | 3.40.50.300 | 0 | Pattern 2 | Bimolecular nucleophilic substitution |
|
| GMP synthase (glutamine- hydrolysing) | 6.3.5.2 | INT | 1gpm | 3.40.50.880 | 0.0980 | Pattern 4 (+2) | Proton transfer |
| Bimolecular nucleophilic substitution | ||||||||
| Unimolecular elimination by the conjugate base | ||||||||
| Bimolecular nucleophilic addition | ||||||||
|
| pyruvate kinase | 2.7.1.40 | INT | 1pkn | 3.20.20.60 | 0.1127 | Pattern 4 | Proton transfer |
| Bimolecular nucleophilic substitution | ||||||||
|
| pyruvate kinase | 2.7.1.40 | INT | 1pkn | 2.40.33.10 | 0.4118 | Pattern 4 | Proton transfer |
| Bimolecular nucleophilic substitution | ||||||||
|
| adenylosuccinate lyase | 4.3.2.2 | INT | 1c3c | 1.20.200.10 | 0.1667 | Pattern 6 | Proton transfer |
| Bimolecular elimination | ||||||||
|
| adenylosuccinate synthase | 6.3.4.4 | INT | 1gim | 3.40.440.10 | 0.2353 | Pattern 4 (+2) | Proton transfer |
| Bimolecular nucleophilic substitution | ||||||||
| Assisted other tautomerisation | ||||||||
| Aromatic bimolecular nucleophilic substitution | ||||||||
|
| nucleoside-diphosphate kinase | 2.7.4.6 | INT | 1ndp | 3.30.70.141 | 0.3186 | Pattern 4 | Proton transfer |
| Bimolecular nucleophilic substitution |
Table columns are: MACiE code, Enzyme name, EC number, Purine metabolic subnetwork [41], PDB code, CATH H-level Structure, nd value and mechanistic step types.