| Literature DB >> 19091081 |
Thomas D Otto1, Ana Carolina R Guimarães, Wim M Degrave, Antonio B de Miranda.
Abstract
BACKGROUND: Enzymes are responsible for the catalysis of the biochemical reactions in metabolic pathways. Analogous enzymes are able to catalyze the same reactions, but they present no significant sequence similarity at the primary level, and possibly different tertiary structures as well. They are thought to have arisen as the result of independent evolutionary events. A detailed study of analogous enzymes may reveal new catalytic mechanisms, add information about the origin and evolution of biochemical pathways and disclose potential targets for drug development.Entities:
Mesh:
Substances:
Year: 2008 PMID: 19091081 PMCID: PMC2628392 DOI: 10.1186/1471-2105-9-544
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Work flow of AnEnPi. Databases are represented as rectangles. Darker gray rectangles represent the five datasets of clusters. Light gray rectangles are the modular functions of AnEnPi, described in the text.
Figure 2Similarity matrix. Similarity matrix (central figure) of EC 4.2.1.2 (fumarate hydratase). Each point of the 508 × 508 matrix represents the blastp similarity score of two enzymes. All scores above 1500 are reset to 1500. Higher similarity scores yield darker points, (white represents a score below 120). Sequences were sorted by the similarity score, using the longest enzymes as reference to the other enzymes. The three arrows on the right site indicate the positions of enzymes of L. major (red arrows) and H. sapiens (blue arrow). Histograms on the left display the distribution of organisms represented in each cluster, for the three kingdoms: archaebacteria (A), bacteria (B) and eukaryotes (E). In each of the two main (analogous) clusters, subclusters can be observed. The graph at the top of the matrix displays the kingdom of the organism for every enzyme in the matrix.
Refinement of The initial Dataset (A) through the application of successive filters.
| Datasets | # Clusters | Max. Clusters | % Analogous | |||
| 1 | 2 | 3 | > 3 | |||
| A | 1447 | 459 | 199 | 328 | 131 | 40.5 |
| B | 1600 | 345 | 113 | 180 | 78 | 26.2 |
| C | 1560 | 316 | 91 | 97 | 46 | 20.7 |
| D | 1619 | 302 | 73 | 70 | 23 | 19.4 |
| E | 1897 | 142 | 23 | 1 | 5 | 8.1 |
Table 1: A, dataset obtained after clustering; B, dataset obtained after the exclusion of singlets (clusters With only one sequence); C, dataset obtained after the exclusion of EC's which are not defined up to the Fourth level (incomplete EC' s); D, dataset obtained after the joining of clusters where some sequences Were annotated as 'subunits'; E, dataset obtained after the joining of clusters with putative intragenomic Analogy. Max. Cluster, the maximum number of clusters found for one specific enzymatic activity; % analogous, fraction of enzymatic activities where analogy was detected. # Clusters: number of functions with, respectively, 1, 2, 3 or more than 3 clusters.
Examples of analogy Found in The literature and the Methods used.
| EC | Enzyme | Organism | Ref. | Method |
| 1.1.1.42 | Isocitrate dehydrogenase | [ | a | |
| 2.7.1.4 | Fructokinase | [ | a | |
| 3.2.1.86 | 6-phospho-beta-glucosidase | [ | a | |
| 3.4.21.72 | Immunoglobulin A (IgA) proteases | [ | a | |
| 2.1.1.- | N-methyltransferase 1 | [ | a, b | |
| 2.7.7.- | Adenylyltransferase | [ | a, b | |
| 3.1.3.11 | Fructose-1,6-bisphosphatase | [ | c | |
| 2.3.1.- | Enoyl thioester reductase | [ | d | |
| 1.13.11.2 | Catechol 2,3-dioxygenase | [ | e | |
| 2.7.2.3 | Glyceric acid 3-phosphate kinase | [ | f | |
| 5.3.1.1 | Triose phosphate isomerase | [ | f |
Table 2: a – computational, b – genetic complementation, c – genome sequencing, d – stereochemical assay, e – biochemical assay and f – isoelectric focusing
Figure 3Comparative analysis between H. sapiens and L. major. Brown: analogy between two genes with the same function; red: function present in both species; blue: Function present only in H. sapiens; green: function present only in L. major; gray: function not found in both species and white: no representative enzymes in KEGG.