| Literature DB >> 22069324 |
Michael Y Galperin1, Eugene V Koonin.
Abstract
Comparative analysis of the sequences of enzymes encoded in a variety of prokaryotic and eukaryotic genomes reveals convergence and divergence at several levels. Functional convergence can be inferred when structurally distinct and hence non-homologous enzymes show the ability to catalyze the same biochemical reaction. In contrast, as a result of functional diversification, many structurally similar enzyme molecules act on substantially distinct substrates and catalyze diverse biochemical reactions. Here, we present updates on the ATP-grasp, alkaline phosphatase, cupin, HD hydrolase, and N-terminal nucleophile (Ntn) hydrolase enzyme superfamilies and discuss the patterns of sequence and structural conservation and diversity within these superfamilies. Typically, enzymes within a superfamily possess common sequence motifs and key active site residues, as well as (predicted) reaction mechanisms. These observations suggest that the strained conformation (the entatic state) of the active site, which is responsible for the substrate binding and formation of the transition complex, tends to be conserved within enzyme superfamilies. The subsequent fate of the transition complex is not necessarily conserved and depends on the details of the structures of the enzyme and the substrate. This variability of reaction outcomes limits the ability of sequence analysis to predict the exact enzymatic activities of newly sequenced gene products. Nevertheless, sequence-based (super)family assignments and generic functional predictions, even if imprecise, provide valuable leads for experimental studies and remain the best approach to the functional annotation of uncharacterized proteins from new genomes.Entities:
Mesh:
Substances:
Year: 2011 PMID: 22069324 PMCID: PMC3249071 DOI: 10.1074/jbc.R111.241976
Source DB: PubMed Journal: J Biol Chem ISSN: 0021-9258 Impact factor: 5.157
Common features of proteins from several structural superfamilies
Root mean square deviation (r.m.s.d.) values of Cα traces were taken from the Dali and Molecular Modeling Databases (12, 82). An expanded version of this table that includes EC numbers, references, and hyperlinks to related databases is available in supplemental Table S1 as well as on the NCBI ftp site (ftp.ncbi.nih.gov/pub/galperin/EnzymeSuperfamilies.html). aa, amino acids; GPI, glycosylphosphatidylinositol; fGly, formylglycine.
| Member enzymes (Protein Data Bank code, where available) | Common traits of superfamily members | Refs. |
|---|---|---|
| Glutathione synthetase ( | Conserved structural core (≤4.3 Å Cα r.m.s.d. on ≥230 aa); common ATP-binding residues, which include two conserved Lys/Arg residues that bind α- and β-phosphates of ATP, Glx/Asp residue that interacts with adenine amino group and N6 atom, hydrophobic residues that bind adenine ring, and three Glx/Asx residues that coordinate Mg2+ ions; common catalytic mechanism that includes formation of phosphoacyl intermediate | |
| Alkaline phosphatase ( | Conserved structural core (≤3.6 Å Cα r.m.s.d. on ≥220 aa); conserved metal (Zn2+, Mn2+, or Mg2+)-binding His and Asp residues; common catalytic mechanism that includes phosphorylation (sulfatation) of active site Ser/Thr/fGly residue | |
| Oxalate oxidase ( | Conserved structural core (<4.6 Å Cα r.m.s.d. on >99 aa); partly conserved metal (Mn2+, Fe2+, Cu2+, Ni2+, or Zn2+)-binding His residues that often form G X5H XH X3,4E X6G and GD | |
| 3′,5′-cAMP/cGMP phosphodiesterase ( | Conserved structural core (<3.6 Å Cα r.m.s.d. on >105 aa); conserved metal (Mn2+, Mg2+, Co2+, or Fe2+)-binding His and Asp residues organized into H | |
| Penicillin acylase ( | Common structural core (<4.1 Å Cα r.m.s.d. on >96 aa) decorated with variety of structural elements; sequence conservation limited to N-terminal β-hairpin that contains catalytic Ser, Cys, or Thr residue | |
FIGURE 1.Conservation of structural core and active site residues in ATP-grasp ( Conserved structural elements, identified through VAST alignments (83), are shown in tan, active site residues are shown as sticks, the most conserved residues are shown in bright colors (with carbon atoms shown in green), and catalytic metal atoms are shown as pink spheres. A, inositol-1,3,4-trisphosphate 5/6-kinase (Protein Data Bank code 1z2p (29)) with bound ATP analog AMP-PCP. Carbon atoms are in silver. B, AlkP(H331Q) mutant with a phosphoserine intermediate (code 1hjk (84)). C, cysteine dioxygenase with a persulfenate intermediate (code 3eln (85)). D, 5′-deoxyribonucleotidase with bound dAMP (code 2pau (50)).