| Literature DB >> 17605815 |
A Maxwell Burroughs1, S Balaji, Lakshminarayan M Iyer, L Aravind.
Abstract
BACKGROUND: The beta-grasp fold (beta-GF), prototyped by ubiquitin (UB), has been recruited for a strikingly diverse range of biochemical functions. These functions include providing a scaffold for different enzymatic active sites (e.g. NUDIX phosphohydrolases) and iron-sulfur clusters, RNA-soluble-ligand and co-factor-binding, sulfur transfer, adaptor functions in signaling, assembly of macromolecular complexes and post-translational protein modification. To understand the basis for the functional versatility of this small fold we undertook a comprehensive sequence-structure analysis of the fold and developed a natural classification for its members.Entities:
Year: 2007 PMID: 17605815 PMCID: PMC1949818 DOI: 10.1186/1745-6150-2-18
Source DB: PubMed Journal: Biol Direct ISSN: 1745-6150 Impact factor: 4.540
Figure 1Topology diagrams of selected β-GF members. A generalized representative is shown in (A) with the key structural features found in certain lineages of the fold labeled, while (B) depicts idealized versions of specific lineages, the names of which are given above the diagrams. Strands are shown as arrows with the arrowhead at the C-terminal end. Strands belonging to the 4-stranded β-GF core are colored green, the additional strand found in the 5-stranded assemblage is colored yellow, strands forming a conserved insert within the β-GF scaffold are colored magenta, and other strands specific to a certain lineage are colored grey and outlined with a broken line. Helices are depicted as rectangles, with the core absolutely conserved helix colored orange and other helices specific to a certain lineage colored grey and outlined with a broken line. The diagrams are grouped and labeled in a manner consistent with the structural classes described in the text, with members of the eukaryotic UB-like superfamily nested within other members of the 5-stranded assemblage. The 2Fe-2S cluster of the ferredoxins is shown as four small ovals bound to cysteine residues represented by the letter "C".
Secondary structure features of major β-GF structural categories.
| Higher-order Classification | Lineage Name | Secondary Structural Features Common to the β-GF Fold1 | ||||||||||||
| S1 | L1 | S2 | L2 | H | L3/LS | S3 | L4 | S4 | L5/CA | S5 | tail | notes | ||
| Basal 4-stranded versions of the β-GF | IF3-N | S1 | -- | S2 | -- | H | -- | S3 | -- | O | O | S5 | -- | |
| Archeo-eukaryotic RNA poly. β-subunit | S1 | -- | S2 | -- | H | -- | S3 | -- | O | O | S5 | -- | ||
| Sporadically-distributed 4-stranded versions | Yml108w | S1 | cc | S2 | -- | H | -- | S3 | -- | O | O | S5 | h | |
| BofC | S1 | -- | S2 | -- | H | -- | S3 | -- | O | O | S5 | -- | ||
| Immunoglobulin-binding | S1 | -- | S2 | -- | H | -- | S3 | -- | O | O | S5 | -- | ||
| POZ | S1 | -- | S2 | -- | H | h | S3 | -- | O | O | S5 | -- | ||
| Nudix superfamily | Nudix (MutT) | S1 | -- | S(ee)2 | -- | H | * | S3 | -- | O | O | S5 | e | |
| Fasciclin-like assemblage | L25 | S1 | -- | S2 | -- | H | ee* | S3 | -- | O | O | S5 | -- | 3 |
| glutamine synthetase N-terminal | S1 | -- | S2 | -- | H | eee* | S3 | -- | O | O | S5 | -- | 3 | |
| fasciclin | S1 | hhh | S2 | -- | H | ee* | S3 | -- | O | O | S5 | -- | 3 | |
| phosphoribosyl AMP cyclohydrolase (HisI) | S1 | -- | S2 | -- | H | ee* | S3 | -- | O | O | S5 | -- | 3,4 | |
| 5-stranded assemblage: classical 5-stranded clade | MoaD | S1 | H | S2 | -- | H | h** | S3 | -- | S4 | * | S5 | -- | |
| ThiS | S1 | -- | S2 | -- | H | * | S3 | -- | S4 | * | S5 | -- | ||
| TmoB | S1 | -- | S2 | -- | H | * | S3 | -- | S4 | * | S5 | -- | ||
| Superantigen | S1 | -- | S2 | -- | H | * | S3 | -- | S4 | h* | S5 | -- | ||
| Strepto/Staphylokinase | S1 | -- | S2 | -- | H | * | S3 | -- | S4 | * | S5 | -- | ||
| YukD | S1 | -- | S2 | -- | H | * | S3 | -- | S4 | * | S5 | -- | ||
| TGS | S1 | -- | S2 | -- | H | h* | S3 | -- | S4 | * | S5 | -- | ||
| Aldehyde OR2 N-terminal domain | S1 | -- | S2 | -- | H | * | S3 | -- | S4 | eh* | S5 | -- | ||
| 5-stranded assemblage: Selected eukaryote UB-like superfamily members | classic UB-like | S1 | -- | S2 | -- | H | * | S3 | -- | S4 | * | S5 | -- | |
| PB1 | S1 | -- | S2 | -- | H | * | S3 | -- | S4 | h* | S5 | -- | ||
| CAD/Doublecortin (DCX) | S1 | -- | S2 | -- | H | * | S3 | -- | S4 | [h]* | S5 | -- | 6 | |
| RA | S1 | -- | S2 | -- | H | * | S3 | -- | S4 | h* | S5 | -- | ||
| Elongin | S1 | -- | S2 | -- | H | * | S3 | -- | S4 | * | S5 | -- | ||
| UBX | S1 | -- | S2 | -- | H | * | S3 | -- | S4 | * | S5 | -- | ||
| E1/UFD | O | -- | S2 | -- | H | * | S3 | -- | S4 | * | S5 | S6 | 7 | |
| 5-stranded assemblage: soluble ligand binding or metal ion chelating clade | molydopterin-dependent oxidoreductase | S1 | -- | S2 | hehee | H | * | S3 | -- | S4 | eee* | S5 | -- | |
| SLBB: Nqo1-type | S1 | -- | S2 | -- | H | * | S3 | -- | S4 | hh* | S5 | -- | 5 | |
| SLBB: transcobalamin-type | S1 | -- | S2 | -- | H | eee* | S3 | -- | S4 | * | S5 | -- | ||
| 2Fe-2S ferredoxin | S1 | -- | S2 | -- | H | cc* | S3 | -- | S4 | * | S5 | -- | ||
| L-proline DH-like OR2 N-terminal domain | S1 | -- | S2 | -- | H | ee* | S3 | -- | S4 | * | S5 | -- | ||
| Miscellaneous | WWE | S1 | -- | S2 | -- | H | e* | S3 | -- | O | O | S5 | e | 8 |
| FimD N-terminal | S1 | -- | S2 | ee | H | * | S3 | -- | O | O | S5 | -- | ||
| S4 | O | O | O | O | H | h* | S3 | -- | S4 | * | S5 | -- | ||
1. S: Strand, L: Loop, H: Helix, LS: Lateral Shelf, CA: Connector Arm, O: absence of given feature, --: presence of a loop feature, *: presence of LS or CA, h: insert in helical conformation, e: insert in extended conformation (strand-like), cc: long coil insert.
2. OR: oxidoreductase.
3. Versions form barrel through insertion of strands at the lateral shelf.
4. Barrel is less pronounced in this version, strands are inserted more upstream relative to the other 3 versions.
5. Two small helices are present in ascending arm.
6. Single helix found at ascending arm in several members.
7. Circular permutation results in new connections between strands; the S1 strand is found at C-terminus (See Figures 1, 2).
8. Additional strand at tail inserted between S1 and S5; lateral shelf forms strand that also stacks with central sheet.
Figure 2Cartoon representations of distinct β-GF domains. Critical residues in MutT and HisI that are involved in enzyme catalysis are also shown.
Figure 3Reconstructed evolutionary history of β-grasp fold. Individual lineages are listed to the left of the figure grouped according to classifications given in the text, with their inferred evolutionary depth traced by solid horizontal lines across the relative temporal epochs representing major evolutionary transitional periods shown as vertical lines. The horizontal lines are color-coded according to their observed phyletic distributions, the key for this coloring scheme is given at the bottom of the figure. Dashed lines indicate uncertainty in terms of the origins of a lineage, while grey ellipses group lineages of relatively restricted phyletic distribution with more broadly distributed lineages, indicating that the former likely underwent rapid divergence from the latter. Major predicted structural/functional transitions of the fold are marked by green ellipses with a brief description given. Colored, labeled squares immediately to the left of the lineage names represent broad functional categories: E, enzymatic activity; LMB, ligand or metal-binding; CO, conjugated versions; AD, mediator of protein-protein interactions; RNA, RNA metabolism-related.
Figure 4Reconstructed evolutionary history of eukaryotic ubiquitin superfamily. Similar to Figure 3, however, major evolutionary transitions are now shown as horizontal lines and the maximum depth to which these individual lineages can be traced is now shown with solid vertical lines. Functional categories are the same as described in Figure 3.
Figure 5A) Architectural complexity plot of β-grasp domains found in eukaryotes and prokaryotes. The complexity quotient for a given species (y-axis) is plotted against the total number of β-grasp domain containing proteins in the same species. Names of species are given next to plot points. B) Domain architectures of β-grasp domains. Only a small sample of architectures is shown. These mainly represent novel or recently reported architectures that are described in the text. The TRS4 C-terminal domain, also found fused to certain E1-enzymes that lack the C-terminal UFD has a highly conserved ExxxH implying enzymatic function (see Additional file 1 for an alignment). Orange ellipses represent the conserved cysteine clusters observed in the NPL4-N family (see Additional file 1). A straight line with a small green box in the Ddi1 family architecture represents a possible cleavage site located between the domains. The proteins are not drawn to scale as only globular segments are show. Explanation of abbreviations/domain names: B3, DNA-binding domain; Auxin response, auxin-responsive transcription factor domain; OTU, OTU-like family of cysteine proteases; Znf, zinc-finger; Znf_LF, little finger family of zinc finger domains; R, Ring-finger domain; β-P, β-propeller domain; X, previously uncharacterized BofC C-terminal domain also found fused to a serine/threonine phosphatase in actinobacteria (see Additional file 1 for alignment). Organism abbreviations: Ehis, Entamoeba histolytica; Ath, Arabidopsis thaliana; Hsap, Homo sapiens; Rnor, Rattus norvegicus; Blic, Bacillus licheniformis; Mmaz, Methanosarcina mazei; Ddis, Dictyostelium discoideum; Lmaj, Leishmania major; Tcru, Trypanosoma cruzi; Pfal, Plasmodium falciparum; Tthe, Tetrahymena thermophila; Ncra, Neurospora crassa; Drer, Danio rerio; Cele, Caenorhabditis elegans; Dmel, Drosophila melonogaster; Scer, Saccharomyces cerevisiae; Tvag, Trichomonas vaginalis; Uma, Ustilago maydis; Spom, Schizosaccharomyces pombe; Cneo, Cryptococcus neoformans; Glam, Giardia lamblia; Cpar, Cryptosporidium parva; Tmar, Thermotoga maritima; Mpne, Mycoplasma pneumoniae; Ecol, Escherichia coli; Vcho, Vibrio cholerae; Hpyl, Helicobacter pylori; Nmen, Neisseria meningitides; Msp., Mesorhizobium sp.; Ctet, Clostridium tetani; Aaeo, Aquifex aeolicus; Tden, Treponema denticola; Drad, Deinococcus radiodurans; Mtub, Mycobacterium tuberculosis; Save, Streptomyces avermitilis; Bfra, Bacteroides fragilis; Ctep, Chlorobium tepidum; Nsp., Nostoc sp.; Ssp., Synecococcus sp.; Cpneu, Chlamydophila pneumoniae.
Figure 6Diagram of relative location of β-grasp interacting partners. The strands and core helix of an idealized β-GF domain have been broken into interaction zones, and the names of representatives of the fold that interact using each of these zones is listed. The top view depicts the exposed face while the bottom view depicts the obscured face. Coloring of the boxes containing lists of specific β-GF domains interacting via a particular region correspond to coloring of structural elements (i.e. a particular strand or loop) involved in the interaction.