| Literature DB >> 20423905 |
Lakshminarayan M Iyer1, Saraswathi Abhiman, Robson F de Souza, L Aravind.
Abstract
Unlike classical 2-oxoglutarate and iron-dependent dioxygenases, which include several nucleic acid modifiers, the structurally similar jumonji-related dioxygenase superfamily was only known to catalyze peptide modifications. Using comparative genomics methods, we predict that a family of jumonji-related enzymes catalyzes wybutosine hydroxylation/peroxidation at position 37 of eukaryotic tRNAPhe. Identification of this enzyme raised questions regarding the emergence of protein- and nucleic acid-modifying activities among jumonji-related domains. We addressed these with a natural classification of DSBH domains and reconstructed the precursor of the dioxygenases as a sugar-binding domain. This precursor gave rise to sugar epimerases and metal-binding sugar isomerases. The sugar isomerase active site was exapted for catalysis of oxygenation, with a radiation of these enzymes in bacteria, probably due to impetus from the primary oxygenation event in Earth's history. 2-Oxoglutarate-dependent versions appear to have further expanded with rise of the tricarboxylic acid cycle. We identify previously under-appreciated aspects of their active site and multiple independent innovations of 2-oxoacid-binding basic residues among these superfamilies. We show that double-stranded β-helix dioxygenases diversified extensively in biosynthesis and modification of halogenated siderophores, antibiotics, peptide secondary metabolites and glycine-rich collagen-like proteins in bacteria. Jumonji-related domains diversified into three distinct lineages in bacterial secondary metabolism systems and these were precursors of the three major clades of eukaryotic enzymes. The specificity of wybutosine hydroxylase/peroxidase probably relates to the structural similarity of the modified moiety to the ancestral amino acid substrate of this superfamily.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20423905 PMCID: PMC2938197 DOI: 10.1093/nar/gkq265
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Predicted hydroxy/hydroperoxy wybutosine biosynthesis pathway and domain architecture of key enzymes involved in its synthesis. (A) Shown above each enzyme of the reaction pathway, in colored squares, is the phyletic profile of the protein family. The color intensity of the outline is used to depict its distribution within a particular lineage. Thus, darkly colored squares are used when an enzyme is present in most members of that eukaryotic lineage. Lighter coloring is used for enzymes that are present only in some members of a lineage. When an enzyme is completely absent from a lineage, the square is not colored. The Tyw4 enzyme in plants, apicomplexans and stramenopiles is a specialized case and marked with an asterisk in the phyletic profile. In these cases, the kelch domain of TYW4 is found between Tyw3 and Tyw2. These also have a solo LCMT domain that in trees groups with LCMT1 and may substitute for the LCMT2 seen in Tyw4. The eukaryotes are divided into eight major groups and the expansion of the abbreviations within the boxes are as follows: An, animals; Cf, choanoflagellates; Fu, fungi; Am, Amoebozoa; Pl, plants; Al, Alveolates, Ap, Apicomplexa; St, Stramenopiles; Ki, Kinetoplastids and Heterolobosea; He, Heterolobosea only; Ba, the basal lineages diplomonads and parabasalids. We predict that the hydroperoxywybutosine reaction seen in some lineages is catalyzed by the same wybutosine hydroxylase. (B) Domain architectures from representative eukaryotes are labeled with the gene name, species abbreviation and Genbank index number separated by underscores. Domains are labeled with standard names. X and Y represent poorly characterized globular domains. The phyletic profile for the domain architecture is shown next to them in colored boxes. The abbreviations are the same as in (A).
Figure 2.Multiple sequence alignment of the JOR/JmjC superfamily. The alignment shows the key sequence features that distinguish the JOR/JmjC superfamily from other 2OGFeDO-related enzymes and also brings to attention features that distinguish the major clades of the JOR/JmjC superfamily. Additionally, it highlights conserved residues shared between the predicted yW hydroxylase and other related members, and lineage-specific residues conserved in the yW hydroxylases that may have a substrate binding role. Sequences are labeled using the gene name, species abbreviation and GenBank gi number and sequence identifiers. Clades and families of representative sequences are shown on the right. The predicted secondary structure is shown above the alignment and family-specific features are highlighted under each alignment block. The color code used to highlight conserved amino acids is listed in the lower panel and is based on 85% consensus. Numbers between alignment columns indicate the length of variable inserts. The species abbreviations are: Aexc, Asticcacaulis excentricus; BPsyn9, Synechococcus phage syn9; Bpse, Burkholderia pseudomallei; Bsub, Bacillus subtilis; Ccre, Caulobacter crescentus; Cjap, Cellvibrio japonicus; Ddis, Dictyostelium discoideum; Gzea, Gibberella zeae; Hsap, Homo sapiens; Lmaj, Leishmania major; Mmus, Mus musculus; Mxan, Myxococcus xanthus; Ncra, Neurospora crassa; Nvec, Nematostella vectensis; Otau, Ostreococcus tauri; Pfal, Plasmodium falciparum; Phum, Pediculus humanus; Pput, Pseudomonas putida; Saur, Stigmatella aurantiaca; Sdeg, Saccharophagus degradans; Spom, Schizosaccharomyces pombe.
Figure 3.Evolutionary history of the DSBH fold and structural features. On the left is a temporal diagram of the reconstructed evolutionary history of the DSBH fold. Individual superfamilies and families are listed to the right of the diagram, with solid horizontal lines tracing the inferred evolutionary depth of each family across several key evolutionary transition events shown as labeled vertical lines. Horizontal lines connecting to a dashed ellipse indicates the family descended from any one of the lineages bundled by the ellipse. Dashed horizontal lines represent an uncertainty in the inferred point of origin. Horizontal lines are color coded (see key) by observed phyletic distributions. Numbered circles illustrate the key transition events within the DSBH fold (see key). A more detailed classification of the DSBH fold is available in the Supplementary Data. On the right, the structures of some DSBH superfamilies are shown as cartoon representations with key structural, substrate binding and catalytic features. The conserved strands of the core of the DSBH fold are colored green, whereas the strand conserved in the metal-binding cupins, classical 2OGFeDO superfamily and JOR/JmjC is colored sky blue. Structural features conserved in classical 2OGFeDO and JOR/JmjC superfamilies are colored magenta, whereas those conserved in JOR/JmjC families are colored orange. In the classical 2OGFeDO and JOR/JmjC proteins, the 2-oxoglutarate is colored orange. The structure of the HIF1 JOR/JmjC contains the positions of the secondary structural annotations used in the text and multiple alignment. The proteins displayed include: Brome Mosaic Virus capsid (PDB: 1yc6), peptidylglycine alpha-hydroxylating monooxygenase (PDB: 1yi9), N-terminal domain of the catabolite gene activator protein (PDB: 1i5z), RmlC-like sugar epimerase (PDB: 2ixk), cysteine dioxygenase (PDB: 2b5h), AlkB-like 2OGFeDO (PDB: 2fd8) and FIH1-like JOR/JmjC (PDB: 1h2l).
Phyletic distribution, relationships and shared sequence features of different JOR/JmjC families
| Family Name | Phyletic distribution | Additional comments | Clade synapomorphies |
|---|---|---|---|
| Histone demethylase clade | |||
| JMJD6a | An, Ch, Fu, Pl, My, Al, St, He, Kib | Conserved arginine just N-terminal to strand-1; sporadic fusions to F-box, AT-hook, Ankyrin repeats, C2, chromo and phospholipase domains. | (i) Two amino acid β bulge in connector between strands 2 and 3, (ii) Conserved asparagine residue in the middle of strand-3, (iii) HxxxN motif in strand-7 and (iv) Universally conserved C-terminal small residue (usually asparagine) displaced towards end of strand 8 in crystal structures. |
| JMJD4a | An, Ch, Fu, Pl, Ap, St, Ki | Conserved N[HR]NWhN motif in strand-8. | |
| Fu, Am, He | Although the active site basic residue in strand-4 is lost, there is a highly conserved lysine residue before strand-8 that probably serves as an alternative active site residue; fusions to ZZ, CxxC, PHD, Bmb domains. | ||
| Jumonji/JARID1/SMCX/KDM5a | An, Fu, My, Pl, Al, St, He | HxE in strand-2 catalytic site; Jumonji inactive; fusions to BRIGHT, PHD, SJA, C2H2 Zn-fingers and SANT domains. | |
| JMJD2/KDM4a | An, Ch, Fu, Plc, St, Ci, He | HxE in strand-2 catalytic site; A cysteine and histidine in insert between strand 4 and 5, and 2 cysteines after strand-8 form a Zn-binding cluster, Fused to PHD, TUDOR, C2H2, chromo, TAM/MBD and SET domains. | |
| JMJD3 (KDM6B)/UTX (KDM6A) | An, Ch, Fu, Ci | Basic residue in strand-4 is lost, there is a highly conserved lysine residue in strand-1 that probably serves as an alternative active site residue; HxE in strand-2 catalytic site; lacks asparagine of HxxxN motif, characteristic C-terminal domain with a C2C2 motif (treble clef fold); fused to N-terminal TPR repeats, expanded in | |
| Hairless/JMJD1 | An, Ba, Pl | Fused to an N-terminal ZZ finger, distinct N-terminal treble clef domain and CxC domains in plants. | |
| JMJD8 | An, Plc, St | Contain a HxH instead of a HxD in the catalytic site in strand-2. | |
| JHDM1/KDM2/PHF2/FBXL10 | An, Ch, Fu, St, Pl | HxxT instead of the HxxxN motif in strand-7; Fusions to CxxC, FBOX, LRR, PHD. | |
| Bacterial Histone-demethylase-like | Proteobacteria, Firmicutes, Bacteroidetes | See text and | |
| yW hydroxylase/FIH1 clade | |||
| yW hydroxylase/peroxidasea | An, Ch, Fu, Plc, Ap, Ki, He | Conserved arginine in strand-N-5 and strand N-1; Fused to TYW3+TYW2, TYW4 and the kelch repeats of TYW4 in some species ( | (i) Insert between strands N-3 and N-4, (ii) Highly conserved asparagine at beginning of strand-3 and (iii) Universally conserved C-terminal small residue (usually asparagine) in middle of strand 8 in structures. |
| FIH1a | An, Ch, My, Plc, St, He | Conserved HP motif in connector between strands 4 and 5 | |
| JMJD5a | An, Ch, Ba, Pl, My, Ap, St, He | Conserved glutamate in N-3 and aromatic between strands N-3 and N-4, DxxxP motif in Helix N-4; fusion to acetyltransferase in nematodes. | |
| Secreted JMD5-like family | An (lost in vertebrates, arthropods, nematodes) | Contains disulfide-bonding conserved cysteines and a unique N-terminal GHxxxhG-motif; three subfamilies each with lineage-specific expansions in various metazoans. | |
| JMJD7a | An, Ch, Fu, Pl, St, Ci, He | Conserved acidic N-terminal to strand-1; conserved asparagine in strand 1 shared with | |
| | Fu, Ch, Plc, St, Ap | Conserved histidine in strands 3 and 8; Fused to TPR repeats and DNAJ in oomycetes; shares conserved asparagine in strand 1 with JMJD7. | |
| HSPBAP1 | An | Conserved histidine between strands 6 and 7; always a stand-alone DSBH domain. | |
| Bacterial yW hydroxylase/FIH1 | Widespread in bacteria, see supplement | See text and | |
| MINA/No66 clade | |||
| MinA/No66a | An, Ch, Fu, Pl, My, Al, St, He, Kib | Expanded in | Share a conserved C-terminal α-helical extension. |
| Bacterial MinA/No66-like/YxbC/YcfD | Wide-spread in bacteria, see supplement | Versions closer to eukaryotic homologs additionally share several conserved motifs, e.g. HxT motif in strand-8 and arginine between strands 6 and 7. |
aAncient eukaryotic JOR/JmjC lineages.
bSpecies abbreviations: Al, Alveolates; An, Animals; Ap, Apicomplexa; Ba, Basidiomycete fungi; Ch, Choanoflagellates; Ci, Ciliates; Fun, Fungi; He, Heterolobosea; Ki, Kinetoplastids; My, Mycetozoa; Pl, Plants; St, Stramenopiles.
cOnly present in chlorophyte algae.
Figure 4.Gene neighborhoods and domain architectures of various JOR/JmjC domains and classical 2OGFeDO containing systems described in this study. Genes are represented as arrows pointing from the 5′ to the 3′ end of the coding frame. Predicted operons are labeled with the gi and species name of the primary JOR/JmjC or 2OGFeDO enzyme in that context. Gene neighborhoods and domain architectures are arranged based on the primary JOR/JmjC or 2OGFeDO enzymes being described in a block. Standard domain names are used in domain architectures. MSD1-5 domains represent distinct Monosiga-specific domains that are predicted to be extracellular. MSD2, MSD4 and MSD5 also posses conserved cysteine residues. GlyRR denotes a distinct domain with glycine rich repeats that is also found in some prokaryotes in standalone form. S denotes sushi repeats. Note that in some antibiotic biosynthesis clusters, we only emphasize the genes in the immediate neighborhood of the JOR/JmjC or classical 2OGFeDO genes, as some of these are much larger gene arrays.