| Literature DB >> 34094591 |
Michelle Langton1, Sining Sun1, Chie Ueda1, Max Markey1, Jiahua Chen1, Isaac Paddy1, Paul Jiang1, Natalie Chin1, Amy Milne1, Maria-Eirini Pandelia1.
Abstract
The histidine-aspartate (HD)-domain protein superfamily contains metalloproteins that share common structural features but catalyze vastly different reactions ranging from oxygenation to hydrolysis. This chemical diversion is afforded by (i) their ability to coordinate most biologically relevant transition metals in mono-, di-, and trinuclear configurations, (ii) sequence insertions or the addition of supernumerary ligands to their active sites, (iii) auxiliary substrate specificity residues vicinal to the catalytic site, (iv) additional protein domains that allosterically regulate their activities or have catalytic and sensory roles, and (v) their ability to work with protein partners. More than 500 structures of HD-domain proteins are available to date that lay out unique structural features which may be indicative of function. In this respect, we describe the three known classes of HD-domain proteins (hydrolases, oxygenases, and lyases) and identify their apparent traits with the aim to portray differences in the molecular details responsible for their functional divergence and reconcile existing notions that will help assign functions to yet-to-be characterized proteins. The present review collects data that exemplify how nature tinkers with the HD-domain scaffold to afford different chemistries and provides insight into the factors that can selectively modulate catalysis.Entities:
Keywords: HD-domain; diiron; hydratase; metalloprotein; nucleic acid; oxygenase; phosphatase; phosphodiesterase; phosphonate
Year: 2020 PMID: 34094591 PMCID: PMC8177086 DOI: 10.3390/catal10101191
Source DB: PubMed Journal: Catalysts ISSN: 2073-4344 Impact factor: 4.146
List of representative histidine–aspartate (HD)-domain proteins from the three known subclasses, oxygenases, phosphatases and phosphodiesterases (PDEs), that are crystallographically and biochemically characterized.
| Subclasses | Protein | Nuclearity | Active Metal | Chemistry | Substrate | PDB ID | Origin | References |
|---|---|---|---|---|---|---|---|---|
| MIOX | dinuclear | Fe | Oxygenase | myo-inositol | 2HUO | [ | ||
| PhnZ | dinuclear | Fe | Oxygenase | OH-AEP | 4MLM | bacterium HF130_AEPn_1 | [ | |
| TmpB | dinuclear | Fe | Oxygenase | OH-TMAEP | 6NPA | [ | ||
| YfbR | mononuclear | Co | Monophosphatase | dAMP | 2PAQ | [ | ||
| YGK1 | mononuclear | Mn | Monophosphatase | dNMP | 5YOX | [ | ||
| YqeK | dinuclear | Fe | Diphosphatase | Ap4A | 2O08 | [ | ||
| YpgQ | mononuclear | Mn | Diphosphatase | dNTP | 5DQV | [ | ||
| SpoT | mononuclear | Mn | Diphosphatase | (p)ppGpp | 1VJ7 | [ | ||
| SAMHD1 | mononuclear | Mg | Triphosphatase | dNTP | 3U1N | [ | ||
| EF1143 | mononuclear | Mg | Triphosphatase | dNTP | 4LRL | [ | ||
| OxsA | Mono/dinuclear | Co | Mono/Di/Triphosphatase | Oxetanocin-A | 5TK8 | [ | ||
| Cas3 | dinuclear | Co | PDE | ssDNA | 4QQW | [ | ||
| Cas3 | dinuclear | Ni | PDE | ssDNA | 4Q2C | [ | ||
| Cas3″ | dinuclear | Ca | PDE | ssDNA | 3S4L | [ | ||
| Cas10 | dinuclear | Ni, Mn | PDE | ssDNA | 4W8Y | [ | ||
| PDE1-3 | dinuclear [ | Mg, Mn | PDE | cAMP, cGMP | 1TAZ, 3ITU, 1SO2 | [ | ||
| PDE4 | dinuclear [ | Mg, Mn | PDE | cAMP | 1F0J | [ | ||
| PDE5 | dinuclear [ | Mg | PDE | cGMP | 1TBF | [ | ||
| PDE7-8 | dinuclear [ | Mn | PDE | cAMP | 4PM0, 3ECM | [ | ||
| PDE9 | dinuclear [ | Mn, Mg | PDE | cGMP | 3DY8 | [ | ||
| PDE10 | dinuclear [ | Mg | PDE | cAMP, cGMP | 2OUN | [ | ||
| PgpH | dinuclear | Mn | PDE | c-di-AMP | 4S1B | [ | ||
| Bd1817 | dinuclear | Fe | PDE | c-di-GMP | 3TM8 | [ | ||
| PmGH | trinuclear | Fe, Mn | PDE | c-di-GMP | 4MCW | [ | ||
| PA4781 | trinuclear [ | Mg | PDE | c-di-GMP | 4R8Z | [ | ||
| Ddi2 | mononuclear | Zn | Hydratase | Cyanamide | 6DKA | [ | ||
Denotes proteins for which the crystal structure shows two active site metal ions at an average interatomic distance of ≈3.8 Å. The primary sequence suggests a mononuclear binding site. In phosphodiesterases (PDEs), the second metal ion is stabilized by the aspartate of the HD motif, a bridging hydroxide and four terminally ligated water molecules.
Although no experimental evidence currently exists, PA4781 belongs the trinuclear clade of HD-GYP proteins as inferred from its primary amino acid sequence.
Bd1817 is inactive toward Bis-(3′-5′)-cyclic guanosine monophosphate (c-di-GMP); therefore, the active metal ion refers to the metal ion observed in the crystal structure.
OH-AEP stands for 1-hydroxy-2-aminoethylphosphonate, OH-TMAEP stands for 1-hydroxy-2-(trimethylammonio)ethylphosphonate, dNMP stands for deoxymonophosphate, in which N can be A, G, U, C, Ap4A stands for diadenosine tetraphosphate, cGMP stands for guanosine 3′,5′-cyclic monophosphate, cAMP stands for adenosine 3′,5′-cyclic monophosphate, c-di-AMP stands for Bis-(3′-5′)-cydic adenosine monophosphate.
Figure 1.(A) Helical structure of three HD-domain proteins. YqeK (PDB: 2O08) is a phosphatase, PhnZ (PDB: 4N6W) is an oxygenase, and Ddi2 (PDB: 6DKA) is a lyase. All exhibit a helical fold characteristic to HD-domain proteins despite their diverse functions. (B) Sequence similarity network (SSN) of the HD-domain superfamily depicting its size and functional clustering. The SSN was generated via the Enzyme Function Initiatives-Enzyme Similarity tool (EFI-EST) and visualized in Cytoscape. The SSN was generated by employing the IPR003607 family and tailored so that nodes represent sequences with ≥ 50% identity and an e-value of 5. The SSN was further refined to contain the major protein clusters (size-wise), which amount to 183,015 unique protein sequences. Edges between nodes represent an alignment score of 70. HD-domain phosphohydrolases (SpoT/RelA, SAMHD1, deoxyguanosine phosphatases (dGTPases), nucleotidyltransferases) are represented in green and blue, while hydratases are shown in yellow. PDEs are shown in red (HD-GYP proteins), light pink (exoribonucleases), orange (Cas proteins), and pink (PDEases). Oxygenases are shown in purple and their cluster, which consists of four nodes (372 sequences), is enlarged for visualization. Gray clusters contain proteins of unidentified function.
Figure 2.Known substrates of HD-domain proteins. Phosphatases can remove one to three terminal phosphate groups from (deoxy)ribonucleotides or cleave (a)symmetrically polyphosphate containing nucleotides (represented in gray). The position of cleavage has been highlighted in red for substrates with four phosphates. PDEs hydrolyze phosphodiester bonds of cyclic (di)nucleotide substrates via either one-step hydrolysis (cleavage of one side of the diester bond) releasing a linearized product or two-step hydrolysis releasing individual nucleotides. PDEs can also act on RNA and DNA substrates. HD-domain oxygenases cleave a C-X bond (indicated in red).
Figure 3.Active sites of HD-domain phosphohydrolases. Mononuclear HD-domain phosphohydrolases utilize a conserved motif “H…HD…D” to bind a variety of metals including cobalt (pink), zinc (purple), magnesium (light green), nickel (dark green), or iron (orange). Small red spheres represent water molecules. The dinuclear phosphatase YqeK harbors two extra histidines between the HD and D residues to stabilize the second metal ion. Phosphatases are classified into mono-, di-, or triphosphohydrolases, labeled in light green, dark green, and blue, respectively. All phosphohydrolases have a conserved arginine (shown in teal), which is located typically three residues prior to the first histidine of the HD motif and in the vicinity of the oxygens of the substrate phosphate group. Other important residues are shown in pale blue and are described in the text.
Figure 4.Active sites of HD-domain PDEs. HD-domain PDEs utilize the conserved HD motif “H…HD…H…HH…D” to bind a di- or trinuclear metal center. The metal ions coordinated in their active sites are zinc (purple), magnesium (light green), nickel (dark green), or iron (orange). Small red spheres represent water molecules.
Figure 5.Active site of HD-GYPs. HD-GYP proteins utilize an “H…HD…H…HH…D” motif that typically binds a dinuclear metal center. The third metal ion in the PmGH active site is stabilized by crystallization molecules shown in gray. In addition, these enzymes contain a GYP residue triad vicinal to the active site (shown in blue), the importance of which is currently unclear. Bd1817, which is inactive toward c-di-GMP, lacks the GYP tyrosine and the terminal aspartate is an asparagine.
Figure 6.Active sites of HD-domain oxygenases. Oxygenases utilize the “H … HD … H … H … D” motif to bind a diiron metal center. The substrate scissile bond is positioned above one of the iron sites, leaving the second site open for oxygen binding. PhnZ and TmpB contain an YxxE loop (green) in their primary sequence that is located vicinal to the active site, which upon substrate binding undergoes a conformational change to allow for oxygen binding and catalysis.
Figure 7.Reaction and active site of the HD-domain hydratase Ddi2. Ddi2 utilizes a Zn metal center to convert cyanamide to urea. The terminal aspartate residue found in HD-domain proteins is a replaced by a threonine in Ddi2. The role of this threonine, T157 (valine in the crystal structure), is predicted to allow for substrate positioning or metal specificity [37].