Literature DB >> 28979838

In silico analysis of heme oxygenase structural homologues identifies group-specific conservations.

Joseph Irvin1, Alexander J Ropelewski2, John Perozich1.   

Abstract

Heme oxygenases (HO) catalyze the breakdown of heme, aiding the recycling of its components. Several other enzymes have homologous tertiary structures to HOs, while sharing little sequence homology. These homologues include thiaminases, the hydroxylase component of methane monooxygenases, and the R2 component of Class I ribonucleotide reductases (RNR). This study compared these structural homologues of HO, using a large number of protein sequences for each homologue. Alignment of a total of 472 sequences showed little sequence conservation, with no residues having conservation in more than 80% of aligned sequences and only five residues conserved in at least 60% of the sequences. Fourteen additional positions, most of which were critical for hydrophobic packing, displayed amino acid similarity of 60% or higher. Ten conserved sequence motifs were identified in HOs and RNRs. Phylogenetic analysis verified the existence of the four distinct groups of HO homologues, which were then analyzed by group entropy analysis to identify residues critical to the unique function of each enzyme. Other methods for determining functional residues were also performed. Several common index positions identified represent critical evolutionary changes that resulted in the unique function of each enzyme, suggesting potential targets for site-directed mutagenesis. These positions included residues that coordinate ligands, form the active sites, and maintain enzyme structure. ENZYMES: Heme oxygenase (EC 1.14.14.18), methane monooxygenase (EC 1.14.13.25), ribonucleotide reductase (EC 1.17.4.1), thiaminase II (EC 3.5.99.2).

Entities:  

Keywords:  group entropy; heme oxygenase; methane monooxygenase; ribonucleotide reductase; thiaminase

Year:  2017        PMID: 28979838      PMCID: PMC5623701          DOI: 10.1002/2211-5463.12275

Source DB:  PubMed          Journal:  FEBS Open Bio        ISSN: 2211-5463            Impact factor:   2.693


Bacillus subtilis TenA thiaminase RNR beta chain from Escherichia coli human heme oxygenase 1 heme oxygenase methane monooxygenase hydroxylase methane monooxygenase thiaminase from Pyrococcus furiosus homologue of RNR beta chain from E. coli homologue of RNR R2F from Salmonella typhimurium ribonucleotide reductase transcriptional regulator Heme oxygenase (HO, EC 1.14.14.18) aids in recycling iron and porphyrin rings by catalyzing the decomposition of heme to biliverdin, CO, and free iron. The reaction requires three molecules of oxygen, NADPH, and cytochrome P450 reductase. In animals, biliverdin is converted to bilirubin, which acts as an antioxidant‐protecting neurons 1 and which is later combined with glucuronic acid and excreted 2. CO functions as a signaling molecule via guanylate cyclase, playing several roles in vasoregulation 3. The iron released is the main source of metal in regenerating heme and nonheme cofactors 4, 5. Two HO isoforms exist in vertebrates, heme oxygenase‐1 (HO1) and heme oxygenase‐2 (HO2). HO1 and HO2 are both membrane‐bound proteins and share 55% identity in humans 6. Human HO2 is expressed constitutively in the liver, brain, and testes. Human HO1 is an inducible protein found in most tissues, especially the spleen and liver 7, 8. HO1 is also expressed in cancer cells. HO1 has been associated with adhesion and signaling in cancer cells, enabling growth and spread of tumors 9. In addition, elevated HO1 levels in mice appear to correlate with insulin resistance, while HO1‐knockout mice were insusceptible to insulin resistance and exhibited lower levels of inflammation 10. A heme oxygenase‐3 enzyme has been identified in humans, but is an inferior catalyst for heme breakdown 8. In Arabidopsis, which possesses four HO genes, HO is essential for the synthesis of light‐sensing phytochrome chromophores 11, 12. Corynebacterium diphtheriae uses a HO homologue (HmuO), which shares 33% sequence identity to human HO1, to acquire iron for growth from its host's own heme molecules 13. There are several different enzymes with diverse functions that share structural homology to HOs. One of these enzymes is another oxidoreductase, methane monooxygenase (MMO, EC 1.14.13.25). MMO is found in aerobic methanogens and catalyzes the NAD(P)H‐dependent conversion of methane and oxygen to methanol 14, although it can act upon slightly longer hydrocarbons 15. There are three components to the structure of a soluble MMO: a reductase, a regulatory B component, and a hydroxylase (MMOH). The structure of MMOH has three subunits (α, β, and γ) and a hydroxo‐bridged diiron cluster. This is found in the α subunit only near the hydrophobic pocket that is the active site and is positioned between helices αB, αC, αE, and αF. Interaction of the β subunit with the reductase component of MMO affects the redox potential of this diiron cluster in the α subunit 16. Both the α and β subunits of MMOH show homology (approximately 12% sequence identity) to HOs and the other homologues addressed here 17. Ribonucleotide reductases (RNRs, EC 1.17.4.1) are catalysts in the process of forming deoxyribonucleotides from ribonucleotides. The four deoxyribonucleotides that are produced are important for DNA synthesis in living organisms. RNRs have been subdivided into three classes based upon the way they form a radical in the reaction mechanism. Class I RNRs have a diiron cluster, while Class II RNRs use adenosylcobalamin and Class III RNRs utilize a ironsulfur cluster and S‐adenosylmethionine. While members of the three classes share little sequence homology, there is a core tertiary structure common to each 18. However, it has been noted that Class I RNRs also share structural homology to HOs, thiaminases, and MMOHs 17. Class I RNR in Escherichia coli is made up of two dimeric proteins, R1 and R2, which differ in structure. R1's structure includes binding sites for substrates, allosteric sites (the specificity and activity sites), and critical cysteine residues. R2, which is a homologue of HOs, has a highly oxidized redox center that is formed by a diiron cluster and a critical tyrosyl radical on Tyr122 that activates the substrate ribose. The tyrosyl radical is needed for the protein to act as an enzyme 19. Like MMOH, it utilizes oxygen and a diiron cluster; however, RNRs reduce hydroxyls on ribose, while MMOH acts to form a hydroxyl on methane. Class I RNRs have been subdivided into two groups: Ia, which are represented by E. coli R2 subunits, and Ib, which are represented by the R2F subunit from RNR from Salmonella typhimurium. Ib RNRs (RNR1Bs) appear to be solely in bacteria, while Ia RNRs (RNR1As) are in both bacteria and eukaryotes. While sharing a similar structure to Ia, Ib differs functionally through a lack of allosteric regulation and the need for small redox protein produced from the same operon as the RNR 20. Another related enzyme is the aminopyrimidine aminohydrolase (thiaminase II; EC 3.5.99.2) TenA protein, which acts as a thiamin (vitamin B1) salvage enzyme that catalyzes hydrolysis of 4‐amino‐5‐aminomethyl‐2‐methylpyrimidine. TenA modifies the aminopyrimidine to hydroxypyrimidine and is found throughout all three domains of life 21. The TenA gene product was initially identified as a transcriptional regulator (Treg) of exoenzyme secretion in Bacillus subtilis 22. However, the TenA homologue in Pyrococcus furiosus lacks appropriate surface charges and therefore may not participate with DNA interactions 17. Pyrococcus furiosus TenA only shares 14% sequence identity to human HO1 17. The Saccharomyces cerevisiae THI20 protein is actually a trifunctional enzyme which includes a C‐terminal tetrameric TenA‐like domain fused to an N‐terminal ThiD domain which catalyzes another step in thiamin metabolism 23. French et al. 23 also identified two structural subclasses of TenAs: one group similar to B. subtilis TenA with a conserved cysteine and the other similar to P. furiosus TenA with a conserved glutamate instead. There has not been an extensive comparative study performed on HO and its homologues: thiaminases, MMOHs, and Class I RNRs. The goal of this research was to align a large number of HO protein sequences and those for these related homologues. We then attempted to identify the structural and possible functional roles of conserved residues and sequence motifs in these enzymes. Group entropy analysis and other methods indicated group‐specific conservations for each homologue, identifying key positions that may contribute to the unique function of each enzyme.

Results and Discussion

Structure and residue conservations

A total of 472 sequences were aligned (Fig. 1) using tertiary structural alignment as a guide (entire alignment available in Fig. S1). Above each amino acid position column is an index number, which is numbered concurrently from the beginning of the alignment; these index numbers will be used to reference each position throughout this manuscript. Two hundred of the sequences analyzed were HOs from animal, plant, and bacterial species; 141 of the sequences were thiaminases; 41 of the sequences were MMOHs (30 alpha chains and 11 beta chains); and 90 of the sequences were Class I RNRs. MAPSCI structural alignment 24 indicates that the tertiary structures of all four homologues (human HO1, PDB ID: 1NI6; P. furiosus thiaminase, PDB ID: 1RTW; Methylosinus trichosporium MMOH alpha chain, PDB ID: 1MHY; and RNR beta chain from E. coli, PDB ID: 1XIK) are conserved with a core RMSD value of 1.37 Å and core size of 77, which represents the percentage of the length of the shortest protein aligned (1RTW at 201 amino acids) that have 100% conservation and are within 4.0 Å of each other in the final tertiary alignment. Interestingly, one can see that the active sites of HOs and thiaminases utilize different spatial coordination of the hydrophobic pocket, perhaps due to the differences in their enzymatic function (Fig. 2).
Figure 1

Summary alignment showing a representative sequence for each group of HO homologues: human HO1 (HomSapHOx1) as a HO, Pyrococcus furiosus thiaminase (tactPyrFur) as a thiaminase, the alpha (MMOAMetTri) and beta chains (MMOxMetTri) of the MMOH from Methylosinus trichosporium as MMOH chains, the RNR beta chain from Escherichia coli (RDR1Esccol) as a RNR1A, and the RNR R2F protein from Salmonella typhimurium (RRR2SalEnt) as a RNR1B. The entire alignment, which contains 472 protein sequences, is found in Fig. S1. Residue positions are colored based upon their conservation in the entire alignment (Fig. S1) as follows: green = 60–79% conserved and blue = 40–59% conserved. Indel (gap) positions from the entire alignment (Fig. S1) are retained to allow correlation with index position numbers (numbers shown above the alignment columns) that are noted within the text.

Figure 2

Structural alignment of human HO1 (blue, PDB ID: 1N45) and TenA thiaminase from Bacillus subtilis (red, PDB ID: 2QCX) performed using jFATCAT, shown in front and back view (180° rotation). The bound heme ring in human HO1 is shown in black, and the bound 4‐amino‐5‐hydroxymethyl‐2‐methylpyrimidine product in TenA is shown in yellow. Note how the bound ligands lie in pockets on opposite faces of the molecules.

Summary alignment showing a representative sequence for each group of HO homologues: human HO1 (HomSapHOx1) as a HO, Pyrococcus furiosus thiaminase (tactPyrFur) as a thiaminase, the alpha (MMOAMetTri) and beta chains (MMOxMetTri) of the MMOH from Methylosinus trichosporium as MMOH chains, the RNR beta chain from Escherichia coli (RDR1Esccol) as a RNR1A, and the RNR R2F protein from Salmonella typhimurium (RRR2SalEnt) as a RNR1B. The entire alignment, which contains 472 protein sequences, is found in Fig. S1. Residue positions are colored based upon their conservation in the entire alignment (Fig. S1) as follows: green = 60–79% conserved and blue = 40–59% conserved. Indel (gap) positions from the entire alignment (Fig. S1) are retained to allow correlation with index position numbers (numbers shown above the alignment columns) that are noted within the text. Structural alignment of human HO1 (blue, PDB ID: 1N45) and TenA thiaminase from Bacillus subtilis (red, PDB ID: 2QCX) performed using jFATCAT, shown in front and back view (180° rotation). The bound heme ring in human HO1 is shown in black, and the bound 4‐amino‐5‐hydroxymethyl‐2‐methylpyrimidine product in TenA is shown in yellow. Note how the bound ligands lie in pockets on opposite faces of the molecules. The overall residue conservation within the HO structural homologues was low. No residues showed conservation of 80% or higher in the entire alignment, five residues exhibited conservation of 60% or higher, and 40 additional residues displayed conservation of 40% or higher. The majority of these conservations were found within the α‐helices. ConSurf analysis 25 was used to visualize these evolutionary conservations. In human HO1 (hHO1, sequence HomSapHOx1, PDB ID: 1N45) and the thiaminase from P. furiosus (PfThiaminase, sequence tactPyrFur, PDB ID: 1RTW), highly conserved residues were located in the core of the structure (Fig. 3). In hHO1, these residues conserved in the entire alignment line the heme‐binding site (Fig. 3A). Alignment index positions (Fig. 1) for each residue are provided in curved brackets. These conservations include His25{142} which coordinates the heme iron (Fig. 3C). Tyr134{325} and Arg183{384} form hydrogen bonds to the porphyrin ring. Mutations of the equivalent residue at index 384 in Corynebacterium diphtherae HO, Arg177{384}, caused a significant decrease in catalytic rate 27. Ala28{145} and Gly143{335} make hydrophobic contacts to the porphyrin ring. A G143H mutation inactivated both mouse and human HO1 28, 29. Tyr58{184} is part of a larger hydrogen bonding network, which will be discussed below 30. Note that the degree of conservation in the entire alignment, highlighted in Fig. 1, is influenced by the number of sequences for each homologue. For example, HOs represent 200 of the total 472 amino acid sequences aligned. In PfThiaminase, several conserved residues also lined the binding site for HMP‐P (Fig. 3B). These residues include Phe22{151} which packs with Phe36{163} near the end of the active site pocket. The hydroxyl of Tyr105{268}, which is near the back of the active site pocket, hydrogen bonds with the side chain carboxylate of Glu198{423}, which coordinates the bound HMP‐P (Fig. 3D) 17. Y112F and E205A mutations at indices 268 and 423, respectively, in B. subtilis TenA (BsThiaminase) both caused significant decreases in catalytic function 21. None of the other residues that contact HMP‐P are conserved above 60% in the entire alignment, perhaps because thiaminases represent only a quarter of all aligned sequences.
Figure 3

Evolutionarily conserved residue positions as determined by the CONSURF program 25. Shown are human HO1 (PDB ID: 1N45) with bound heme ring in black (A) and TenA thiaminase from Pyrococcus furiosus (PDB ID: 1RTW) with bound HMP‐P in black (B). Residue conservation scale is from CONSURF website. Note how most conserved positions surround the active site and protein core. The thiaminase is shown as a ribbon diagram as the HMP‐P is more buried and would be harder to see. ligplot+ diagrams 26 are shown to highlight residues that contact heme (designated Hem300) in human HO1 (C) and HMP‐P (designated Mp51213) in P. furiosus thiaminase (D) (similar to figure previously published 17). Blue boxes surrounding the residue names indicate that these residues are at least 40% conserved in the entire amino acid sequence alignment. Most of the residues in each active site are not conserved in the entire alignment due to the diversity of the structural homologues aligned.

Evolutionarily conserved residue positions as determined by the CONSURF program 25. Shown are human HO1 (PDB ID: 1N45) with bound heme ring in black (A) and TenA thiaminase from Pyrococcus furiosus (PDB ID: 1RTW) with bound HMP‐P in black (B). Residue conservation scale is from CONSURF website. Note how most conserved positions surround the active site and protein core. The thiaminase is shown as a ribbon diagram as the HMP‐P is more buried and would be harder to see. ligplot+ diagrams 26 are shown to highlight residues that contact heme (designated Hem300) in human HO1 (C) and HMP‐P (designated Mp51213) in P. furiosus thiaminase (D) (similar to figure previously published 17). Blue boxes surrounding the residue names indicate that these residues are at least 40% conserved in the entire amino acid sequence alignment. Most of the residues in each active site are not conserved in the entire alignment due to the diversity of the structural homologues aligned. Five residue positions are conserved above 60% in our entire alignment: Leu17{132}, Phe33{151}, Gly40{166}, Phe47{173}, and Tyr114{268} (hHO1 residue identity with alignment index position in curved brackets). Four of five residues are involved in hydrophobic packing, as observed in the hHO1 structure 30. Leu17{132} is 69% conserved overall and was leucine in thiaminases, RNR1As, and HOs except plant species. This position also has 84% similarity with aliphatic hydrophobic residues (I, V, L, and M) throughout the entire alignment. This residue is found in α‐1 and is involved with hydrophobic packing with Ala203{416} (not conserved) and Tyr134{325} (44% conserved), which is located in the heme‐binding pocket of HO. Phe33{151} is 63% conserved overall and is found in thiaminases and HOs except plant species. This residue is found in the proximal helix of HOs and functions in hydrophobic packing with Val50{176} (not conserved). Gly40{166} is 63% conserved overall and is found in thiaminases and HOs except plant species. This residue is found in the turn between α‐1 and α‐2 30. Conserved glycines are often found in turns in enzyme structures or at positions where a side chain would lead to steric conflicts in many enzyme families 31, 32, 33, 34. Phe47{173} is 71% conserved overall and is found in RNR1As, most thiaminases, and HOs except plant species. This residue is found in the beginning of α‐2 and is involved in hydrophobic packing with Ala151{348} (not conserved) that is found in the distal helix of HOs. Tyr114{268} is 66% conserved overall and is found in all HOs and most thiaminases except proteins identified as Tregs. This residue is found in α‐4 in hHO1 30. In hHO1, the side chain hydroxyl of Tyr114{268} is 3.1 Å from NH1 of Arg136{327} (not conserved) from the distal helix of HOs, and 3.0 Å from ND1 of Asn210{423}. In PfThiaminase, the side chain hydroxyl of Tyr105{268}, which is part of the catalytic triad 21, is 2.7 Å from the side chain carboxylate of Glu198{423}, which contacts the bound HMP‐P 17. Glutamate is 46% conserved at index 423 in the alignment, primarily in all RNRs and thiaminases except Tregs. Fourteen additional residue positions also display amino acid similarity of 60% or higher within these structural homologues. All but one of these positions are hydrophobic residues involved in hydrophobic packing. Index position 152 (Met34 in hHO1) exhibits 79% similarity of aliphatic hydrophobic residues (I, V, L, and M) in HOs except plant species, thiaminases except Tregs, and RNR1As. Met34{152} faces α‐meso‐face of heme in the hHO1 structure 30. Index 168 (Val42 in hHO1) shows 62% similarity of aliphatic residues in HOs except plants species and most thiaminases. Index 177 (Met51 in hHO1) displays 63% similarity of aliphatic residues in a few HOs, all thiaminases, and RNR1As. Index 230 (Leu93 in hHO1) shows 71% similarity of aliphatic residues in all HOs, a few thiaminases, MMOHs, and all RNRs. Index 272 (Leu118 in hHO1) exhibits 93% similarity of aliphatic residues in all HOs, most thiaminases, and all RNRs. Index 320 (Val130 in hHO1) shows 84% similarity of aliphatic residues in most HOs, most thiaminases, the beta chain of MMOHs, and RNR1As. Index 340 (Leu147 in hHO1) shows 68% of aliphatic residues in all HOs, some thiaminases, and the alpha chain of MMOHs. This residue is located on the distal helix and contacts with heme in HOs 30. The CD2 atom of Leu147{340} forms a hydrogen bond with the NE atom of Gln38{156} 35. Val139{340} in PfThiaminase is about 3.6 Å from Leu23{152}. Index 367 (Phe166 in hHO1) displays 71% similarity of aromatic residues with mostly phenylalanines in HOs and tryptophans in thiaminases. Index 387 (Met186 in hHO1) exhibits 85% similarity of aliphatic residues in HOs, MMOHs, RNR1As, and some thiaminases. Met186{387} is located about 6.6 Å from the heme molecule in hHO1. In PfThiaminase, Leu172{387} is located 4.2 Å from Phe122{320}. Index 388 (Asn187 in hHO1) shows 82% similarity of aspartate or asparagine (asparagine is 58% conserved in the entire alignment) in HOs, RNR1As, and some thiaminases. Index 388 appears to maintain protein folding. The side chain OD1 atom of Asn187{388} is 3.2 Å away from the main chain nitrogen of Leu13{128} in hHO1. In PfThiaminase, the ND atom of Asn173{388} is 3.0 Å from the hydroxyl of Ser3{129}, and the OD1 atom is 2.8 Å from the main chain nitrogen of Ser3{129}. Index 391 (Leu189 in hHO1) displays 85% similarity of aliphatic residues in HOs, MMOHs, RNR1As, and some thiaminases. Index 412 (Val199 in hHO1) exhibits 61% similarity of aliphatic residues in most HOs, some thiaminases, and the alpha chain of MMOHs. Index 427 (Phe214 in hHO1) shows 63% similarity of aromatic residues (phenylalanine or tryptophan) in HOs except plant species and most thiaminases. It is found on the wall opposite the α‐meso‐face of heme in hHO1 35. Index 430 (Leu217 in hHO1) displays 74% similarity of aliphatic residues in HOs, MMOHs, RNRs, and a few thiaminases.

Conserved sequence motifs

Ten conserved sequence motifs were identified using the meme program 36) (Table 1). Motifs 1, 2, 3, and 6 are only found in HOs. All four motifs line the heme‐binding pocket (Fig. 4A). Motif 1 is comprised of the distal helix, which includes the heme iron ligand His25{142}, and motif 2 constitutes the proximal helix of HOs 30. Motifs 4, 5, 7, 8, 9, and 10 are found only in the RNR1A enzymes. In EcRNR1A, motifs 4, 5, 9 and 10 from intersubunit contacts within the dimer, while motifs 5, which includes the iron ligands Asp84{184}, Glu115{219}, and His118{223}, and 9, which includes the iron ligand Glu204{328}, line the binding site with the diiron cluster 19 (Fig. 4B). None of the top ten motifs were found in thiaminases, MMOHs, or RNR1Bs.
Table 1

Ten most conserved sequence motifs in the HO homologues

Motif numberMotif regular expressionIndicesGroup
1EPELLVAHAxYTRY[LM]GDLSGxGQVLKK[VI]xxAQ[RK]ALKLPS[TS]GExxxGL[QA]F[YF]283–368HO
2TKE[VAS]H[DE][RQ]AExNTxxx[QE]F[VM]K[DN]F[LQ]KG[QRN][IV][KT][KR]xE[LG]FKL[AV][TM][AVT]x[AS]LY[FH][TI]YSALEEE138–190HO
3[DEP][NG][IV][SD][NS]A[QT][QK]FKQLYR[AS]R[ML]NxAL[ED]xxxxxxxxx[LM][DT]PExx[TE]KER[IV][VL]EEA[NK]KAF371–420HO
4KNDQLKEPMF[FL]GQ[PS]VNVARYDQQK[YFH][ED][IV]FE[KR]LIEKQLSFFxxWRPEEVDVS97–146RNR1A
5SNLKYQTLLDSIQxGRSPNVALLPL[VI]S[IL]PExxLETW[IV]ETxxW[AS]FSETIHS175–224RNR1A
6ALE[KQ]D[LM][EA][YF][FW]YxxxxxG[EP][DN]xxxxW[REKQ]E[QK]xx[IV][PQ][CY][ST][EP]AxxxxxT[QR][KR]YV[DE]R[IL]H[EY]V225–275HO
7G[AT][RKT][QS]NPIPWIN[TA]WL[SV]SDNVQVAPQE[VA]E[VI]SSYLVGQID[SNA]EV[DNS][AT]DD[LF][GS]DF[EQ]L481–532RNR1A
8AxAEQEKEWxA[ED]YLF[KRQ]DGSMIGLxxNKxxDILCQYVEYITxN[IL]RMQAVGL419–466RNR1A
9LRELKKxKLYLCLMSVxxNALEAIRFYVSFACxSFAxxFAERELMxxEGN309–354RNR1A
10DP[SA][VIQ]VFDDIV[TE]NE[QH]I[LQI]KRA[EG][DGS]I[SA][HC]YYDDLIE[AM]T[SNAQ][YLD][YW][HN][LR][LYH]GEGTHT[VI]NG[KE][TE]249–304RNR1A
Figure 4

Conserved motifs found in the monomer of human HO1 (A) and a dimer of R2 protein of Escherichia coli RNR (B). Motifs 1 (red), 2 (pink), 3 (orange), and 6 (dark green) comprise the active site of HO (A). Motifs 5 (purple) and 9 (cyan) line the binding site with the diiron cluster (red) (B). Motifs 4 (yellow), 5, 9, and 10 (blue) form intersubunit contacts within the RNR dimer. Motifs 7 (light green) and 8 (navy blue) are found at the surface.

Ten most conserved sequence motifs in the HO homologues Conserved motifs found in the monomer of human HO1 (A) and a dimer of R2 protein of Escherichia coli RNR (B). Motifs 1 (red), 2 (pink), 3 (orange), and 6 (dark green) comprise the active site of HO (A). Motifs 5 (purple) and 9 (cyan) line the binding site with the diiron cluster (red) (B). Motifs 4 (yellow), 5, 9, and 10 (blue) form intersubunit contacts within the RNR dimer. Motifs 7 (light green) and 8 (navy blue) are found at the surface. mast was used to search for matches in the protein database using the motifs. The strongest matches were RNRs (lowest E‐score = 3.4e‐154), followed by HOs (lowest E‐score = 7.5e‐30). As noted, the ten motifs identified in this study were only present in RNRs and HOs. Two weaker scoring matches that occurred multiple times were the prokaryotic MreB rod‐shape determining bacterial actin protein (A0A1H1VW64_9PSED) and the SecB protein‐export protein (A0A062MXT3_9GAMM). MreB showed homology to motifs 3 and 10; however, the E‐score was only 0.068. The MreB protein from Caulobacter crescentus (PDB ID: 4CZM) 37 did not show any structural homology to HOs or its related homologues. SecB showed homology to motifs 6 and 10; however, the E‐score was only 1.7. The SecB chaperone protein from E. coli (PDB ID: 1QYN) 38 also did not show any structural homology to HOs or its related homologues.

Phylogenetic analysis

An unrooted neighbor‐joining phylogenetic tree of the HO structural homologues (Fig. 5) was generated from the entire amino acid sequence alignment. This neighbor‐joining method was chosen because it has been shown to yield excellent evolutionary relationships in some families 34, 39. Maximum‐likelihood and parsimony methods require excessive computations for large data sets. A bootstrapped parsimony tree using only 50 data sets (Fig. S2) had similar sequence groupings and group arrangements to the neighbor‐joining tree using 250 replicates. The neighbor‐joining tree was subsequently used as the basis for assigning each sequence into a group for group entropy analysis. HOs, thiaminases, MMOHs, and RNRs cluster into distinct clades and these four groups were used for group entropy analysis. Groups were named based upon the representative tertiary structures present in each clade. MMOHs and RNRs are more closely related to each other, but both are more distantly related to HOs.
Figure 5

Unrooted phylogenetic tree of the HO homologues. Branches are color‐coded based on enzyme type: blue = HOs; purple = thiaminases; red = MMOHs; and green = RNRs. Specific subgroups within each enzyme group are labeled.

Unrooted phylogenetic tree of the HO homologues. Branches are color‐coded based on enzyme type: blue = HOs; purple = thiaminases; red = MMOHs; and green = RNRs. Specific subgroups within each enzyme group are labeled. Distinct subgroups were also evident for each group. For example, in HOs vertebrate HO1s and HO2s, as well as plant and bacterial HOs, were isolated in distinct clades. In thiaminases, three distinct subgroups were apparent. There was a group of sequences similar to PfThiaminase and another group similar to BsThiaminase, as previously observed 23. There was also a subgroup of Tregs identified in the thiaminase group. It is possible that these Tregs may not possess the enzymatic function of other thiaminases as they are missing certain residues found in other thiaminases, such as Glu205{418} and Cys135{322} in BsThiaminase 21, where Tregs have threonine and alanine in those positions, respectively. In MMOHs, the alpha and beta chains cluster separately. In the RNRs, both RNR1As and RNR1Bs are located in distinct clades.

Group entropy analysis of HOs

The GEnt program 40 was used to calculate ‘Group Entropy’ and ‘Family Entropy’ values for each position in the alignment. Residue positions studied here had the highest Group Entropy scores within each homologue group, indicating they are well conserved within its group, and low Family Entropy scores, indicating they are not well conserved throughout the entire alignment. These residues would indicate unique residue conservations within the enzyme group. For the current study, residue positions examined had a Group Entropy score of at least 10.0, with Family Entropy score of no more than 3.0. The GEnt program utilizes the Kullback–Leibler method to calculate a divergence measure to detect covariance. It was chosen as it allows the user to define each group, to which the user can assign sequences from an alignment. Also, GEnt does not consider positions with gaps 40. The GEnt program has been used to identify group‐specific conservations in Class 3 aldehyde dehydrogenases 40 and NDP‐sugar dehydrogenases 34. Eight residues were found to have the highest Group Entropy scores in the HO group (Table 2, complete GEnt results can be found in Table S1). Examination of residues was made with hHO1 (sequence HomSapHOx1, PDB ID: 1N45). The residue position with the highest Group Entropy score is index 142. His25{142}, which is fully conserved on the proximal helix in all HOs, is the critical heme iron ligand 30 (Fig. 3C). A H25A mutation in hHO1 led to inactivation of the enzyme 41. Next, Tyr58{184} interacts with several polar residues near pyrrole B 30. In hHO1, the hydroxyl of Tyr58{184} hydrogen bonds with the side chain carboxylate of Asp140{332} and is 3.5 Å from the NE atom of Arg136{327}. Index 322 had the third highest Group Entropy score. The NE2 position of His132{322}, which is invariant in HOs, forms a salt bridge to the side chain carboxylate of Glu202{415} 30. Next, Arg136{327} is a residue that is invariant in the HOs of vertebrates but is isoleucine for plant species. The side chain of Arg136{327} is part of a hydrogen bonding network that also includes Asp140{332}, Asn210{423}, Tyr58{184} (also identified by GEnt above), and Tyr114{268}, which is 66% conserved in the entire alignment. This network may serve to help activate the oxygen in the catalytic mechanism 35. A D140A mutation in hHO1 also converts the enzyme into a peroxidase, illustrating the critical nature of this hydrogen bond network 42.
Table 2

Group entropy analysis of HOs

IndexResidue identitya Group entropyFamily entropyHighest group residueCommon thiaminase residueCommon MMOH residueCommon RNR residue
142His2517.1642.800HisTrpLeuGlu, Lys
184Tyr5816.5261.812TyrLeu, ValGluAsp
322His13215.6542.012HisAlaAsnSer
327Arg13613.0711.497ArgCysGlyLeu
188Glu6212.6831.065GluAlaPhe, IleGly, Asn
383Tyr18212.3481.449TyrLeu, GlnGlyThr, Ile
325Tyr13412.3341.924TyrLeuLeuAsn, Val
371Pro17012.1501.094AspTyr, TrpIle, ValIle

a Residue identity in HomSapHOx1.

Group entropy analysis of HOs a Residue identity in HomSapHOx1. Glu62{188} is conserved in all but one HO sequence, which has a conservative replacement with an aspartate. The side chain carboxylate of Glu62{188} is located 3.4 Å from the main chain nitrogen of the conserved Arg85{222}, acting to maintain enzyme structure. Next, Tyr182{383} is conserved in the HOs of vertebrates but is valine in plants. Its side chain hydroxyl is nearly 4 Å from the main chain nitrogen of Tyr134{325} and the main chain oxygens of Ala133{323} and Val77{205}. Tyr134{325}, which is conserved in all HOs, is located on the distal helix and contacts the beta edge of heme 30, 35. Lastly, Pro170{371} lies on a surface loop and is a hydrophobic residue in the other groups.

Group entropy analysis of thiaminases

Eight residue positions in thiaminases had the highest Group Entropy scores (Table 3, complete GEnt results can be found in Table S2). Residues noted here were from PfThiaminase (sequence tactPyrFur, PDB ID: 1RTW). Index 142 had the highest Group Entropy score in thiaminases. Trp14{142} is involved in hydrophobic packing and is located where the heme‐binding site is in HOs. The NE1 position of Trp14{142} also hydrogen bonds to the side chain hydroxyl of Ser195{420}. Next, Val47{184} forms the wall of the HMP‐P binding pocket 17 (Fig. 3D). The side chain carboxylate of Asp43{180} forms hydrogen bonds with the amino groups of the HMP‐P ligand. A D44A mutation at index 180 in BsThiaminase caused a significant decrease in catalytic function 21. Index 371 had the fourth highest Group Entropy score. Trp156{371} also lines the HMP‐P binding pocket, making hydrophobic contact with the ligand. Next, Phe46{183} is at an index position of aromatic residues in thiaminases. The pyrimidine ring of HMP‐P is located between the side chains of Tyr132{332} and Phe46{183} 17. Interestingly, a Y47F mutation at index 183 in BsThiaminase leads to a significant impairment of catalytic function 21. Also, a F47Y mutation at index 183 in Helicobacter pylori TenA disrupts the helical folding of helices D and E 43.
Table 3

Group entropy analysis of thiaminases

IndexResidue identitya Group entropyFamily entropyHighest group residueCommon HO residueCommon MMOH residueCommon RNR residue
142Trp1417.9932.800TrpHisLeuGlu, Lys
184Val4714.1331.812LeuTyrGluAsp
180Asp4312.7461.399AspLeu, SerLeuGln, Leu
371Trp15612.7341.094TrpAspIle, ValIle
183Phe4611.9891.146TyrIle, ValGlyLeu
430Ala20511.1921.693AlaLeu, IleLeu, Ile, MetLeu
367Phe15211.1612.534TrpPheValIle, Leu
149His2010.3761.339HisAla, ThrArgAsp

a Residue identity in tactPyrFur.

Group entropy analysis of thiaminases a Residue identity in tactPyrFur. Ala205{430} is near the C terminus on the surface of the protein, but not at the subunit interface. Index 430 is a position that is an aliphatic residue in 74% of all aligned sequences. Next, Phe152{367} lies at a position in thiaminases that is predominantly aromatic. Its side chain is nearly 8.6 Å from the bound HMP‐P. Lastly, His20{149}, which is 90% conserved in the aligned thiaminases, forms a salt bridge with the side chain carboxylate of Asp203{428} 17.

Group entropy analysis of MMOHs

GEnt analysis of MMOHs revealed that ten residues have the highest Group Entropy scores (Table 4, complete GEnt results can be found in Table S3). Residues from both the alpha (sequence MMOAMetTri) and beta chains (sequence MMOxMetTri) were researched in the MMOH from M. trichosporium (PDB ID: 1MHY). Several residues identified here make intersubunit contact. The highest scoring index position is index 265. Trp181{265} from the alpha chain makes hydrophobic contact with Pro61{74} from the beta chain. The residue from the beta chain at index 265 is Tyr227{265}, which is involved in hydrophobic packing within the beta chain. Next, Leu82{142} is conserved in all MMOHs except for two that have alanine. This index position is involved in hydrophobic packing for both alpha and beta chains. The third highest Group Entropy position is Met184{268} and it lines the hydrophobic cavity where the diiron cluster is located in the alpha chain 16. Index 268 is a tyrosine in 66% of all sequences in the entire alignment. Glu114{184}, which is fully conserved in MMOHs, coordinates the FE1 iron atom and is one of four residues that coordinate the iron and hydroxo ligands 16. The fifth highest Group Entropy score is Tyr115{185}, which is invariant in MMOHs. The hydroxyl of Tyr115{185} in the alpha chain is located 3.4 Å from the main chain nitrogen of Asp176{202} in the beta chain, making intersubunit contact. Tyr160{180} in the beta chain is about 5 Å from Trp37{74} and Phe35{72} in the alpha chain.
Table 4

Group entropy analysis of MMOHs

IndexResidue identitya Group entropyFamily entropyHighest group residueCommon HO residueCommon thiaminase residueCommon RNR residue
265Trp18115.6581.167TrpMet, ThrAla, AsnIle, Leu
142Leu8214.4112.800LeuHisTrpGlu, Lys
268Met18414.1482.802MetTyrTyr, PheArg, Lys
184Glu11413.9461.812GluTyrVal, LeuAsp
185Tyr11513.5130.836TyrVal, SerLys, AspSer, Thr
229Phe15311.6191.376PheAspGluHis, Ser
365Pro23311.3960.621ProLeu, IleGlnAla
168Arg9811.2571.097TrpVal, IleLeuHis, Ala
169Trp9911.0670.887TrpThr, SerGlu, AspGlu
183Gly11310.0251.146GlyIle, ThrTyrLeu

a Residue identity in MMOAMetTri.

Group entropy analysis of MMOHs a Residue identity in MMOAMetTri. Next, Phe153{229} in the alpha chain is involved in hydrophobic packing with Leu55{68} from the beta chain, making intersubunit contact. The seventh highest scoring index position is Pro233{365}, which is fully conserved in both alpha and beta chains of MMOHs. This residue is at the beginning of α‐F that lines the diiron cluster‐binding site, likely positioning the helix. Next, the side chain of Arg98{168} in the alpha chain functions by forming a salt bridge to Asp365{515}. In the beta chain, Trp143{168} is involved in hydrophobic packing. The ninth highest scoring index position is index 169. Trp99{169} is a residue that is involved in hydrophobic packing in the alpha chain. The last residue examined is Gly113{183}. This residue is fully conserved in the alpha chain and lines the pocket for the diiron cluster. The lack of a side chain on this residue could be important for positioning of the iron atoms.

Group entropy analysis of RNRs

Eleven index positions displayed the highest Group Entropy scores in RNRs (Table 5, complete GEnt results can be found in Table S4). Residues were examined using EcRNR1A (sequence RDR1Esccol, PDB ID: 1XIK). Trp49{138} had the highest Group Entropy score and is invariant among all RNRs. Trp49{138} likely serves as the tryptophan radical, providing an electron to completely reduce the dioxygen. It also hydrogen bonds to the iron ligand His118{223} via Asp237{375} 19. Arg149{268} is the next residue and is also fully conserved in RNR1As but is lysine in RNR1Bs. The NE position of Arg149{268} is located 3.3 Å from the side chain carboxylate of Glu283{423}. Arg149{268} also stacks against Trp286{426} 19, which is also identified by GEnt. Trp286{426} is invariant in RNR1As but is tyrosine in RNR1Bs. The NE1 position of Trp286{426} hydrogen bonds with the hydroxyl of Ser75{175}. Trp286{426} is also involved in hydrogen bond changes in a S211A mutant 19.
Table 5

Group entropy analysis of Class I RNRs

IndexResidue identitya Group entropyFamily entropyHighest group residueCommon HO residueCommon thiaminase residueCommon MMOH residue
138Trp4820.4871.532TrpThrVal, AlaGln
268Arg14918.7742.802ArgTyrTyrMet, Ala
336Phe21217.4361.899PheGlyGlyIle
426Trp28617.0001.784TrpIle, Val, LeuPhePhe, Ala, Val
184Asp8416.7471.812AspTyrLeuGlu
142Glu5215.9012.800GluHis, AlaTrpLeu
135Phe4715.2110.834PheAla, GlyThr, His, ArgArg
187Gln8714.9111.548GlnLeuPheAla
179Tyr7914.3601.417TyrSer, AlaGlnAla, Phe
181Thr8114.1341.563ThrTyrTyr, SerGlu
327Leu20313.9301.497LeuArgCysGly

a Residue identity in RDR1Esccol.

Group entropy analysis of Class I RNRs a Residue identity in RDR1Esccol. Phe212{336}, which is fully conserved in all RNRs, forms a conserved hydrophobic pocket with residues Phe208{332} and Ile234{371} for the binding of dioxygen 19. Next, Asp84{184} is invariant in all RNRs and coordinates one of the iron ligands 19. The position with the sixth highest Group Entropy score is Glu52{142}, which is on the surface of the molecule near the subunit interface. Analysis of an E52Q mutant in the beta chain of RNR1A appears to suggest that Glu52{142} is critical for the conformational change at the α/β subunit interface that initiates radical formation 44. Next, Phe47{135}, which is fully conserved in all RNRs, ring stacks with Trp286{426} between subunits. Another residue identified by GEnt that is invariant in all RNRs is Gln87{187}. The OE1 position of Gln87{187} is located 2.9 Å from the side chain carboxylate of Glu204{328}, which is an iron ligand 19. Next, the hydroxyl of Tyr79{179} hydrogen bonds to the side chain carboxylate Glu283{423}, which also interacts with Arg149{268} noted above. The Cα of Thr81{181}, which is fully conserved in all RNRs, is 3.8 Å away from the side chain of Tyr122{227}, which is suggested to be the radical in the enzyme mechanism 19. Lastly, Leu203{327} is involved in hydrophobic packing with the side chain of Gln87{187}, also identified by GEnt.

Alternate methods for determining functional residues

The evolutionary trace method was developed to identify critical residues in active sites and clusters of residues at functional interfaces in proteins. The program generates a dendrogram of aligned sequences and creates evolutionary partitions that divide the tree into distinct families and subgroups. Residues unique to each group, both buried and on the surface, can be identified 45, 46. For HOs, partition 2 groups all HOs in one group, but does not identify any group‐specific residues. It is not until partition 8, where HOs are divided into 16 subgroups, that six group‐specific residues are identified (Table 6). Of these, only His25{142} and Tyr58{184} are among the residues in HOs with the highest Group Entropy scores (above). After the tenth and final partition, all but one, Pro170{371}, of the highest scoring residues in GEnt are identified by evolutionary trace, but there are an additional 28 residues also identified in the 21 HO subgroups. This highlights one of the advantages of GEnt that one can define each group with representative sequences, instead of following arbitrary partitions.
Table 6

Index positions identified by functional residue identification programs in HOs

GEntEvolutionary traceJensen–Shannon divergenceProperty entropyVN entropyRelative entropyShannon entropySum of pairs
142142142184173142142142
184173184187184321184184
188179229188188322187187
322184321229198323188198
325328322230229325222222
327332323319232327226229
371325320268328229268
383327321284332268322
328322322334321325
329325325335322328
330326328336325335
332328330337326336
333335332367327337
334336334381328367
335337335384330368
336338336332370
337340337333384
367348367334415
384367368335420
368370336
370381337
384384367
387388381
413415384
415420415
Index positions identified by functional residue identification programs in HOs Evolutionary trace also does not result in any group‐specific residues from partition 2 of thiaminases, where thiaminases were divided into only two groups. It is not until partition 8, where thiaminases are divided into 20 subgroups, that six group‐specific residues are identified (Table 7). Of these, only Trp14{142} and Val47{184} are among the residues in thiaminases with the highest Group Entropy scores (above). It is also interesting to note that the residue positions identified by evolutionary trace in HOs are the same ones identified in thiaminases. This coincides with the fact that HOs and thiaminases are on neighboring clades in the phylogenetic tree (Fig. 5).
Table 7

Index positions identified by functional residue identification programs in thiaminases

GEntEvolutionary traceJensen–Shannon divergenceProperty entropyVN entropyRelative entropyShannon entropySum of pairs
142142142142142142142142
149173149149149149149149
180179151151150151151151
183184152152151152152152
184328166155152166166166
367332176166166179168173
371179168173180173176
430180173176326176177
181176177327177179
183177179329179180
268178180332180181
326179181364181183
327180183367183268
332181268426184327
367183271427267332
423184326428268364
426268327429326367
427271332327423
428272364332426
323367364427
327376367
423423423
426426426
427427427
428428428
Index positions identified by functional residue identification programs in thiaminases Evolutionary trace partition 2 places MMOH alpha and beta chains in separate groups. Evolutionary trace partition 4, the last partition to have the alpha and beta chains in one group each, identified five group‐specific positions in the beta chain: Trp221{258}, Trp238{276}, Phe259{331}, Phe267{339}, and Asp277{351}. However, Trp221{258}, Trp238{276}, and Phe267{339} are at alignment positions that are primarily indels in other families. In the M. trichosporium, MMOH alpha chain partition 4 identified 18 positions (Table 8). However, 15 of these residues are at or above alignment index 468, which constitute the C terminus, and residues that extend beyond the length of all of the other homologues (beyond index 540 of the summary alignment in Fig. 1). Thus, similar to the residues in the beta chain at indel positions, most of these positions do not have residues in other homologues to which to compare. Of the residues identified in both chains by evolutionary trace, only Leu82{142} and Glu114{184} identified in the alpha chain score highly for Group Entropy in GEnt. Another feature of GEnt analysis is that it does not consider positions with gaps. Thus, most positions identified by evolutionary trace for MMOHs were not identified by GEnt.
Table 8

Index positions identified by functional residue identification programs in MMOHs

GEntEvolutionary tracea Jensen–Shannon divergenceProperty entropyVN entropyRelative entropyShannon entropySum of pairs
142142184173125168171184
168173185184171169184185
169179222185184224185218
183184224187185263186263
184211263206186265187282
185236265218206351218350
229247282220218354219351
265276326255255365222353
268277328264263454224354
365324329265265455263365
328331267282460282457
331332282285285459
332350285328326460
346351319329328504
351354325330329521
458365326331331526
460328332332527
461330333334529
331350349603
350351350
351353351
353354353
354365354
365409365
431431411

a 1MHY alpha chain.

Index positions identified by functional residue identification programs in MMOHs a 1MHY alpha chain. Evolutionary trace partition 2 places all RNRs in the same group, while partition 8 is the last to group RNR1A and RNR1B in one group each. Partition 2 did not identify any group‐specific residues in all RNRs. Partition 8 identified 12 group‐specific residues in RNR1As (Table 9). Partition 8 identified 11 group‐specific residues in RNR1Bs, all of which were also identified in RNR1As (Table 9). Of the residue positions identified in both RNR groups, indices 142, 179, and 184 were highly scoring for Group Entropy for all RNRs in GEnt.
Table 9

Index positions identified by functional residue identification programs in RNRs

GEntEvolutionary trace RNR1AEvolutionary trace RNR1BJensen–Shannon divergenceProperty entropyVN entropyRelative entropyShannon entropySum of pairs
135142142135138133135133135
138173173138140135138135138
142179179140143138140138140
179184184181182140187140141
181211211187183141223141184
184296317217184184225181202
187317328223185185336182223
268328331225186187345183225
327331332226222202378184227
336332346332223217426186333
426346351336225225490187336
351345226226491223345
373227227225373
375228228226375
376365333227376
379367336333379
368338336490
369345345
370375365
371376368
372422371
373423373
375451375
376454376
378490377
Index positions identified by functional residue identification programs in RNRs In addition, six other algorithms were used to identify functional residues in HOs, thiaminases, MMOHs, and RNRs: Jensen–Shannon divergence, property entropy, VN entropy, relative entropy, Shannon entropy, and sum of pairs analysis 47. Each algorithm was used with combinations of all possible backgrounds and matrices. Only residues that were identified for all 21 combinations of backgrounds and matrices were reported as results for each method (Tables 6, 7, 8, 9).

Common positions of group‐specific residues

Despite the fact that these homologues had different mechanisms, several common index positions were identified by various methods in multiple groups. These positions would represent critical sites of evolutionary differences between the homologues. Seven alignment index positions were identified in all four homologues: 142, 173, 179, 184, 268, 328, and 332. Index 142 had the highest Group Entropy score in GEnt in HOs and thiaminases, second highest in MMOHs, and sixth highest in RNRs. In HOs, His25{142} is the essential heme iron ligand 30. In thiaminases, Trp14{142} is involved in hydrophobic packing where the heme‐binding site is in HOs, but it also hydrogen bonds to the side chain hydroxyl of Ser195{420}. Leu82{142} in MMOHs is involved in hydrophobic packing for both alpha and beta chains. Thus, in thiaminases and MMOHs that lack a bound heme ring, this index position is involved in hydrophobic packing in the core of the protein. However, as indicated by evolutionary tracing, Glu52{142} in RNR1A and Lys35{142} in RNR1B are class‐specific, surface‐exposed residues. Both residues are near the subunit interfaces. Glu52{142} in the beta chain of RNR1A appears necessary for the conformational change at the α/β subunit interface that initiates radical formation 44. The residues at index position 173 are involved in hydrophobic packing. Phe47{173} in hHO1 is at a position that is 71% conserved in the entire alignment and is a phenylalanine in RNR1As, most thiaminases, and HOs except plant species. MMOH alpha chains have Met103{173}. However, RNR1Bs have a polar Thr56{173}, which contacts Leu168{338}. The residue at index 179 in hHO1, Ser53{179}, lies at the bottom of the heme‐binding pocket. In PfThiaminase, the side chain of Asn42{179} hydrogen bonds to the main chain nitrogen of Asp99{262}, maintaining protein structure. In the MMOH alpha chain, Phe109{179} packs with Trp181{265}. In EcRNR1A, the hydroxyl of Tyr79{179} hydrogen bonds to the side chain carboxylate of Glu283{423}. In Sal. typhimurium RNR1B, Gly62{179} lies in a helix preceding the iron ligand Asp67{184} 20. Index 184 had the second highest Group Entropy score in HOs and thiaminases, fourth highest in MMOHs, and fifth highest in RNRs. In HOs, Tyr58{184} is involved in a hydrogen bonding network close to pyrrole B that may activate the oxygen in the catalytic mechanism 35. In thiaminases, Val47{184} forms the wall of the HMP‐P binding pocket 17. The residues in MMOH alpha chain, Glu114{184}, and RNR1A, Asp84{184}, both coordinate iron ligands 16, 19. Thus, this index position is critical for the unique function of each homologue. Index 268 has the third highest Group Entropy score in MMOHs and second in RNRs. In the entire alignment, index 268 is a 66% conserved tyrosine. In MMOHs, Met184{268} lines the hydrophobic cavity where the diiron cluster is located in the alpha chain 16. In RNR1As, Arg149{268} forms a salt bridge with Glu283{423}, and also stacks against Trp286{426} 19, which is a residue position also identified by GEnt in RNRs. This might indicate a compensatory mutation in RNRs. Interestingly, although index position 268 did not score highly in group entropy in either HOs or thiaminases, several other functional residue identification methods identified index 268 in both homologues (Tables 6 and 7). This may have been due to the presence of a conserved tyrosine in both HOs and thiaminases. The side chain of Tyr114{268} in hHO1 also participates in the crucial hydrogen bond network 35. In PfThiaminase, the hydroxyl of Tyr105{268} hydrogen bonds with the side chain carboxylate of Glu198{423}, which coordinates the bound HMP‐P (Fig. 3D) 17. A Y112F mutation at index 268 in BsThiaminase caused a significant impairment of catalytic function 21. The residue at index 328 serves critical roles in the active sites of most homologues. In PfThiaminase, the side chain carboxylate of Glu129{328} hydrogen bonds to phosphate of HMP‐P 17. The glutamate residues at index 328 in the MMOH alpha chain (Glu209{328}) and both RNR1A (Glu204{328}) and RNR1B (Glu158{328}) all coordinate iron ligands 16, 19, 20. However, in hHO1, the side chain of Tyr137{328} hydrogen bonds to the side chain carboxylate of Glu62{188}, maintaining enzyme structure. The residue at index 332 also lies in the active sites of the homologues. In hHO1, Asp140{332} is part of the critical hydrogen bonding network 35. In PfThiaminase, the pyrimidine ring of HMP‐P is packed between the side chains of Tyr132{332} and Phe46{183} 17. In the M. trichosporium MMOH alpha chain, Thr213{332} is within 7 Å of the two iron atoms 16. Phe208{332} in RNR1As and Phe162{332} in RNR1Bs form a conserved hydrophobic pocket between the diiron cluster and tyrosyl radical 19, 20. Two more index positions that scored high in Group Entropy in multiple homologues are indices 183 and 327. Index 183 scored fifth highest for Group Entropy in thiaminases and tenth highest in MMOHs. The pyrimidine ring of HMP‐P is packed between the side chains of Phe46{183} and Tyr132{332} in PfThiaminase 17. Mutations at this index position in BsThiaminase and H. pylori TenA thiaminase decrease catalytic activity and disrupt helical folding 21, 43. In MMOHs, Gly113{183} in the alpha chain lines the pocket for the diiron cluster 16. Index 183 was identified in RNRs by two of the eight methods performed. The Cβ of Leu83{183} in RNR1A is nearly 6.5 Å from the diiron cluster 19. In hHO1, Ile57{183} is approximately 10 Å from the bound heme 35. Index 327 is fourth highest in Group Entropy for HOs and eleventh in RNRs. In hHO1, the side chain of Arg136{327} is part of the same hydrogen bonding network as Tyr58{184}, Tyr114{268}, and Asp140{332} (all noted above) that may help to activate the oxygen in the catalytic mechanism 35, demonstrating the critical nature of this network. However in RNRs, Leu203{327} is involved in hydrophobic packing with the side chain of Gln87{187}, another residue identified by GEnt in RNRs, perhaps indicating a compensatory mutation. Although index 327 did not score highly for Group Entropy for thiaminases in GEnt, Cys135{327} in BsThiaminase serves as the catalytic nucleophile for the enzyme. A C135A mutation completely inactivates the enzyme 21.

Conclusions

While sharing some structural homology, HOs, thiaminases, methane monooxygenase hydrolases, and Class I RNRs share little sequence homology, with only five residue positions conserved in at least 60% of all 472 aligned sequences and fourteen more with amino acid similarities above 60% conservation. The most well‐conserved sequence motifs were present in HOs and RNRs. Phylogenetic analysis revealed that the four homologues were distinct, and these four groups were used to perform group entropy analysis, as well as several other methods to predict functional residues. Despite the amino acid sequence and functional differences in each homologue, several common residue positions appeared critical in determining the unique function of each homologue. In HOs, the residue at index 142 coordinated the heme iron, while residues at indices 184, 268, 327, and 332 all participated in an essential hydrogen bonding network. Residues at indices 173 and 328 helped to maintain enzyme structure, and the residues at indices 179 and 183 line the heme‐binding pocket 35. In thiaminases, the residues at indices 142 and 173 are involved in hydrophobic packing. Index 179 maintains enzyme structure through a hydrogen bond. The residue at index 184 lines HMP‐P binding site. The residues at indices 183, 268, 328, and 332 help to bind HMP‐P, while the residue at index 327 acts as the catalytic nucleophile 17, 21. In MMOHs, the residues at indices 142, 173, and 179 contribute to hydrophobic packing. Residues at indices 184 and 328 coordinate the iron atoms, while residues at indices 183, 268, and 332 line the active site 16. In Class I RNRs, the charged residue at index 142 is located on the surface of the enzyme at the subunit interface and may be important for a critical conformation change. The residues at indices 173 and 327 are involved in hydrophobic packing. The residues at indices 179 and 268 form bonds that maintain enzyme structure in RNR1As. The residues at indices 184 and 328 coordinate the diiron cluster, while the residues at indices 183 and 332 line the active site 19, 20. The common index positions identified here represent potential evolutionary evidence that may explain the diversity of these enzymes despite being structurally homologous. These group‐specific residues, as well as the conserved residues throughout all four homologues, could also serve as excellent targets for site‐directed mutagenesis by others.

Materials and methods

The procedure used here was similar to that previously published 34. The project began by downloading the amino acid sequences and tertiary structures of human HO1 (PDB IDs: 1NI6 & 1N45), human HO2 (PDB ID: 2RGZ), TenA from P. furiosus (PDB ID: 1RTW), Treg from Streptococcus pneumoniae (PDB ID: 1UDD), TenA from B. subtilis (PDB ID: 2QCX), HO‐like domain from Shewanella denitrificans (PDB ID: 3DDE), THI20 from Sac. cerevisiae (PDB ID: 3RM5), MMOH from M. trichosporium (PDB ID: 1MHY), RNR R2F from Sal. typhimurium (PDB ID: 1R2F), and RNR beta chain from E. coli (PDB ID: 1XIK) from the RCSB Protein Data Bank. Homologous sequences were then identified using psi‐blast 48 searches of the nonredundant protein database at the National Center for Biotechnology Information (NCBI). 472 related HO, thiaminase, MMOH, and RNR amino acid sequences were collected with percentage identities ranging from 99% to 4%. These sequences were initially aligned using t‐coffee 49. The alignment was manually adjusted using tertiary structure comparison of all structures using mapsci (http://www.geom-comp.umn.edu/mapsci/) 24 and through the RCSB PDB Protein Comparison Tool‐jFATCAT method 50, 51 using pairs of structures as a guide. The alignment editor used was genedoc 52. Tertiary structures were used to assess the structural or functional roles of conserved residues within the alignment. Molecular visualization was performed using rasmol 53. Analysis of conserved sequence motifs was facilitated by meme program, and these motifs were searched against a protein database using mast 36. Group entropy analysis (GEnt ) 40 was performed on the entire alignment to compare HO, MMOH, RNR, and thiaminase groups to each other. Evolutionary trace (http://mordred.bioc.cam.ac.uk/~jiye/evoltrace/evoltrace.html) 45, 46 was also performed on the entire alignment. Protein residue conservation prediction (http://compbio.cs.princeton.edu/conservation/score.html) 47 was performed on subalignments of HO, MMOH, RNR, and thiaminase groups. Each algorithm was used using combinations of all three possible backgrounds (BLOSUM62, SwissProt, and PF) and all seven possible matrices (BLOSUM62, BLOSUM35, BLOSUM40, BLOSUM45, BLOSUM50, BLOSUM80, and BLOSUM100) distributed with the program. The phylip suite of programs was used to generate phylogenetic trees 54. First, the alignment was trimmed using trimal 55. 250 bootstrapped data sets of the trimmed alignment were then generated using the seqboot program. Next, evolutionary distances for each data set were determined by the protdist program using the Jones–Taylor–Thornton matrix. Phylogenetic trees for each data set were generated using the neighbor program. Lastly, the unrooted consensus tree was generated using the consense program. The tree graphic was generated using figtree (available at http://tree.bio.ed.ac.uk/software/figtree). A parsimony tree was generated using 50 bootstrapped data sets using the protpars program, followed by consense.

Author contributions

JI performed the alignment of 472 HO homologue sequences, analyzed the alignment for functional, structural, and phylogenetic conservations, and drafted the manuscript. AR performed functional residue prediction analyses and interpreted the results. JP supervised the project, participated in the analysis of the alignment, helped to write the initial and final drafts of the manuscript, and addressed reviewer's comments. Fig. S1. Complete alignment of 472 heme oxygenase homologue sequences (MSF format). Click here for additional data file. Fig. S2. Unrooted bootstrapped parsimony tree of the heme oxygenase homologues. Branches are color‐coded based on enzyme type: blue = HOs, purple = thiaminases, red = MMOHs and green = RNRs. Specific subgroups within each enzyme group are labeled. Click here for additional data file. Table S1. Complete GEnt results of HOs. Click here for additional data file. Table S2. Complete GEnt results of thiaminases. Click here for additional data file. Table S3. Complete GEnt results of MMOHs. Click here for additional data file. Table S4. Complete GEnt results of RNRs. Click here for additional data file.
  52 in total

1.  Evolutionary trace analysis of TGF-beta and related growth factors: implications for site-directed mutagenesis.

Authors:  C A Innis; J Shi; T L Blundell
Journal:  Protein Eng       Date:  2000-12

2.  T-Coffee: A novel method for fast and accurate multiple sequence alignment.

Authors:  C Notredame; D G Higgins; J Heringa
Journal:  J Mol Biol       Date:  2000-09-08       Impact factor: 5.469

3.  Mutagenesis studies on TenA: a thiamin salvage enzyme from Bacillus subtilis.

Authors:  Amy L Jenkins; Yang Zhang; Steven E Ealick; Tadhg P Begley
Journal:  Bioorg Chem       Date:  2007-12-03       Impact factor: 5.275

4.  Differences between alcohol dehydrogenases. Structural properties and evolutionary aspects.

Authors:  H Jörnvall
Journal:  Eur J Biochem       Date:  1977-02

5.  The effect of the Gly139His, Gly143His, and Ser142His mouse heme oxygenase-1 mutants on the HO reaction in vivo and in vitro.

Authors:  Lingyun Zhou; Ying Liu; Chaoxia Zou; Ning Ma; Yang Hui; Guixiang Lv; Hongyu Zhang; Hong Zhou; Xu Gao
Journal:  Anat Rec (Hoboken)       Date:  2010-11-16       Impact factor: 2.064

6.  Glutamate 52-β at the α/β subunit interface of Escherichia coli class Ia ribonucleotide reductase is essential for conformational gating of radical transfer.

Authors:  Qinghui Lin; Mackenzie J Parker; Alexander T Taguchi; Kanchana Ravichandran; Albert Kim; Gyunghoon Kang; Jimin Shao; Catherine L Drennan; JoAnne Stubbe
Journal:  J Biol Chem       Date:  2017-04-04       Impact factor: 5.157

7.  Disruption of an active site hydrogen bond converts human heme oxygenase-1 into a peroxidase.

Authors:  L K Lightning; H Huang ; P Moenne-Loccoz; T M Loehr; D J Schuller; T L Poulos; P R de Montellano
Journal:  J Biol Chem       Date:  2000-12-19       Impact factor: 5.157

Review 8.  Biochemistry of the soluble methane monooxygenase.

Authors:  J D Lipscomb
Journal:  Annu Rev Microbiol       Date:  1994       Impact factor: 15.500

9.  Mixed regioselectivity in the Arg-177 mutants of Corynebacterium diphtheriae heme oxygenase as a consequence of in-plane heme disorder.

Authors:  Yuhong Zeng; Rahul Deshmukh; Gregori A Caignan; Richard A Bunce; Mario Rivera; Angela Wilks
Journal:  Biochemistry       Date:  2004-05-11       Impact factor: 3.162

10.  trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses.

Authors:  Salvador Capella-Gutiérrez; José M Silla-Martínez; Toni Gabaldón
Journal:  Bioinformatics       Date:  2009-06-08       Impact factor: 6.937

View more
  1 in total

1.  In silico analysis of class I adenylate-forming enzymes reveals family and group-specific conservations.

Authors:  Louis Clark; Danielle Leatherby; Elizabeth Krilich; Alexander J Ropelewski; John Perozich
Journal:  PLoS One       Date:  2018-09-04       Impact factor: 3.240

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.