Literature DB >> 31890941

Substrate specificity of thioredoxins and glutaredoxins - towards a functional classification.

Manuela Gellert1, Md Faruq Hossain1, Felix Jacob Ferdinand Berens1,2, Lukas Willy Bruhn1,2, Claudia Urbainsky1, Volkmar Liebscher2, Christopher Horst Lillig1.   

Abstract

The spatio-temporal reduction and oxidation of protein thiols is an essential mechanism in signal transduction in all kingdoms of life. Thioredoxin (Trx) family proteins efficiently catalyze thiol-disulfide exchange reactions and the proteins are widely recognized for their importance in the operation of thiol switches. Trx family proteins have a broad and at the same time very distinct substrate specificity - a prerequisite for redox switching. Despite of multiple efforts, the true nature for this specificity is still under debate. Here, we comprehensively compare the classification/clustering of various redoxins from all domains of life based on their similarity in amino acid sequence, tertiary structure, and their electrostatic properties. We correlate these similarities to the existence of common interaction partners, identified in various previous studies and suggested by proteomic screenings. These analyses confirm that primary and tertiary structure similarity, and thereby all common classification systems, do not correlate to the target specificity of the proteins as thiol-disulfide oxidoreductases. Instead, a number of examples clearly demonstrate the importance of electrostatic similarity for their target specificity, independent of their belonging to the Trx or glutaredoxin subfamilies.
© 2019 The Author(s).

Entities:  

Keywords:  Biocomputational method; Biomolecules; Electrostatics; Glutaredoxin; Gromov-Wasserstein distance; Mathematical biosciences; Molecular docking; Protein-protein interaction; Redox signaling; Thioredoxin

Year:  2019        PMID: 31890941      PMCID: PMC6928294          DOI: 10.1016/j.heliyon.2019.e02943

Source DB:  PubMed          Journal:  Heliyon        ISSN: 2405-8440


Introduction

Redox modifications of cysteinyl and also methionyl side chains are a vital part of numerous signal transduction pathways as well as the reaction cycle of essential metabolic enzymes [1, 2, 3, 4]. Many of these redox reactions are directly or indirectly catalyzed by members of the Trx family of proteins. This group of proteins share a common basic structural motif – the Trx fold. Their active sites, in most cases consisting of two cysteinyl residues separated by two amino acids (Cys-X-X-Cys), are the basis of their redox activity [5]. Proteins of this family are known to catalyze the reduction of disulfides in target proteins, the de- or glutathionylation of proteins, they catalyze the oxidative folding of proteins and are able to reduce redox modifications like sulfenic acids [4]. Trx family proteins are encoded in essentially all genomes and are localized in all compartments of eukaryotic cells, e.g. the cytosol, ER, mitochondria, nucleus, and plastids – often in multiple isoforms. Most members of the family have a broad, but distinct substrate specificity. The nature of this specificity is the focus of this work. The Trx family divides into subfamilies, two of the major groups are the Trxs themselves and the glutaredoxins (Grxs). Trxs catalyze thiol-disulfide exchange reactions and the trans-nitrosylation of thiol groups [4, 6]. The disulfide formed in their consensus Cys-Gly-Pro-Cys active site during their reaction cycle is reduced by specific reductases named thioredoxin reductases (TrxRs) [7, 8]. The members of one of the Grx subfamilies (dithiol Grxs, with a consensus active site of Cys-Pro-Tyr-Cys) catalyze thiol-disulfide oxidoreductions as well, however, when oxidized these proteins are reduced by the tripeptide glutathione (GSH). The mechanisms of these reactions have been discussed in great detail before, see for instance [4, 9, 10, 11]. The members of a second subclass of the Grxs (monothiol Grxs, with a consensus Cys-Gly-Phe-Ser active site) do not catalyze thiol-disulfide exchange reactions at significant rates. Instead, they function in the regulation of iron metabolism or in the transfer of iron-sulfur centers [12, 13, 14]. Traditionally, Grxs and Trxs were named in each organism in order of their discovery, for instance in mammals, the firstly discovered cytosolic Trx1 [15] and the later discovered mitochondrial Trx2 [11]. An other example are the yeast Grxs 1–8, summarized in [16]. This historical naming, however, does not include any information on structural or functional differences with the Trx family of proteins. A more advanced classification and naming system based on the active site sequences was presented for plants Grxs and became widely accepted [17]. This nomenclature defines three classes, i.e. class I, that contains the largely redox-active dithiol Grxs, class II, including all monothiol Grxs, and class III, including the land plant-specific CC-type Grxs, also known as ROXYs [18]. In most species, including bacteria, fungi, mammals, and plants, multiple Trx family proteins are present in the same compartment, prompting questions on overlapping functions. Various proteomic studies, also summarized in this work, indicate a rather high degree of substrate specificity for each member of the family, with only some overlapping substrate/target proteins. Various attempts have been made to understand the substrate specificity and reactivity of the Trx family proteins. Previous suggestions primarily addressed the thermodynamics of the reaction, including the nucleophilicity of the more N-terminal active site thiol, the differences in redox potential, and entropic changes during the reaction [19, 20, 21]. Recently, based on the analysis of E. coli phosphoadenylyl sulfate (PAPS) reductase that can react with many, but not all Trxs and Grxs [22, 23], we proposed that different electrostatic properties of the redoxins govern their target specificity and reactivity [24]. In this work, we provide a detailed comparison on the coherence between the similarity of redoxins in (1st) primary structure, (2nd) tertiary structure, and (3rd) electrostatic properties. Where possible, we correlate these different methods of clustering/classification to known functions of the redoxins. We have focused on the redoxins encoded in the human genome and all redoxins from various species with experimentally determined structures deposited in the protein data base. Our results provide further evidence for the importance of the electrostatic properties of the proteins for their distinct target specificity. This clustering may allow a new functional classification of the redoxins and may enable the prediction of common functions and interactions partners.

Methods and procedures

Structures and molecular modeling

Structures, when available, were obtained from the protein data bank (https://www.rcsb.org); the PDB entries used are listed in the supplementary table. Molecular modeling was performed using the Swiss Model web server [25, 26, 27, 28, 29]. The final model was chosen from structures modeled with different templates that displayed the highest sequence identity with the target protein based on the quality assessment provided, i.e. the lowest QMEAN with no major outlier in the global or local quality estimates of Cβ, all atom, solvation, or torsion. The individual template structures used and the QMEAN values are summarized in the supplementary table (structures).

Sequence and structure comparison

Sequences of the proteins were obtained from the uniprot resources. In case of multi-domain proteins, the sequences encoding the redoxin domains only were mostly extracted as annotated in the respective uniprot entries, i.e. based on PROSITE-ProRule [30]. In some critical cases, for instance human nucleoredoxin, multiple sequence alignments were performed including sequences from various species. These typically share a higher degree of homology within the functional domains and a lower in the joining peptides. Primary structure alignments and the generation of the corresponding distance trees were performed with the CLC sequence viewer (Qiagen bioinformatics, Hilden, Germany) and Clustal omega [31]. The three dimensional structures were aligned using UCSF chimera [32] (MatchMaker) including structure-based multiple sequence alignments. From these primary structure and 3-D structure alignments, the corresponding distance trees were generated with the CLC sequence viewer applying the neighbor joining method and the Jukes-Cantor protein distance measure.

Electrostatic calculations

The structures in the PDB files were aligned in the desired orientation using UCSF chimera. The electrostatic properties of the proteins were computed from the pdb files as follows: the reconstruction of any missing atoms, the addition of hydrogens, the assignment of atomic charges and radii was performed using pdb2pqr with the amber force field [33]. The electrostatic parameters were calculated using the Adaptive Poisson-Boltzmann Solver (APBS) [34] within the vmd (visual molecular dynamics) software package [35]. The following parameters were used: 150 mM mobile ions, solvent dielectric constant: 78.54, temperature: 298.15 K. Images were rendered depicting the secondary structures of the proteins (with the N-terminal active site cysteinyl residues facing towards the camera perspective), the electrostatic potential mapped to the surface of the proteins (from -4 in red to 4 K T·e−1 in blue), and the isosurfaces of the electrostatic potential at -1 in red and 1 K T·e−1 in blue. These pictures were used to generate a summary picture using ImgageMagick. All steps following the 3D alignment of the structures were automatized with the help of scripts and a graphical interface. These can be obtained from: https://github.com/WillyBruhn/MutComp.

Electrostatic distances and clustering

Here, we compared the 3-dimensional isosurfaces of both the negative and positive electrostatic potential using the Gromov-Wasserstein-distance [36, 37]. Solving this problem, is NP-hard as the objective function is not convex. However, three lower bounds for the Gromov-Wasserstein-distance can be calculated in polynomial time. Our empirical tests demonstrated that not all points of the isosurface shall be calculated, instead we limited the sample to n points randomly distributed on the isosurfaces and calculate the lower bound for them. This was repeated m times, the obtained values were summarized in form of a histogram. This comparison was performed pairwise for all proteins. To get a measure of similarity between the histograms, the earth-mover's-distance was used [38]. For the hierarchical clustering, the unweighted pair group method with arithmetic mean (UPGMA) was used. This method yields the mean distance between all points from the new cluster to all points of another cluster. The result of this clustering were displayed in form of a dendrogram. Further details of this mathematical approach will be presented elsewhere. All code of the software produced here (C++, R, scripts) can be obtained free of charge and open source here: https://github.com/BerensF/ComparingProteins.

Interactome data and comparison

Interactome data were retrieved from the IntAct database [39] (as of Sept. 2018) as well as the BioGRID resources [40] (Vers. 3.4, as of Sept. 2018). ID mapping was performed using the uniprot resources (https://www.uniprot.org). All entries are listed with their unique UniprotKB ID. When available, additional resources for interactions – not yet listed in the upper mentioned databases – were included, for instance from some dedicated publications. All entries identified are listed in the supplementary tables (interactome). The matrix of common interactions as well as the Venn diagrams were computed using R (https://www.r-project.org) from RStudio (https://www.rstudio.com).

Results and discussion

All presently established classifications of Trx family proteins are based on the comparison of, and the clustering according to primary structures. We know, however, that these systems do not, or at best partially, reflect the various functions of the proteins. As examples, some only distantly related Trxs and Grxs share overlapping functions, while some closely related Grxs do not. The disulfide formed in E. coli PAPS reductase during its catalytic cycle can be reduced by the distantly related Trx1 and Grx1 (primary sequence identity: 19.6%), but not by Grx3 (sequence identity to Grx1: 41.7%) [23, 41]. The primary aim of this study is to provide a thorough comparison between the clustering and classification according to primary structure, 3-D structure, and electrostatic characteristics. Further more, we aim to evaluate the practicality of the comparison of electrostatic properties for the functional classification of Trx family proteins and the prediction of functions. In the first part, we focus on the Trx family proteins from human (see Table 1). Functions of the proteins were deduced from the literature, identified interacting proteins were collected from the literature, our own data sets, and some major interaction databases, i.e. IntAct and BioGRID (see methods section). These sources summarize studies dedicated to single interactions as well as proteomic approaches. If available, 3-D structures were obtained from the protein data base (pdb). Missing structures were obtained by homology modeling. The electrostatic calculations were performed using pdb2pqr and APBS implemented in vmd. For the comparison and clustering of the electrostatic similarities, we have adopted new strategies based on Gromov-Wasserstein distances of lower boundaries and Earth-Movers distances. These computational strategies are outlined in the methods section.
Table 1

Redoxins and redoxin domains encoded in the human genome. loc: localization, c: cytosol, e: endoplamic reticulum, g: golgi apparatus, l: lipid membrane, m: mitochondria, n: nucleus, o: outside of the cell, secreted; ‘.d’ marks individual domains. n.a.: not analyzed (as no information on the individual domains were available).

Protein nameIDActive sitepdbFunctionsloc.Interactome (number)
Homo sapiens glutaredoxins
Grx1P35754CPYC1b4q, ...(de)-glutathionylationc/n24
Grx2a/cQ9NS18CSYC2fls, ...Fe/S, redox sensor [52]c/m/n64 [53]
Grx3.d2O76003CGFS3zywFe/S, iron metabolism [54, 55]c98
Grx3.d3CGFS2yan
Grx5Q86SX6CGFS2wul, ...Fe/S biogenesis [56, 57]m22
TrxR1v3.d1Q16881-1CTRCcn.a.
TrxR3.d1Q86VQ6CPHS2h8qcn.a.
GrxCR1.d2A8MXD5FERCc9
PTGES2.d1Q9H7Z7CPFC2pbjprostaglandin synthesis [58]c/l31
SH3BGRL3Q9H299KSQQ1sj6cancer progression [59]c/n10
Homo sapiens thioredoxins
Trx1P10599CGPC1ert, ...electron donor, redox signaling, ... [4]c/n/o247 [60]
Trx2Q99757CGPC1wh4, ...m102
Grx3.d1O76003APQC2diy, ...Fe/S, iron metabolism [54, 55]csee above
Nrx.d2Q6DKJ4CPPCredox signalingc/n629 [61]
Nxnl1Q96CM4CPQCn/l4
Nxnl2Q5VZ03CAPSc2
Txnl1O43396CGPC1gh2, ...c/n62
Txnl4AP83876DPTC1qgv, ...n39
Txnl4BQ9NX01DPVC3gix, ...n21
Tmx1Q9H3N1CPAC1x5ee/l/o79
Tmx2Q9Y320SNDC2dj0l/o103
Tmx4Q9H1E5CPSCl/o25
Txndc2Q86VQ3CGPCspermatogenesis [62, 63]c4
Txndc3Q8N427CGPCspermatogenesis [64]c8
Txndc6Q86XW9CGPCmicrotuble dynamics, cancer progression [65, 66]c-
Txndc8Q6A555CGPCspermatogenesis [67]c/g-
Txndc9O14530TFRCprotein complex assembly [68]c/n91
Txndc11.d1Q6PKC3CGQSe/l83
Txndc11.d2CGFC
Txndc12O95881CGAC1sen, ...PDIe30
Txndc15Q96J42CRFSciliogenesis [69]l/o44
Txndc16Q9P2K2QAVSmeningioma-associated antigen [70]e/s29
Txndc17Q9BRA2CPDC1woudisulfide and cystine reduction, denitrosylation [71, 72]c39
Qsox1O00391CGHC3q6q, ...disulfide formation [73]g/o17
Qsox2Q6ZRP7CGHCdisulfide formation [73]c/l/n/o28
Redoxins and redoxin domains encoded in the human genome. loc: localization, c: cytosol, e: endoplamic reticulum, g: golgi apparatus, l: lipid membrane, m: mitochondria, n: nucleus, o: outside of the cell, secreted; ‘.d’ marks individual domains. n.a.: not analyzed (as no information on the individual domains were available).

Human Grxs and Trxs

The human genome encodes about 9 Grxs or Grx domain- containing proteins and 24 Trxs or Trx domain-containing proteins (see Table 1). From the in total 35 Trx-fold domains in these 33 proteins, the 3-D structure of 20 had already been determined experimentally. The remaining 15 were predicted by homology modeling, see supplementary Table, sheet 1 – modeling. Similar to the comparison of the plant redoxins, the human redoxins divide into similar groups in the trees generated from primary and tertiary structure (Figure 1A and B). The subgroup of the Grxs both show the separate monothiol and dithiol Grx groups. The Trx group is quite diverse, however, the Nrxs and the Trxs with the consensus active site motif Cys-Gly-Pro-Cys (minus mitochondrial Trx2) clearly separate from the others in both analyses. The electrostatic characteristics (Figure 2) define two major groups (Figure 1C). Group ‘I’ contains Trxs and Trx domain-containing proteins exclusively. Group ‘II’ contains all of the Grxs and Grxs domains next to some Trx-related proteins and domains.
Figure 1

Clustering of human redoxins. (A) Phylogram based on primary structure comparison, computed by Clustal Omega and CLC sequence viewer. (B) Similarity tree based on the similarity of the 3D structures extracted from the pdb and generated by homology modeling; the tree was computed using UCSF Chimera and the CLC sequence viewer. (C) The electrostatic similarity of the whole proteins was computed as outlined in the methods section; the tree was generated using ‘R’. The protein abbreviations highlighted in green are referred to in the main text. The Trx proteins with a Cys-Gly-Pro-Cys active site motif were highlighted with a red circle in B.

Figure 2

Electrostatic features of the active site contact areas of the human redoxins. The first rows depict the electrostatic potential isosurfaces at +/- 1 K T·e−1. The second row depicts the electrostatic potential at +/- 4 K T·e−1 mapped to the water-accessible surface of the proteins. Blue: positive, red negative potential. The third row depicts the proteins in cartoon models, helices are colored in purple, sheets in yellow. The proteins were arranged with the N-terminal active site thiol in the middle of the models. The electrostatic similarity of the whole proteins was computed as outlined in the methods section.

Clustering of human redoxins. (A) Phylogram based on primary structure comparison, computed by Clustal Omega and CLC sequence viewer. (B) Similarity tree based on the similarity of the 3D structures extracted from the pdb and generated by homology modeling; the tree was computed using UCSF Chimera and the CLC sequence viewer. (C) The electrostatic similarity of the whole proteins was computed as outlined in the methods section; the tree was generated using ‘R’. The protein abbreviations highlighted in green are referred to in the main text. The Trx proteins with a Cys-Gly-Pro-Cys active site motif were highlighted with a red circle in B. Electrostatic features of the active site contact areas of the human redoxins. The first rows depict the electrostatic potential isosurfaces at +/- 1 K T·e−1. The second row depicts the electrostatic potential at +/- 4 K T·e−1 mapped to the water-accessible surface of the proteins. Blue: positive, red negative potential. The third row depicts the proteins in cartoon models, helices are colored in purple, sheets in yellow. The proteins were arranged with the N-terminal active site thiol in the middle of the models. The electrostatic similarity of the whole proteins was computed as outlined in the methods section. Using various sources, see Table 1, we collected almost 2000 potential interactions partners of these 33 proteins (all entries are available in the supplementary tables, sheet H.s. interactome). The pair-wise comparison of common targets in these sets is summarized in Figure 3A. The largest degree of overlapping interacting proteins was found between the thioredoxins Trx1, Txnl1, Txndc9, Txndc17, and Nrx, also summarized in the Venn diagram depicted in Figure 3B. This suggests some overlapping functions of various pairs of these five proteins. Triple and quadruple overlaps, on the other hand, are rare and no common potential interaction partner of all five of these redoxins was identified nor suggested so far. In the trees generated from the primary structure and tertiary structure comparisons, these five proteins of the Trx subfamily are localized in three different branches, separated by large distances (Figure 1A and B, green labels), only two of them contain the consensus Cys-Gly-Pro-Cys active site motif (Trx1 and Txnl1, see Table 1). In the tree based on their similarity in electrostatic properties, these five proteins are all localized close to each other in the terminal branches of cluster ‘I’ (Figure 1C, green labels). The glutaredoxins Grx1, Grx2, Grx3, and Grx5 share only three potential interaction partners, one between Grx1 and Grx2, two between Grx3 and Grx5, out of a total of 210 suggested target proteins (see Figure 3C). The primary reasons for this may be their different subcellular localization, dithiol Grx1 and monothiol Grx3 are cytosolic, dithiol Grx2 and monothiol Grx5 primarily mitochondrial.
Figure 3

Common interaction partners between the human redoxins. (A) Pair-wise comparison between all human redoxins. The total numbers of potential interactions partners collected from various data sources is depicted with gray background in the diagonal; yellow background: 3–4 common interaction partners; light green background: 5–9 common interaction partners; green background: ≥ 10 common interaction partners. The full list of interaction partners can be found in the supplementary tables. (B–C) Venn diagrams of the overlapping potential interactions partners between Trx1, Nrx, Txndc9, Txndc17, and Txnl1 (B) as well as Grx1, Grx2, Grx3, and Grx5 (B).

Common interaction partners between the human redoxins. (A) Pair-wise comparison between all human redoxins. The total numbers of potential interactions partners collected from various data sources is depicted with gray background in the diagonal; yellow background: 3–4 common interaction partners; light green background: 5–9 common interaction partners; green background: ≥ 10 common interaction partners. The full list of interaction partners can be found in the supplementary tables. (B–C) Venn diagrams of the overlapping potential interactions partners between Trx1, Nrx, Txndc9, Txndc17, and Txnl1 (B) as well as Grx1, Grx2, Grx3, and Grx5 (B). This study primarily focused on redox-interactions of the redoxins with target proteins. The main reason for this was the availability of data. However, it has been suggested early on that the Trx-fold domains might act as platform for protein-protein interactions, e.g. as processivity factor of T7 DNA polymerase [42] or as the basis for their redox-independent chaperon activity, summarized in [43]. Our study implies that redox-inactive redoxins (mostly domains) will show a similar target specificity as electrostatically similar redox-active redoxins. The N-terminal Trx domain of human Grx3 (Grx3_d1), for instance, contains the ‘active’ site motif Ala-Pro-Gln-Cys, hence it cannot catalyze thiol-disulfide exchange reactions. Electrostatically, this domain is most similar to the redox-active Grx1 and Grx2, as well as the Grx-domains of TrxR1 transcript variant 3 [44] and TrxR3 [45], see Figure 1C (cluster II). It remains to be established whether these proteins and domains share a similar pattern of interacting proteins or other substrates as implied by our study.

All representative redoxin structures from the pdb

For a more comprehensive clustering of Trx-family proteins, we have selected all Trx- and Grx-fold structures from the pdb for analysis. For the electrostatic analysis, we selected only non-mutated proteins and, if required, extracted the Trx-/Grx-domains. Other molecules included in some of the structures, i.e. cofactors or water, were excluded. For the present analysis, we did not include other Trx-family members, such as DsbA/B/C proteins, protein disulfide isomerases, GSH peroxidases, or arsenate reductases. Our final collection included 119 structures (see supplementary Table, sheet 5 – pdb structures) from all domains of life, including some phage/virus-encoded proteins. For these structures, we generated both primary structure and electrostatic similarity trees, see Figure 4. The sequence-based tree clearly separates the structures into the Grx and Trx subfamilies (Figure 4A), independent of their sometimes confusing annotation in the pdb and sequence databases. Within the Grx branch, the genuine monothiol Grxs (Cys-Gly-Phe-Ser) form a well defined side branch, as well as the dithiol Grxs. The remaining structures include redoxins such as mycoredoxin, methanoredoxin, and NrdHs proteins. The Trx subfamily contains six distinct groups, two eukaryotic, three bacterial, and the tryparedoxin branches (Figure 4A). In contrast to sequence similarity, the electrostatic similarity tree separates into eleven branches, marked as I-XI in Figure 4B, see also Figure 5, of which most include structures from both the Trx and Grx subfamilies. None of these groups overlap with a branch of the sequence-based tree.
Figure 4

Clustering of all representative redoxins in the pdb. (A) Phylogram based on primary structure comparison, computed by Clustal Omega and CLC sequence viewer. The dashed red line separates the Trx and Grx subfamilies. (B) The electrostatic similarity of the whole proteins was computed as outlined in the methods section; the tree was generated using ‘R’. The red asterisks mark proteins interacting with E. coli RNR, the black asterisks proteins interacting with E. coli PAPS reductase. The color code is included in the figure. Further information on the protein structures can be obtained from the supplementary Table, sheet 5.

Figure 5

Electrostatic features of all representative redoxins in the pdb. The first rows depict the electrostatic potential isosurfaces at +/- 1 K T·e−1. The second row depicts the electrostatic potential at +/- 4 K T·e−1 mapped to the water-accessible surface of the proteins. Blue: positive, red negative potential. The third row depicts the proteins in cartoon models, helices are colored in purple, sheets in yellow. The pdb entry code of the structures is indicated in the fourth row. The proteins were arranged with the N-terminal active site thiol in the middle of the models. The electrostatic similarity of the whole proteins was computed as outlined in the methods section.

Clustering of all representative redoxins in the pdb. (A) Phylogram based on primary structure comparison, computed by Clustal Omega and CLC sequence viewer. The dashed red line separates the Trx and Grx subfamilies. (B) The electrostatic similarity of the whole proteins was computed as outlined in the methods section; the tree was generated using ‘R’. The red asterisks mark proteins interacting with E. coli RNR, the black asterisks proteins interacting with E. coli PAPS reductase. The color code is included in the figure. Further information on the protein structures can be obtained from the supplementary Table, sheet 5. Electrostatic features of all representative redoxins in the pdb. The first rows depict the electrostatic potential isosurfaces at +/- 1 K T·e−1. The second row depicts the electrostatic potential at +/- 4 K T·e−1 mapped to the water-accessible surface of the proteins. Blue: positive, red negative potential. The third row depicts the proteins in cartoon models, helices are colored in purple, sheets in yellow. The pdb entry code of the structures is indicated in the fourth row. The proteins were arranged with the N-terminal active site thiol in the middle of the models. The electrostatic similarity of the whole proteins was computed as outlined in the methods section. So what can be said about the correlation between the two clustering methods and the functional interaction of the different redoxins with target proteins? The two first discovered functions of Trxs and Grxs offer some insights. Trx1 (pdb: 1xoa) from E. coli was first discovered as electron donor for ribonucleotide reductase (RNR) [46]. Later studies demonstrated that this function could also be fulfilled by Grx1 (1egr) [47, 48], Grx3 (1fov) [49], and NrdH (1h75) [50] from E. coli. In the sequence based tree, these four proteins can be found in three distant branches (Figure 4A, red asterisks). In the electrostatic similarity tree, however, these four proteins can be found in the very close neighbouring branches IX and X (Figures 4B and 5, red asterisks). The requirement of Trxs for sulfate reduction was first described in yeast [51]. Subsequently, Trx1 was also demonstrated to be a cosubstrate of PAPS reductase in E. coli. Similar to RNR, Grx1 can replace Trx1 in vivo and in vitro in this function [23, 41], however, Grx3 and NrdH failed to do so [23, 24]. E. coli PAPS reductase can be reduced by a number of redoxins from various species, while some others cannot interact with the protein [22, 24]. Positive interactions partners are, for instance human and Arabidopsis thaliana Trx1 (1ert) and TrxH1 (1xfl), respectively. Again, these functionalities and non-functionalities cannot be predicted or explained by their similarities in primary sequence (Figure 4A, black asterisks). In the electrostatic similarity tree, three of the four proteins are located in cluster ‘X’, one (A.t. TrxH1) in cluster ‘XI’. It should also be mentioned that cluster ‘XI’ contains proteins that cannot interact productively with PAPS reductase, i.e. human Grx2 (2fls), and T4 Grx (1aba) [24]. It appears that – at least some – proteins included in the electrostatic characteristics cluster ‘X’ can interact with both PAPS reductase and RNR, proteins of cluster ‘IX’ also with RNR, but not PAPS reductase. These two examples clearly support our hypothesis that the electrostatic similarity between the redoxins correlates to their profile of interacting proteins or may even guide these specific interactions [24]. For this study, the points analyzed for electrostatic similarity were distributed randomly on the surface of the proteins. One might argue that more weight on the properties of the immediate contact surface surrounding the active site could even further improve the model. To test this, we have developed a strategy to extract the electrostatic isosurfaces of the areas surrounding the N-terminal active site residue, that is usually the cysteinyl residue that forms an intermediate mixed-disulfide with the (redox-) target proteins and analyzed them in the same way as the features of the whole proteins. This strategy and the results obtained when it was applied to the ‘representative structures from the pdb’ data set were summarized in the supplementary material (suppl. Figure 1 and text). In brief, the hierarchical clustering of the electrostatic similarities in the area close to the active site did not reflect the functions of some redoxins as electron donor of PAPS reductase or RNR anywhere near as good as the clustering of the electrostatic similarities of the whole proteins. The redoxins that were previously shown to donate electrons to PAPS reductase clustered all within the distance of 9.8% of the maximum distance of all redoxins, the ones donating electrons to RNR within 12.7% of the maximum distance (Figure 4B) when the features of the whole proteins were compared. When only the electrostatic features of the active site faces of the proteins were compared, these redoxins did not cluster in close proximity and the relative distances increased to 37.4% and 62.6% of the maximum distance for PAPS reductase and RNR, respectively (suppl. Figure 1, nodes a and b). These results indicate that the global properties of the proteins may play a more important role than previously assumed. Moreover, they favor a model of redoxin-target interaction in which the recognition of the two proteins is controlled by attractive and repulsive electrostatic forces that, presumably, take part in pre-orientation of the two proteins before they can form a productive encounter complex, rather than contact surface complementarity only. Further experimental studies will have to address this hypothesis.

Conclusions

Here, we evaluated the practicality of a mathematical model for the automated clustering of the electrostatic properties of proteins for the functional classification of Trx family proteins and the prediction of functions. The analysis of the human, and pdb-wide redoxin structures clearly demonstrate that primary and tertiary structure (backbone) similarity do not correlate to the target specificity of the proteins as thiol-disulfide oxidoreductases and neither does their redox potential, see [24]. Instead, the examples of the human and pdb-wide redoxins clearly demonstrate the importance of the electrostatic properties of the whole protein for target specificity and discrimination. The mathematical model evaluated here is the first step towards an automated analysis and comparison of electrostatic properties of a large number of protein structures.

Declarations

Author contribution statement

Manuela Gellert: Conceived and designed the experiments; Performed the experiments; Analyzed and interpreted the data. Md Faruq Hossain: Performed the experiments; Analyzed and interpreted the data. Felix Jacob Ferdinand Berens, Lukas Willy Bruhn: Conceived and designed the experiments; Performed the experiments; Analyzed and interpreted the data; Contributed reagents, materials, analysis tools or data. Claudia Urbainsky: Performed the experiments. Volkmar Liebscher: Conceived and designed the experiments; Analyzed and interpreted the data; Contributed reagents, materials, analysis tools or data. Christopher Horst Lillig: Conceived and designed the experiments; Performed the experiments; Analyzed and interpreted the data; Wrote the paper.

Funding statement

This work was supported by Deutsche Forschungsgemeinschaft, German Research Foundation (DFG): Li984/3-2, GRK1947-A1. We acknowledge support for the Article Processing Charge from the DFG (393148499) and the Open Access Publication Fund of the University of Greifswald.

Competing interest statement

The authors declare no conflict of interest.

Additional information

Data associated with this study has been deposited at GitHub under the URLs: https://github.com/WillyBruhn/MutComp and https://github.com/BerensF/ComparingProteins. Supplementary content related to this article has been published online at https://doi.org/10.1016/j.heliyon.2019.e02943.
  70 in total

Review 1.  Plant-specific CC-type glutaredoxins: functions in developmental processes and stress responses.

Authors:  Nora Gutsche; Corinna Thurow; Sabine Zachgo; Christiane Gatz
Journal:  Biol Chem       Date:  2015-05       Impact factor: 3.915

2.  PAPS-reductase from Escherichia coli: characterization of the enzyme as probe for thioredoxins.

Authors:  J D Schwenn; U Schriek
Journal:  Z Naturforsch C J Biosci       Date:  1987 Jan-Feb

3.  Cytosolic monothiol glutaredoxins function in intracellular iron sensing and trafficking via their bound iron-sulfur cluster.

Authors:  Ulrich Mühlenhoff; Sabine Molik; José R Godoy; Marta A Uzarska; Nadine Richter; Andreas Seubert; Yan Zhang; JoAnne Stubbe; Fabien Pierrel; Enrique Herrero; Christopher Horst Lillig; Roland Lill
Journal:  Cell Metab       Date:  2010-10-06       Impact factor: 27.287

4.  Cloning and expression of a novel mammalian thioredoxin.

Authors:  G Spyrou; E Enmark; A Miranda-Vizuete; J Gustafsson
Journal:  J Biol Chem       Date:  1997-01-31       Impact factor: 5.157

5.  Cellular prostaglandin E2 production by membrane-bound prostaglandin E synthase-2 via both cyclooxygenases-1 and -2.

Authors:  Makoto Murakami; Karin Nakashima; Daisuke Kamei; Seiko Masuda; Yukio Ishikawa; Toshiharu Ishii; Yoshihiro Ohmiya; Kikuko Watanabe; Ichiro Kudo
Journal:  J Biol Chem       Date:  2003-06-30       Impact factor: 5.157

6.  SH3BGRL3 Protein as a Potential Prognostic Biomarker for Urothelial Carcinoma: A Novel Binding Partner of Epidermal Growth Factor Receptor.

Authors:  Cheng-Yao Chiang; Chin-Chen Pan; Hong-Yi Chang; Ming-Derg Lai; Tzong-Shin Tzai; Yuh-Shyan Tsai; Pin Ling; Hsiao-Sheng Liu; Bi-Fang Lee; Hong-Ling Cheng; Chung-Liang Ho; Shu-Hui Chen; Nan-Haw Chow
Journal:  Clin Cancer Res       Date:  2015-08-18       Impact factor: 12.531

Review 7.  Understanding the pK(a) of redox cysteines: the key role of hydrogen bonding.

Authors:  Goedele Roos; Nicolas Foloppe; Joris Messens
Journal:  Antioxid Redox Signal       Date:  2012-09-20       Impact factor: 8.401

8.  Escherichia coli thioredoxin: a subunit of bacteriophage T7 DNA polymerase.

Authors:  D F Mark; C C Richardson
Journal:  Proc Natl Acad Sci U S A       Date:  1976-03       Impact factor: 11.205

Review 9.  Thioredoxin--a fold for all reasons.

Authors:  J L Martin
Journal:  Structure       Date:  1995-03-15       Impact factor: 5.006

Review 10.  Thiol-disulfide balance: from the concept of oxidative stress to that of redox regulation.

Authors:  Pietro Ghezzi; Valentina Bonetto; Maddalena Fratelli
Journal:  Antioxid Redox Signal       Date:  2005 Jul-Aug       Impact factor: 8.401

View more
  7 in total

Review 1.  Matrix Redox Physiology Governs the Regulation of Plant Mitochondrial Metabolism through Posttranslational Protein Modifications.

Authors:  Ian Max Møller; Abir U Igamberdiev; Natalia V Bykova; Iris Finkemeier; Allan G Rasmusson; Markus Schwarzländer
Journal:  Plant Cell       Date:  2020-01-06       Impact factor: 11.277

Review 2.  Redox and Thiols in Archaea.

Authors:  Mamta Rawat; Julie A Maupin-Furlow
Journal:  Antioxidants (Basel)       Date:  2020-05-05

3.  Can thiol-based redox systems be utilized as parts for synthetic biology applications?

Authors:  Ché S Pillay; Nolyn John
Journal:  Redox Rep       Date:  2021-12       Impact factor: 4.412

4.  New insights on thioredoxins (Trxs) and glutaredoxins (Grxs) by in silico amino acid sequence, phylogenetic and comparative structural analyses in organisms of three domains of life.

Authors:  Soumila Mondal; Shailendra P Singh
Journal:  Heliyon       Date:  2022-09-27

Review 5.  Selective redox signaling shapes plant-pathogen interactions.

Authors:  Jade R Bleau; Steven H Spoel
Journal:  Plant Physiol       Date:  2021-05-27       Impact factor: 8.340

6.  Molecular Basis for the Interactions of Human Thioredoxins with Their Respective Reductases.

Authors:  Md Faruq Hossain; Yana Bodnar; Calvin Klein; Clara Ortegón Salas; Elias S J Arnér; Manuela Gellert; Christopher Horst Lillig
Journal:  Oxid Med Cell Longev       Date:  2021-06-01       Impact factor: 6.543

7.  Inhibition of microglial receptor-interacting protein kinase 1 ameliorates neuroinflammation following cerebral ischaemic stroke.

Authors:  Yang Jiao; Jianjian Wang; Huixue Zhang; Yuze Cao; Yang Qu; Siyu Huang; Xiaotong Kong; Chang Song; Jie Li; Qian Li; Heping Ma; Xiaoyu Lu; Lihua Wang
Journal:  J Cell Mol Med       Date:  2020-09-29       Impact factor: 5.310

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.