| Literature DB >> 32151723 |
Alexander I Denesyuk1, Mark S Johnson2, Outi M H Salo-Ahen3, Vladimir N Uversky4, Konstantin Denessiouk3.
Abstract
(Chymo)trypsin-like serine fold proteases belong to the serine/cysteine proteases found in eukaryotes, prokaryotes, and viruses. Their catalytic activity is carried out using a triad of amino acids, a nucleophile, a base, and an acid. For this superfamily of proteases, we propose the existence of a universal 3D structure comprising 11 amino acids near the catalytic nucleophile and base - Nucleophile-Base Catalytic Zone (NBCZone). The comparison of NBCZones among 169 eukaryotic, prokaryotic, and viral (chymo)trypsin-like proteases suggested the existence of 15 distinct groups determined by the combination of amino acids located at two "key" structure-functional positions 54T and 55T near the catalytic base His57T. Most eukaryotic and prokaryotic proteases fell into two major groups, [ST]A and TN. Usually, proteases of [ST]A group contain a disulfide bond between cysteines Cys42T and Cys58T of the NBCZone. In contrast, viral proteases were distributed among seven groups, and lack this disulfide bond. Furthermore, only the [ST]A group of eukaryotic proteases contains glycine at position 43T, which is instrumental for activation of these enzymes. In contrast, due to the side chains of residues at position 43T prokaryotic and viral proteases do not have the ability to carry out the structural transition of the eukaryotic zymogen-zyme type.Entities:
Keywords: (Chymo)trypsin-like proteases; Catalytic triad; Structural framework; Structural motif
Mesh:
Substances:
Year: 2020 PMID: 32151723 PMCID: PMC7124590 DOI: 10.1016/j.ijbiomac.2020.03.025
Source DB: PubMed Journal: Int J Biol Macromol ISSN: 0141-8130 Impact factor: 8.025
Fig. 1Structure of the active site in (chymo)trypsin-like serine fold proteases. Amino acid numbers are taken as in Trypsin (PDB ID: 4I8H_A). The catalytic triad includes Asp102 (the catalytic acid), His57 (the catalytic base) and Ser195 (the catalytic nucleophile). The PROSITE “TRYPSIN_SER” pattern (PS00135; G–[DE]–S–G–[GS]) includes Gly193-Asp194-Ser195(cat. nucleophile)-Gly196-Gly197. The PROSITE “TRYPSIN_HIS” pattern (PS00134; [LIVM]-[ST]-A-[STAG]-H-C) includes Val53-Ser54-Ala55-Ala56-His57(cat. base)-Cys58. Two main-chain nitrogens, N/Gly193 and N/Ser195, are the two canonical oxyanions “N(oxyI)” and “N(oxyII)”. Gly43 and Val213 simultaneously interact with both the TRYPSIN_SER and TRYPSIN_HIS pattern, and thus constitute the “43/213 Nucleophile-Base Catalytic Zone” (43/213-NBCZone) of (chymo)trypsin-like serine fold proteases. The disulfide bond Cys42-Cys58 joins the elements of the “42/43 Base Catalytic Zone”, which includes the TRYPSIN_HIS pattern and the Cys42-Gly43 dipeptide. Two conserved structural water molecules in positions X and Y, HOH X and HOH Y, interact with the TRYPSIN_SER pattern and form the “Nucleophile-Base Catalytic Zone Conserved Extension” in eukaryotic serine (chymo)trypsin-like fold proteases. Structural data were visualized and analyzed using Discovery Studio [76] and Bodil [77]. Figures were drawn with MolScript [78] and Raster 3D [79].
Fig. 2(A) shows the “43/213 Nucleophile-Base Catalytic Zone” (43/213-NBCZone) of the “[ST]A Group” of (chymo)trypsin-like serine fold proteases (see Table 1). The 43/213-NBCZone, shown in (A), together with the “42/43 Base Catalytic Zone” shown in (B) constitute the entire Nucleophile-Base Catalytic Zone (NBCZone) of trypsin, which is the representative structure of the “[ST]A Group” (see Table 1). Unlike the “[ST]A Group”, shown in (A), where the 55T position of the 43/213-NBCZone is occupied by an alanine (Ala55 in panel A), in the “TN Group” enzymes, shown in (C), (D), (E) and (F), the 55T position is occupied by an asparagine (Asn196 in panel C; Asn218 in panel D; Asn171 in panel E and Asn38 in panel F), whose conformation is different in four different groups, named Sets I to IV, of the TN Group enzymes. In Sets I and II, respectively shown in (C) and (D), the ND2 atom of Asn55T takes part in the formation of the 43/213-NBCZone, while the OD1 atom of Asn55T does either form an Asx-turn with the catalytic histidine (as in C) or interacts with the main-chain oxygen atom of the catalytic acid (as in D). In Sets III and IV, respectively shown in (E) and (F), the OD1 atom of Asn55T takes part in the formation of the 43/213-NBCZone, while the ND2 atom of Asn55T interacts with either the catalytic acid (as in E) or base (as in F).
Geometrical parameters of interactions within the amino acid sets forming NBCZones in representative structures of nine (chymo)trypsin-like serine fold proteases groups.
| Protein | Organism | PDB ID resolution | Hydrogen bonds of amino acid at position 43T | Hydrogen bonds of amino acid at position 213T | Interactions of amino acids at positions 42T&58T | Ref. |
|---|---|---|---|---|---|---|
| Eukaryotic proteases | ||||||
| [ST]A group | ||||||
| Trypsin | 4I8H_A | N/G43-O/S195, 2.8 | N/V213-O/G197, 2.9 | CB/C42-O/S195, 3.3 (2.7) | ||
| Trypsinogen | 1TGT_A | N/G43-O/S195, 2.7 | N/V213-O/G197, 2.7 | CA/C42-O/S195, 3.4 (2.8) | ||
| Mannan-binding lectin serine protease 2 | 3TVJ_B | N/A469-O/S633, 2.9 | N/V653-O/G635, 2.9 | CB/A468-O/S633, 3.7 (2.8) | ||
| TN group | ||||||
| Serine protease HTRA2, mitochondrial | 5M3N_A | N/S183-O/S306, 3.0 | N/N321-O/G308, 2.9 | CA/G182-O/S306, 3.6 (2.9) | ||
| Serine protease HTRA1 | 3TJN_B | N/S205-O/S328, 3.1 | N/N343-O/G330, 3.0 | CA/G204-O/S328, 4.0 (3.6) | ||
| Protease Do-like 1, chloroplastic | 3QO6_A | N/S158-O/S282, 3.1 | N/N297-O/G284, 2.9 | CA/G157-O/S282, 4.1 (3.6) | ||
| Prokaryotic proteases | ||||||
| TN group | ||||||
| Serine protease Spl | 2AS9_A | N/T26-O/S158, 2.8 | N/V173-O/S160, 2.8 | CB/A25-O/S158, 3.6 (2.7) | ||
| 43&[STG]V group | ||||||
| Immunoglobulin A1 protease | 3H09_A | CG2/I86-O/S288, 3.4 (2.6) | N/Y308-O/S290, 2.8 | CG2/V101-O/S288, 3.6 (2.6) | ||
| Viral serine proteases | ||||||
| [KR]P group | ||||||
| Sindbis virus capsid protein | Sindbis virus | 1SVP_A | N/H128-O/S215A, 2.9 | N/V230-O/R217, 3.0 | CA/G127-O/S215A, 3.3 (2.7) | |
| Viral cysteine proteases | ||||||
| [TA]N group | ||||||
| Nuclear inclusion protein A | Tobacco etch virus | 1LVM_A | N/Y33-O/C151, 2.8 | N/H167-O/S153, 2.9 | CB/L32-O/C151, 3.5 (2.5) | |
| [ΨC][PQ] group | ||||||
| Hepatitis A protease 3C | Human hepatitis A virus | 2HAL_A | N/N30-O/C172, 3.0 | N/H191-O/G174, 2.9 | CB/M29-O/C172, 3.5 (2.4) | |
| 3Cl protease | Alpha-mesoni-virus 1 | 5LAC_B | N/R35-O/C153, 3.0 | N/H168-O/G155, 2.7 | CB/L34-O/C153, 3.6 (2.5) | |
| 43&[VR]N group | ||||||
| 2A proteinase | Coxsac-kievirus A16 | 4MG3_A | N/A | N/V124-O/G112, 2.8 | CD1/L22-O/C110, 3.9 (3.6) | |
| Inactive proteases | ||||||
| Eukaryotic proteases | ||||||
| T[TG] group | ||||||
| Propheno-loxidase activating factor-II | Holo-trichia diom-phalia | 2B9L_A | N/G186-O/G353, 2.6 | N/V374-O/S355, 2.9 | CB/C185-O/G353, 3.1 (2.5) | |
Sets “I-IV” refer to four subgroups of TN groups proteases with different orientation of Asn55T. The values within the parentheses indicate distances to hydrogen atoms.
Fig. 3(A) shows the 43/213-NBCZone of the “43&[STG]V Group” of (chymo)trypsin-like serine fold proteases, which is not found in eukaryotes (example of immunoglobulin A1 protease; see Tables 1 and S1). In these prokaryotic and viral proteins, the change in the course of the polypeptide chain at the position 43T leads to the replacement of the “key” canonical NH...O hydrogen bond (N/Gly43-O/cat.nucleophile in Fig. 2A) with a weak CH...O hydrogen bond (CG2/Ile86-O/cat.nucleophile in panel A), and the impossibility of forming a Cys42T-Cys58T disulfide bridge within the 42/43 Base Catalytic Zone. In (B), the viral “[KR]P group” is shown, where at the position 54T, a lysine or an arginine is found instead of a threonine or a serine. In (C) and (D), extension of the NBCZone in trypsin and trypsinogen, respectively, is shown due to either inclusion of two conserved structural water molecules at the positions X and Y of trypsin (as in C), or a side-chain oxygen atom (OD1 atom of Asp194 in trypsinogen) and one water molecule at the same spatial positions X and Y (as in D). (E) shows the extension of the NBCZone in the “TN Group” of (chymo)trypsin-like serine fold proteases (example of the chloroplastic protease Do-like 2; PDB ID: 5ILB), where the OG atom of Ser43T is located at position X instead of the structural water molecule found in the NBCZone extension of trypsin. (F) shows the NBCZone extension in the five “inactive” proteases (example of Heparin binding protein; PDB ID: 1A7S; see Tables S1 and S2), where instead of a glycine at position 197T there is a serine, threonine, or aspartate (Thr177 in F), whose side-chain OG1 atom substitutes the HOH Y water molecule at the NBCZone extension.
Geometrical parameters of interactions within the positions X and Y of active sites in representative structures of (chymo)trypsin-like serine fold proteases.
| Protein | Organism | PDB ID resolution | Nuc195 | Hydrogen bonds of water molecule or amino acid at position X | Hydrogen bonds of water molecule or amino acid at position Y | Ref. |
|---|---|---|---|---|---|---|
| Eukaryotic proteases | ||||||
| [ST]A group | ||||||
| Trypsin | 4I8H_A | Ser195 | HOH1015-O/G193, 2.8 | HOH1003-O/D194, 2.9 | ||
| Trypsinogen | 1TGT_A | Ser195 | OD1/D194-HOH701, 2.9 | HOH701-CB/D194, 3.4 (2.5) | ||
| Mannan-binding lectin serine protease 2 | 3TVJ_B | Ser633 | HOH6-O/G631, 2.8 | HOH10-O/D632, 2.9 | ||
| Kallikrein-4 | 4K8Y_A | S195 | HOH549-O/G193, 2.8 | HOH301-O/D194, 2.8 | ||
| Complement factor C2 | 2ODP_A | Ser659 | NH1/R473-O/G657, 3.1 | HOH960-O/E658, 2.8 | ||
| TN group | ||||||
| Protease Do-like 2, chloroplastic | 5ILB_A | Ser268 | OG/S145-O/G266, 2.7 | HOH798-O/N267, 2.8 | ||
| Prokaryotic proteases | ||||||
| TA group | ||||||
| Trypsin | 5KWM_A | Ser179 | HOH490-O/G177, 2.9 | HOH480-O/D194, 2.9 | ||
| VESB protease | 4LK4_A | S221A | OD1/D194-HOH455, 3.0 | HOH455-O/D220, 3.3 | ||
| 43&[STG][AV] group | ||||||
| Immunoglobulin A1 protease | 3H09_A | S288 | HOH1038-O/G286, 2.9 | OG/S290-O/D287, 2.6 | ||
| Viral serine proteases | ||||||
| TA group | ||||||
| Putative serine protease | Human astrovirus-1 | 2W5E_A | S551 | HOH2022-O/G549, 3.2 | CB/A553-O/M550, 3.1 (2.7) | |
| Viral cysteine proteases | ||||||
| [ΨC][PQ] group | ||||||
| EV71 3C protease | Enterovirus A71 | 3R0F_A | C147 | HOH186-O/G145, 2.8 | No HOH | |
| Inactive proteases | ||||||
| Eukaryotic proteases | ||||||
| TA group | ||||||
| Heparin binding protein | 1A7S_A | G175 | HOH451-O/G173, 2.9 | CB/T177-O/D174, 3.3 (2.7) | ||