| Literature DB >> 27898009 |
Abstract
Posttranslational modifications are an important feature of most proteases in higher organisms, such as the conversion of inactive zymogens into active proteases. To date, little information is available on the role of glycosylation and functional implications for secreted proteases. Besides a stabilizing effect and protection against proteolysis, several proteases show a significant influence of glycosylation on the catalytic activity. Glycans can alter the substrate recognition, the specificity and binding affinity, as well as the turnover rates. However, there is currently no known general pattern, since glycosylation can have both stimulating and inhibiting effects on activity. Thus, a comparative analysis of individual cases with sufficient enzyme kinetic and structural data is a first approach to describe mechanistic principles that govern the effects of glycosylation on the function of proteases. The understanding of glycan functions becomes highly significant in proteomic and glycomic studies, which demonstrated that cancer-associated proteases, such as kallikrein-related peptidase 3, exhibit strongly altered glycosylation patterns in pathological cases. Such findings can contribute to a variety of future biomedical applications.Entities:
Keywords: Michaelis constant; N-glycosylation; O-glycosylation; core glycan; enzyme kinetics; flexible loops; secreted protease; sequon; substrate recognition; turnover number
Mesh:
Substances:
Year: 2016 PMID: 27898009 PMCID: PMC5187769 DOI: 10.3390/ijms17121969
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Figure 1Examples of the most relevant types of glycosylation according to the literature [3,27]. (A) N-glycosylation of asparagine in sequons with the consensus sequence Asn-Xaa-Ser/Thr. N-glycans are generated by trimming and extending the common precursor GlcNAc2Man9Glu3. Small core glycans are mostly intermediates in mammalian glycan synthesis, but often occur in more primitive eukaryotes and insects, as used for recombinant expression. Mammalian N-glycans exhibit an enormous diversity, due to many possible combinations of branching sugars; (B) O-glycosylation at Ser and Thr is found in all kingdoms of life. There is no distinct consensus sequence, but proline-rich regions are favored, e.g., a typical O-glycan site would be Pro-Ser/Thr-Xaa-Yaa-Pro. A very common mammalian O-glycan is the mucin-type that starts with GalNAc and is extended by galactose and sialic acids or GlcNAc, with eight different cores known. In addition, the O-xylose linked, non-branched glucosamine glycans (GAG) or proteoglycans are a large and diverse glycan family. The displayed chondroitin can be phosphorylated and heavily sulfated, comprising up to fifty disaccharide units. O-GlcNAc glycans occur inside cells, even in the nucleus, while O-galactosylation is found at hydroxylysine residues (Hyl) of collagens.
Human aspartic and cysteine proteases with experimentally confirmed or predicted glycosylation sites (O-glycosylation indicated in parentheses as “o”). MEROPS families are A1, A28 (clan AA), A22 (clan AD), C1, C12 (clan CA), C13 (clan CD), and C26 (PC). UniProtKB entries refer to the first example, following ones continue numerical or alphabetical.Selected structures are shown with Protein Data Bank (PDB) code in bold and numbers of N- and O-glycans per monomer.
| Family | UniProtKB | Aspartic Proteases | PDB ( |
|---|---|---|---|
| A1 | BACE1_HUMAN | β-Secretase 1, 2 | - |
| RENI_HUMAN | Renin | ||
| CATD_HUMAN | Cathepsin D( | D: | |
| NAPSA_HUMAN | Napsin-A | - | |
| A22 | SPP2A_HUMAN | Signal peptide peptidase-like 2A, 2B, 2C | - |
| HM13_HUMAN | Minor histocompatibility antigen H13 | - | |
| A28 | APRV1_HUMAN | Retroviral-like aspartic protease 1 | - |
| C1 | CATB_HUMAN | Cathepsin B, C, F, H, K, L2, S, W, Z | C: |
| C12 | UCHL1_HUMAN | Ubiquitin C-terminal hydrolase isozyme L1( | - |
| C13 | LGMN_HUMAN | Legumain | |
| C26 | GGH_HUMAN | γ-Glutamyl hydrolase | - |
Human metalloproteases with glycosylation sites (o: O-glycans) according to UniProtKB. MEROPS families are M1, M2, M10, M12, M13, M43 (clan MA), M14 (MC), M16 (ME), M19 (MJ), M20, M28 (MH), M24 (MG) and M50 (MM). M87 and M96 have not been assigned to a clan. UniProtKB entries refer to the first example, following ones continue numerical or alphabetical. Selected structures are shown with PDB code in bold and numbers of N- and O-sites per monomer. MB: membrane-bound.
| Family | UniProtKB | Metallo Proteases | PDB ( |
|---|---|---|---|
| M1 | AMPE_HUMAN | Aminopeptidase E, N( | E: |
| ERAP1_HUMAN | Endoplasmic reticulum aminopeptidase 1, 2 | 1: | |
| LCAP_HUMAN | Leucyl-cystinyl aminopeptidase | ||
| TRHDE_HUMAN | TRH-degrading ectoenzyme | - | |
| M2 | ACE_HUMAN | Angiotensin-converting enzyme 1, ACE 2 | |
| M10 | MMP1_HUMAN | MMP 1–3, 8, 9( | - |
| M12 | ADAM2_HUMAN | ADAM 2, 7–10, 12, 15, 17–23, 28–30, 32, 33 | 33: |
| ADEC1_HUMAN | ADAM DEC1 | - | |
| ATS1_HUMAN | ADAM-TS 1–4, 5( | 5: | |
| BMP1_HUMAN | Bone morphogenetic protein 1 | - | |
| TLL1_HUMAN | Tolloid-like protein 1, 2 | - | |
| MEP1A_HUMAN | Meprin A subunit α, β( | ||
| M13 | ECE1_HUMAN | Endothelin-converting enzyme 1, 2, -like 1 | - |
| NEP_HUMAN | Neprilysin | ||
| KELL_HUMAN | Kell protein | - | |
| MMEL1_HUMAN | Membrane metallo-endopeptidase-like 1 | - | |
| PHEX_HUMAN | Phosphate-regulating neutral endopeptidase | - | |
| M14 | CBPA4_HUMAN | Carboxypeptidase A4, A6, B2, D, E | A4: |
| Carboxypeptidase M, N( | M: | ||
| M16 | IDE_HUMAN | Insulin-degrading enzyme | - |
| M19 | DPEP1_HUMAN | Dipeptidase 1, 2, 3 | 1: |
| M20 | P20D1_HUMAN | Carboxypeptidase PM20D1 | - |
| CNDP1_HUMAN | β-Ala-His dipeptidase | - | |
| M24 | MAP2_HUMAN | Methionine aminopeptidase 2( | - |
| XPP2_HUMAN | Xaa-Pro aminopeptidase 2 | - | |
| M28 | CBPQ_HUMAN | Carboxypeptidase Q | - |
| ERMP1_HUMAN | Endoplasmic reticulum metallopeptidase 1 | - | |
| FOLH1_HUMAN | Glutamate carboxypeptidase 2 | ||
| NALD2_HUMAN | NAALADase 2, NAALADaseL | 2: | |
| M43 | PAPP1_HUMAN | Pappalysin 1, 2 | - |
| M50 | MBTP1_HUMAN | MB transcription factor site protease 1, 2 | - |
| M87 | CLCA1_HUMAN | Ca-activated chloride channel regulator 1, 2, 4 | - |
| M96 | TIKI1_HUMAN | Metalloprotease TIKI1, TIKI2 | - |
Glycosylated human serine protease family S1 (clan PA) members (o: O-glycans). UniProtKB entries refer to the first example, following ones continue numerical or alphabetical. Selected structures are shown with PDB code in bold and numbers of N- or O-glycosylation sites per monomer. SP: serine protease.
| Family | UniProtKB | Serine Proteases | PDB ( |
|---|---|---|---|
| S1 | C1R_HUMAN | Complement C1r, C1s, C2 C1r subcomponent-like protein | 1r: |
| CFAB_HUMAN | Complement factor B, I | B: | |
| CELA1_HUMAN | Chymotrypsin-like elastase family A1, 3A, 3B | - | |
| CMA1_HUMAN | Chymase | ||
| CORIN_HUMAN | Atrial natriuretic peptide-converting enzyme | - | |
| CTRC_HUMAN | Chymotrypsin-C, CTR-like protease 1 | - | |
| ELNE_HUMAN | Neutrophil elastase | ||
| ENTK_HUMAN | Enteropeptidase (Enterokinase) | ||
| FA7_HUMAN | Coagulation factor V, VII( | 7: | |
| Coagulation factor XI, XII( | 11: | ||
| GRAA_HUMAN | Granzyme A, B, H, M | B: | |
| HABP2_HUMAN | Hyaluronan-binding protein 2 | - | |
| HEPS_HUMAN | Serine protease hepsin | - | |
| HGFA_HUMAN | Hepatocyte growth factor activator | ||
| KLK1_HUMAN | Kallikrein-related peptidase 1( | 1: | |
| 6–13, 15 | 5: | ||
| KLKB1_HUMAN | Plasma kallikrein | - | |
| MASP1_HUMAN | Mannan-binding lectin serine protease 1 | ||
| NETR_HUMAN | Neurotrypsin | - | |
| OVCH1_HUMAN | Ovochymase-1, 2 | - | |
| PLMN_HUMAN | Plasminogen( | ||
| POLS2_HUMAN | Polyserase-2 | - | |
| PROC_HUMAN | Vitamin K-dependent protein C( | - | |
| PRS23_HUMAN | SP 23, 27, 29, 35, 38, 41, 42, 44, 45, 47, 48, 55–58 | 57: | |
| PRSS8_HUMAN | Prostasin | - | |
| PRTN3_HUMAN | Myeloblastin | ||
| ST14_HUMAN | Matriptase | - | |
| TEST_HUMAN | Testisin | - | |
| THRB_HUMAN | Prothrombin | ||
| TMPS2_HUMAN | Transmembrane (TM) protease serine 2–7, 9 | - | |
| TMPSC_HUMAN | TM protease serine 12, 13 | - | |
| TM11A_HUMAN | TM protease serine 11A, 11B, 11D, 11E | - | |
| TPA_HUMAN | Tissue-type plasminogen activator( | - | |
| TRYB1_HUMAN | Tryptase α/β-1, β-2, γ, δ | B1: | |
| UROK_HUMAN | Urokinase-type plasminogen activator( |
Glycosylated human serine and threonine proteases (o: O-glycans). Included are families S8, S53 (clan SB), S9, S10, S28 (SC), S60 (SR), T1 and T3 (clan PB) according MEROPS. UniProtKB entries refer to the first example, following ones continue numerical or alphabetical. Selected PDBs with glycans are indicated with numbers of N- or O-glycosylation sites per monomer. PPC: Proprotein convertase.
| Family | UniProtKB | Serine Proteases | PDB ( |
|---|---|---|---|
| S8 | FURIN_HUMAN | Furin | - |
| NEC1_HUMAN | Neuroendocrine convertase 1( | - | |
| PCSK4_HUMAN | PPC subtilisin/kexin type 4–7, 9 | 9: | |
| S9 | DPP4_HUMAN | Dipeptidyl peptidase 4 | |
| SEPR_HUMAN | Prolyl endopeptidase FAP | ||
| S10 | PPGB_HUMAN | Cathepsin A | |
| RISC_HUMAN | Retinoid-inducible serine carboxypeptidase | - | |
| S28 | DPP2_HUMAN | Dipeptidyl peptidase | |
| PCP_HUMAN | Lysosomal Pro-X carboxypeptidase | ||
| TSSP_HUMAN | Thymus-specific serine protease | - | |
| S53 | TPP1_HUMAN | Tripeptidyl-peptidase 1 | |
| S60 | TRFL_HUMAN | Lactotransferrin | |
| T1 | PSA1_HUMAN | Proteasome subunit α type-1( | - |
| PSB1_HUMAN | Proteasome subunit β type-1( | - | |
| T3 | GGT1_HUMAN | γ-Glutamyltransferase 1, 3, 5–7 | 1: |
Figure 2Substrate recognition and catalytic steps in proteases. (A) Two-dimensional interaction map of the model peptide AWTRVR-SILMHY with the specificity subsites S6-S6′ of a KLK protease, calculated with the MOE software [114]. The tryptic specificity is based on the electrostatic interaction of P1-Arg and Asp189 in the S1 pocket. Chymotryptic proteases prefer hydrophobic P1 side chains, such as Tyr or Phe; (B) The aspartic protease mechanism requires a pair of Asp residues, with one being the general base that activates a water molecule as in BACE-1 [115]. After substrate binding, the nucleophilic water attacks the scissile bond between P1 and P1′ at the carbonyl C atom; (C) Three major metalloprotease mechanisms are known, such as the favored one for MMP-3 [116]; and (D) Catalysis of a serine protease with chymotrypsinogen numbering [117]. Other serine protease clans exhibit different arrangements of the triad with similar mechanisms. The catalytic triad activates the Ser Oγ as nucleophile via an acid (Asp102) and a general base (His57), which activates a water molecule for hydrolysis of the acyl intermediate [118]. In addition, cysteine protease mechanisms are related, but often require only Cys-His dyads, since the Sγ is more nucleophilic than the Ser Oγ [119]. Similarly, threonine proteases have a nucleophilic Thr Oγ, while the N-terminus and/or a Lys side chain serve as bases [120]. MEROPS lists variations of catalytic residues and rare protease types [45].
Figure 3Selected human proteases shown as molecular surface with glycans as spheres according to the standard sugar coloring scheme (Figure 1). Active site residues are colored dark red, while inhibitors or substrates are shown as ball-and-stick models. (A) The aspartic protease cathepsin D with two N-glycans distant from the active sit cleft (1LYA/1LYB); (B) The papain-like cysteine protease cathepsin C has four N-glycans, one is located near the substrate binding region (1K3B/2DJF); (C) The metalloprotease meprin β carries seven N-glycans, with the one at Asn254 located at the substrate binding cleft near the catalytic Zn2+ (4GWM); (D) Carboxypeptidase N is another Zn2+-dependent protease, with three O-glycans distant to the active site (2NSM); (E) The trypsin-like factor VII is an important activator in the blood coagulation cascade, exhibiting an N-glycan in the catalytic domain and two O-glycans in the EGF-like domain 1 (1QFK); (F) Plasmin, another trypsin-like protease degrades fibrin clots and is O-glycosylated between kringle domains 3 and 4 (4DUR); (G) The trypsin-like KLK1 exhibits one N-linked GlcNAc of a core glycan (1SPJ), which is sufficient to rigidify the flexible 99-loop (yellow) at the active sit cleft, filled with the PPACK inhibitor of the closely related KLK2 (4NFF), in order to explain the effect of glycosylation on substrate binding. In glycan-free KLK2, the 99-loop is more flexible and open; (H) The chymotrypsin-like KLK3/PSA carries a triantennary N-glycan, which may enhance binding of natural substrates and inhibitors (3QUM/2ZCK). The flexible 99-loop resembles the one of KLK1 (Figure 3G) and is depicted in yellow; (I) DPP4 is a membrane anchored, dimeric α/β-hydrolase, with the shielded active site located inside a cavity. The glycans may prevent aggregation and could play a role in the receptor function of DPP4 (1N1M).
Figure 4Free energy profile for substrate turnover by the serine protease trypsin [118,144,148]. Various approaches find shifted energy levels and additional energy minima of intermediates and the enzyme product complex [149,150]. The y-axis represents the Gibb´s free energy of the process (∆G), which is temperature dependent and related to the reaction enthalpy and entropy: ∆G = ∆H − T∆S. The reaction coordinate represents the progress of the reaction, not a real time process. E, S, and P denote enzyme, substrate and products, while TS1 to TS4 are transition states. Direct conversion of k and KM into free energy values is not feasible, while they show an inverse correlation, e.g., a high kcat with a lower free activation energy ∆G‡. Currently, the influence of glycosylation on the single mechanistic steps can only be estimated, but differences in the binding energy of the transition states can be calculated as for mutant enzymes [151]. For the substrate Bz-Pro-Phe-Arg-pNA with the kcat/KM values of N-glycosylated KLK2 (76,930 M−1·s−1) and of glycan-free KLK2 (5780 M−1·s−1) with ∆G = −RT·ln([kcat/KM]glyc/[kcat/KM]) the result is−6.7 KJ·mol−1 [140].
Figure 5The effect of N-glycosylation at Asn95 on the active site conformation of KLK2 according to crystal structure derived models and kinetic data [140]. (A) Glycan-free KLK2 expressed in E. coli exhibits a wide open 99-loop and access of substrates, depicted as green ball-and-stick model bound to the specificity subsites (S4 to S2′ specificity subsites are labeled); (B) N-glycosylation at Asn95 favors a closed 99-loop, which covers the non-prime side region, left to Ser195 (dark red) in the standard orientation, which prevents substrate binding; (C) The N-glycosylated 99-loop of KLK2 opens to a lesser extent than in the glycan-free variant. Thus, substrate binding to glycosylated KLK2 requires more free energy, resulting in a lower kon rate and higher KM.
Figure 6Effects of glycosylation on human proteases. After protein synthesis (A) unfolded proteases enter the endoplasmic reticulum (ER), N-glycans are linked (B); which supports folding (C); The N-glycans are trimmed and extended, accompanied by quality control (D); N-glycans are further modified and O-glycans are attached in the Golgi (E); Sorting leads to membrane anchoring or storage in secretory vesicles (F); After secretion, glycosylation prevents aggregation and unspecific binding (G); as well as proteolysis, which increases the stability and lifetime of proteases (H); Glycosylation regulates binding of: activators (I); cofactors (J) oligomer partners (K); inhibitors (L); and substrates (M); Eventually, glycosylation fine tunes turnover and kinetic parameters in enzymatic reactions (N) [183].