| Literature DB >> 22384068 |
Maike Gallwitz1, Mattias Enoksson, Michael Thorpe, Lars Hellman.
Abstract
Thrombin is one of the most extensively studied of all proteases. Its central role in the coagulation cascade as well as several other areas has been thoroughly documented. Despite this, its consensus cleavage site has never been determined in detail. Here we have determined its extended substrate recognition profile using phage-display technology. The consensus recognition sequence was identified as, P2-Pro, P1-Arg, P1'-Ser/Ala/Gly/Thr, P2'-not acidic and P3'-Arg. Our analysis also identifies an important role for a P3'-arginine in thrombin substrates lacking a P2-proline. In order to study kinetics of this cooperative or additive effect we developed a system for insertion of various pre-selected cleavable sequences in a linker region between two thioredoxin molecules. Using this system we show that mutations of P2-Pro and P3'-Arg lead to an approximate 20-fold and 14-fold reduction, respectively in the rate of cleavage. Mutating both Pro and Arg results in a drop in cleavage of 200-400 times, which highlights the importance of these two positions for maximal substrate cleavage. Interestingly, no natural substrates display the obtained consensus sequence but represent sequences that show only 1-30% of the optimal cleavage rate for thrombin. This clearly indicates that maximal cleavage, excluding the help of exosite interactions, is not always desired, which may instead cause problems with dysregulated coagulation. It is likely exosite cooperativity has a central role in determining the specificity and rate of cleavage of many of these in vivo substrates. Major effects on cleavage efficiency were also observed for residues as far away as 4 amino acids from the cleavage site. Insertion of an aspartic acid in position P4 resulted in a drop in cleavage by a factor of almost 20 times.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22384068 PMCID: PMC3288055 DOI: 10.1371/journal.pone.0031756
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Comparison of selected studies since 1981 establishing the substrate recognition sequence of thrombin.
| Study(Reference) | Method | P4 | P3 | P2 | P1 | P1′ | P2′ | P3′ | P4′ | Remarks |
| Pozsgay 1981 (7) | 35 pNA chromogenicsubstrates | n.d. |
|
| (R) | n.d. | n.d. | n.d. | n.d. | Subsite coopera-tivity |
| Lottenberg 1983 (8) | 24 pNA chromogenicsubstrates | n.d. |
|
|
| n.d. | n.d. | n.d. | n.d. | |
| Chang 1985 (12) | Polypeptide hormones and derivatives |
|
|
|
|
|
| - | - | Natural peptides selected by homology |
| Chang 1985 (36) | Digestion of mouse kappa light chains | - | - |
|
|
| - | - | - | to chromo-genic substrates |
| Kawabata 1988 (11) | Boc-XZR-NH-Med(X:12, Z:15) | n.d. |
|
| (R) | n.d. | n.d. | n.d. | n.d. | Not all aa represented |
| Le Bonniec 1991 (13) | 1) 17 pNA chromogenic substrates | n.d. |
|
|
| n.d. | n.d. |
| n.d. | |
| 2) Mutagenesis of peptides corresponding to protein C P7-P5′ | (V) |
| (P) | (R) | (L) | (I) |
| (G) | Subsite coopera-tivity | |
| Ebert 1991 (20) | Mutagenesis of R to S or N in fibrinogen Aα | (G) | (G) | (V) | (R) | (G) | (P) |
| (V) | Only R, S or N in P3′ |
| Theunissen 1993 (17) | Mutagenesis of antithrombin-III (pseudosubstrate) | (I) |
| (G) | (R) |
| (L/V) |
| (P) | Subsite coopera-tivity |
| Le Bonniec 1996 (18) | 21 fluorescence-quenched substrates(Abz-VGPRSXXLK(Dnp)D) | (V) | (G) | (P) | (R) | (S) |
|
| (L) | R not among 10 repre-sented aa |
| Vindigni 1997 (37) | 8 pNA chromogenic substrates (P1: R/K; P2: P/G; P3: V/F) | n.d. |
|
|
| n.d. | n.d. | n.d. | n.d. | Subsite coopera-tivity |
| Marque 2000 (19) | 38 fluorescence-quenched substrates(Abz-VGPRSXXLK(Dnp)D) | (V) | (G) | (P) | (R) | (S) |
|
| (L) | X is not C |
| Backes 2000 (15) | Fluorogenic substrate library, 6859 members(Ac-XXXK-AMC) |
|
|
| (K) | n.d. | n.d. | n.d. | n.d. | P1: K |
| Petrassi 2005 (14) | 1) Fluorogenic positional scanning; 6 wells à 361 substrates |
|
|
|
| n.d. | n.d. | n.d. | n.d. | X is not C |
| Petrassi 2005 (14) | 2) Biased donor-quencher library; 19 sublibraries à 6859 members (LTPRXXXX) | (L) | (T) | (P) | (R) |
|
|
|
| No specific P3′ and P4′ preference found |
|
| Phage-displayed 9-mer library, ∼5×107 members |
|
|
|
|
|
|
|
| Subsite coopera-tivity |
Letters in bold indicates investigated positions, residues that were held constant are in parentheses. The preferred amino acids are denoted in the order of preference. Equally favorable residues are indicated by the absence of a slash (/). n.d., not determined; -, not applicable; pNA, para-nitroanilide.
Figure 1Alignment of sequences obtained after five selection rounds with 1 U of thrombin or 0.2 U of thrombin, compared to natural substrates.
Panel A shows the result with 1 U of thrombin, panel B the result with 0.2 U of thrombin and panel C a panel of natural substrates. The P1 residue in natural substrates (after which cleavage occurs) is denoted in parentheses. Substrate sequences refer to Homo sapiens where not indicated otherwise.! marks phage sequences that have LTP instead of LTP in the N-terminal flank. Residues from the non-randomized phage region are in italics. *, from Rattus norvegicus; IGFBP, insulin-like growth factor-binding protein; PAR, protease-activated receptor. The cleavage site of thrombin in the natural substrates listed in panel C is numbered from the N terminal of the pre-pro protein, from the first methionine. This list of natural substrates is a selection of a few of the most well known substrates of this enzyme. However, the list of potential in vivo substrates is much longer and includes many other proteins such as protein S, TAFI, antithrombin, heparin cofactor II and nexin I.
Figure 2Amino acid frequency in positions P4 to P4′ of thrombin-susceptible phage sequences.
This analysis is based on the alignments shown in Figure 1A and 1B. For clarity, amino acids are displayed in functional groups, starting to the left with aromatic residues, and ending with acidic residues to the right.
Figure 3Analysis of the cleavage specificity by the use of new types of recombinant protein substrate.
Panel A shows the overall structure of the recombinant protein substrates used for analysis of the efficiency in cleavage by thrombin. In these substrates two thioredoxin molecules are positioned in tandem and the proteins have a His6-tag positioned in their C termini. The different cleavable sequences are inserted in the linker region between the two thioredoxin molecules by the use of two unique restriction sites, one Bam HI and one SalI site, which are indicated in the bottom of panel A. Panels B to E shows the cleavage of a number of substrates by thrombin, where individual amino acids has been changed from the thrombin consensus sequence. The name and sequence of the different substrates are indicated above the pictures of the gels. The time of cleavage (in minutes) is also indicated above the corresponding lanes of the different gels. The uncleaved substrates have a molecular weight of approximately 25 kDa and the cleaved substrates appear as two closely located bands with a size of 12–13 kDa.
Amino acid summary of thrombin cleavage sequences.
| Variant | Cleavable sequence | Variant | Cleavable sequence |
| Consensus |
| 11 |
|
| 1 |
| 12 |
|
| 2 |
| 13 |
|
| 3 |
| 14 |
|
| 4 |
| 15 |
|
| 5 |
| 16 |
|
| 6 |
| 17 |
|
| 7 |
| 18 |
|
| 8 |
| 19 |
|
| 9 |
| ||
| 10 |
|
Amino acids shown in bold and larger font are deviations from the preferred thrombin consensus sequence. The cleavage efficiency compared to the consensus is shown as a percentage.
Figure 4Analysis of the cleavage specificity by the use of new types of recombinant protein substrate.
Panels A to D shows the cleavage of a number of substrates by thrombin, where individual amino acids has been changed from the thrombin consensus sequence. The name and sequence of the different substrates are indicated above the pictures of the gels. The time of cleavage (in minutes) is also indicated above the corresponding lanes of the different gels. Variants 16 is the Protein C cleavage site R211, variant 17 is the Prothrombin site R327, variant 18 is the Prothrombin site R200 and variant 19 is the Fibrinogen A alpha site R35 (see Figure 1).
Potential novel thrombin substrates.
| ID/Functional grouping | Name | (Presumable) function | Motif | Positions | Location in protein | Protein expression |
| P11230A | Acetylcholin receptor subunit β | Synaptic transmission | PRGGR | 224–228 | 24–244 ED (multi-pass) | Neurons, muscle |
| Q6UY14A B C | ADAMTS-like protein 4 | Stimulates apoptosis | PRGIR | 424–428 | Secreted | Lung, plasma, placenta, |
| Q9UKB5 | Adherens junction-associated protein 1 | Cell adhesion and migration | PRARR | 96–100 | 1–282 ED | Uterus, pancreas |
| O00253A | Agouti-related protein | Weight homeostasis | PRSSR | 81–85 | Secreted | Brain, testis lung, kidney |
| Q9BXJ7A B C | Amnionless protein | Vit. B12 absorption, directs trunk mesoderm | PRSSR | 195–299 | 20–357 ED | Kidney, testis, thymus, PBL, colon, small intestine |
| Q9Y5L1 | Angiopoietin-related protein 3 | Cell-matrix adhesion, lipid metabolism, angiogenesis | PRAPR | 220–224 | Secreted | Liver (kidney) |
| Q6SPF0A B | Atherin | Atherogenesis by immobilizing LDL in arterial wall | PRAPR | 112–116 | Cytoplasmic/secreted | Atherosclero-tic lesions |
| P01160A B | Atrial natriuretic factor | Cardiovascular homeostasis | PRSLR | 122–126 | Secreted; 56–122 propeptide | |
| O14514 | Brain-specific angiogenesis inhibitor 1 | Inhibits angiogenesis in brain; cell adhesion, signal transduction | PRSLR | 861–865 | 31–948 ED(7-TM) | Brain |
| Q9NYQ6 | Cadherin EGF LAG seven-pass G-type receptor 1 | Cell-cell signalling during formation of the nervous system, planar polarity | PRAPR | 58–62 | 22–2469 ED | |
| Q9NYQ7 | Cadherin EGF LAG seven-pass G-type receptor 3 | Cell-cell signalling during formation of the nervous system | PRTARPRGAR | 249–2532337–2341 | 33–2540 ED | |
| Q96IY4A B | Carboxypeptidase B2 (Thrombin-activable fibrinolysis inhibitor) | Cleaves kinins and anaphylatoxins | PRTSR | 33–37 | Secreted; 23–114 activation peptide | Plasma; synthesized in liver |
| Q16619A B | Cardiotrophin-1 | Induces cardiac myocyte hypertrophy | PRAPR | 113–117 | Secreted | Heart, ovary, prostate, skeletal muscle |
| Q9H2X0A B C | Chordin | Key developmental dorsalizing factor | PRGCR | 869–873 | Secreted | Early vertebrate tissues |
| P02452 | Collagen α1(I) chain | Epidermis and skeletal development | PRGPR | 119–123 | Secreted | Brain, spleen, tendon, ligaments, bones |
| P02461 | Collagen α1(III) chain (Goodpasture antigen) | Homotrimers in most soft connective tissues | PRGNR | 1165–1169 | Secreted | Skin, placenta, liver |
| P20908 | Collagen α1(V) chain | Binds DNA, heparan sulfate, heparin, thrombospondin, insulin | PRGQR | 908–912 | Secreted | Nearly ubiquitous |
| P12107 | Collagen α1(XI) chain | Trimer α1(XI), α2(XI), α3(XI) may control growth of collagen II fibrils | PRGQRPRGSR | 878–882887–891 | Secreted | Cartilage, placenta |
Hits holding the consensus P-R-[AGST]-[not DE]-R in an extra-cellular or secreted part are listed alphabetically by name. In the column to the left, A in bold and larger font indicates (presumable) involvement in cell adhesion, B in the nervous system, C in the cardiovascular system, and D in development/differentiation. ED, extra-cellular domain; 7-TM, seven-transmembrane receptor; PBL, peripheral blood leukocytes.
Potential novel thrombin substrates.
| ID/Functional grouping | Name | (Presumable) function | Motif | Positions | Location in protein | Protein expression |
| P12110 | Collagen α2(VI) chain | Cell-binding protein | PRGPR | 568–572 | Secreted | Fibroblasts, placenta, uterus |
| P113942 | Collagen α2(XI) chain | Trimer α1(XI), α2(XI), α3(XI) may control growth of collagen II fibrils | PRSARPRGQR | 176–180845–849 | Secreted | Cartilage |
| P12111 | Collagen α3(VI) chain | Cell-binding protein | PRGNR | 2366–2370 | Secreted | Fibroblasts, placenta, plasma |
| Q14050 | Collagen α3(IX) chain | Structural component of hyaline cartilage | PRGLR | 230–234 | Secreted | Skin, cartilage |
| Q9UQ03A | Coronin-2B | Reorganization of neuronal actin structure | PRAAR | 397–401 | Brain | |
| Q14118 | Dystroglycan | Cell-matrix interaction; laminin receptor; target for M. leprae | PRTPR | 453–57 | Secreted;30–653 α-dystroglycan | Skeletal muscle |
| P27539A | Embryonic growth/differentiation factor 1 | Embryonic tissue differentiation | PRSLR | 210–214 | Secreted; 30–253 propeptide | Brain |
| P05305A B | Endothelin-1 | Vasoconstriction | PRSKR | 88–92 | Secreted;90 end of big endothelin 1 | Lung, placenta |
| Q06828 | Fibromodulin | Rate of fibrils formation, binds to collagen type I and II | PRSLR | 175–179 | Secreted;168–188LRR-5 2 | |
| P41439A B C | Folate receptor γ | Binds folate | PRSAR | 22–26 | Secreted,1–23 signal sequence | Spleen, thymus, bone marrow; ovary and uterine carcinoma |
| P09681A B C D | Gastric inhibiory polypeptide | Stimulates insulin release, inhibits gastric acid secretion | PRGPR | 47–51 | Secreted;22–50 propetide | |
| O60391A | Glutamate (NMDA receptor subunit 3B | Ion channel in prosynaptic membrane | PRALR | 469–473 | 23–564 ED | Brain, motorneurons |
| Q9NZ20A B C D | Group 3 secretory phospholipase A2 | Phospholipid metabolism | PRAIR | 444–448 | Secreted | Kidney, heart, liver, skeletal muscle |
| Q96RW7 | Hemicentin-1 (Fibulin-6) | Part of ECM | PRGYR | 5301–5305 | 5272–5307 EGF-like 5; Ca2+-binding | Fibroblasts, retinal pigment epithelium |
| Q86UW8 | Hyaluron and proteoglycan link protein 4 | Binds to hyaluronic acid, formation of ECM | PRGGR | 169–173 | Secreted;163–268 link 1 | Brain |
| Q9UMF0 | ICAM-5 | Binds to LFA-1 | PRAPR | 629–633 | 32–825 ED | Brain |
| Q969P0A | Immunoglobulin superfamily member 8 | Cell motility and proliferation, NS development, fertilization | PRSHR | 524–528 | 28–579 ED; 431–560 Ig-like C2 type 4 | Brain, kidney, testis, liver, placenta |
| Q13349 | Integrin αD (CD11d) | Receptor for ICAM-3 and VCAM1, lipoprotein clearance, antigen clearance | PRGQR | 497–501 | 18–1100 ED | Blood cells |
| P38570 | Integrin αE pre-cursor | PRTKR | 58–62 | 19–1124 ED | Intraepithelial T cells | |
| P20701 | Integrin αL (CD11a) | αL/β2 is receptor for ICAM-1, -2, -3 and –4; cytotoxicity | PRAGR | 39–43 | 26–1090 ED | Leukocytes |
| P06756 | Integrin αV(CD51) | Receptor for vitronectin, fibronectin, fibrinogen, prothrombin, laminin, thrombospondin | PRAAR | 274–278 | 31–992 ED |
Hits holding the consensus P-R-[AGST]-[not DE]-R in an extra-cellular or secreted part are listed alphabetically by name. In the column to the left, A in bold and larger font indicates (presumable) involvement in cell adhesion, B in the nervous system, C in the cardiovascular system, and D in development/differentiation. ED, extra-cellular domain; 7-TM, seven-transmembrane receptor; PBL, peripheral blood leukocytes.
Potential novel thrombin substrates.
| ID/Functional grouping | Name | (Presumable) function | Motif | Positions | Location in protein | Protein expression |
| P20702 | Integrin αX (CD11c) | Receptor for fibrinogen | PRGWR | 498–502 | 20–1107 ED | Monocytes, granulocytes |
| P16144 | Integrin β4 precursor (CD104) | α6/β4 is receptor for laminin | PRGLR | 378–382 | 28–710 ED | Epithelia |
| Q9Y6N6 | Laminin subunit γ3 | Organization of embryonic cells into tissues | PRSGR | 443–447 | Secreted; 430–479EGF-like 4 | Skin, heart, reproductive tracts, lung |
| O15230 | Laminin subunit α5 | Organization of embryonic cells into tissues | PRSSR | 3372–3376 | Secreted; 3340–3513 laminin-like 4 | Heart, muscle, lung, placenta, kidney, retina, pancreas |
| O75610A B C | Left-right determination factor 1 | Left-right axis determination | PRSAR | 138–142 | Secreted; 132–135 R-X-X-R site | Colon, pancreas, spleen |
| Q9NT99A B C D | Leucin-rich repeat-containing protein 4B | PRSSR | 543–547 | 36–576 ED | ||
| Q9NZU1 | Leucin-rich repeat transmembrane protein FLRT1 | Cell adhesion, receptor signaling | PRSLR | 98–102 | 21–524 ED78–98 LRR299–121 LRR3 | Kidney, brain |
| Q9NZR2A | Low-density lipoprotein receptor-related protein 1B | Receptor-mediated endocytosis | PRSAR | 2605–2609 | ED 25–4444; 2590–2626 LDL receptor class A13 | Thyroid and salivary gland; adult and fetal brain |
| Q9NPA2 | Matrix metalloproteinase-25 | Activate progelatinase 1 | PRAPR | 515–519 | Mature form 108–539 | Leukocytes, lung, spleen |
| P58417A | Neurexophilin-1 | Resembles neuropeptides | PRAKR | 102–106 | Secreted; 98–176. region III | Brain |
| Q9UM47A B C | Neurogenic locus notch homolog protein 3 (Notch 3) | Regulates cell-fate determination | PRGFRPRGPRPRARR | 109–1131308–13121567–1571 | ED 40–1643 78–118 EGF-like 21289–1325 EGF-like 33 | ubiquitous |
| Q99466A B | Neurogenic locus notch homolog protein 4 (Notch 4) | Regulates cell-fate determination. branching in the vascular system | PRGRR | 1911–1915 | 1432–2003 extracellular. truncation | Heart, lung, placenta |
| Q8N729A | Neuropeptide W | Regulates neuroendocrine signaling, stimulates water and food intake | PRSPR | 115–119 | Secreted | Substantia nigra, fetal kidney and trachea |
| Q14112 | Nidogen-2 | Cell-ECM interactions; binds to collagens I and IV, perlecan, laminin 1 | PRSAR | 145–49 | Secreted;198–273 NIDO domain | Heart, placenta, bone |
| Q8NG85A | Olfactory receptor 2L3 | Putative odorant receptor | PRSLR | 261–265 | 259–271 ED | |
| Q8NG80A | Olfactory receptor 2L5 | Putative odorant receptor | PRSLR | 261–265 | 259–271 ED | |
| Q8NGY9 | Olfactory receptor 2L8 | Putative odorant receptor | PRSLR | 261–265 | 259–271 ED | |
| A | ||||||
| O60542A | Persephin | Neurotropic activity, development of the CNS | PRGAR | 98–102 | Secreted | |
| Q8TCZ9A B C | Polycystic kidney and hepatic disease 1 | Differentiation of bile collecting duct, biliary differentiation | PRGGR | 3236–3240 | 24–3858 ED | Kidney, liver, pancreas |
| Q9P2E7 | Protocadherin-10 | Ca2+-dependent cell adhesion | PRTGR | 395–399 | 19–715 ED; 251–358 cadherin 3 | Brain. testis, ovary |
| Q9H158 | Protocadherin alpha C1 | Ca2+-dependent cell adhesion; specific neuronal connections in the brain | PRSAR | 571–575 | 19–683 ED; 570–667 cadherin 6 | Brain |
| O95206 | Protocadherin-8 | Ca2+-dependent cell adhesion | PRSGR | 301–305 | 30–749 ED; 247–354 cadherin 3 | Brain |
Hits holding the consensus P-R-[AGST]-[not DE]-R in an extra-cellular or secreted part are listed alphabetically by name. In the column to the left, A in bold and larger font indicates (presumable) involvement in cell adhesion, B in the nervous system, C in the cardiovascular system, and D in development/differentiation. ED, extra-cellular domain; 7-TM, seven-transmembrane receptor; PBL, peripheral blood leukocytes.
Potential novel thrombin substrates.
| ID/Functional grouping | Name | (Presumable) function | Motif | Positions | Location in protein | Protein expression |
| Q9Y6N7A | Roundabout homolog 1 (H-Robo-1) | Guides cellular migration, axon development | PRSHR | 115–19 | 26–897 ED;68–164 Ig-like C2-type 1 | Widely expressed (not in kidney) |
| Q9HCK4A | Roundabout homolog 2 (ROBO2) | Guides cellular migration, axon development | PRSHR | 78–82 | 22–859 ED;31–127 Ig-like C2 type 1 | Ovary, brain |
| Q96MS0A | Roundabout homolog 3 (ROBO3) | Guides cellular migration, axon development, spinal chord development | PRAHR | 115–119 | 21–891 ED;64–160 Ig-like C2 type 1 | |
| P09683A B C D | Secretin | Stimulates secretion of NaHCO3 from pancreas, inhibits release of gastric acid | PRARR | 23–27 | Secreted;19–26 propeptde | |
| O94933A | SLIT and NTRK-like protein 3 | Suppresses neurite outgrowth | PRTPR | 339–343 | 29–654 ED | Cerebral cortex |
| O60721A | Sodium/potassium/cal-cium exchanger 1 | Sensory transduction, vision | PRGRR | 192–196 | 1–452 ED | Retinal rod |
| Q99523A | Sortilin | Clearance receptor on cell surface, promotes neuronal apoptosis, receptor for neurotensin, osteogenesis | PRGGR | 70–74 | 34–77 propeptide | Brain, spinal chord, heart, thyroid, testis, placenta, skeletal muscle |
| A2VEC9 | SCO-spondin | Formation of the CNS, neuronal aggregation | PRGWR | 1579–1583 | Secreted; 1564–1600 LDL-receptor class A5 | |
| P22105 | Tenascin X | Cell-ECM interactions | PRAVR | 1500–1504 | Secreted; 1459–1540fibronectin type II7 | Fetal adrenal, testis, muscle |
| O95407A | Tumor necrosis factor receptor super family member 6B | Inhibits apoptosis | PRAGR | 246–250 | Secreted | Fetal brain, lung, liver, adult tissues |
| Q8WY21A | VPS10 domain-containing receptor SorCS1 | Neuropeptide receptor | PRTPR | 797–801 | ED 34–1099 | Fetal/infant brain, fetal retina |
| Q96PQ0A | VPS10 domain-containing receptor SorCS2 | Neuropeptide receptor | PRGVR | 419–423 | ED 51–1878 | Brain, kidney |
| Q9Y493 | Zonadhesin | Binding of sperm to egg | PRGLR | 2622–2626 | ED 18–2757 | Sperms |
| Q9BS86 | Zona pellucida-binding protein 1 | Gamete interaction | PRAFR | 50–54 | Secreted (45–351) | Testis |
Hits holding the consensus P-R-[AGST]-[not DE]-R in an extra-cellular or secreted part are listed alphabetically by name. In the column to the left, A in bold and larger font indicates (presumable) involvement in cell adhesion, B in the nervous system, C in the cardiovascular system, and D in development/differentiation. ED, extra-cellular domain; 7-TM, seven-transmembrane receptor; PBL, peripheral blood leukocytes.