| Literature DB >> 32920969 |
Neil D Rawlings1, Alex Bateman1.
Abstract
The MEROPS website (https://www.ebi.ac.uk/merops) and database was established in 1996 to present the classification and nomenclature of proteolytic enzymes. This was expanded to include a classification of protein inhibitors of proteolytic enzymes in 2004. Each peptidase or inhibitor is assigned to a distinct identifier, based on its biochemical and biological properties, and homologous sequences are assembled into a family. Families in which the proteins share similar tertiary structures are assembled into a clan. The MEROPS classification is thus a hierarchy with at least three levels (protein-species, family, and clan) showing the evolutionary relationship. Several other data collections have been assembled, which are accessed from all levels in the hierarchy. These include, sequence homologs, selective bibliographies, substrate cleavage sites, peptidase-inhibitor interactions, alignments, and phylogenetic trees. The substrate cleavage collection has been assembled from the literature and includes physiological, pathological, and nonphysiological cleavages in proteins, peptides, and synthetic substrates. In this article, we make recommendations about how best to analyze these data and show analyses to indicate peptidase binding site preferences and exclusions. We also identify peptidases where co-operative binding occurs between adjacent binding sites.Entities:
Keywords: binding site; enzyme classification; peptidase; proteolytic enzyme; substrate specificity
Year: 2020 PMID: 32920969 PMCID: PMC7737757 DOI: 10.1002/pro.3948
Source DB: PubMed Journal: Protein Sci ISSN: 0961-8368 Impact factor: 6.725
FIGURE 1An example peptidase summary page from the MEROPS website. The peptidase summary page for glutamyl endopeptidase I is shown
Counts of identifiers, families and clans in the MEROPS database
| Proteolytic enzymes | Protein inhibitors | |
|---|---|---|
|
| 4,684 | 734 |
| Families | 276 | 83 |
| Clans | 47 | 34 |
Counts of other features in the MEROPS database
| Item | Total |
|---|---|
| References | 70,628 |
| Substrate cleavages | 98,378 |
| Peptidase–inhibitor interactions | 6,854 |
| Small‐molecule inhibitors | 1,345 |
Percentages of amino acids in substrate‐binding sites P4‐P4'
| Amino acid | P4 | P3 | P2 | P1 | P1' | P2' | P3' | P4' |
|---|---|---|---|---|---|---|---|---|
| ‐ | 10.5 | 8.2 | 6.5 | 0 | 0 | 4.3 | 5.3 | 6.2 |
| Ala | 7.6 | 9 | 8.5 | 7 | 11.1 | 8.9 | 7.9 | 7.3 |
| Arg | 4.6 | 4 | 3.7 | 17 | 3.1 | 4.1 | 3.3 | 3 |
| Asn | 2.9 | 2.8 | 2.8 | 4 | 2.9 | 3 | 3.5 | 3.5 |
| Asp | 4.9 | 3.8 | 2.7 | 8.5 | 4.4 | 4.6 | 5.6 | 7.1 |
| Cys | 1.2 | 1.3 | 1.2 | 0.9 | 1.1 | 1 | 1.3 | 1.1 |
| Glu | 6.5 | 6.9 | 6 | 8.2 | 4.9 | 6.7 | 7.6 | 8.7 |
| Gln | 3.9 | 4.1 | 4.1 | 2.8 | 3.1 | 4.4 | 4.5 | 4.5 |
| Gly | 6.7 | 6.5 | 6.3 | 5.8 | 7.8 | 6.9 | 8.3 | 7.6 |
| His | 1.9 | 1.8 | 1.6 | 1.2 | 1.8 | 2 | 2 | 1.9 |
| Ile | 4.4 | 4.7 | 4.6 | 0.7 | 5.7 | 5.2 | 4.4 | 4.2 |
| Leu | 7.8 | 8.4 | 11.7 | 5.6 | 11 | 8.9 | 7.9 | 6.9 |
| Lys | 5.2 | 5.1 | 5 | 15.7 | 6.8 | 5.2 | 5.1 | 5 |
| Met | 1.9 | 2 | 2 | 6 | 2.4 | 1.8 | 1.7 | 1.5 |
| Phe | 3.1 | 3.5 | 4.3 | 3.8 | 4.5 | 3.4 | 3.2 | 2.9 |
| Pro | 5.8 | 5.6 | 5.6 | 2 | 1.5 | 5.2 | 6.2 | 7.3 |
| Ser | 6 | 6 | 5.9 | 3.6 | 9.3 | 6.8 | 6.8 | 6.5 |
| Thr | 4.4 | 4.7 | 4.4 | 2.6 | 4.7 | 5.6 | 5.2 | 5.3 |
| Trp | 0.8 | 0.7 | 0.9 | 0.7 | 0.7 | 0.8 | 0.8 | 0.8 |
| Tyr | 2.3 | 2.2 | 2.7 | 2.5 | 2.8 | 2.7 | 2.4 | 2.1 |
| Val | 6.5 | 7.6 | 8.6 | 1.4 | 7.2 | 8 | 6.6 | 6.1 |
| X | 1 | 0.9 | 0.9 | 0.2 | 3 | 0.3 | 0.3 | 0.4 |
Note: Standard amino acids are included, plus “‐” to indicate an unoccupied site and “X” to indicate a nonstandard amino acid or other moiety. An amino acid that occupies a substrate‐binding site in 10% or more cleavages is highlighted in yellow. An amino acid that occupies a substrate‐binding site in less than 1% of cleavages is highlighted in orange.
Co‐operative binding in substrate‐binding sites of proteolytic enzymes
|
| Recommended name | Cleavages | Frequency | Site | Residue n | Residue n + 1 |
|---|---|---|---|---|---|---|
| A01.002 | pepsin B | 25 | 12 | P2 | Gly | Phe |
| A01.002 | pepsin B | 25 | 17 | P2' | Arg | Leu |
| A01.014 | candidapepsin SAP1 | 25 | 15 | P4 | Leu | Val |
| A01.014 | candidapepsin SAP1 | 25 | 14 | P2' | Ala | Glu |
| A01.060 | candidapepsin SAP2 | 21 | 15 | P4 | Leu | Val |
| A01.060 | candidapepsin SAP2 | 21 | 15 | P3 | Val | Glu |
| A01.060 | candidapepsin SAP2 | 21 | 15 | P2' | Tyr | Leu |
| A01.060 | candidapepsin SAP2 | 21 | 15 | P3' | Leu | Val |
| A02.002 | HIV‐2 retropepsin | 26 | 10 | P1' | Pro | Ile |
| A02.063 | walleye dermal sarcoma virus retropepsin | 23 | 10 | P3' | Val | Gln |
| C01.100 | cruzipain 2 | 26 | 16 | P3' | Lys | Gln |
| C14.001 | caspase‐1 | 171 | 18 | P2 | Pro | Asp |
| C14.001 | caspase‐1 | 171 | 15 | P2 | Thr | Asp |
| C14.004 | caspase‐7 | 466 | 13 | P2 | Phe | Asp |
| C14.004 | caspase‐7 | 466 | 14 | P2 | Met | Asp |
| C30.005 | SARS coronavirus picornain 3C‐like peptidase | 24 | 13 | P2 | Gly | Gly |
| G01.001 | scytalidoglutamic peptidase | 35 | 28 | P3 | Lys | Leu |
| G01.001 | scytalidoglutamic peptidase | 35 | 16 | P1' | Ser | Ser |
| G01.001 | scytalidoglutamic peptidase | 35 | 28 | P2' | Ser | Lys |
| M01.018 | endoplasmic reticulum aminopeptidase 1 | 47 | 13 | P3' | Asn | Lys |
| M03.002 | neurolysin | 118 | 14 | P2 | Gly | Phe |
| M03.011 | tropolysin | 31 | 19 | P3 | Phe | Ser |
| N10.002 | intein‐containing replicative DNA helicase precursor | 80 | 42 | P2 | His | Asn |
| N10.002 | intein‐containing replicative DNA helicase precursor | 80 | 10 | P1 | Pro | Cys |
| N10.004 | intein‐containing translation initiation factor IF‐2 precursor | 30 | 15 | P2 | His | Asn |
| N10.004 | intein‐containing translation initiation factor IF‐2 precursor | 30 | 11 | P3 | Val | His |
| S01.136 | granzyme B, rodent‐type | 360 | 20 | P2 | Gln | Asp |
| S01.217 | thrombin | 163 | 16 | P2 | Ala | Arg |
| S01.217 | thrombin | 163 | 18 | P2 | Gly | Arg |
| S01.251 | kallikrein‐related peptidase 4 | 116 | 21 | P2 | Lys | Arg |
| S01.251 | kallikrein‐related peptidase 4 | 116 | 12 | P2 | Arg | Arg |
| S01.267 | streptogrisin E | 23 | 14 | P1' | Val | Phe |
| S08.070 | kexin | 184 | 13 | P3 | Lys | Lys |
| S09.010 | oligopeptidase B | 35 | 10 | P2 | Arg | Arg |
| S50.004 | blotched snakehead birnavirus Vp4 peptidase | 30 | 12 | P2 | Ala | Ala |
| S50.004 | blotched snakehead birnavirus Vp4 peptidase | 30 | 11 | P1' | Glu | Ala |
| S53.011 | scytalidolisin | 20 | 12 | P1' | Ala | Ala |