| Literature DB >> 31100077 |
Heli A M Mönttinen1, Janne J Ravantti1,2, Minna M Poranen1.
Abstract
Specific cleavage of proteins by proteases is essential for several cellular, physiological, and viral processes. Chymotrypsin-related proteases that form the PA clan in the MEROPS classification of proteases is one of the largest and most diverse group of proteases. The PA clan comprises serine proteases from bacteria, eukaryotes, archaea, and viruses and chymotrypsin-related cysteine proteases from positive-strand RNA viruses. Despite low amino acid sequence identity, all PA clan proteases share a conserved double β-barrel structure. Using an automated structure-based hierarchical clustering method, we identified a common structural core of 72 amino acid residues for 143 PA clan proteases that represent 12 protein families and 11 subfamilies. The identified core is located around the catalytic site between the two β-barrels and resembles the structures of the smallest PA clan proteases. We constructed a structure-based distance tree derived from the properties of the identified common core. Our structure-based analyses support the current classification of these proteases at the subfamily level and largely at the family level. Structural alignment and structure-based distance trees could thus be used for directing objective classification of PA clan proteases and to strengthen their higher order classification. Our results also indicate that the PA clan proteases of positive-strand RNA viruses are related to cellular heat-shock proteases, which suggests that the exchange of protease genes between viruses and cells might have occurred more than once.Entities:
Mesh:
Substances:
Year: 2019 PMID: 31100077 PMCID: PMC6524800 DOI: 10.1371/journal.pone.0216659
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Structural fold of chymotrypsinogen A of Bos taurus (PDBid: 2CGA, member of S1A subfamily).
The catalytic triad located in the interface of the two β-barrels is shown in purple.
Protease families and subfamilies used in this study.
| Protease family | Protease subfamily | Number of structures in the study | Type peptidase | Catalytic amino acids | Activity/Function | Organisms |
|---|---|---|---|---|---|---|
| C3A | 5 | C3 protease | His/Asp or Glu/ Cys | Processing of the viral polyprotein, inhibition of host cell protein synthesis. | (+) ssRNA viruses ( | |
| C3B | 2 | 2A enterovirus peptidase | His/Asp or Glu/ Cys | Processing of the viral polyprotein, inhibition of host cell protein synthesis. | (+) ssRNA viruses | |
| C3C | 1 | foot-and-mouth disease virus C3 protease | His/Asp or Glu /Cys, | Processing of the viral polyprotein | (+) ssRNA viruses | |
| C3E | 1 | Hepatitis A C3 protease | His/Asp/Cys (aspartate may not be functional) | Processing of the viral polyprotein | (+) ssRNA viruses | |
| 2 | Nuclear-inclusion-a peptidase of plum pox virus | Catalytic triad, His/Asp/Cys | The NIa proteases are required for the processing of the potyviral polyproteins | (+) ssRNA viruses | ||
| 7 | Porcine transmissible gastroenteritis virus-type main peptidase | Catalytic dyad, His/Cys | Processing of the viral polyprotein | (+) ssRNA viruses | ||
| 2 | Calicivirin | Catalytic dyad, His/Cys | Processing of the viral polyprotein | (+) ssRNA viruses | ||
| S1A | 79 | Chymotrypsin A | Catalytic triad, His/Asp/Ser | Many functions, | Eukaryotes, Bacteria | |
| S1B | 8 | Glutamyl peptidase I | Catalytic triad, His/Asp/Ser | Bacteria | ||
| S1C | 8 | DegP peptidase | Catalytic triad, His/Asp/Ser | Heat-shock proteases, activated in response to various stress reactions | Bacteria, chloroplasts, and mitochondria | |
| S1D | 4 | Lysyl endopeptidase | Catalytic triad, His/Asp/Ser | Bacteria | ||
| S1E | 7 | Streptogrisin A | Catalytic triad, His/Asp/Ser | Bacteria | ||
| S1F | 1 | Astrovirus serine peptidase | Catalytic triad, His/Asp/Ser | Processing of the viral polyprotein | (+) ssRNA viruses ( | |
| 3 | Togavirin | Catalytic triad, His/Asp/Ser | Capsid protein, cleaves itself from the polyprotein | (+) ssRNA virus ( | ||
| 5 | IgA1-specific serine peptidase | Catalytic triad, His/Asp/Ser | Interference of mucosal immunity | Bacteria | ||
| 2 | Flavivirin | Catalytic triad, His/Asp/Ser | Processing of the viral polyprotein | (+) ssRNA virus | ||
| 1 | Hepacivirin | Catalytic triad, His/Asp/Ser | Processing of the viral polyprotein | (+) ssRNA virus | ||
| 2 | Equine arteritis virus serine peptidase | His/Asp/Ser | Processing of the viral polyprotein | (+) ssRNA virus ( | ||
| S39A | 1 | Sobemovirus peptidase | His/Asp/Ser | Processing of the viral polyprotein | (+) ssRNA virus | |
| 2 | dipeptidyl-peptidase 7 | His/Asp/Ser | Cleavage of peptide for metabolism | Bacteria |
aAccording to the MEROPS database. C in the family name indicates cysteine and S serine proteases.
Fig 2Common structural fold for PA clan proteases.
The common structural core for the PA clan proteases was identified using the HSF program. The 72 equivalent residues deduced from the structural clustering are mapped in green on the structures of the 3C protease of the hepatitis A virus (left; family C3, subfamily C3E, PDBid: 1HAV), MamO protease of Magnetospirillum magneticum (middle; family S1, subfamily S1C, PDBid: 5HMA), and trypsin of Fusarium oxysporum (right; family S1A, PDBid: 1GDQ). The other parts of the protein structures are shown in grey.
Fig 3Cartoon of the secondary structures in the canonical two β-barrel structure of PA clan proteases and the identified structurally conserved core.
The secondary structures observed in the identified core are shown in green. Elements observed only in the canonical β-barrel structure are shown in light purple. The catalytic amino acids are indicated with dark purple spheres (positions according to trypsin). The secondary structures are numbered. N-terminal secondary structures start with N and C-terminal with C.
Fig 4A structure-based distance tree for members of PA clan proteases.
The structure-based distance tree was deduced based on the 72 equivalent amino acid residues located close to the catalytic site. The colors indicate the different families of the PA clan according to the MEROPS database. The five clusters (I−V) are indicated. The split subfamily S1D groups are labeled with “S1Dtype” and “S1Dnew”. The branches corresponding to the protease paralogs SMIPP-S-D1 and SMIPP-S-I1 (PDBids: 3H7T and 3H7O, respectively) are marked with asterisks. The names of the families/subfamilies that comprise viral proteases are in bold.