| Literature DB >> 25697094 |
Karen McLuskey1, Jeremy C Mottram1.
Abstract
Clan CD forms a structural group of cysteine peptidases, containing seven individual families and two subfamilies of structurally related enzymes. Historically, it is most notable for containing the mammalian caspases, on which the structures of the clan were founded. Interestingly, the caspase family is split into two subfamilies: the caspases, and a second subfamily containing both the paracaspases and the metacaspases. Structural data are now available for both the paracaspases and the metacaspases, allowing a comprehensive structural analysis of the entire caspase family. In addition, a relative plethora of structural data has recently become available for many of the other families in the clan, allowing both the structures and the structure-function relationships of clan CD to be fully explored. The present review compares the enzymes in the caspase subfamilies with each other, together with a comprehensive comparison of all the structural families in clan CD. This reveals a diverse group of structures with highly conserved structural elements that provide the peptidases with a variety of substrate specificities and activation mechanisms. It also reveals conserved structural elements involved in substrate binding, and potential autoinhibitory functions, throughout the clan, and confirms that the metacaspases are structurally diverse from the caspases (and paracaspases), suggesting that they should form a distinct family of clan CD peptidases.Entities:
Mesh:
Substances:
Year: 2015 PMID: 25697094 PMCID: PMC4357240 DOI: 10.1042/BJ20141324
Source DB: PubMed Journal: Biochem J ISSN: 0264-6021 Impact factor: 3.857
The structural availability and phylogenetic distribution of the clan CD families
The availability (✓) and absence (×) of clan CD families in the phylogenetic kingdom. The year that the first structure became available is shown for each family (year).
| FAMILY | Representative member | Structural data? (year) | Bacteria | Achaea | Protozoa | Fungi | Plants | Viruses | Animals |
|---|---|---|---|---|---|---|---|---|---|
| C11 | Clostripain | Yes (2013) | ✓ | ✓ | ✓ | × | ✓ | × | × |
| C13 | Legumain | Yes (2013) | ✓ | ✓ | ✓ | ✓ | ✓ | × | ✓ |
| C14A | Caspase | Yes (1994) | × | × | × | × | × | ✓ | ✓ |
| C14B(P) | Paracaspase | Yes (2011) | ✓ | ✓ | × | × | × | × | ✓ |
| C14B(M) | Metacaspase | Yes (2012) | ✓ | ✓ | ✓ | ✓ | ✓ | × | × |
| C25 | Gingipain R | Yes (1999) | ✓ | ✓ | × | × | × | × | × |
| C50 | Separase | No | × | × | ✓ | ✓ | ✓ | × | ✓ |
| C80 | MARTX-CPD | Yes (2008) | ✓ | × | × | × | × | × | ✓ |
| C84 | PrtH peptidase | No | ✓ | × | × | × | × | × | × |
| Clan CD | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ |
Enzymatic properties of the clan CD peptidases
| Family | Representative member | Specificity in P1 | Requirement for activation | Self-inhibition observed? | Region of self-inhibition |
|---|---|---|---|---|---|
| C11 | Clostripain | Arginine | Ca2+ | Unknown | – |
| C13 | Legumain | Asparagine and aspartate | Change in pH | Yes | C-terminal domain |
| C14A | Caspase | Aspartate | Dimerization or proteolysis | Unclear | N-terminal region |
| C14B(P) | Paracaspase | Arginine | Dimerization | Yes | Substrate-binding loop (L5) |
| C14B(M) | Metacaspase | Arginine and lysine | Ca2+ and/or proteolysis | Yes | N-terminal region |
| C25 | Gingipain R | Arginine or lysine | Proteolysis and/or Ca2+ | Yes | N-terminal prodomain |
| C50 | Separase | Arginine | Ca2+ | Unknown | – |
| C80 | MARTX-CPD | Leucine | Ligand binding | Yes | N-terminal region |
| C84 | PrtH peptidase | Arginine | Unknown | Unknown | – |
*Legumain will accept asparagine or aspartate residues depending on the pH.
**Cleavage of the proform of gingipain is required for full activation of the enzyme and while Ca2+ is not reported as a prerequisite for activation, all active forms appear to have Ca2+ present.
†Caspases are activated by dimerization or proteolysis depending on the type; typically initiator caspases are activated by dimerization whereas the effector caspases are activated by cleavage (proteolysis).
‡Self-inhibition using the N-terminal region has been suggested in the effector caspases but there are no structural data to date.
§Metacaspases are known to accept arginine and lysine in P1.
║Type I metacaspases generally activated by Ca2+; this is also true for type II metacaspases but, in addition, proteolysis has also been shown to be important in some cases.
¶Gingipain will accept arginine or lysine depending on the enzyme (gingipain R and K have a strict specificities for arginine and lysine, respectively).
Figure 1The topology and structure of the caspase dimer
Caspase-7 (PDB ID 1F1J, see Supplementary Table S1) is used to represent a typical caspase with all β-strands coloured blue and α-helices grey. Structural elements from the second monomer in the dimer are coloured paler than the first. (A) The topology and simplified nomenclature of the caspases. The central β-strands and major α-helices are named from the N-terminus (β1–β6 and α1–α5, respectively); the important loop regions (L) are named according to the strands that they follow (L1–L5); and the small sections of β-turns on L3 and L5 are named β1–β3 and β1–β2, respectively. The position of the catalytic dyad histidine (H) and cysteine (C) is highlighted in red on L3 and L4, respectively, and the position of the conserved aspartate cleavage site (D) is shown on L4 (/). A vertical dashed line represents the dimer interface and the C-terminus of the second monomer is shown (′). The CHF SSEs are highlighted (*). (B) Ribbon diagram of a caspase dimer. Topdraw [90] and PyMOL (http://pymol.sourceforge.net, Schrodinger) were used for topology diagrams and molecular images throughout the present review, respectively.
Figure 2The S1-binding pockets of the clan CD family members
The catalytic dyad is shown in red and conserved aromatic residues are shown in green. With the exception of TbMCA2, residues that form hydrogen bonds to the P1 residue of a bound inhibitor are shown in blue (the darker shade of blue represents interactions through functional groups, whereas the lighter blue shows interactions from main chain atoms). Residues and SSEs involved in P1 binding are labelled and SSEs structurally homologous (but topologically diverse) to those found in the caspases are highlighted (H). (A) Caspase-7 PDB (ID 1F1J). (B) Inhibitor-free TbMCA2 (PDB ID 4AFR) in which residues shown to be important in substrate binding are highlighted in blue, with those responsible for specificity in P1 [11] shown in navy blue. (C) MALT1 paracaspase domain (MALT1-P) (PDB IB 3UOA). (D) Legumain (PDB ID 4AW9). (E) Gingipain R (PDB ID 1CVR). (F) MARTX-CPD (PDB ID 3GCD). Inhibitors used in complex structures are shown in Supplementary Table S1.
Figure 3The structural topologies of the clan CD enzymes
(A) Caspase-7; (B) TbMCA2; (C) MALT1-P; (D); legumain; (E) gingipain R; and (F) MARTX-CPD. The S1-binding pockets are highlighted as in Figure 2 and the topologies are based on the PDB codes described in the same Figure. Strands in the central β-sheet are numbered from the N-terminus in black. Black numbering is also used for the five major α-helices and important S1-binding loops (L) when they are located in the structure in the same order as they are in the caspases. SSEs that are structurally homologous to those found in the caspases, but appear in the structure in a different order, are highlighted with an (H), followed by the caspase numbering, and shown in purple (α and β have been omitted as a result of space constraints but are used in the text). The position of the catalytic dyad (H/C) is shown in red on loops L3 and L4 (or HL3 and HL4), respectively.
Three-dimensional superposition of clan CD families with caspase 7
The Table is ordered in terms of the quality of the Cα alignment (Q score, QS), in which %SSEQ-C7 is the percentage of the SSEs in the query (Q) that can be identified in caspase-7 (where Q=MALT1-P, legumain, TbMCA2, PmC11, gingipain R and MARTX-CPD); %SSEC7-Q is the percentage of the SSEs in caspase-7 that can be identified in Q (see above); % Seq. ID is the percentage of the sequence identity found after structural alignment; Nalign is the number of matched residues; and RMSD is the root-mean-square deviation on the Cα positions of the matched residues.
| Enzyme | Family | PDB ID | %SSEQ-C7 | %SSEC7-Q | % Seq. ID | RMSD (Å) | ||
|---|---|---|---|---|---|---|---|---|
| Caspase-7 | C14A | 1F1J | 1.00 | 100 | 100 | 100 | 230 | 0.00 |
| MALT1-P | C14B(P) | 3V4O | 0.41 | 79 | 73 | 19 | 177 | 1.94 |
| Legumain | C13 | 4AW9 | 0.34 | 65 | 87 | 13 | 173 | 2.05 |
| TbMCA2 | C14B(M) | 4AFR | 0.22 | 59 | 67 | 13 | 175 | 2.69 |
| PmC11 | C11 | 3UWS | 0.14 | 38 | 73 | 11 | 151 | 3.03 |
| Gingipain R | C25 | 1CVR | 0.13 | 32 | 67 | 9 | 161 | 2.97 |
| MARTX-CPD | C60 | 3GCD | 0.10 | 47 | 47 | 6 | 109 | 3.60 |
Figure 4The structural diversity in the central β-sheet of the clan CD enzymes
(A) The β-sheet topologies exhibited by the families in the clan. The β-strands described by the CHF [23] are shown in grey whereas the other strands are shown in blue; the N- and C-terminal ends of the enzymes are labelled accordingly. (B) A phylogenetic tree based on structure, in which QH [85] is a measure of structural homology. This Figure was produced using a STAMP [87] structural alignment and VMD [87].