| Literature DB >> 31053072 |
Thommas Mutemi Musyoka1, Joyce Njoki Njuguna1, Özlem Tastan Bishop2.
Abstract
BACKGROUND: Falcipains are major cysteine proteases of Plasmodium falciparum involved in haemoglobin degradation and remain attractive anti-malarial drug targets. Several inhibitors against these proteases have been identified, yet none of them has been approved for malaria treatment. Other Plasmodium species also possess highly homologous proteins to falcipains. For selective therapeutic targeting, identification of sequence and structure differences with homologous human cathepsins is necessary. The substrate processing activity of these proteins is tightly controlled via a prodomain segment occluding the active site which is chopped under low pH conditions exposing the catalytic site. Current work characterizes these proteases to identify residues mediating the prodomain regulatory function for the design of peptide based anti-malarial inhibitors.Entities:
Keywords: Binding affinity; Cysteine protease; Falcipain; Homology modelling; Prodomain inhibitory segment; Zymogen
Mesh:
Substances:
Year: 2019 PMID: 31053072 PMCID: PMC6500056 DOI: 10.1186/s12936-019-2790-2
Source DB: PubMed Journal: Malar J ISSN: 1475-2875 Impact factor: 2.979
Fig. 1Clan CA cysteine protease zymogen prodomain-catalytic domain interaction modes. Surface representation of a human Cat-K and b FP-2. c FP-2 prodomain structural elements (pink; in cartoon representation) interacting with the S1 (red), S2 (blue), S3 (green) and S1′ (cyan) subsites of the catalytic domain
Fig. 2A graphical workflow of the methods and tools (in brackets) used in sequence and structural analysis of FP-2, FP-3 and their homologs
Details of all protein sequences retrieved from PlasmoDB and NCBI databases
| Source organism | Host | Accession number | Common name (abbreviation) | aa | % SI |
|---|---|---|---|---|---|
|
| Human | PF3D7_1115700 | Falcipain-2 (FP-2) | 484 | QS |
| PF3D7_1115400 | Falcipain-3 (FP-3) | 492 | QS | ||
|
| PVX_091415 | Vivapain-2 (VP-2) | 487 | 56a | |
| PVX_091410 | Vivapain-3 (VP-3) | 493 | 55b | ||
|
| Human/monkey | PKH_091250 | Knowlesipain-2 (KP-2) | 495 | 53a |
| PVX-091260 | Knowlesipain-3 (KP-3) | 479 | 58b | ||
|
| Rodents | PY00783 | Yoelipain-2 (YP-2) | 472 | 47a |
|
| PCHAS_091190 | Chabaupain-2 (CP-2) | 471 | 48b | |
|
| PBANKA_093240 | Berghepain-2 (BP-2) | 470 | 50a | |
|
| – | NP_000387.1 | Cathepsin-K (Cat-K) | 329 | 34a |
| AAA66974.1 | Cathepsin-L (Cat-L) | 333 | 33a | ||
| AAC37592.1 | Cathepsin-S (Cat-S) | 331 | 32a |
Percentage sequence identity (SI) is calculated based on the partial zymogen of query sequence (QS) and that of corresponding homolog
aFP-2 homolog, bFP-3 homolog
Homology model quality validation scores of partial zymogen complexes using different assessment tools
| Protein | Z-DOPE | Verify3D | ProSA | QMEAN | RAMACHANDRAN (%) | ||
|---|---|---|---|---|---|---|---|
| Favoured | Allowed | Disallowed | |||||
| FP-2 | − 1.05 | 78.48 | − 8.27 | 0.69 | 88.90 | 10.50 | 0.60 |
| FP-3 | − 0.94 | 84.64 | − 7.84 | 0.67 | 89.70 | 10.30 | 0.00 |
| VP-2 | − 0.64 | 81.27 | − 6.94 | 0.62 | 85.40 | 13.90 | 0.70 |
| VP-3 | − 0.74 | 79.58 | − 7.31 | 0.70 | 88.60 | 11.40 | 0.00 |
| KP-2 | − 0.63 | 85.89 | − 7.23 | 0.62 | 89.30 | 9.70 | 1.00 |
| KP-3 | − 0.92 | 90.63 | − 7.88 | 0.63 | 86.10 | 13.90 | 0.00 |
| BP-2 | − 0.62 | 86.85 | − 7.75 | 0.63 | 86.60 | 12.40 | 1.00 |
| CP-2 | − 0.60 | 84.45 | − 7.02 | 0.63 | 83.90 | 13.80 | 2.30 |
| YP-2 | − 0.42 | 75.54 | − 7.28 | 0.62 | 84.90 | 14.40 | 0.70 |
| Cat-L | − 1.47 | 87.38 | − 7.94 | 0.87 | 89.80 | 9.80 | 0.40 |
| Cat-S | − 1.61 | 85.39 | − 8.57 | 0.79 | 89.20 | 10.40 | 0.20 |
| 1BY8 | * | 85.16 | − 8.62 | 0.78 | 65.80 | 34.20 | 0.00 |
| 2O6X | * | 94.77 | − 7.00 | 0.77 | 90.50 | 9.50 | 0.00 |
| 2OUL | * | 98.13 | − 7.90 | 0.75 | 88.10 | 11.20 | 0.70 |
| 3BWK | * | 93.51 | − 7.35 | 0.65 | 86.10 | 13.30 | 0.60 |
* Template
Fig. 3Homology models of different plasmodial proteases and human Cat-L together with the templates used in homology modelling. Colour code ranges from blue (accurate modelling) to red (poorly modelled regions)
A summary of physicochemical properties of FP-2 and FP-3 and homologs partial zymogen sequences
| Protein | Aromaticity | GRAVY | Instability index | Mwgta | p | ||
|---|---|---|---|---|---|---|---|
| Complex | Catalytic | Complex | Catalytic | ||||
| FP-2 | 0.12 | − 0.59 | 40.31 | 38,021.06 | 27,176.69 | 6.50 | 4.94 |
| FP-3 | 0.14 | − 0.53 | 33.90 | 38,029.91 | 27,348.62 | 5.47 | 4.72 |
| VP-2 | 0.14 | − 0.40 | 23.78 | 37,941.11 | 27,388.93 | 5.49 | 4.74 |
| VP-3 | 0.14 | − 0.47 | 28.92 | 38,405.76 | 28,088.95 | 5.60 | 5.00 |
| KP-2 | 0.14 | − 0.54 | 29.55 | 38,583.83 | 27,947.73 | 5.69 | 4.93 |
| KP-3 | 0.14 | − 0.51 | 23.16 | 38,284.39 | 27,685.36 | 5.89 | 5.13 |
| BP-2 | 0.14 | − 0.52 | 39.73 | 38,014.94 | 27,367.90 | 5.44 | 4.77 |
| CP-2 | 0.13 | − 0.47 | 52.14 | 37,883.92 | 27,140.51 | 5.14 | 4.54 |
| YP-2 | 0.14 | − 0.46 | 40.05 | 38,120.25 | 27,454.06 | 5.66 | 4.78 |
| Cat-K | 0.10 | − 0.62 | 33.12 | 34,566.05 | 23,495.47 | 8.83 | 8.92 |
| Cat-L | 0.13 | − 0.70 | 38.17 | 35,074.20 | 24,298.86 | 5.33 | 4.64 |
| Cat-S | 0.12 | − 0.56 | 37.20 | 34,986.63 | 23,963.97 | 8.44 | 7.64 |
aSignificant variation between partial zymogen and catalytic sequence
Fig. 4Structural-based multiple sequence alignment of FP-2, FP-3 and homologs prodomain-catalytic domains. Actual residue numbering per protein is given on the side, and the top numbering is based on partial zymogen alignment. The papain family characteristic prodomain ERFNIN and GNFD motif residues are indicated with an asterisk. Bold short lines depict the prodomain-catalytic domain border. Dashed green lines indicate the position of α-helix and arrows β-sheet structural elements. Fully conserved residues in all the proteins are marked with red while residues only conserved in plasmodial proteases with blue. Position of subsite residues is shown with filled circles (Red = S1, Blue = S2, Green = S3 and S1′ = black)
Fig. 5A phylogenetic tree of plasmodial and human FP-3, and FP-3 homologs prodomain-catalytic protein sequences using MEGA5.2.2. The evolutionary history was inferred by using the Maximum Likelihood method based on the Whelan and Goldman model (WAG) model with a γ discrete distribution (+G) parameter of 2.4 and an evolutionary invariable ([+I]) of 0.1. All positions with gaps were completely removed (100% deletion) and bootstrap value set at 1000. The scale bar represents the number of amino acid substitutions per site. Toxoplasma gondii CAT-L is used as the outgroup
Fig. 6Motif analysis of plasmodial proteases and human cathepsins partial zymogen domains. a A heat map showing the distribution, level of conservation and information of different motifs found in plasmodial and human proteases studied. A cartoon presentation showing the location of all motifs within the prodomain-catalytic structural fold. Labelled in green boxes are motifs present in both (b) human cathepsins and (c) plasmodial proteases
Fig. 7Intra-prodomain and prodomain-catalytic residue interaction network in a FP-2 and b Cat-K. For each protein full length residue numbering is used. Enclosed in black are residue interactions involved in anchoring the prodomain onto the catalytic domain with the rest being those involved in mediating the inhibitory effect. Shown in lines are the different interaction types between the prodomain and catalytic domains
Fig. 8A heatmap for residue interaction energies between prodomain inhibitory segment and the catalytic subsite residues per protein. The inhibitory segment starts from the conserved Asn residue in the GNFD motif (Fig. 4)
Fig. 9Sequence alignment of the prodomain inhibitory segment for the plasmodial and human cathepsin proteases studied. Marked sequence sections indicate the portions used to design different oligopeptides for docking studies and their conservation as determined by WebLogo server. Actual residue numbering per protein is given on the side
Amino acid sequences of proposed peptides, their predicted binding affinity values (ΔG—kcalmol−1) and dissociation constant (Kd) with individual catalytic domains of the different proteins studied
| Protein | Peptide | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 2 | 3 | 4 | 5 | ||||||
| ΔG | Kd (M) | ΔG | Kd (M) | ΔG | Kd (M) | ΔG | Kd (M) | ΔG | Kd (M) | |
| FP-2 | − 11.8 | 2.2E−09 | − 11.2 | 8.4E−09 | − 8.1 | 1.2E−06 | − 9 | 2.7E−07 | − 11.4 | 4.4E−09 |
| FP-3 | − 12.2 | 1.2E−09 | − 10.5 | 2.0E−08 | − 9.6 | 9.4E−08 | − 9.7 | 7.8E−08 | − 12.3 | 8.8E−10 |
| VP-2 | − 11.0 | 8.3E−09 | − 10.1 | 3.8E−08 | − 12.3 | 8.8E−10 | − 9.0 | 2.7E−07 | − 10.9 | 1.1E−08 |
| VP-3 | − 11.7 | 2.7E−09 | − 11.0 | 8.3E−09 | − 9.9 | 5.7E−08 | − 9.2 | 1.7E−07 | − 11.8 | 2.1E−09 |
| KP-2 | − 12.7 | 4.7E−10 | − 11.8 | 2.1E−09 | − 6.7 | 1.3E−05 | − 8.1 | 1.2E−06 | − 10.6 | 1.8E−08 |
| KP-3 | − 11.8 | 2.1E−09 | − 12.7 | 5.2E−10 | − 9.3 | 1.4E−07 | − 9.2 | 1.9E−07 | − 12.7 | 5.2E−10 |
| BP-2 | − 11.5 | 3.7E−09 | − 12.2 | 1.2E−09 | − 10.9 | 9.9E−09 | − 7.9 | 1.7E−06 | − 11.7 | 2.7E−09 |
| CP-2 | − 11.9 | 2.2E−09 | − 11.7 | 2.7E−09 | − 8.7 | 3.9E−07 | − 8.3 | 1.1E−07 | − 12.7 | 4.7E−10 |
| YP-2 | − 11.7 | 2.7E−09 | − 14.3 | 5.5E−11 | − 12.4 | 7.9E−10 | − 10.4 | 2.5E−08 | − 9.3 | 1.5E−07 |
| Cat-K | − 13.9 | 6.3E−11 | − 11.5 | 3.5E−09 | − 11.9 | 2.0E−09 | − 11.4 | 4.2E−09 | − 8.3 | 1.1E−07 |
| Cat-L | − 12.2 | 1.0E−09 | − 11.8 | 5.1E−09 | − 12.3 | 9.7E−10 | − 10.2 | 3.5E−08 | − 8.7 | 9.2E−07 |
| Cat-S | − 10.4 | 2.4E−09 | − 11.5 | 3.5E−09 | − 12.4 | 7.7E−10 | − 9.9 | 5.2E−08 | − 8.6 | 4.8E−07 |
Peptide 1NRFGDLSFEEFKKKYLNLKLFD, 2LTYHEFKNKYLSLRSSK, 3MTFEEFKQKYLTLKSKD, 4EFKKKYLTLK, 5EFKKKYLTLKSKD
Fig. 10A heatmap for residue interaction energies between peptide 5 and the catalytic subsite residues per protein
Peptide 5-catalytic domain residue interaction fingerprint
| Protein | Subsite | Non-subsite residues | |||
|---|---|---|---|---|---|
| S1 | S2 | S3 | S1′ | ||
| FP-2 | Q279, C282, G283, C323 | L327, | N320, G325, G326 | V395, S396, A400, | K280, W286, R470, C285 |
| FP-3 | Q287, C290, G291, Y332 | K327, N328 | A402, A403, S404, A408, | L289, C293, W294, N338 | |
| VP-2 | Q282, G286, Y327 | F330, I331, P418, | Q322, N323, G328, G329 | I396, A397, V398, A403, H420, N451 | D282, F331, P332, Y351, E367, F389 |
| VP-3 | Q288, C291, G292, C332, D333 | N336, I337, S401, P424, | G334, G335, N329 | I402, C403, A404, N405, | D287, K289, C294, D406, S423, S457, G459, K458, W462 |
| KP-2 | Q288, C291, G292, C332, D333 | L336, I337, S401, P424, | N329, G333, G334 | N403, A404, N405, T409, H426, N456, W458 | |
| KP-3 | Q274, C277, G278, C318, D319 | F322, N387, T410, N411, | Q314, N3145, N316, G320, G321 | I388, A389, V390, S391, A395, | G275, S279, C280, P324, R325, E368, N387, D392, T409 |
| BP-2 | Q264, A268, C308, E309 | I312, A377, | N304, N305, F306, G311 | V378, G379, D381, H402, N432, W434 | K266, A272, P314, Y334, E349, A367, I376 |
| CP-2 | Q267, C270, A271, E312 | I315, L316, A380, A303, | N307, N308, D309, G314 | V381, G382, A383, S384, | K265, W274, Q306, P337, K351 |
| YP-2 | C271, A272, D313 | I316, L317, A381, A404, | N308, N309, F310, G314 | V382, G383, V384, A385, H406, N438 | Q269, W275, F310, E353, I380 |
| Cat-K | C136, G137 | Y181, M182, A248, L274, N275, A277 | G179 | D250, A251, S252, H276 | C139, W140, F186, Y224, F256, W302, E226 |
| Cat-L | C135, N179 | L182, A248, D275, G277 | N175 | I249, A251, L257, H276 | P172, G224, A234, I263 |
| Cat-S | C134, | F182, G249, V274, | N175, K176, G180 | R253, F258, W298 | Y173, Y225, E227, P229, Y230 |
Italics are residues forming hydrogen bonds with the peptide
Fig. 11Peptide 5 binding mode with catalytic domain of various proteins (Red = S1, Blue = S2, Green = S3 and Cyan = S1′)