| Literature DB >> 22355773 |
Sun Tian, Wang Huajun, Jianhua Wu.
Abstract
Furin cleaves diverse types of protein precursors in the secretory pathway. The substrates for furin cleavage possess a specific 20-residue recognition sequence motif. In this report, based on the functional characterisation of the 20-residue sequence motif, we developed a furin cleavage site prediction tool, PiTou, using a hybrid method composed of a hidden Markov model and biological knowledge-based cumulative probability score functions. PiTou can accurately predict the presence and location of furin cleavage sites in protein sequences with high sensitivity (96.9%) and high specificity (97.3%). PiTou's prediction scores are biological meaningful and reflect binding strength and solvent accessibility of furin substrates. A prediction result is interpreted within cellular contexts: subcellular localisation, cellular function and interference by other dynamic protein modifications. Combining next-generation sequencing, PiTou can help with elucidating the molecular mechanism of furin cleavage-associated human diseases. PiTou has been made freely available at the associated website.Entities:
Year: 2012 PMID: 22355773 PMCID: PMC3281273 DOI: 10.1038/srep00261
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Comparison of performance and design feature of furin cleavage site prediction tools
| Prediction tool | Sensitivity | Specificity | Design method | Reference |
|---|---|---|---|---|
| PiTou | 96.9% | 97.3% | Biological knowledge-based: combination of a hidden Markov model and biological knowledge- based cumulative probability score functions | Results section |
| ProP | 94.7% | 83.7% | Pure machine learning neural network | Duckert |
Figure 1The design of PiTou algorithm: the PiTou score function is biological knowledge-based and it comprises of a machine learning-based hidden Markov model and a set of biological mechanism-based cumulative probability score functions. is the sum of two parts: the core region binding score (calculated from eight amino acids at P6–P2′) and the flanking region solvent accessible score (calculated from eight amino acids at P7–P14 and four amino acids at P3′–P6′). and incorporate the physical properties of the 20-residue furin cleavage motif and the binding of substrates to the furin catalytic domain. The analysis of the 20-residue furin cleavage motif is described in previous publication8.
Predictions of PiTou on furin cleavage sites to elucidate the molecular mechanisms of furin cleavage-associated human diseases.
| Protein | Furin cleavage site motif | PiTou prediction score | Genetic mutation | Resulting amino acid mutation | Position of resulting amino acid mutation on furin cleavage site motif | Mutated furin cleavage site motif | PiTou prediction score of mutated furin cleavage site motif | Cellular consequence | Associated Human Disease |
|---|---|---|---|---|---|---|---|---|---|
| Ectodysplasin-A isoform II | EKPYSEEESRRVRR↓NKRSKS | 15.9 (cleaved by furin) | C704->T | R155->C | P2: R->C | EKPYSEEESRRVCR↓ NKRSKS | −9.8 | Loss of furin cleavage site | X-linked hypohydrotic ectodermal dysplasia |
| C707->T | R156->C | P1: R->C | EKPYSEEESRRVCC↓ NKRSKS | Loss of P1 arginine | Loss of furin cleavage site | ||||
| G708->A | R156->H | P1: R->H | EKPYSEEESRRVCH↓ NKRSKS | Loss of P1 arginine | Loss of furin cleavage site | ||||
| YSEEESRRVRRNKR↓SKSNEG | 17.0 (cleaved by furin) | C704->T | R155->C | P5: R->C | YSEEESRRVCRNKR↓ SKSNEG | −8.3 | Loss of furin cleavage site | ||
| C707->T | R156->C | P4: R->C | YSEEESRRVRCNKR↓SKSNEG | −10.3 | Loss of furin cleavage site | ||||
| G708->A | R156->H | P4: R->H | YSEEESRRVRHNKR↓SKSNEG | 9.3 | Reduced furin cleavage efficiency | ||||
| Desmoglein 2 | KLLPKHPHLVRQKR↓ AWITAP | 10.6 (cleaved by furin) | G143->A | R48->H | P1: R->H | KLLPKHPHLVRQKH↓ AWITAP | Loss of P1 arginine | Loss of furin cleavage site | Arrhythmogenic right ventricular dysplasia/cardiomyopathy
disorder |
| G134->A | R45->Q | P4: R->Q | KLLPKHPHLVQQKR↓ AWITAP | −14.6 | Loss of furin cleavage site | ||||
| Fibrinogen alpha chain precursor | GDFLAEGGGVRGPR↓ VVERHQ | −14.6 (not cleaved by furin) | T1215->A | V39->D | P1: V->D | GDFLAEGGGVRGPR↓ DVERHQ | 8.1 | Gain of unwanted furin cleavage site | The mutated fibrinogen alpha chain was cleaved by furin.
Prolonged thrombin time and a mild bleeding tendency |
Figure 2The hydrophobicity scale of the P3′–P6′ region of viral substrates (filled black box) is much higher than that of mammalian and bacterial substrates (white box), student t-test pvalue = 1.3E-004.The hydrophobicity is calculated using the physical property EISD840101 consensus normalized hydrophobicity scale for amino acids14.
List of 12 functions that evaluate binding strength of the core region (P6–P2′) and solvent accessibility of two flanking regions (P7–P14 and P3′–P6′)
| Physical property | Position on the furin cleavage site motif | Description | |
|---|---|---|---|
|
| ZIMJ680104 | P2 P4 P5 P6 | Positive charge and isoelectric point |
|
| ZIMJ680104 | P2 P3 | Positive charge and isoelectric point |
|
| Cysteine | P2–P6 | Disulfide bond formation potential and negative charge |
|
| ZIMJ680104 | P4 | Aliphatic residue or positively charged residue |
|
| FAUJ880111 | P4–P6 | Positive charge compensation |
|
| BULH740102 | P1′ | Volume |
|
| BULH740102 | P1′ – P3′ | Volume |
|
| KARP850103 | P1′ P2 P4 P5 P6 | Flexibility |
|
| KARP850103 | P1′ – P3′ | Flexibility |
|
| EISD840101 | P7–P10 | Hydrophobicity |
|
| EISD840101 | P3′–P6′ | Hydrophobicity |
|
| EISD840101 | P11–P14 | Hydrophobicity |