| Literature DB >> 17254298 |
Lawrence J K Wee1, Tin Wee Tan, Shoba Ranganathan.
Abstract
BACKGROUND: Caspases belong to a class of cysteine proteases which function as critical effectors in apoptosis and inflammation by cleaving substrates immediately after unique sites. Prediction of such cleavage sites will complement structural and functional studies on substrates cleavage as well as discovery of new substrates. Recently, different computational methods have been developed to predict the cleavage sites of caspase substrates with varying degrees of success. As the support vector machines (SVM) algorithm has been shown to be useful in several biological classification problems, we have implemented an SVM-based method to investigate its applicability to this domain.Entities:
Mesh:
Substances:
Year: 2006 PMID: 17254298 PMCID: PMC1764470 DOI: 10.1186/1471-2105-7-S5-S14
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Different subsequence segments for SVM training and testing. For human Mcl-1 [Swiss-Prot:Q07820], a sequence window of 24 amino acids in length centred on the tetrapeptide cleavage site, TSTD (underlined) is shown. Amino acids to the left of the scissile bond (shown by the inverted triangle) are labelled from P1 (D) to P14 (L). Amino acids to the right of the scissile bond are labelled from P1' (G) to P10' (A). Curly brackets indicate the subsequence segments extracted for SVM implementation. The sequences spanning P4 to P1 (TSTD), P4 to P2' (TSTDGS) and P14 to P10' (LELVGEGSNNTSTDGSLPSTPPPA) are labelled as P4P1, P4P2' and P14P10' respectively.
Figure 2Schematic layout of the datasets used for SVM training and testing. The primary dataset consist of non-redundant tetrapeptide caspase substrate cleavage sites obtained from literature (see Additional File 1) and an equal number of non-cleavage sites. 1The P4P1 sequences consist of all the sequences in the primary tetrapeptide cleavage site dataset. P4P2' and P14 P10' datasets were derived by extracting subsequence segments from the parent protein chains in the vicinity of the tetrapeptide cleavage sites, as shown in Figure 1. All datasets contain equal number of positive and negative examples.
Results of SVM prediction for various test datasets.
| 0.01 | 100 | 95.83 | 95.83 | 95.83 | 0.92 | |
| 0.1 | 100 | 97.92 | 95.83 | 100.00 | 0.96 | |
| 0.1 | 100 | 97.92 | 95.83 | 100.00 | 0.96 | |
| 0.01 | 1 | 81.25 | 62.50 | 100.00 | 0.67 | |
| 1 | 100 | 89.58 | 79.17 | 100.00 | 0.81 | |
| 0.1 | 1 | 93.75 | 87.50 | 100.00 | 0.88 | |
The SVM parameters (γ and C) were obtained from the cross-validation conducted on the training datasets.
GraBCas prediction on the P4P1 training dataset (positive sequences only)
| 87.43 | |
| 69.46 | |
| 40.72 | |
| 28.14 | |
| 19.76 |
SVM prediction of caspase substrate cleavage sites in Livin and mutants.
| Wild type Livin | Cleaved | Cleaved |
| LE Δ52–61 | Not cleaved | Cleaved |
| Δ53–55 | Cleaved | Cleaved |
| Δ55–57 | Cleaved | Cleaved |
| Δ57–59 | Cleaved | Cleaved |
| Δ60–62 | Cleaved | Cleaved |
| Δ52–61 | Not cleaved | Not cleaved |
| Δ53–61 | Not cleaved | Cleaved |
| Δ52 | Not cleaved | Not cleaved |
| Δ51–53 | Not cleaved | Cleaved |
a. Wild type Livin and various deletion mutants as reported in Yan et al. b Experimentally verified cleavage (cleaved) or non-cleavage (not cleaved) of Livin and deletion mutants. c. SVM prediction of caspase cleavage sites on Livin and deletion mutants (Cleaved – presence of cleavage site; Not cleaved -absence of cleavage site).