| Literature DB >> 24995852 |
Szu-Chin Fu1, Kenichiro Imai, Tatsuya Sawasaki, Kentaro Tomii.
Abstract
Because of its wide range of substrates, caspase-3, a main executioner among apoptosis-related caspases, is thought to have many unknown substrates that have remained unidentified. This report describes our predictive method to facilitate the discovery of novel caspase-3 substrates. To develop a more reliable prediction method, we specifically examined improvement of the data quantity and quality of caspase-3 cleavage sites. The ScreenCap3 method is based on machine learning and on information not only of experimentally verified positive examples but also of negative examples, which were not cleaved by caspase-3. Using information of experimentally verified noncleavage sites, we elucidate novel patterns of amino acids around "actual" cleavage sites. Results show that ScreenCap3 provides substantial improvement in terms of precision, compared with existing methods. Therefore, ScreenCap3 is anticipated for use with proteomic screening and identification of novel caspase-3 substrates and their cleavage sites. ScreenCap3 is available at http://scap.cbrc.jp/ScreenCap3/.Entities:
Keywords: Apoptosis; Bioinformatics; Caspase-3; High-throughput proteomics
Mesh:
Substances:
Year: 2014 PMID: 24995852 PMCID: PMC4282780 DOI: 10.1002/pmic.201400002
Source DB: PubMed Journal: Proteomics ISSN: 1615-9853 Impact factor: 3.984
Figure 1Two iceLogos generated using the same positive examples (experimentally verified cleavage sites) but different negative examples: (A) our experimentally verified noncleavage D-sites and (B) the “plausible” noncleavage D-sites, as most previous methods used.
Performance comparison with existing methods
| Method (cut-off) | #TPs (recall) | #FPs (precision) | MCC |
|---|---|---|---|
| SitePrediction (99%) | 42 (0.79) | 188 (0.18) | 0.32 |
| Pripper | 38 (0.72) | 128 (0.23) | 0.35 |
| SitePrediction (99.9%) | 18 (0.34) | 19 (0.49) | 0.37 |
| CAT3 (30) | 23 (0.43) | 38 (0.38) | 0.37 |
| ScreenCap3 (0.7) | 26 (0.49) | 38 (0.41) | 0.41 |
Pripper is not a score-based predictor.
Figure 2(A) Comparison of the number of false positives while attaining an equal number of true-positive results. Corresponding cut-off values are shown in parentheses. (B) Comparison of the number of true positives while attaining an equal number of false-positive results. Corresponding cut-off values are shown in parentheses. (C) PR curve of three tools at the site level. Recall is defined as TP/(TP + FN). Precision is defined as TP/(TP + FP), here TP = number of true positives, FN = number of false negatives, and FP = number of false positives.