| Literature DB >> 17044919 |
Yu Xue1, Hu Chen, Changjiang Jin, Zhirong Sun, Xuebiao Yao.
Abstract
BACKGROUND: Protein palmitoylation, an essential and reversible post-translational modification (PTM), has been implicated in cellular dynamics and plasticity. Although numerous experimental studies have been performed to explore the molecular mechanisms underlying palmitoylation processes, the intrinsic feature of substrate specificity has remained elusive. Thus, computational approaches for palmitoylation prediction are much desirable for further experimental design.Entities:
Mesh:
Substances:
Year: 2006 PMID: 17044919 PMCID: PMC1624852 DOI: 10.1186/1471-2105-7-458
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
The detailed description of data set.
| Data set | Old | New | |
| original | Clear redundant | ||
| protein | 84 | 111 | 105 |
| sites | 209 | 266 | 245 |
| non-sites | 720 | 1017 | 977 |
Top five Gene Ontology (GO) groups of biological processes, molecular functions and cellular components in palmitoylated proteins.
| GO:0007165 | signal transduction | 26 |
| GO:0007186 | G-protein coupled receptor protein signaling pathway | 21 |
| GO:0006810 | transport | 16 |
| GO:0006811 | ion transport | 7 |
| GO:0007155 | cell adhesion | 7 |
| GO:0005515 | protein binding | 41 |
| GO:0004872 | receptor activity | 27 |
| GO:0004871 | signal transducer activity | 25 |
| GO:0004930 | G-protein coupled receptor activity | 15 |
| GO:0001584 | rhodopsin-like receptor activity | 14 |
| GO:0016020 | membrane | 70 |
| GO:0016021 | integral to membrane | 54 |
| GO:0005886 | plasma membrane | 24 |
| GO:0005887 | integral to plasma membrane | 19 |
| GO:0005783 | endoplasmic reticulum | 9 |
Comparison of the prediction performance for three machine learning algorithms on old data set.
| 3 | 85.25% | 54.39% | 94.21% | 0.5438 | 86.01% | 55.98% | 94.72% | 0.5679 | 86.33% | 57.42% | 94.72% | 0.5795 | 0.5637 | 0.0357 | |
| 4 | 85.97% | 56.46% | 94.54% | 0.5677 | 86.29% | 56.94% | 94.81% | 0.5776 | 86.44% | 58.85% | 94.44% | 0.5851 | 0.5768 | 0.0174 | |
| 5 | 85.86% | 58.53% | 93.80% | 0.569 | 86.11% | 58.53% | 94.12% | 0.5757 | 86.22% | 58.37% | 94.31% | 0.5783 | 0.5743 | 0.0093 | |
| 7 | 86.15% | 60.61% | 93.56% | 0.5811 | 86.19% | 60.61% | 93.61% | 0.5820 | 86.44% | 61.72% | 93.61% | 0.5909 | 0.5847 | 0.0098 | |
| 8 | 86.01% | 60.29% | 93.47% | 0.5766 | 86.08% | 60.13% | 93.61% | 0.5781 | 86.01% | 62.20% | 92.92% | 0.5811 | 0.5786 | 0.0045 | |
| 3 | 84.39% | 49.92% | 94.40% | 0.5104 | 85.15% | 53.91% | 94.21% | 0.5399 | 83.85% | 50.72% | 93.47% | 0.4975 | 0.5159 | 0.0424 | |
| 4 | 85.11% | 53.59% | 94.26% | 0.5382 | 85.58% | 55.18% | 94.40% | 0.5543 | 86.11% | 57.89% | 94.31% | 0.5745 | 0.5557 | 0.0363 | |
| 5 | 85.97% | 57.10% | 94.35% | 0.569 | 85.61% | 57.26% | 93.84% | 0.5596 | 85.36% | 57.42% | 93.47% | 0.5534 | 0.5607 | 0.0156 | |
| 6 | 86.04% | 57.42% | 94.35% | 0.5716 | 86.11% | 59.01% | 93.98% | 0.5767 | 85.79% | 59.33% | 93.47% | 0.5689 | 0.5724 | 0.0078 | |
| 7 | 85.36% | 58.53% | 93.15% | 0.556 | 85.65% | 57.89% | 93.70% | 0.5620 | 86.65% | 58.85% | 94.72% | 0.591 | 0.5697 | 0.0350 | |
| 8 | 85.07% | 57.26% | 93.15% | 0.5456 | 85.43% | 57.10% | 93.66% | 0.5545 | 86.76% | 58.85% | 94.86% | 0.594 | 0.5647 | 0.0484 | |
| 3 | 84.46% | 56.62% | 92.55% | 0.5285 | 85.15% | 56.14% | 93.56% | 0.5448 | 86.44% | 59.81% | 94.17% | 0.5869 | 0.5534 | 0.0584 | |
| 4 | 83.82% | 59.97% | 90.74% | 0.5229 | 84.03% | 58.37% | 91.48% | 0.5231 | 82.24% | 52.15% | 90.97% | 0.4616 | 0.5025 | 0.0615 | |
| 5 | 82.99% | 59.81% | 89.72% | 0.5041 | 83.24% | 59.97% | 90.00% | 0.5100 | 80.52% | 53.11% | 88.47% | 0.4272 | 0.4804 | 0.0828 | |
| 6 | 83.10% | 62.52% | 89.07% | 0.5156 | 83.75% | 63.48% | 89.63% | 0.5326 | 82.13% | 60.77% | 88.33% | 0.4893 | 0.5125 | 0.0433 | |
| 7 | 81.45% | 61.72% | 87.18% | 0.4793 | 83.21% | 63.48% | 88.94% | 0.5212 | 83.53% | 63.16% | 89.44% | 0.5269 | 0.5091 | 0.0476 | |
| 8 | 80.95% | 61.88% | 86.48% | 0.4702 | 82.10% | 63.16% | 87.59% | 0.4974 | 82.45% | 63.16% | 88.06% | 0.5046 | 0.4907 | 0.0344 | |
1. Default parameters of WEKA software package were used.
2. Ridge value was set to 10E-8.
3. Polynomial kernel, exponent was set to 1, complexity value C to 0.1.
Comparison of the prediction performance for three machine learning algorithms on new data set.
| 3 | 84.64% | 43.95% | 94.85% | 0.4629 | 85.08% | 44.08% | 95.36% | 0.4767 | 85.27% | 45.31% | 95.29% | 0.4857 | 0.4751 | 0.0228 | |
| 4 | 85.43% | 49.52% | 94.44% | 0.5017 | 85.62% | 51.29% | 94.23% | 0.512 | 85.76% | 51.43% | 94.37% | 0.5162 | 0.51 | 0.0145 | |
| 5 | 85.49% | 51.97% | 93.89% | 0.5101 | 86.17% | 53.74% | 94.30% | 0.5339 | 86.58% | 54.69% | 94.58% | 0.5479 | 0.5306 | 0.0378 | |
| 7 | 85.65% | 54.42% | 93.48% | 0.5216 | 86.52% | 56.87% | 93.96% | 0.5519 | 86.58% | 57.14% | 93.96% | 0.5541 | 0.5425 | 0.0325 | |
| 8 | 85.79% | 55.37% | 93.42% | 0.528 | 86.20% | 57.55% | 93.38% | 0.545 | 86.25% | 56.73% | 93.65% | 0.5442 | 0.5391 | 0.0170 | |
| 3 | 84.12% | 45.17% | 93.89% | 0.4516 | 85.05% | 45.44% | 94.98% | 0.4794 | 84.62% | 46.12% | 94.27% | 0.4684 | 0.4665 | 0.0278 | |
| 4 | 84.83% | 46.39% | 94.47% | 0.4755 | 85.57% | 49.12% | 94.71% | 0.5046 | 85.68% | 49.80% | 94.68% | 0.5095 | 0.4965 | 0.0340 | |
| 5 | 85.32% | 48.98% | 94.44% | 0.4971 | 85.35% | 48.57% | 94.58% | 0.4967 | 86.91% | 54.29% | 95.09% | 0.5565 | 0.5168 | 0.0598 | |
| 6 | 85.38% | 51.29% | 93.93% | 0.5051 | 85.76% | 50.61% | 94.58% | 0.514 | 85.92% | 50.20% | 94.88% | 0.5178 | 0.5123 | 0.0127 | |
| 7 | 85.08% | 51.16% | 93.59% | 0.4965 | 86.33% | 53.47% | 94.58% | 0.5379 | 86.99% | 53.88% | 95.29% | 0.558 | 0.5308 | 0.0615 | |
| 8 | 85.43% | 50.48% | 94.20% | 0.5043 | 86.42% | 54.01% | 94.54% | 0.5416 | 87.15% | 55.51% | 95.09% | 0.5664 | 0.5374 | 0.0621 | |
| 3 | 84.72% | 48.44% | 93.82% | 0.4785 | 85.73% | 48.57% | 95.05% | 0.5081 | 85.35% | 47.35% | 94.88% | 0.4935 | 0.4934 | 0.0296 | |
| 4 | 82.84% | 49.66% | 91.16% | 0.4349 | 84.89% | 51.16% | 93.35% | 0.4914 | 84.78% | 52.24% | 92.94% | 0.4919 | 0.4727 | 0.0570 | |
| 5 | 83.17% | 52.65% | 90.82% | 0.4541 | 85.32% | 55.24% | 92.87% | 0.5155 | 86.82% | 60.41% | 93.45% | 0.5694 | 0.513 | 0.1153 | |
| 6 | 82.87% | 52.79% | 90.41% | 0.4478 | 85.30% | 57.55% | 92.26% | 0.5221 | 84.37% | 57.55% | 91.10% | 0.4999 | 0.4899 | 0.0743 | |
| 7 | 81.53% | 53.33% | 88.60% | 0.4213 | 84.70% | 58.37% | 91.30% | 0.5104 | 88.46% | 64.49% | 94.47% | 0.6234 | 0.5184 | 0.2021 | |
| 8 | 81.01% | 53.33% | 87.96% | 0.4108 | 84.48% | 60.54% | 90.48% | 0.5132 | 85.52% | 61.22% | 91.61% | 0.5393 | 0.4878 | 0.1285 | |
1. Default parameters of WEKA software package were used.
2. Ridge value was set to 10E-8.
3. Polynomial kernel, exponent was set to 1, complexity value C to 0.1.
Figure 1The sequence logos of palmitoylation sites. Both two logos show that around palmitoylation sites there is a Leucine/Cysteine-rich region. A taller letter indicates that this kind of residue is more frequently used. (a) on old data set; (b) on new data set.
Comparison of prediction performances between NBA-Palm and CSS-Palm.
| Cut-off | Cut-off | |||||||||
| 4 | 43.34% | 40.84% | 100.00% | 0.1667 | 4 | 87.00% | 50.65% | 97.62% | 0.5946 | |
| 2.6 | 68.75% | 67.78% | 88.89% | 0.2398 | 2.6 | 82.94% | 82.16% | 83.17% | 0.5877 | |
| 1.5 | 88.70% | 90.66% | 44.44% | 0.2247 | 1.5 | 56.67% | 97.12% | 44.86% | 0.3672 | |
| 0.869 | 86.61% | 40.70% | 100.00% | 0.5891 | 0.745 | 86.66% | 50.24% | 97.23% | 0.5809 | |
| 0.359 | 85.78% | 67.90% | 91.00% | 0.5916 | 0.406 | 86.67% | 67.46% | 92.25% | 0.6102 | |
| 0.011 | 60.78% | 90.90% | 52.00% | 0.3630 | 0.016 | 56.41% | 94.73% | 45.28% | 0.3475 | |
* Cut-off values of NBA-Palm were chosen for convenience of performance comparison to CSS-Palm.
Figure 2The ROC curves for potential palmitoylated peptides with window length of six. The "3 fold CV" stands for 3 fold cross-validation, the "8 fold CV" for 8 fold cross-validation and the "Jack-Knife" stands for the Jack-Knife validation. The "AUC" stands for Area Under Curve score. (a) ROC curves on old data set; (b) ROC curves on new data set.