| Literature DB >> 29914358 |
Bin Yu1,2,3, Shan Li4,5, Wenying Qiu4,5, Minghui Wang4,5, Junwei Du6, Yusen Zhang7, Xing Chen8.
Abstract
BACKGROUND: Apoptosis is associated with some human diseases, including cancer, autoimmune disease, neurodegenerative disease and ischemic damage, etc. Apoptosis proteins subcellular localization information is very important for understanding the mechanism of programmed cell death and the development of drugs. Therefore, the prediction of subcellular localization of apoptosis protein is still a challenging task.Entities:
Keywords: Apoptosis proteins; Detrended cross-correlation analysis coefficient; Local fisher discriminant analysis; Pseudo-position specific scoring matrix; Subcellular localization; Support vector machine
Mesh:
Substances:
Year: 2018 PMID: 29914358 PMCID: PMC6006758 DOI: 10.1186/s12864-018-4849-9
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Flowchart of PsePSSM-DCCA-LFDA prediction method
Prediction results of selecting different ξ on CL317 by jackknife test
|
| |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Locations | Jackknife test (%) | ||||||||||
| 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | |
| Cy | 88.4 | 90.2 | 90.2 | 92.9 | 92.0 | 92.9 | 92.9 | 92.9 | 92.9 | 92.9 | 92.9 |
| Me | 76.4 | 83.6 | 81.8 | 83.6 | 83.6 | 81.8 | 83.6 | 83.6 | 87.3 | 87.3 | 87.3 |
| Mi | 38.2 | 55.9 | 58.8 | 70.6 | 64.7 | 64.7 | 64.7 | 64.7 | 64.7 | 64.7 | 67.7 |
| Se | 47.1 | 76.5 | 76.5 | 76.5 | 76.5 | 76.5 | 76.5 | 76.5 | 76.5 | 76.5 | 76.5 |
| Nu | 51.9 | 57.7 | 63.5 | 73.1 | 76.9 | 73.1 | 73.1 | 71.2 | 71.2 | 71.2 | 73.1 |
| En | 85.1 | 85.1 | 85.1 | 85.1 | 85.1 | 85.1 | 85.1 | 85.1 | 85.1 | 85.1 | 85.1 |
| OA | 72.2 | 78.5 | 79.5 | 83.6 | 83.3 | 82.6 | 83.0 | 82.6 | 83.3 | 83.3 | 83.9 |
Prediction results of selecting different ξ on ZW225 by jackknife test
|
| |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Locations | Jackknife test (%) | ||||||||||
| 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | |
| Cy | 74.3 | 81.4 | 80.0 | 81.4 | 81.4 | 80.0 | 81.4 | 80.0 | 80.0 | 80.0 | 80.0 |
| Me | 93.3 | 91.0 | 87.6 | 88.8 | 86.5 | 85.4 | 85.4 | 85.4 | 85.4 | 85.4 | 85.4 |
| Mi | 16.0 | 32.0 | 24.0 | 40.0 | 44.0 | 44.0 | 44.0 | 40.0 | 36.0 | 36.0 | 36.0 |
| Nu | 48.8 | 63.4 | 61.0 | 68.3 | 68.3 | 75.6 | 75.6 | 75.6 | 75.6 | 75.6 | 75.6 |
| OA | 70.7 | 76.4 | 73.3 | 77.3 | 76.9 | 77.3 | 77.8 | 76.9 | 76.4 | 76.4 | 76.4 |
Fig. 2Effect of selecting different values of ξ on CL317 and ZW225 datasets by jackknife test
Prediction results of selecting different S on CL317 by jackknife test
|
| ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Locations | Jackknife test (%) | |||||||||
| 5 | 10 | 15 | 20 | 25 | 30 | 35 | 40 | 45 | 49 | |
| Cy | 96.4 | 90.2 | 92.0 | 90.2 | 95.5 | 97.3 | 97.3 | 95.5 | 95.5 | 96.4 |
| Me | 83.0 | 83.0 | 83.0 | 83.0 | 83.0 | 83.0 | 83.0 | 83.0 | 83.0 | 83.0 |
| Mi | 80.0 | 87.3 | 83.6 | 87.3 | 83.6 | 87.3 | 89.1 | 90.9 | 92.7 | 92.7 |
| Se | 44.1 | 52.9 | 82.4 | 79.4 | 88.2 | 85.3 | 85.3 | 82.4 | 82.4 | 82.4 |
| Nu | 46.2 | 63.5 | 63.5 | 65.4 | 61.5 | 61.5 | 63.5 | 65.4 | 65.4 | 67.3 |
| En | 64.7 | 52.9 | 52.9 | 52.9 | 70.6 | 70.6 | 76.5 | 76.5 | 64.7 | 58.8 |
| OA | 76.0 | 78.2 | 81.4 | 81.4 | 83.9 | 84.9 | 85.8 | 85.5 | 85.2 | 85.5 |
Prediction results of selecting different S on ZW225 by jackknife test
|
| ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Locations | Jackknife test (%) | |||||||||
| 5 | 10 | 15 | 20 | 25 | 30 | 35 | 40 | 45 | 49 | |
| Cy | 85.7 | 81.4 | 80.0 | 85.7 | 84.3 | 85.7 | 84.3 | 85.7 | 85.7 | 87.1 |
| Me | 84.3 | 86.5 | 86.5 | 91.0 | 91.0 | 89.9 | 89.9 | 91.0 | 89.9 | 89.9 |
| Mi | 28.0 | 36.0 | 48.0 | 52.0 | 56.0 | 56.0 | 56.0 | 56.0 | 56.0 | 56.0 |
| Nu | 48.8 | 68.3 | 70.7 | 61.0 | 58.5 | 58.5 | 70.7 | 75.6 | 75.6 | 73.2 |
| OA | 72.0 | 76.0 | 77.3 | 79.6 | 79.1 | 79.1 | 80.9 | 82.7 | 82.2 | 82.2 |
Fig. 3Effect of selecting different values of S on CL317 and ZW225 datasets by jackknife test
Prediction results of subcellular localization of the CL317 dataset by selecting different dimensionality reduction methods and different dimensions
| Dimensions | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Algorithms | Jackknife test (%) | |||||||||
| 10 | 20 | 30 | 40 | 50 | 60 | 70 | 80 | 90 | 100 | |
| PCA | 79.5 | 84.9 | 89.0 | 89.3 | 87.7 | 86.1 | 85.8 | 83.9 | 82.6 | 80.4 |
| Laplacian | 63.4 | 73.8 | 79.2 | 82.6 | 84.9 | 84.5 | 82.3 | 80.8 | 78.5 | 74.4 |
| AKPCA | 78.2 | 84.5 | 87.7 | 89.0 | 89.6 | 88.6 | 87.4 | 88.3 | 86.8 | 86.4 |
| LFDA | 99.7 | 98.7 | 98.7 | 98.4 | 98.4 | 97.8 | 97.5 | 97.5 | 97.2 | 97.2 |
Prediction results of subcellular localization of the ZW225 dataset by selecting different dimensionality reduction methods and different dimensions
| Dimensions | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Algorithms | Jackknife test (%) | |||||||||
| 10 | 20 | 30 | 40 | 50 | 60 | 70 | 80 | 90 | 100 | |
| PCA | 74.2 | 79.1 | 85.8 | 84.9 | 83.6 | 81.3 | 78.7 | 78.2 | 77.8 | 74.7 |
| Laplacian | 69.3 | 80.0 | 82.2 | 80.9 | 82.2 | 77.8 | 73.8 | 72.0 | 68.0 | 67.1 |
| AKPCA | 74.7 | 82.2 | 84.0 | 86.2 | 84.9 | 83.1 | 80.9 | 80.9 | 77.3 | 78.2 |
| LFDA | 99.6 | 99.6 | 99.6 | 99.6 | 99.6 | 99.6 | 99.6 | 99.6 | 99.1 | 99.1 |
Fig. 4Effects of selecting four different dimensionality reduction methods and different dimensions on the overall prediction results of subcellular localization in CL317 dataset
Fig. 5Effects of selecting four different dimensionality reduction methods and different dimensions on the overall prediction results of subcellular localization in ZW225 dataset
Prediction results of different feature extraction methods on CL317 by jackknife test
| Locations | |||||||
|---|---|---|---|---|---|---|---|
| Algorithms | Jackknife test (%) | ||||||
| Cy | Me | Mi | Se | Nu | En | OA | |
| PsePSSM | 92.9 | 85.1 | 83.6 | 70.6 | 73.1 | 76.5 | 83.6 |
| DCCA | 95.5 | 83.0 | 90.9 | 82.4 | 65.4 | 76.5 | 85.5 |
| PsePSSM-DCCA | 93.8 | 85.1 | 90.9 | 82.4 | 76.9 | 70.6 | 86.8 |
| PsePSSM-DCCA-LFDA | 99.1 | 100 | 100 | 100 | 100 | 100 | 99.7 |
Prediction results of different feature extraction methods on ZW225 by jackknife test
| Locations | |||||
|---|---|---|---|---|---|
| Algorithms | Jackknife test (%) | ||||
| Cy | Me | Mi | Nu | OA | |
| PsePSSM | 81.4 | 88.8 | 40.0 | 68.3 | 77.3 |
| DCCA | 85.7 | 91.0 | 56.0 | 75.6 | 82.7 |
| PsePSSM-DCCA | 88.6 | 88.8 | 56.0 | 87.8 | 84.9 |
| PsePSSM-DCCA-LFDA | 100 | 98.9 | 100 | 100 | 99.6 |
Fig. 6The ROC curves of four different feature extraction methods on dataset CL317
Fig. 7The ROC curves of four different feature extraction methods on dataset ZW225
Prediction performance of datasets CL317 and ZW225 protein subcellular localization on the jackknife test method
| Jackknife test | ||||||
|---|---|---|---|---|---|---|
| Locations | CL317 | ZW225 | ||||
| Sens (%) | Spec (%) | MCC | Sens (%) | Spec (%) | MCC | |
| Cy | 99.1 | 100 | 0.99 | 100 | 99.4 | 0.99 |
| Me | 100 | 100 | 1 | 98.9 | 100 | 0.99 |
| Mi | 100 | 99.6 | 0.99 | 100 | 100 | 1 |
| Se | 100 | 100 | 1 | – | – | – |
| Nu | 100 | 100 | 1 | 100 | 100 | 1 |
| En | 100 | 100 | 1 | – | – | – |
| OA (%) | 99.7 | 99.6 | ||||
Prediction results of different methods on CL317 dataset by jackknife test
| Jackknife test (%) | |||||||
|---|---|---|---|---|---|---|---|
| Methods | Sensitivity for each class (%) | OA (%) | |||||
| Cy | Me | Mi | Se | Nu | En | ||
| ID [ | 81.3 | 81.8 | 85.3 | 88.2 | 82.7 | 83.0 | 82.7 |
| ID_SVM [ | 91.1 | 89.1 | 79.4 | 58.8 | 73.1 | 87.2 | 84.2 |
| DF_SVM [ | 92.9 | 85.5 | 76.5 | 76.5 | 93.6 | 86.5 | 88.0 |
| FKNN [ | 93.8 | 92.7 | 82.4 | 76.5 | 90.4 | 93.6 | 90.9 |
| PseAAC_SVM [ | 93.8 | 90.9 | 85.3 | 76.5 | 90.4 | 95.7 | 91.1 |
| EN_FKNN [ | 98.2 | 83.6 | 79.4 | 82.4 | 90.4 | 97.9 | 91.5 |
| DWT_SVM [ | 100 | 98.2 | 82.4 | 94.1 | 100 | 100 | 97.5 |
| Auto_Cova [ | 86.4 | 90.7 | 93.8 | 85.7 | 92.1 | 93.8 | 90.0 |
| APSLAP [ | 99.1 | 89.1 | 85.3 | 88.2 | 84.3 | 95.8 | 92.4 |
| Liu et al. [ | 98.2 | 96.4 | 94.1 | 82.4 | 96.2 | 95.7 | 95.9 |
| PSSM_AC [ | 93.8 | 90.9 | 91.2 | 82.4 | 86.5 | 95.7 | 91.5 |
| DCCA coefficient [ | 91.1 | 92.7 | 82.4 | 76.5 | 80.8 | 93.6 | 88.3 |
| PsePSSM-DCCA-LFDA | 99.1 | 100 | 100 | 100 | 100 | 100 | 99.7 |
Prediction results of different methods on ZW225 dataset by jackknife test
| Jackknife test (%) | |||||
|---|---|---|---|---|---|
| Methods | Sensitivity for each class (%) | OA (%) | |||
| Cy | Me | Mi | Nu | ||
| EBGW_SVM [ | 90.0 | 93.3 | 60.0 | 63.4 | 83.1 |
| ID_SVM [ | 92.9 | 91.0 | 68.0 | 73.2 | 85.8 |
| DF_SVM [ | 87.1 | 92.1 | 64.0 | 73.2 | 84.0 |
| FKNN [ | 84.3 | 93.3 | 72 | 85.5 | 85.8 |
| EN_FKNN [ | 94.3 | 94.4 | 60.0 | 80.5 | 88.0 |
| DWT_SVM [ | 87.1 | 93.2 | 64 | 90.2 | 87.6 |
| Liu et al. [ | 97.1 | 98.9 | 96.0 | 97.6 | 97.8 |
| PSSM_AC [ | 82.9 | 92.1 | 68.0 | 78.0 | 84.0 |
| Auto_Cova [ | 81.3 | 93.3 | 85.7 | 84.6 | 87.1 |
| PsePSSM-DCCA-LFDA | 100 | 98.9 | 100 | 100 | 99.6 |
Prediction results of different methods on the independent testing dataset ZD98 by jackknife test
| Jackknife test (%) | |||||
|---|---|---|---|---|---|
| Methods | Sensitivity for each class (%) | OA (%) | |||
| Cy | Me | Mi | Other | ||
| ID [ | 90.7 | 90.0 | 92.3 | 91.7 | 90.8 |
| ID_SVM [ | 95.3 | 93.3 | 84.6 | 58.3 | 88.8 |
| DF_SVM [ | 97.7 | 96.7 | 92.3 | 75.0 | 93.9 |
| FKNN [ | 95.3 | 96.7 | 100 | 91.7 | 95.9 |
| PseAAC_SVM [ | 95.3 | 93.3 | 92.3 | 83.3 | 92.9 |
| DWT_SVM [ | 95.4 | 93.3 | 53.9 | 91.7 | 88.8 |
| APSLAP [ | 95.3 | 90.0 | 100 | 91.7 | 94.9 |
| Liu et al. [ | 95.3 | 100 | 100 | 91.7 | 96.9 |
| PSSM_AC [ | 97.7 | 96.7 | 100 | 83.3 | 95.9 |
| EBGW_SVM [ | 97.7 | 90.0 | 92.3 | 83.3 | 92.9 |
| DCCA coefficient [ | 93.0 | 86.7 | 92.3 | 75.0 | 88.9 |
| PsePSSM-DCCA-LFDA | 100 | 100 | 100 | 100 | 100 |