| Literature DB >> 35571059 |
Xiujin Wu1, Wenhua Zeng1, Fan Lin1,2, Peng Xu3, Xinzhu Li1,2,3.
Abstract
Background: Modern lifestyles mean that people are more likely to suffer from some form of cancer. As anticancer peptides can effectively kill cancer cells and play an important role in fighting cancer, they have been a subject of increasing research interest.Entities:
Keywords: anticancer peptide; attention mechanism; classfication; multi-CNN; prediction
Year: 2022 PMID: 35571059 PMCID: PMC9092594 DOI: 10.3389/fgene.2022.887894
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.772
Summary of datasets.
| Datasets | Dataset Type | Total Number | Number of Positive Samples | Number of Negative Samples |
|---|---|---|---|---|
| ACPs500 | Training set | 500 | 250 | 250 |
| ACPs164 | Test set | 164 | 82 | 82 |
| NPs1400 | Training set | 1400 | 700 | 700 |
| NPs350 | Test set | 350 | 175 | 175 |
| AFPs2336 | Training set | 2336 | 1168 | 1168 |
| AFPs582 | Test set | 582 | 291 | 291 |
FIGURE 1The framework of the proposed ACP-MCAM. (A) Embedding layer. (B) Multi-kernel CNN layer. (C) Position embedding layer. (D) Encoding layer. (E) Task-output layer.
FIGURE 2Multi-kernel CNN to extract anticancer peptide sequence features.
Cross validation results of ACP-MCAM and existing models.
| Methods | SE (%) | SP (%) | Accuracy (%) | MCC (%) | AUC (%) |
|---|---|---|---|---|---|
| iACP | 57.2 | 84.0 | 70.6 | 42.8 | 80.9 |
| ACPred-FL | 71.6 | 84.4 | 78.0 | 56.5 | 84.6 |
| PEPred-Suite | 72.8 | 88.0 | 80.4 | 61.5 | 86.0 |
| ACPred-Fuse | 77.2 | 87.6 | 82.4 | 65.2 | 88.2 |
| AntiCP_ACC | 66.8 | 78.4 | 72.6 | 45.5 | 82.4 |
| AntiCP_DC | 71.6 | 77.6 | 74.6 | 49.3 | 82.5 |
| Hajisharifi’s | 67.2 | 83.6 | 75.4 | 51.5 | 83.1 |
| ACP-MCAM |
|
|
|
|
|
Note: The best results are marked in bold and the second best results are underlined.
Independent test results of ACP-MCAM and existing models.
| Methods | SE (%) | SP (%) | Accuracy (%) | MCC (%) | AUC (%) |
|---|---|---|---|---|---|
| iACP | 54.9 | 88.8 | 87.7 | 22.6 | 76.1 |
| ACPred-FL | 69.5 | 85.8 | 85.3 | 25.9 | 85.1 |
| PEPred-Suite | 68.3 | 90.6 | 89.9 | 32.0 | 86.1 |
| ACPred-Fuse | 72 | 89.5 | 89 | 32.0 | 86.8 |
| AntiCP_ACC | 68.3 | 88.5 | 87.9 | 28.8 | 85.3 |
| AntiCP_DC | 68.3 | 82.6 | 82.2 | 22.3 | 83.0 |
| Hajisharifi’s | 69.5 | 88.4 | 87.9 | 29.2 | 85.5 |
| ACP-MCAM |
|
|
|
|
|
Note: The best results are marked in bold and the second best results are underlined.
The performance of the ACP-MCAM model affected by the learning rate.
| Learning Rate | Accuracy | Precision | Recall | F1-Score | AUC |
|---|---|---|---|---|---|
| 1e-4 | 0.8292 | 0.8 | 0.878 | 0.8372 | 0.9225 |
| 2e-4 | 0.9085 | 0.9589 | 0.8536 | 0.9032 | 0.9479 |
| 3e-4 | 0.8841 | 0.8888 | 0.878 | 0.8834 | 0.9341 |
| 4e-4 | 0.8902 | 0.9 | 0.878 | 0.8888 | 0.9388 |
| 5e-4 | 0.8841 | 0.8705 | 0.9024 | 0.8862 | 0.9375 |
| 6e-4 | 0.8902 | 0.8902 | 0.8902 | 0.8902 | 0.9298 |
| 7e-4 | 0.9085 | 0.9135 | 0.9024 | 0.9079 | 0.9301 |
| 8e-4 | 0.8902 | 0.9102 | 0.8658 | 0.8875 | 0.9144 |
| 9e-4 | 0.8841 | 0.8987 | 0.8658 | 0.8819 | 0.9207 |
| 1e-3 | 0.8658 | 0.8571 | 0.878 | 0.8674 | 0.9162 |
Note: The best results are highlighted in bold.
The performance of the ACP-MCAM model affected by kernel combination.
| Kernel | Accuracy | Precision | Recall | F1-Score | AUC |
|---|---|---|---|---|---|
| 1 | 0.8536 | 0.8372 | 0.878 | 0.8571 | 0.9177 |
| 3 | 0.8475 | 0.8275 | 0.878 | 0.852 | 0.932 |
| 5 | 0.8353 | 0.8021 | 0.8902 | 0.8439 | 0.9143 |
| 7 | 0.8475 | 0.8131 | 0.9024 | 0.8554 | 0.9158 |
| 1 + 3 | 0.8658 | 0.8488 | 0.8902 | 0.869 | 0.9439 |
| 1 + 5 | 0.8597 | 0.8831 | 0.8292 | 0.8553 | 0.9244 |
| 1 + 7 | 0.8109 | 0.8591 | 0.7439 | 0.7973 | 0.8856 |
| 3 + 5 | 0.8536 | 0.8536 | 0.8536 | 0.8536 | 0.9118 |
| 3 + 7 | 0.7987 | 0.7752 | 0.8414 | 0.807 | 0.9015 |
| 5 + 7 | 0.817 | 0.8095 | 0.8292 | 0.8192 | 0.9028 |
| 1 + 3+5 |
|
|
|
|
|
| 1 + 3+7 | 0.8597 | 0.8831 | 0.8292 | 0.8553 | 0.9074 |
| 1 + 5+7 | 0.8353 | 0.8666 | 0.7926 | 0.828 | 0.9131 |
| 3 + 5+7 | 0.8719 | 0.8765 | 0.8658 | 0.8711 | 0.9434 |
| 1 + 3+5 + 7 | 0.8719 | 0.8961 | 0.8414 | 0.8679 | 0.9158 |
Note: The best results are highlighted in bold.
The performance of three different models on three different peptide datasets.
| Model | Accuracy | Precision | Recall | F1 | AUC |
|---|---|---|---|---|---|
| Embedding + ACP | 0.8719 | 0.9178 | 0.817 | 0.8645 | 0.8719 |
| Embedding_cnn + ACP | 0.8658 | 0.8947 | 0.8292 | 0.8607 | 0.9321 |
| Embedding_multicnn + ACP | 0.9085 | 0.9589 | 0.8536 | 0.9032 | 0.9479 |
| Embedding + NPs | 0.8343 | 0.8197 | 0.8571 | 0.8380 | 0.8840 |
| Embedding_cnn + NPs | 0.8229 | 0.8192 | 0.8286 | 0.8239 | 0.8894 |
| Embedding_multicnn + NPs | 0.8400 | 0.8479 | 0.8285 | 0.8381 | 0.9063 |
| Embedding + AFPs | 0.8625 | 0.8371 | 0.9003 | 0.8675 | 0.9161 |
| Embedding_cnn + AFPs | 0.9038 | 0.8803 | 0.9347 | 0.9067 | 0.9580 |
| Embedding_multicnn + AFPs | 0.8762 | 0.8119 | 0.9793 | 0.8878 | 0.9677 |
Note: The best results of different dataset are highlighted in bold.
FIGURE 3Performance comparison of ACP-MCAM and existing methods. The left figure is the ROC curves of different models on the ACP dataset. The right figure is the PR curve of different models on the ACP dataset.
FIGURE 4Dimension reduction of each samples on ACP500 and ACP164 dataset by TSNE and PCA.