| Literature DB >> 32657386 |
Gopalakrishnan Venkatesh1, Aayush Grover1, G Srinivasaraghavan1, Shrisha Rao1.
Abstract
MOTIVATION: Accurate prediction of binding between a major histocompatibility complex (MHC) allele and a peptide plays a major role in the synthesis of personalized cancer vaccines. The immune system struggles to distinguish between a cancerous and a healthy cell. In a patient suffering from cancer who has a particular MHC allele, only those peptides that bind with the MHC allele with high affinity, help the immune system recognize the cancerous cells.Entities:
Mesh:
Substances:
Year: 2020 PMID: 32657386 PMCID: PMC7355292 DOI: 10.1093/bioinformatics/btaa479
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.Steps to synthesize personalized cancer vaccine
Fig. 2.An overview of the MHCAttnNet architecture
The 5-fold cross-validation performance of MHCAttnNet with different hyper-parameters on class I MHC alleles
| Bi-LSTM | Number of layers | Number of layers | Context | AUC-PRC | AUC-ROC |
|
|---|---|---|---|---|---|---|
| hidden dimension | in peptide Bi-LSTM | in MHC Bi-LSTM | dimension | score | ||
| 64 | 3 | 3 | 16 |
|
|
|
| 64 | 3 | 1 | 16 | 0.9418 | 0.8893 | 0.9416 |
| 64 | 3 | 3 | 32 | 0.9409 | 0.8874 | 0.9398 |
| 64 | 3 | 1 | 32 | 0.9416 | 0.8887 | 0.9404 |
Note: The best performance is indicated in bold. The hyper-parameter setting, corresponding to this, was used to run all the experiments.
Comparison of performance of MHCAttnNet with the state-of-the-art methods for class I MHC alleles
| Model | AUC- | AUC- | PCC | PPV |
| Sensitivity |
|---|---|---|---|---|---|---|
| ROC | PRC | score | ||||
| NetMHC4.0 | 0.8237 | 0.9048 | 0.5738 | 0.9486 | 0.8531 | 0.7757 |
| MHCflurry | 0.8234 | 0.9074 | 0.5624 |
| 0.8230 | 0.7149 |
| PUFFIN | 0.8185 | 0.9129 | 0.5398 | 0.9668 | 0.8264 | 0.7215 |
| MHCAttnNet (no attention) | 0.8822 | 0.9380 | 0.7463 | 0.9643 | 0.9402 | 0.9303 |
| MHCAttnNet |
|
|
| 0.9683 |
|
|
Note: The best performances are indicated in bold.
Comparison of performance of MHCAttnNet with the state-of-the-art methods for class II MHC alleles
| Model | Accuracy | AUC-ROC | PCC | Sensitivity | PPV |
|
|---|---|---|---|---|---|---|
| NetMHCIIpan | 0.6822 | 0.6751 | 0.4297 | 0.3888 | 0.9039 | 0.5435 |
| PUFFIN |
|
|
| 0.6853 | 0.8644 | 0.7645 |
| MHCAttnNet (no attention) | 0.7503 | 0.7530 | 0.5032 | 0.7275 | 0.8732 | 0.7636 |
| MHCAttnNet | 0.7549 | 0.7579 | 0.5130 |
|
|
|
Note: The best performances are indicated in bold.
Fig. 3.Attention weights (2018) are highlighted on class I MHC allele (HLA-A*02:01) and peptide. respectively
Fig. 4.Attention weights (2018) are highlighted on class II MHC allele (HLA-DRB1*04:01) and peptide, respectively
Fig. 5.Number of trigrams needed to collectively convey 100% of information. The average number of trigrams needed to convey the complete information is 258 and is indicated with the dotted line
Fig. 6.Relevance of particular trigrams of MHC alleles for some amino acids in peptides