| Literature DB >> 27340658 |
Zhao Li1, Yilei Zhao1, Gaofeng Pan1, Jijun Tang2, Fei Guo1.
Abstract
MHC molecule plays a key role in immunology, and the molecule binding reaction with peptide is an important prerequisite for T cell immunity induced. MHC II molecules do not have conserved residues, so they appear as open grooves. As a consequence, this will increase the difficulty in predicting MHC II molecules binding peptides. In this paper, we aim to propose a novel prediction method for MHC II molecules binding peptides. First, we calculate sequence similarity and structural similarity between different MHC II molecules. Then, we reorder pseudosequences according to descending similarity values and use a weight calculation formula to calculate new pocket profiles. Finally, we use three scoring functions to predict binding cores and evaluate the accuracy of prediction to judge performance of each scoring function. In the experiment, we set a parameter α in the weight formula. By changing α value, we can observe different performances of each scoring function. We compare our method with the best function to some popular prediction methods and ultimately find that our method outperforms them in identifying binding cores of HLA-DR molecules.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27340658 PMCID: PMC4906198 DOI: 10.1155/2016/3832176
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.411
Details of 39 MHC molecules and peptide binding complexes.
| PDB ID | DRB allele | Peptide sequence |
|---|---|---|
| 1AQD | DRB1 | VGSDWRFLRGYHQYA |
| 1PYW | DRB1 | XFVKQNAAALX |
| 1KLG | DRB1 | GELIGILNAAKVPAD |
| 1KLU | DRB1 | GELIGTLNAAKVPAD |
| 2FSE | DRB1 | AGFKGEQGPKGEPG |
| 1SJH | DRB1 | PEVIPMFSALSEG |
| 1SJE | DRB1 | PEVIPMFSALSEGATP |
| 1T5W | DRB1 | AAYSDQATPLLLSPR |
| 1T5X | DRB1 | AAYSDQATPLLLSPR |
| 2IAN | DRB1 | GELIGTLNAAKVPAD |
| 2IAM | DRB1 | GELIGILNAAKVPAD |
| 2IPK | DRB1 | XPKWVKQNTLKLAT |
| 1FYT | DRB1 | PKYVKQNTLKLAT |
| 1R5I | DRB1 | PKYVKQNTLKLAT |
| 1HXY | DRB1 | PKYVKQNTLKLAT |
| 1JWM | DRB1 | PKYVKQNTLKLAT |
| 1JWS | DRB1 | PKYVKQNTLKLAT |
| 1JWU | DRB1 | PKYVKQNTLKLAT |
| 1LO5 | DRB1 | PKYVKQNTLKLAT |
| 2ICW | DRB1 | PKYVKQNTLKLAT |
| 2OJE | DRB1 | PKYVKQNTLKLAT |
| 2G9H | DRB1 | PKYVKQNTLKLAT |
| 1A6A | DRB1 | PVSKMRMATPLLMQA |
| 1J8H | DRB1 | PKYVKQNTLKLAT |
| 2SEB | DRB1 | AYMRADAAAGGA |
| 1BX2 | DRB1 | ENPVVHFFKNIVTPR |
| 1YMM | DRB1 | ENPVVHFFKNIVTPRGGSGGGGG |
| 1FV1 | DRB5 | NPVVHFFKNIVTPRTPPPSQ |
| 1H15 | DRB5 | GGVYHFVKKHVHES |
| 1ZGL | DRB5 | VHFFKNIVTPRTPGG |
| 4E41 | DRB1 | GELIGILNAAKVPAD |
| 1DLH | DRB1 | PKYVKQNTLKLAT |
| 1KG0 | DRB1 | PKYVKQNTLKLAT |
| 3L6F | DRB1 | APPAYEKLSAEQSPP |
| 3PDO | DRB1 | KPVSKMRMATPLLMQALPM |
| 3PGD | DRB1 | KMRMATPLLMQALPM |
| 3S4S | DRB1 | PKYVKQNTLKLAT |
| 3S5L | DRB1 | PKYVKQNTLKLAT |
| 1HQR | DRB5 | VHFFKNIVTPRTP |
Figure 1The architecture of our approach to MHC II and peptide binding problem.
30 HLA-complexes binding pockets.
| PDB ID | Pocket 1 | Pocket 2 | Pocket 3 | Pocket 4 | Pocket 5 | Pocket 6 | Pocket 7 | Pocket 8 | Pocket 9 |
|---|---|---|---|---|---|---|---|---|---|
| 1AQD | 82N 85V 86G | 77T 78Y 81H 82N | 78Y | 13F 74A 78Y | 13F 71R | 11L | 47Y 61W 67L 70Q 71R | 60Y 61W | 9W 57D 61W |
|
| |||||||||
| 1PYW | 82N 85V 86G 89F | 77T 78Y 81H 82N | 78Y | 13F 70Q 71R 74A 78Y | 13F 71R | 11L | 11L 28E 61W 71R | 60Y 61W | 57D 61W |
|
| |||||||||
| 1KLG | 82N 85V | 78Y 81H 82N | 78Y | 13F 71R 78Y | 13F 71R | 11L | 61W | 60Y 61W | 57D 61W |
|
| |||||||||
| 2FSE | 82N 85V 86G 89F | 77T 78Y 82N | 13F 28E 70Q 71R 74A 78Y | 13F 71R | 71R | 28E 47Y 61W 67L | 61W | 57D | |
|
| |||||||||
| 1KLU | 82N 85V | 78Y 81H 82N | 13F 71R 78Y | 13F 71R | 11L | 61W | 60Y 61W | 57D 61W | |
|
| |||||||||
| 1SJH | 82N | 78Y 81H 82N | 13F 26L 70Q 71R 74A 78Y | 71R | 11L | 61W | 60Y 61W | 57D 61W | |
|
| |||||||||
| 1SJE | 82N | 78Y 81H 82N | 78Y | 13F 26L 70Q 71R 74A 78Y | 71R | 11L | 61W | 60Y 61W | 57D 60Y 61W |
|
| |||||||||
| 1T5W | 82N 86G 89F | 78Y 81H 82N | 78Y | 13F 70Q 71R 74A 78Y | 13F 71R | 11L | 61W 71R | 60Y 61W | 9W 57D 61W |
|
| |||||||||
| 1T5X | 82N 86G 89F | 78Y 81H 82N | 78Y | 13F 70Q 71R 74A 78Y | 71R | 11L | 61W 71R | 61W | 57D 61W |
|
| |||||||||
| 2IAN | 82N 85V | 78Y 81H 82N | 78Y | 13F 70Q 74A 78Y | 13F 70Q 71R | 11L | 61W 71R | 61W | 57D 61W |
|
| |||||||||
| 2IPK | 82N 85V 86G 89F | 77T 78Y 81H 82N | 13F 70Q 71R 74A 78Y | 71R | 11L | 47Y 61W 67L 71R | 60Y 61W | 9W 57D 61W | |
|
| |||||||||
| 1FYT | 82N 85V 86G 89F | 78Y 81H 82N | 78Y | 13F 70Q 71R 74A 78Y | 13F 71R | 11L | 28E 47Y 61W 67L 71R | 60Y 61W | 9W 57D 61W |
|
| |||||||||
| 1R5I | 82N 85V 86G 89F | 77T 78Y 81H 82N | 78Y | 13F 70Q 71R 74A 78Y | 70Q 71R | 11L | 47Y 61W 67L 71R | 61W | 9W 57D 61W |
|
| |||||||||
| 1HXY | 82N 85V 86G 89F | 78Y 81H 82N | 13F 70Q 71R 74A 78Y | 71R | 11L | 28E 47Y 61W 67L 71R | 60Y 61W | 9W 57D 61W | |
|
| |||||||||
| 1JWM | 82N 85V 86G 89F | 78Y 81H 82N | 78Y | 13F 70Q 71R 74A 78Y | 71R | 11L | 28E 47Y 61W 67L 71R | 61W | 57D 61W |
|
| |||||||||
| 1JWS | 82N 85V 86G 89F | 78Y 81H 82N | 78Y | 13F 70Q 71R 74A 78Y | 13F 71R | 11L | 47Y 61W 67L 71R | 61W | 9W 57D 61W |
|
| |||||||||
| 1JWU | 82N 85V 86G 89F | 78Y 81H 82N | 78Y | 13F 70Q 71R 74A 78Y | 13F 71R | 11L | 28E 47Y 61W 67L 71R | 61W | 9W 57D 61W |
|
| |||||||||
| 1LO5 | 82N 85V 86G 89F | 78Y 81H 82N | 78Y | 13F 70Q 78Y | 13F 71R | 11L | 47Y 61W 67L 71R | 61W | 9W 57D 60Y 61W |
|
| |||||||||
| 2ICW | 82N 85V 86G 89F | 78Y 81H 82N | 78Y | 13F 70Q 71R 74A 78Y | 13F 71R | 11L | 28E 47Y 61W 67L 71R | 61W | 9W 57D 61W |
|
| |||||||||
| 2OJE | 82N 85V 86G | 77T 78Y 81H 82N | 78Y | 13F 70Q 71R 74A 78Y | 70Q 71R | 11L | 28E 47Y 61W 67L 71R | 61W | 9W 57D 61W |
|
| |||||||||
| 2G9H | 82N 85V 86G 89F | 77T 78Y 81H 82N | 78Y | 13F 70Q 71R 74A 78Y | 71R | 11L 13F | 28E 47Y 61W 67L 71R | 60Y 61W | 9W 57D 61W |
|
| |||||||||
| 2IAM | 82N | 78Y 81H 82N | 78Y | 13F 70Q 71R 74A 78Y | 70Q 71R | 11L | 61W 67L | 60Y 61W | 57D 61W |
|
| |||||||||
| 1A6A | 82N 85V 86V | 77T 78Y 81H 82N | 78Y | 13S 26Y 74R 78Y | 71K 74R | 11S 30Y | 30Y 47F 61W 67L 71K | 61W | 9E 30Y 57D 61W |
|
| |||||||||
| 1J8H | 82N 85V 86G 89F | 77T 78Y 81H 82N | 78Y | 13H | 13H 70Q 71K | 11V 13H 30Y | 30Y 47Y 61W 67L | 60Y 61W | 37Y 57D 61W |
|
| |||||||||
| 2SEB | 82N | 77T 78Y 81H 82N | 13H | 13H 71K | 30Y | 30Y 47Y 61W | 60Y 61W | 61W | |
|
| |||||||||
| 1BX2 | 82N 85V | 77T 78Y 81H 82N | 78Y | 13H | 70Q | 13R | 57D 60Y 61W | ||
|
| |||||||||
| 1YMM | 82N | 77T 78Y 81H 82N | 78Y | 13R | 70Q | 13R | 61W 67I | 61W | 57D 61W |
|
| |||||||||
| 1FV1 | 82N 85V 86G 89F | 78Y 81H 82N | 78Y | 13Y 71R 78Y | 71R | 13Y | 61W 67L | 61W | 57D |
|
| |||||||||
| 1H15 | 82N 89F | 77T 78Y 81H 82N | 78Y | 13Y 71R 78Y | 71R | 11D 13Y 30D | 61W | 57D 60Y | |
|
| |||||||||
| 1ZGL | 82N 85V 89F | 77T 78Y 81H 82N | 13Y 26F 71R 78Y | 13Y | 13Y 28H 61W 71R | 61W | 57D 60Y 61W | ||
Important positions at the binding core for MHC II molecules.
| Important positions | |
|---|---|
| Pocket 1 | 82 85 86 89 |
| Pocket 2 | 77 78 81 82 |
| Pocket 3 | 78 |
| Pocket 4 | 11 13 26 28 70 71 74 78 |
| Pocket 5 | 11 13 28 70 71 74 |
| Pocket 6 | 11 13 28 70 71 74 |
| Pocket 7 | 11 28 30 47 61 67 70 71 |
| Pocket 8 | 60 61 |
| Pocket 9 | 9 30 37 57 60 61 |
Figure 2Predicted results by different score functions. x-axis represents different α values, and the y-axis refers to predicted results of different score functions.
Comparison of our binding prediction with other approaches. The 5th column is the result of our method, and 6th to 8th columns are results of TEPITOPE, MultiRTA, and NetMHCIIpan. The bold cell means one error.
| PDB ID | Allele | Peptide | Core | Ours | TEPITOPE | MultiRTA | NetMHCIIpan-2.0 |
|---|---|---|---|---|---|---|---|
| 1AQD | DRB1 | VGSDWRFLRGYHQYA | WRFLRGYHQ | WRFLRGYHQ | WRFLRGYHQ | WRFLRGYHQ | WRFLRGYHQ |
| 1PYW | DRB1 | XFVKQNAAALX | FVKQNAAAL | FVKQNAAAL | FVKQNAAAL | FVKQNAAAL | FVKQNAAAL |
| 1KLG | DRB1 | GELIGILNAAKVPAD | IGILNAAKV | IGILNAAKV | IGILNAAKV | IGILNAAKV |
|
| 2FSE | DRB1 | GELIGTLNAAKVPAD | IGTLNAAKV | IGTLNAAKV | IGTLNAAKV | IGTLNAAKV | IGTLNAAKV |
| 1KLU | DRB1 | AGFKGEQGPKGEPG | FKGEQGPKG | FKGEQGPKG | FKGEQGPKG | FKGEQGPKG | FKGEQGPKG |
| 1SJH | DRB1 | PEVIPMFSALSEG | VIPMFSALS | VIPMFSALS | VIPMFSALS | VIPMFSALS | VIPMFSALS |
| 1SJE | DRB1 | PEVIPMFSALSEGATP | VIPMFSALS | VIPMFSALS | VIPMFSALS | VIPMFSALS | VIPMFSALS |
| 1T5W | DRB1 | AAYSDQATPLLLSPR | YSDQATPLL |
| YSDQATPLL |
| YSDQATPLL |
| 1T5X | DRB1 | AAYSDQATPLLLSPR | YSDQATPLL |
| YSDQATPLL |
| YSDQATPLL |
| 2IAN | DRB1 | GELIGTLNAAKVPAD | IGTLNAAKV | IGTLNAAKV | IGTLNAAKV | IGTLNAAKV | IGTLNAAKV |
| 2IPK | DRB1 | GELIGILNAAKVPAD | IGILNAAKV | IGILNAAKV | IGILNAAKV | IGILNAAKV |
|
| 1FYT | DRB1 | XPKWVKQNTLKLAT | WVKQNTLKL | WVKQNTLKL | WVKQNTLKL | WVKQNTLKL | WVKQNTLKL |
| 1R5I | DRB1 | PKYVKQNTLKLAT | YVKQNTLKL | YVKQNTLKL | YVKQNTLKL | YVKQNTLKL | YVKQNTLKL |
| 1HXY | DRB1 | PKYVKQNTLKLAT | YVKQNTLKL | YVKQNTLKL | YVKQNTLKL | YVKQNTLKL | YVKQNTLKL |
| 1JWM | DRB1 | PKYVKQNTLKLAT | YVKQNTLKL | YVKQNTLKL | YVKQNTLKL | YVKQNTLKL | YVKQNTLKL |
| 1JWS | DRB1 | PKYVKQNTLKLAT | YVKQNTLKL | YVKQNTLKL | YVKQNTLKL | YVKQNTLKL | YVKQNTLKL |
| 1JWU | DRB1 | PKYVKQNTLKLAT | YVKQNTLKL | YVKQNTLKL | YVKQNTLKL | YVKQNTLKL | YVKQNTLKL |
| 1LO5 | DRB1 | PKYVKQNTLKLAT | YVKQNTLKL | YVKQNTLKL | YVKQNTLKL | YVKQNTLKL | YVKQNTLKL |
| 2ICW | DRB1 | PKYVKQNTLKLAT | YVKQNTLKL | YVKQNTLKL | YVKQNTLKL | YVKQNTLKL | YVKQNTLKL |
| 2OJE | DRB1 | PKYVKQNTLKLAT | YVKQNTLKL | YVKQNTLKL | YVKQNTLKL | YVKQNTLKL | YVKQNTLKL |
| 2G9H | DRB1 | PKYVKQNTLKLAT | YVKQNTLKL | YVKQNTLKL | YVKQNTLKL | YVKQNTLKL | YVKQNTLKL |
| 2IAM | DRB1 | PKYVKQNTLKLAT | YVKQNTLKL | YVKQNTLKL | YVKQNTLKL | YVKQNTLKL | YVKQNTLKL |
| 1A6A | DRB1 | PVSKMRMATPLLMQA | MRMATPLLM | MRMATPLLM | MRMATPLLM | MRMATPLLM | MRMATPLLM |
| 1J8H | DRB1 | PKYVKQNTLKLAT | YVKQNTLKL | YVKQNTLKL | YVKQNTLKL | YVKQNTLKL | YVKQNTLKL |
| 2SEB | DRB1 | AYMRADAAAGGA | MRADAAAGG | MRADAAAGG | MRADAAAGG | MRADAAAGG |
|
| 1BX2 | DRB1 | ENPVVHFFKNIVTPR | VHFFKNIVT | VHFFKNIVT | VHFFKNIVT | VHFFKNIVT |
|
| 1YMM | DRB1 | ENPVVHFFKNIVTPRGGSGGGGG | VHFFKNIVT | VHFFKNIVT | VHFFKNIVT | VHFFKNIVT | VHFFKNIVT |
| 1FV1 | DRB5 | NPVVHFFKNIVTPRTPPPSQ | FKNIVTPRT |
| FKNIVTPRT |
|
|
| 1H15 | DRB5 | GGVYHFVKKHVHES | YHFVKKHVH | YHFVKKHVH | YHFVKKHVH | YHFVKKHVH | YHFVKKHVH |
| 1ZGL | DRB5 | VHFFKNIVTPRTPGG | FKNIVTPRT |
| FKNIVTPRT |
|
|
|
| |||||||
| Results | 4 errors | 0 errors | 4 errors | 6 errors | |||
Figure 3Comparison of different methods by sequence logos of peptides on HLA-DRB10101.
Other prediction results of nine MHC molecules. This table shows the prediction result of our method on 9 MHC molecules. The 5th column is the result. There is only one error result, which is shown using bold font.
| PDB ID | Allele | Peptide | Core | Ours |
|---|---|---|---|---|
| 4E41 | DRB1 | GELIGILNAAKVPAD | IGILNAAKV | IGILNAAKV |
| 1DLH | DRB1 | PKYVKQNTLKLAT | YVKQNTLKL | YVKQNTLKL |
| 1KG0 | DRB1 | PKYVKQNTLKLAT | YVKQNTLKL | YVKQNTLKL |
| 3L6F | DRB1 | APPAYEKLSAEQSPP | YEKLSAEQS | YEKLSAEQS |
| 3PDO | DRB1 | KPVSKMRMATPLLMQALPM | MRMATPLLM |
|
| 3PGD | DRB1 | KMRMATPLLMQALPM | MRMATPLLM | MRMATPLLM |
| 3S4S | DRB1 | PKYVKQNTLKLAT | YVKQNTLKL | YVKQNTLKL |
| 3S5L | DRB1 | PKYVKQNTLKLAT | YVKQNTLKL | YVKQNTLKL |
| 1HQR | DRB5 | VHFFKNIVTPRTP | FKNIVTPRT | FKNIVTPRT |
|
| ||||
| Results | 1 error | |||