Yilin Ye1,2, Jian Wang1, Yunwan Xu1, Yi Wang1, Youdong Pan1, Qi Song1, Xing Liu3, Ji Wan4. 1. Shenzhen Neocura Biotechnology Co. Ltd., Shenzhen, 518055, China. 2. School of Computer Science and Technology, Heilongjiang University, Harbin, 150080, China. 3. The Center for Microbes, Development and Health, Key Laboratory of Molecular Virology and Immunology, Institut Pasteur of Shanghai, Chinese Academy of Sciences, Shanghai, 200031, China. 4. Shenzhen Neocura Biotechnology Co. Ltd., Shenzhen, 518055, China. jiw@neocura.net.
Abstract
BACKGROUND: Accurate prediction of binding between class I human leukocyte antigen (HLA) and neoepitope is critical for target identification within personalized T-cell based immunotherapy. Many recent prediction tools developed upon the deep learning algorithms and mass spectrometry data have indeed showed improvement on the average predicting power for class I HLA-peptide interaction. However, their prediction performances show great variability over individual HLA alleles and peptides with different lengths, which is particularly the case for HLA-C alleles due to the limited amount of experimental data. To meet the increasing demand for attaining the most accurate HLA-peptide binding prediction for individual patient in the real-world clinical studies, more advanced deep learning framework with higher prediction accuracy for HLA-C alleles and longer peptides is highly desirable. RESULTS: We present a pan-allele HLA-peptide binding prediction framework-MATHLA which integrates bi-directional long short-term memory network and multiple head attention mechanism. This model achieves better prediction accuracy in both fivefold cross-validation test and independent test dataset. In addition, this model is superior over existing tools regarding to the prediction accuracy for longer ligand ranging from 11 to 15 amino acids. Moreover, our model also shows a significant improvement for HLA-C-peptide-binding prediction. By investigating multiple-head attention weight scores, we depicted possible interaction patterns between three HLA I supergroups and their cognate peptides. CONCLUSION: Our method demonstrates the necessity of further development of deep learning algorithm in improving and interpreting HLA-peptide binding prediction in parallel to increasing the amount of high-quality HLA ligandome data.
BACKGROUND: Accurate prediction of binding between class I human leukocyte antigen (HLA) and neoepitope is critical for target identification within personalized T-cell based immunotherapy. Many recent prediction tools developed upon the deep learning algorithms and mass spectrometry data have indeed showed improvement on the average predicting power for class I HLA-peptide interaction. However, their prediction performances show great variability over individual HLA alleles and peptides with different lengths, which is particularly the case for HLA-C alleles due to the limited amount of experimental data. To meet the increasing demand for attaining the most accurate HLA-peptide binding prediction for individual patient in the real-world clinical studies, more advanced deep learning framework with higher prediction accuracy for HLA-C alleles and longer peptides is highly desirable. RESULTS: We present a pan-allele HLA-peptide binding prediction framework-MATHLA which integrates bi-directional long short-term memory network and multiple head attention mechanism. This model achieves better prediction accuracy in both fivefold cross-validation test and independent test dataset. In addition, this model is superior over existing tools regarding to the prediction accuracy for longer ligand ranging from 11 to 15 amino acids. Moreover, our model also shows a significant improvement for HLA-C-peptide-binding prediction. By investigating multiple-head attention weight scores, we depicted possible interaction patterns between three HLA I supergroups and their cognate peptides. CONCLUSION: Our method demonstrates the necessity of further development of deep learning algorithm in improving and interpreting HLA-peptide binding prediction in parallel to increasing the amount of high-quality HLA ligandome data.
Entities:
Keywords:
Cancer immunotherapy; Deep learning; HLA-peptide binding prediction
Authors: Maria Bonsack; Stephanie Hoppe; Jan Winter; Diana Tichy; Christine Zeller; Marius D Küpper; Eva C Schitter; Renata Blatnik; Angelika B Riemer Journal: Cancer Immunol Res Date: 2019-03-22 Impact factor: 11.151
Authors: David Gfeller; Philippe Guillaume; Justine Michaux; Hui-Song Pak; Roy T Daniel; Julien Racle; George Coukos; Michal Bassani-Sternberg Journal: J Immunol Date: 2018-11-14 Impact factor: 5.422
Authors: Timothy J O'Donnell; Alex Rubinsteyn; Maria Bonsack; Angelika B Riemer; Uri Laserson; Jeff Hammerbacher Journal: Cell Syst Date: 2018-06-27 Impact factor: 10.304
Authors: Ugur Sahin; Evelyna Derhovanessian; Matthias Miller; Björn-Philipp Kloke; Petra Simon; Martin Löwer; Valesca Bukur; Arbel D Tadmor; Ulrich Luxemburger; Barbara Schrörs; Tana Omokoko; Mathias Vormehr; Christian Albrecht; Anna Paruzynski; Andreas N Kuhn; Janina Buck; Sandra Heesch; Katharina H Schreeb; Felicitas Müller; Inga Ortseifer; Isabel Vogler; Eva Godehardt; Sebastian Attig; Richard Rae; Andrea Breitkreuz; Claudia Tolliver; Martin Suchan; Goran Martic; Alexander Hohberger; Patrick Sorn; Jan Diekmann; Janko Ciesla; Olga Waksmann; Alexandra-Kemmer Brück; Meike Witt; Martina Zillgen; Andree Rothermel; Barbara Kasemann; David Langer; Stefanie Bolte; Mustafa Diken; Sebastian Kreiter; Romina Nemecek; Christoffer Gebhardt; Stephan Grabbe; Christoph Höller; Jochen Utikal; Christoph Huber; Carmen Loquai; Özlem Türeci Journal: Nature Date: 2017-07-05 Impact factor: 49.962
Authors: Binbin Chen; Michael S Khodadoust; Niclas Olsson; Lisa E Wagar; Ethan Fast; Chih Long Liu; Yagmur Muftuoglu; Brian J Sworder; Maximilian Diehn; Ronald Levy; Mark M Davis; Joshua E Elias; Russ B Altman; Ash A Alizadeh Journal: Nat Biotechnol Date: 2019-10-14 Impact factor: 54.908
Authors: Siranush Sarkizova; Susan Klaeger; Phuong M Le; Letitia W Li; Giacomo Oliveira; Hasmik Keshishian; Christina R Hartigan; Wandi Zhang; David A Braun; Keith L Ligon; Pavan Bachireddy; Ioannis K Zervantonakis; Jennifer M Rosenbluth; Tamara Ouspenskaia; Travis Law; Sune Justesen; Jonathan Stevens; William J Lane; Thomas Eisenhaure; Guang Lan Zhang; Karl R Clauser; Nir Hacohen; Steven A Carr; Catherine J Wu; Derin B Keskin Journal: Nat Biotechnol Date: 2019-12-16 Impact factor: 54.908