| Literature DB >> 31798635 |
Anja Mösch1,2, Silke Raffegerst2, Manon Weis2, Dolores J Schendel2, Dmitrij Frishman1.
Abstract
In the last years, immunotherapies have shown tremendous success as treatments for multiple types of cancer. However, there are still many obstacles to overcome in order to increase response rates and identify effective therapies for every individual patient. Since there are many possibilities to boost a patient's immune response against a tumor and not all can be covered, this review is focused on T cell receptor-mediated therapies. CD8+ T cells can detect and destroy malignant cells by binding to peptides presented on cell surfaces by MHC (major histocompatibility complex) class I molecules. CD4+ T cells can also mediate powerful immune responses but their peptide recognition by MHC class II molecules is more complex, which is why the attention has been focused on CD8+ T cells. Therapies based on the power of T cells can, on the one hand, enhance T cell recognition by introducing TCRs that preferentially direct T cells to tumor sites (so called TCR-T therapy) or through vaccination to induce T cells in vivo. On the other hand, T cell activity can be improved by immune checkpoint inhibition or other means that help create a microenvironment favorable for cytotoxic T cell activity. The manifold ways in which the immune system and cancer interact with each other require not only the use of large omics datasets from gene, to transcript, to protein, and to peptide but also make the application of machine learning methods inevitable. Currently, discovering and selecting suitable TCRs is a very costly and work intensive in vitro process. To facilitate this process and to additionally allow for highly personalized therapies that can simultaneously target multiple patient-specific antigens, especially neoepitopes, breakthrough computational methods for predicting antigen presentation and TCR binding are urgently required. Particularly, potential cross-reactivity is a major consideration since off-target toxicity can pose a major threat to patient safety. The current speed at which not only datasets grow and are made available to the public, but also at which new machine learning methods evolve, is assuring that computational approaches will be able to help to solve problems that immunotherapies are still facing.Entities:
Keywords: MHC binding affinity prediction; T cell receptor; cancer immunotherapy; cross-reactivity; neoantigen; neoepitope
Year: 2019 PMID: 31798635 PMCID: PMC6878726 DOI: 10.3389/fgene.2019.01141
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Figure 1Major histocompatibility complex (MHC) class I antigen presentation pathway for peptides recognized by CD8+ cytotoxic T cells.
Figure 2Workflow to analyze of major histocompatibility complex (MHC)-eluted peptides by mass-spectrometric (MS). A sample is lysed, pMHC complexes are captured and peptides are purified by immunoaffinity purification using MHC-specific immobilized antibodies. Eluted peptides are separated by high pressure liquid chromatography (HPLC), analyzed by MS, and the resulting data are computationally processed.
Publications describing the application of machine learning approaches to neoepitope prediction.
| Publication | Indication | Sample type and number | number of HLAs used | Estimated ratio of predicted neoepitopes from mutations | Estimated ratio of experimentally confirmed neoantigens | Number of features | Algorithms |
|---|---|---|---|---|---|---|---|
| ( | BRCA/CRC | 11 patients | 1 | 0.17 | N/A | 1 | NetMHC, SYFPEITHI, BIMAS, RANKPEP |
| ( | MEL | 1 murine cell line | N/S | 0.05 | 0.32T | 2 | NetMHC |
| ( | various | 312 genes (COSMIC) | 57 | 1.40 | N/A | 2 | NetMHC 3.2 |
| ( | MEL | 3 patients | 2 | 0.18 | 0.03 T | 3 | NetMHCpan 2.4 |
| ( | MEL | 1 patient | 4 | 0.42 | <0.01 T | 3 | NetChop, NetMHC 3.2 |
| ( | various | 167 cancer cell lines | 6 | 0.44 | N/A | 1 | IEDB 2.9 |
| ( | SARC | 2 murine tumors | 3 | 0.75 | 0.56 T | 2 | NetMHC 3.0 |
| ( | MEL | 64 patients | 6 | 0.42 | <0.01 T | 3 | NetMHC 3.4, RANKPEP, IEDB immunogenicity, CTLPred |
| ( | CRC/PRAD | 2 murine cell lines | 2 | 0.03 | 0.02 T | 3 | NetMHC 3.4 |
| ( | CRC | 552 TCGA patients | 6 | 0.41 | N/A | 2 | NetMHCpan |
| ( | MEL | 7 samples/3 patients | 1 | 0.04 | 0.43 B | 3 | NetMHC 3.4 |
| ( | MEL | 8 patients | 2 | 0.02 | 0.02 T | 2 | IEDB |
| ( | NSCLC | 34 patients | 6 | 0.62 | <0.01 T | 2 | NetMHC 3.4 |
| ( | various | 4250 TCGA patients | 6 | 0.14 | N/A | 2 | NetMHCpan 2.4 |
| ( | GIC | 10 patients | 12 | 0.03 | 0.21 T | 2 | NetMHCpan 2.8, NetMHCIIpan 3.0 |
| ( | MEL | 110 patients | 6 | 1.56 | N/A | 2 | NetMHCpan 2.4 |
| ( | UCEC | 245 TCGA patients | 1 | 0.06 | N/A | 3 | NetMHCpan 2.8 |
| ( | MEL | 1 patient | 6 | 1.43 | <0.01 B | 1 | NetMHCpan 2.8 |
| ( | MCC | 49 patients | 4 | 0.09 | N/A | 1 | NetMHC 3.4 |
| ( | MEL | 3 patients | 6 | 0.03 | 0.55 T | 2 | IEDB |
| ( | MEL | 38 patients | 12 | 0.06 | N/A | 3 | NetMHCpan 2.8, NetMHCIIpan 3.0 |
| ( | MEL | 1 patient | 6 | 5.30 | <0.01 B | 1 | NetMHCpan 2.8 |
| ( | NSCLC | 15 patients | 6 | 0.62 | N/A | 1 | NetMHCpan 2.8 |
| ( | CHOL | 1 patient | 6 | 3.68 | 0 B | 2 | NetMHC 3.4, NetMHCpan 2.8, SYFPEITHI |
| ( | MEL | 3 patients | 1 | 0.05 | 0.19 T | 4 | NetChop, NetMHC 3.2, NetMHCpan 2.0 |
| ( | NSCLC | 10 patients | 6 | 0.76 | <0.01 T | 4 | SYFPEITHI, NetMHCpan, NetCTLpan |
| ( | PED | 540 patients | 6 | 0.42 | N/A | 2 | NetMHCcons 1.1 |
| ( | NSCLC | 4 patients | 6 | 0.20 | N/A | 2 | NetMHCpan 2.8 |
| ( | BRCA | 5 patients | 6 | 0.47 | N/A | 2 | NetMHC 3.4, NetMHCpan 2.8 |
| ( | MM | 664 patients | 6 | 0.16 | N/A | 3 | NetMHC 4.0 |
| ( | MEL | 6 patients | 6 | 0.01 | 0.60 T | 3 | NetMHCpan 2.4 |
| ( | MEL | 13 patients | 10 | 0.02 | 0.60 T | 2 | IEDB 2.5 (MHC class I & II) |
| ( | BRCA | 3 patients | 6 | 0.01 | 0.16 T | 3 | NetMHC 3.2 |
| ( | MEL | 15 patients/cell lines | 6 | 9.57 | 0.15 T | 2 | NetMHCpan 3.0 |
| ( | PAAD | 1 murine cell line | 2 | 0.27 | 0.16 T | 2 | NetMHC 3.2/3.4, NetMHCpan 2.8 |
| ( | OV | 1 patient | 6 | 1.57 | 0,09 T | 2 | NetMHCpan 2.4 |
| ( | OV | 92 patients | 6 | 0.02 | N/A | 2 | NetMHCpan 2.8 |
| ( | PDAC | 1 patient | 10 | 2.00 | 0.75 T | 3 | NetMHC, NetMHCIIpan 3.1, SYFPEITHI |
| ( | various | 8546 TCGA patients | 6 | 0.74 | N/A | 2 | NetMHCpan 3.0, pVAC-Seq 4.0.8 |
| ( | HCC | 1 patient | 3 | 0.05 | 0.15 T | 2 | SYFPEITHI, IEDB (MHC class II) |
| ( | various | 7748 TCGA samples | 100 | 1.18 | N/A | 1 | NetMHCpan 4.0 |
| ( | NSCLC | 7 patients | 6 | 0.10 | 0.08 T | >4 | EDGE |
| ( | GBM | 10 patients | 1 | 0.03 | 0.85 T | 3 | IEDB 2.5 |
| ( | GBM | 8 patients | 6 | 0.20 | 0.07 T | 3 | NetMHCpan 2.4 |
| ( | various | 10186 TCGA patients | 1 | 0.02 | N/A | 2 | NetMHC 4.0 |
| ( | OV | 20 patients | 12 | 0.15 | 0.24 T | 3 | NetMHCpan 3.0, NetMHCIIpan 3.1 |
| ( | HCC | 16 patients | 6 | 1.79 | 0 B | 2 | NetMHC 4.0, NetMHCpan 3.0, SYFPEITHI |
| ( | NSCLC | 164 samples/64 patients | 6 | 0.86 | N/A | 2 | NetMHC 4.0, NetMHCpan 2.8 |
| ( | PNMN | 113 patients | 6 | 2.53 | 0.66 B | 2 | NetMHCpan |
N/S means not specified. Cancer type abbreviations: adenocarcinoma (AC), breast cancer (BRCA), cholangiocarcinoma (CHOL), colorectal cancer (CRC), glioblastoma (GBM), gastrointestinal cancer (GIC), hepatocellular carcinoma (HCC), merkel cell carcinoma (MCC), melanoma (MEL), multiple myeloma (MM), non-small cell lung cancer (NSCLC), ovarian cancer (OV), pancreatic ductal adenocarcinoma (PDAC), pediatric cancers (PED), Ph-negative myeloproliferative neoplasms (PNMN), prostate adenocarcinoma (PRAD), sarcoma (SARC) and uterine corpus endometrial cancer (UCEC). T indicates experimentally confirmed T cell responses (e.g., IFNγ ELISPOT), B indicates experimentally confirmed major histocompatibility complex (MHC) binding (e.g., mass spectrometric [MS] of eluted peptides), and N/A indicates that no experimental validation was done. Features are mutated peptide binding prediction, wild-type peptide binding prediction, gene expression, sequence-based features like sequence similarity scores, and immunogenicity predictions. If available, version information of algorithms is included.
Figure 3(A) Neoepitopes per mutation grouped by the number of features used for neoepitope selection. Data based on publications that offered comparable data, e.g., not obviously counting a neoepitope predicted to be presented by multiple major histocompatibility complexes (MHCs) multiple times (n = 38). (B) Ratio of confirmed to predicted neoepitopes grouped by the number of features used for neoepitope selection. Data based on publications that experimentally validated all predicted neoepitopes (n = 30)
Neoepitope prediction pipelines based on mutation data input. Additional features are cancer driver status of the mutated gene used by MuPeXI; differential agretopicity index (DAI), sequence-based immunogenicity score, and more used by Neopepsee; DAI, cleavage, and stability prediction used by pVACtools.
| MuPeXI | CloudNeo | Neopepsee | pVACTools | |
|---|---|---|---|---|
|
| NetMHCpan | NetMHCpan | NetCTLpan, IEDB Bayes classifier | 8 MHC class I predictors 4 MHC class II predictors |
|
| VCF gene expression TSV | VCF BAM | VCF RNA-Seq FASTQ | VCF BAM (RNA and DNA) |
|
| user input | integrated | user input or integrated | user input or integrated |
|
| SNVs indels frameshifts | SNVs | SNVs | SNVs indels fusions (additional input) |
|
| yes | yes | yes | yes |
|
| yes (optional) | no | yes | yes |
|
| yes | no | yes | yes |
|
| local, webserver | cloud | local | local |
|
| ( | ( | ( | ( |
Figure 4T cell receptor (TCR) binding to a peptide presented by major histocompatibility complex (MHC) class I.
Figure 5IEDB and VDJdb contents of CDR3α and CDR3β sequences of human origin. IEDB contains 386 unique epitopes linked to CDR3α sequences and 426 unique epitopes linked to CDR3β sequences. For VDJdb there are 93 and 177 unique epitopes, respectively. IEDB data was downloaded from https://www.iedb.org on September 30th, 2019 with the following query parameters: Current Filters: No B cell assays, No major histocompatibility complex (MHC) ligand assays, Restriction Type: Class I, Host: Homo sapiens (human). VDJdb data was taken from https://vdjdb.cdr3.net/overview (last updated on August 7th, 2019).