| Literature DB >> 31614809 |
Elizabeth Gensterblum-Miller1,2, J Chad Brenner3,4,5,6.
Abstract
Recent developments in bioinformatics technologies have led to advances in our understanding of how oncogenic viruses such as the human papilloma virus drive cancer progression and evade the host immune system. Here, we focus our review on understanding how these emerging bioinformatics technologies influence our understanding of how human papilloma virus (HPV) drives immune escape in cancers of the head and neck, and how these new informatics approaches may be generally applicable to other virally driven cancers. Indeed, these tools enable researchers to put existing data from genome wide association studies, in which high risk alleles have been identified, in the context of our current understanding of cellular processes regulating neoantigen presentation. In the future, these new bioinformatics approaches are highly likely to influence precision medicine-based decision making for the use of immunotherapies in virally driven cancers.Entities:
Keywords: HNSCC; human papilloma virus; neoantigen bioinformatics
Year: 2019 PMID: 31614809 PMCID: PMC6826432 DOI: 10.3390/cancers11101543
Source DB: PubMed Journal: Cancers (Basel) ISSN: 2072-6694 Impact factor: 6.639
Bioinformatics methods available for neoantigen prediction.
| Program Name | Input Data Type | Summary | Website |
|---|---|---|---|
| CloudNeo | WES or WGS | Integrates neoantigen peptide sequence calling, HLA typing, and peptide-MHC binding affinity predictions |
|
| INTEGRATE-neo | RNA-seq | Integrates gene fusion identification |
|
| NeoPredPipe | variant call set | Integrates putative neoantigen peptide sequence identification and MHC binding affinity prediction |
|
| NetChop | Peptide sequences | Predicts peptide cleavage sites |
|
| NetMHC | Peptide sequences, MHC haplotype | Predicts neoantigen binding affinity in an MHC type-dependent manner |
|
| NetMHCcons | Peptide sequences, MHC DNA sequence | Predicts antigen binding affinities of rare MHC haplotypes |
|
| NetMHCpan | Peptide sequences, MHC haplotype | Similar to NetMHC, with more MHC types included in training data |
|
| NetTepi | Peptide sequences, MHC haplotype | Predicts neoantigen activity by combining peptide-MHC binding affinity and stability, and T cell propensity |
|
| Pcleavage | Protein sequences | Predicts peptide cleavage sites |
|
| PRED(TAP) | Peptide sequences | Predicts peptide-TAP binding patterns | antigen.i2r.a-star.edu.sg/predTAP (currently unavailable) |
| pVAC-Seq | WES or WGS and RNA-seq | Combines variant calling and RNA-seq to identify transcribed putative antigens |
|
| ScanNeo | RNA-seq | Neoantigen sequence prediction, optimized for indel mutations |
|
| TIminer | RNA-seq, somatic mutation calling | Integrates RNA-seq and somatic mutations to predict expressed neoantigens |
|
Each algorithm described in this review is included in the table. Peptide sequences are typically generated from nonsynonymous coding mutations, identified from variant call sets. WES: whole exome sequencing. WGS: whole genome sequencing. TAP Transporter associated with antigen processing. Indel: Insertion-deletion mutation.
Figure 1The predicted HPV antigen load of all common MHC class I haplotypes. All possible antigens were generated from the canonical peptide sequences encoded from HPV16, and each antigen’s binding affinity to each HLA haplotype was predicted using NetMHC 4.0 [28]. Predicted HPV antigen load is the sum of all HPV antigens predicted to strongly bind to each MHC class I protein (binding affinity < 500 nM). All MHC haplotype available from the immune epitope database are included in this analysis [53]. Specific HLA alleles are highlighted. Red points correspond to HLA alleles associated with increased risk of cervical cancer, and blue points are protective alleles for cervical cancer [54].
Figure 2Mutation frequency and localization of MHC class I complex and related proteins. (A) mutation frequency of members of the MHC. HLA-A, B, and C, are paralogs of the MHC heavy chain, and β-2 microglobulin (B2M) forms the MHC light chain. 8% of HNSCC tumors have a somatic mutation in one or more HLA gene, and 2% have a mutation in B2M. 21% of all tumors have a somatic mutation in one or more genes associated with either MHC class I transcriptional regulation or the antigen loading pathway. NFKB, IRF1/2, SP1, NLRC5, RFX1, and USF1 are each transcription factors that have been shown to interact with the HLA promoter region in various cell types [57,58,59]. The transcription factors that regulate MHC class I have not been described in HNSCC. BCAP31 is necessary for shuttling the MHC class I from the ER to the GA, and each of the remaining proteins has a role in MHC class I folding and neoantigen loading within the ER [60,61,62,63]. Transcriptional regulation of the MHC class I has been shown to depend on a complex network of cis and trans-acting regulation, so a comprehensive list of transcription factors with a defined role in MHC class I regulation has not been fully determined [64]. (B–D) Localization of mutations detected in HNSCC patients in HLA-A (B), HLA-B (C), and HLA-C (D). Black dots represent a nonsense mutation; green dots represent a missense mutation. Peptide domains are marked on each MHC class I peptide. Green domain: MHC class I alpha 1 and 2 domains. Red: C1-set domain. Blue: C terminal domain. (E–G) Location of somatic missense mutations within the HLA protein detected in HNSCC tumors. The HLA protein consists of three α domains: α-1 and -2 contain the antigen binding groove, while the α-3 domain interacts with β-2 microglobulin. Purple: β-2 microglobulin. Grey: HLA heavy chain, including the α-1–3 domains. Blue: neoantigen peptide, located in the antigen binding groove. Red: amino acids that are subject to somatic missense mutation in one or more HNSCC patient. Mutations in HLA-A (E) are localized to the antigen binding groove, eight total missense mutations. Mutations in HLA-B (F) are present in each α domain, seven total missense mutations. Mutations in HLA-C (G) are present in the α-3 domain as well as the antigen binding groove, two total missense mutations. Mutation data retrieved from the TCGA PanCancer Atlas.