Literature DB >> 27703532

Overlap of the cancer genome atlas and the immune epitope database.

Shaimaa Sait1, Timothy Fawcett2, George Blanck3.   

Abstract

Mutant peptides resulting from cancer drivers or passenger mutations are expected to have the potential to serve as a basis for cancer vaccines. However, a number of parameters regulate vaccine-associated immunogenicity, including the suitability of a peptide for binding to an antigen-presenting molecule or antibody. In order to obtain a basic indication of the prospect of human cancer epitope identification via current database development strategies, an overlap of the mutant Homo sapiens epitopes listed on the Immune Epitope Database (IEDB) and the mutant peptides indicated by The Cancer Genome Atlas (TCGA) somatic mutation database was obtained. No putative TCGA mutant peptides were detected among the 8,890 14-18 amino acid (AA) IEDB peptides available. In total, 3 IEDB mutant epitopes that encompassed a TCGA mutant AA position, but did not overlap the exact position of the TCGA mutant AA, were detected. The results of the present analysis confirm that verification of certain aspects of cancer epitope function can be obtained via the continued and systematic expansion of databases representing human protein epitopes. However, the analysis also indicates that there is relatively limited systematic information available regarding antigen-presenting molecule epitopes and cancer-related mutant peptides.

Entities:  

Keywords:  Immune Epitope Database; The Cancer Genome Atlas; bioinformatics; cancer immunology; collagen type II α 1; extra-cellular matrix proteins; immuno-peptidome project; integrin subunit β 3

Year:  2016        PMID: 27703532      PMCID: PMC5038905          DOI: 10.3892/ol.2016.4991

Source DB:  PubMed          Journal:  Oncol Lett        ISSN: 1792-1074            Impact factor:   2.967


Introduction

The development of cancer vaccines has become a high priority in the field of cancer treatment. However, there are numerous parameters that affect the success of an immune response against an antigen, including the binding of a T-cell receptor to a major histocompatibility complex (MHC)-bound antigen and antigen processing. Empirical evaluations of vaccine efficacy parameters are costly and time-consuming. Thus, bioinformatic approaches may provide a useful alternative. The Immune Epitope Database (IEDB) includes >35,000 human peptides known to either bind to human leukocyte antigen (HLA) class I or II, or to have other immune receptor binding properties (1,2). Knowledge of the capacity of a peptide to bind to antigen-presenting molecules could potentially improve the selection of cancer vaccine candidates that are based on mutant peptides, whether these result from cancer drivers or passenger mutations. In addition, sufficient database development may allow for a better understanding of any presumed selection against the binding of cancer peptide neoantigens to MHC molecules as an aspect of cancer development. The present study focused on searching for overlaps of The Cancer Genome Atlas (TCGA) mutant peptides (3,4) and peptides in the IEDB, in order to discover cancer-related peptides that have the demonstrable capability to bind to MHC molecules.

Materials and methods

The overview of the approach is provided in Fig. 1. Supporting online material (SOM) representing each stage of the approach were also used in the present study (http://www.universityseminarassociates.com/Supporting_online_material_for_scholarly_pubs.php) (5). Briefly, human epitopes were downloaded from the IEDB (www.iedb.org) using the following search terms: Epitope, linear epitope; antigen, Homo sapiens (human) (ID: 9606, Homo sapiens); host, humans; assay, all assays; MHC restriction, MHCI and MHCII; disease, any disease. The results comprised ~35,000 epitopes and were downloaded as an Excel file. Epitopes that ranged between 14–18 amino acids (AAs) in size were used to determine mismatches with the human genome version 19 (hg19) reference genome at genome.ucsc.edu. The nucleotide spans of the mismatches were obtained by using the BLAT search (http://genome.ucsc.edu/cgi-bin/hgBlat?command=start) to search a local database, which was created from the TCGA download portal and consisted of a collection of all TCGA cancer mutations (https://tcga-data.nci.nih.gov/tcga/tcgaDownload.jsp). The recovered nucleotide positions were extended with hg19 nucleotides using The Extract Genomic DNA tool at https://usegalaxy.org/. The extended regions were then translated into all possible reading frames (including forward and reverse) using the European Molecular Biology Laboratory-European Bioinformatics Institute to generate a database for screening the IEDB 14–18 AA set, in order to verify the IEDB matches and to detect mismatches at the location of the TCGA mutation. All HLA candidates were removed due to overly extensive sequence variation. Gene family members that were originally inaccurately regarded as hg19 mismatches, may be found in the SOM files by Sait et al (5) (Table I).
Figure 1.

Overview of the procedure used to determine whether any IEDB peptides, which did not match the hg19, matched putative TCGA mutant peptides. The file numbers (1–4) in the figure refer to the supporting online material files by Sait et al (http://www.universityseminarassociates.com/Supporting_online_material_for_scholarly_pubs.php), which include results and details on each step of the procedure. IEDB, Immune Epitope Database; TCGA, The Cancer Genome Atlas; hg19, human genome version 19; EMBL-EBI, European Molecular Biology Laboratory-European Bioinformatics Institute.

Table I.

Identification of IEDB peptides that overlap the position of a mutant amino acid in the TCGA database.

Hugo gene symbolNormal peptide sequence (hg19)IEDB ID numberIEDB peptide (hg19 mismatched amino acid in large type, bold)hg19 translation of TCGA mutation on either side of mutation positionTCGA mutation chromosome numberTCGA cancer datasetTCGA nucleotide numberMHC
ATP1A2NPREAKACVVHGSDLK103447NPRDAKACVVHGSDLKVVHGSDL  1LUAD/LIHC160104960HLA-DQB
ACVVHG  1LIHC160104952
CVVHGS  1LIHC160104955
VVHGSDL  1LUAD/LIHC160104960
COL2A1GEPGIAGFKGEQGPKG107398GKPGIAGFKGEQGPKGIAGFKG12READ  48380651HLA-DRB1
TROVE2LQEMPLTALLRNLGKM118499LQEMPTLALLRNLGKMALLRNL  1SKCM193045674NA

Data were obtained from the supporting online material files by Sait et al (5) (http://www.universityseminarassociates.com/Supporting_online_material_for_scholarly_pubs.php) using the procedure described in Fig. 1. IEDB, Immune Epitope Database; TCGA, The Cancer Genome Atlas; hg19, human genome version 19; ID, identification; MHC, major histocompatibility complex; ATP1A2, ATPase Na+/K+ transporting subunit α 2; LUAD, lung adenocarcinoma; LIHC, liver hepatocellular carcinoma; READ, rectum adenocarcinoma; SKCM, skin cutaneous melanoma; COL2A1, collagen type II α 1; TROVE2, Telomerase, Ro and Vault domain family member 2; LUAD, Lung Adenocarcinoma; HLA-DQB, major histocompatibility complex, class II, DO β; HLA-DRB1, major histocompatibility complex, class II, DR β 1; NA, not applicable.

Results and Discussion

The present study was required to determine whether detecting an IEDB peptide that had a mismatch at the exact position of a TCGA mutant AA was possible. Therefore, a search was performed among the 8,890 IEDB human peptides consisting of 14–18 AAs, with translated AAs on either side of all TCGA point mutations, to check for overlap with an IEDB epitope that had a mismatch with the hg19 version of the reference genome. Since the translations represented exact matches with the hg19 translations, the 8,890 epitopes consisting of 14–18 AA were searched, allowing for one mismatch with the translations used, in order to ‘surround’ the location of the TCGA mutation. According to this protocol, while the TCGA point mutation-referenced translations overlapped the position of the TCGA mutation, these translations matched hg19 exactly, thus requiring the single mismatch standard for searching the aforementioned 8,890 IEDB epitopes for an exact match. Numerous IEDB epitopes were identified using this method; however, following the exclusion of IEDB epitopes that did not match the gene of the TCGA mutation, only one IEDB peptide had a non-hg19 AA in the position of the TCGA mutant AA. This IEDB epitope mapped to integrin subunit β 3 (ITGB3), which is a known ITGB3 single nucleotide polymorphism. The data supporting this finding is presented in SOM file no. 5 of Sait et al (5). To determine whether the TCGA mutant AA positions overlapped IEDB peptides that contained a mismatch with the hg19 AA sequence, without the TCGA position equaling the precise location of the IEDB mismatched AAs, the protocol indicated in Fig. 1 was followed. The results are provided in Table I. This protocol indicated that, following the removal of mismatches attributable to closely associated family members or mismatches detected anomalously due to repeats within a protein, 3 IEDB peptides, which were a mismatch to hg19, also overlapped the position of the TCGA mutant AA. For details of the results that were obtained by pursuing this approach, including the discounted IEDB peptides that were anomalously recovered using the Fig. 1 approach, please see SOM file no. 6 in Sait et al (5). Overall, these results indicate that mutant peptides in human cancer overlap apparent mutant peptides in the IEDB, suggesting that the AAs surrounding TCGA mutants are not fundamentally a hindrance to MHC binding. Notably, two of the proteins represented by the overlap of TCGA mutations and IEDB non-hg19 peptides represent the extracellular matrix, ITGB3 and collagen type II α 1, an emerging topic in the field of cancer research (4,6,7). However, the general paucity of the overlap of the two databases strongly indicates that, from a bioinformatic perspective, there is very little information available for determining which cancer drivers or passenger mutations have the potential of significant MHC binding. This conclusion is even more striking considering the extensive MHC polymorphism and protease activities that could impact binding affinities of cancer peptides (8). In conclusion, there is a strong case to be made for the development of a more comprehensive human immuno-peptidome project, with the particular aim of determining whether cancer peptides are selected for the reduced likelihood of MHC occupancy.
  7 in total

Review 1.  Bioinformatic databases and resources in the public domain to aid HLA research.

Authors:  W Helmberg
Journal:  Tissue Antigens       Date:  2012-10

2.  Regulation of HLA-DR peptide occupancy by histone deacetylase inhibitors.

Authors:  Kevin Cronin; Hernando Escobar; Karoly Szekeres; Eduardo Reyes-Vargas; Alan L Rockwood; Mark C Lloyd; Julio C Delgado; George Blanck
Journal:  Hum Vaccin Immunother       Date:  2013-01-17       Impact factor: 3.452

Review 3.  Databases and in silico tools for vaccine design.

Authors:  Yongqun He; Zuoshuang Xiang
Journal:  Methods Mol Biol       Date:  2013

4.  Big genes are big mutagen targets: a connection to cancerous, spherical cells?

Authors:  Michele L Parry; Michael Ramsamooj; George Blanck
Journal:  Cancer Lett       Date:  2014-10-16       Impact factor: 8.679

5.  Flat cells come full sphere: Are mutant cytoskeletal-related proteins oncoprotein-monsters or useful immunogens?

Authors:  Michele L Parry; George Blanck
Journal:  Hum Vaccin Immunother       Date:  2015-07-30       Impact factor: 3.452

6.  A pan-cancer proteomic perspective on The Cancer Genome Atlas.

Authors:  Rehan Akbani; Patrick Kwok Shing Ng; Henrica M J Werner; Maria Shahmoradgoli; Fan Zhang; Zhenlin Ju; Wenbin Liu; Ji-Yeon Yang; Kosuke Yoshihara; Jun Li; Shiyun Ling; Elena G Seviour; Prahlad T Ram; John D Minna; Lixia Diao; Pan Tong; John V Heymach; Steven M Hill; Frank Dondelinger; Nicolas Städler; Lauren A Byers; Funda Meric-Bernstam; John N Weinstein; Bradley M Broom; Roeland G W Verhaak; Han Liang; Sach Mukherjee; Yiling Lu; Gordon B Mills
Journal:  Nat Commun       Date:  2014-05-29       Impact factor: 14.919

7.  Extracellular matrix signatures of human primary metastatic colon cancers and their metastases to liver.

Authors:  Alexandra Naba; Karl R Clauser; Charles A Whittaker; Steven A Carr; Kenneth K Tanabe; Richard O Hynes
Journal:  BMC Cancer       Date:  2014-07-18       Impact factor: 4.430

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.