| Literature DB >> 22986455 |
Pornpimol Charoentong1, Mihaela Angelova, Mirjana Efremova, Ralf Gallasch, Hubert Hackl, Jerome Galon, Zlatko Trajanoski.
Abstract
Recent mechanistic insights obtained from preclinical studies and the approval of the first immunotherapies has motivated increasing number of academic investigators and pharmaceutical/biotech companies to further elucidate the role of immunity in tumor pathogenesis and to reconsider the role of immunotherapy. Additionally, technological advances (e.g., next-generation sequencing) are providing unprecedented opportunities to draw a comprehensive picture of the tumor genomics landscape and ultimately enable individualized treatment. However, the increasing complexity of the generated data and the plethora of bioinformatics methods and tools pose considerable challenges to both tumor immunologists and clinical oncologists. In this review, we describe current concepts and future challenges for the management and analysis of data for cancer immunology and immunotherapy. We first highlight publicly available databases with specific focus on cancer immunology including databases for somatic mutations and epitope databases. We then give an overview of the bioinformatics methods for the analysis of next-generation sequencing data (whole-genome and exome sequencing), epitope prediction tools as well as methods for integrative data analysis and network modeling. Mathematical models are powerful tools that can predict and explain important patterns in the genetic and clinical progression of cancer. Therefore, a survey of mathematical models for tumor evolution and tumor-immune cell interaction is included. Finally, we discuss future challenges for individualized immunotherapy and suggest how a combined computational/experimental approaches can lead to new insights into the molecular mechanisms of cancer, improved diagnosis, and prognosis of the disease and pinpoint novel therapeutic targets.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22986455 PMCID: PMC3493665 DOI: 10.1007/s00262-012-1354-x
Source DB: PubMed Journal: Cancer Immunol Immunother ISSN: 0340-7004 Impact factor: 6.968
Fig. 1Data and information flow in cancer immunology research. The datasets are integrated from clinical observations, medical records, “omic” technologies, and the next-generation sequencing technology and analyzed by using bioinformatics methods. Cancer researchers are using these data to extract information for diagnosis, classification, prognosis, and therapeutic guidance. Furthermore, the multi-parametric data can lead to the improvement of the immunotherapy and can be exploited for patients benefit using individualized therapeutic cancer vaccines
Public databases for cancer genomics data
| Resource | Description | URL | Expr | CNV | Mut | Epi | Integ | Others |
|---|---|---|---|---|---|---|---|---|
| The Cancer Genome Atlas (TCGA) | Copy number, gene and microRNA expression, promoter methylation, genetic alterations association with brain, lung and ovarian cancer |
| ✓ | ✓ | ✓ | ✓ | ||
| The International Cancer Genome Consortium (ICGC) | Full range of somatic mutations in 50 different cancer type [ |
| ✓ | ✓ | ✓ | |||
| NCBI dbGaP | Store individual-level phenotype, exposure, genotype and sequence data and the associations between them [ |
| ✓ | |||||
| COSMIC | Provide mutation range and frequency statistics based upon a choice of gene and/or cancer phenotype [ |
| ✓ | ✓ | ||||
| Oncomine | Collect gene expression, pathways, networks [ |
| ✓ | ✓ | ||||
| Cancer Gene Census | Annotation of muted genes [ |
| ✓ | |||||
| Cancer Genome Anatomy Project (CGAP) | Resource of gene expression profiles of normal, pre-cancer, and cancer cells [ |
| ✓ | |||||
| Cancer Molecular Analysis Project (CMAP) | Available for analysis gene associated with oncogenesis and cancer profiles, clinical trials and therapies [ |
| ✓ | |||||
| Cancer Biomedical Informatics Grid (caBIG) | Open access for large multi-disciplinary data sets, analysis tools, and other resources [ |
| ✓ | |||||
| caArray | Accessible array data management and allow to share data across caBIG |
| ✓ | |||||
| Cancer Genome Wide Association Scan (caGWAS) | Integrate, query, report, and analyze significant associations between genetic variations and disease, drug response or other clinical outcomes |
| ✓ | |||||
| Cancer Model Database (caMOD) | Provide information about animal models for human cancer to the public research community |
| ✓ | |||||
| Database for copy number alterations of cancer genome from SNP array data (caSNP) | Collect of copy number alteration (CNA) from SNP arrays |
| ✓ | ✓ | ||||
| Database of Differentially Expressed Proteins in Human Cancers (dbDEPC) | Provide cancer proteomics data, a resource for information on protein-level expression changes, and explore protein profile differences among different cancers [ |
| ✓ | |||||
| Cancer Genetic Markers of Susceptibility (CGEMS) | Identify common inherited genetic variations associated with risk for breast and prostate cancer |
| ✓ | |||||
| Tumorscape | Provide copy number alterations across multiple cancer types |
| ✓ | ✓ | ||||
| UCSC Cancer Genome Browser | Visualize, integrate and analyze cancer genomics and its associated clinical data [ |
| ✓ | |||||
| Gene Expression Omnibus (GEO) | Store high-throughput functional genomic data, including those that examine genome copy number variations, chromatin structure, methylation status and transcription factor binding [ |
| ✓ | |||||
| Single Nucleotide Polymorphism Database (dbSNP) | dbSNP currently classifies nucleotide sequence variations with the following types of the database: (1) single-nucleotide substitutions, (2) small insertion/deletion polymorphisms, (3) invariant regions of sequence, (4) microsatellite repeats, (5) named variants, and (6) uncharacterized heterozygous assays [ |
| ✓ | |||||
| Integrative Genomics Portal (IGP) and Integrative Genomics Viewer (IGV) | The Starr Cancer Consortium developed IGP for sharing and analysis of RNAi, copy number, gene expression and sample annotation data. Also, they provide IGV, which is a high performance desktop application that supports integrated visualization of a wide range of genomic data types including aligned sequence reads, mutations, copy number, RNAi screens, gene expression, methylation, and genomic annotations [ |
| ✓ |
Publicly available cancer databases contain gene/microRNA expression data (Expr), copy number of variations (CNV), mutations (Mut), epigenetic profiling (Epi), integration analysis (Integ), and other data (i.e., proteomics, networks, mouse models)
Databases containing immunogenic and non-immunogenic peptides in human
| Database | Content | # Entries | URL | Reference |
|---|---|---|---|---|
| Bcipep | Linear B cell epitopes with descriptive immunogenicity measure | 719 |
| [ |
| CED | Conformational B cell epitopes with immunoproperty description | 293 |
| [ |
| CIG-DB | Publicly available epitopes that interact with IG (linear and conformational) and TCR | 270 |
| [ |
| CTDatabase | Cancer-Testis antigens and corresponding mRNA and protein expression, and immune response | 126 |
| [ |
| DFRMLI | HLA binding peptides packed up into ready-to-train-and-test data sets, and T cell epitopes | 718 TAAs |
| [ |
| EPIMHC | HLA ligands associated with high, low, moderate, or unknown binding level and a flag indicating immunogenic epitopes | 290 TAAs |
| [ |
| IEDB | Linear and conformational antibody and T cell epitopes cross-referenced with publications, MHC binding experiments and T cell assays | 598 Conf. 18950 Lin. |
| [ |
| Immunology DB | HIV antibody epitopes (mainly from non-human sources), HIV CTL and T helper epitopes, epitope variants and escape mutations (EVEM) | 1,493 T cell epitopes 2516 EVEM |
| |
| MHCBN | Class I and II MHC and TAP binders associated with binding affinity and T cell activity measures, as well as non-binders | 645 TAP 18,404 MHC |
| [ |
| PeptideDatabase | T cell-defined tumor antigens | 378 |
| [ |
| SYFPEITHI | MHC Class I and II binding peptides and corresponding binding motifs | 5,435 |
| [ |
| TANTIGEN | Human tumor-associated HLA ligands and T cell epitopes with detailed description for the source antigen | 1,423 |
|
Fig. 2Databases for epitopes and calculation of the total number of epitopes. Shown are available databases and the number of entries in each database (see text for abbreviations). Since there is a considerable overlap between the databases, we have analyzed the data and as of to date identified the number of unique peptide sequences to be around 35,000. The number of entries per database refers only to human peptide sources