Literature DB >> 34345532

CanImmunother: a manually curated database for identification of cancer immunotherapies associating with biomarkers, targets, and clinical effects.

Wenliang Zhang1,2,3, Binghui Zeng4, Huancai Lin4, Wen Guan3,5, Jing Mo3, Song Wu6, Yanjie Wei2,7,8, Qianshen Zhang1, Dongsheng Yu4, Weizhong Li6,9,10, Godfrey Chi-Fung Chan1,11.   

Abstract

As immunotherapy is evolving into an essential armamentarium against cancers, numerous translational studies associated with relevant biomarkers, targets, and clinical effects have been reported in recent years. However, a large amount of associated experimental data remains unexplored due to the difficulty in accessibility and utilization. Here, we established a comprehensive high-quality database for cancer immunotherapy called CanImmunother (http://www.biomedical-web.com/cancerit/) through manual curation on 4515 publications. CanImmunother contains 3267 experimentally validated associations between 218 cancer sub-types across 34 body parts and 484 immunotherapies with 642 biomarkers, 108 targets, and 121 control therapies. Each association was manually curated by professional curators, incorporated with valuable annotation and cross references, and assigned with an association score for prioritization. To help clinicians and researchers in identifying and discovering better cancer immunotherapy and their respective biomarkers and targets, CanImmunother offers user-friendly web applications including search, browse, excel table, association prioritization, and network visualization. CanImmunother presents a landscape of experimental cancer immunotherapy association data, serving as a useful resource to improve our insight and to facilitate further discovery of advanced immunotherapy options for cancer patients.
© 2021 The Author(s). Published with license by Taylor & Francis Group, LLC.

Entities:  

Keywords:  Cancer immunotherapy; biomarker; database; immune checkpoint; tumor vaccine

Year:  2021        PMID: 34345532      PMCID: PMC8288037          DOI: 10.1080/2162402X.2021.1944553

Source DB:  PubMed          Journal:  Oncoimmunology        ISSN: 2162-4011            Impact factor:   8.110


Introduction

Cancer immunotherapy has emerged and rapidly developed over the past few decades[. Its basic principle is to promote and facilitate the immune system in targeting cancer cells. The technologies include single or bispecific monoclonal antibodies, immune checkpoint inhibitors, various forms of cellular therapies, immunomodulatory cytokines and chemokines, tumor vaccines, and so on [. One of the successful strategies is immune checkpoint inhibitor, which targets at the cytotoxic T lymphocyte-associated protein 4 (CTLA-4), the programmed cell death protein 1 (PD-1), or their ligands (such as PD-L1).[ Currently, various forms of cancer immunotherapy have already been proven by clinical trials to be effective, and they become an essential part of contemporary cancer therapy [. Although applications of cancer immunotherapy cover a broad range of human cancers, their usefulness is restricted by whether particular cancer types have tumor-specific antigens or co-inhibitory molecules.[ In addition, different forms of cancer immunotherapy have their own unique toxicity profiles, depending on their mechanism of action, which are distinct from the usual therapy related to toxicity encountered in chemotherapy.[ However, useful association data about cancer immunotherapy and their predictive biomarkers, therapy combination, clinical efficacy, and adverse effects, may still be unexplored within the large amount of categorical literature, which are difficult to be accessed and analyzed. To facilitate the discovery of putative cancer immunotherapies and their respective targets, several types of databases and tools have been developed for predicting or detecting tumor-specific neoantigen and immune antigen,[ compiling cancer immune checkpoints and their modulators [, evaluating genetic variants on tumor immune infiltration [, exploring molecular mechanism of traditional Chinese medicine on cancer immunology [, and providing oncolytic virus-based cancer immunotherapy.[ For example, TSNAdb [ is a database for tumor-specific neoantigens from immunogenomics data analysis; pVAC-Seq [ is a genome-guided tool for identifying tumor neoantigens; pTuneos [ is another tool for prioritizing tumor neoantigens from next-generation sequencing data; and CancerImmunityQTL [ is a database to systematically evaluate the impact of genetic variants on tumor immune infiltration. In October 2020, Zhang et al. developed the CKTTD [ database via enhanced text-mining system with manual curation. CKTTD provides the association data between cancer immune checkpoint therapies and their targets. These databases and tools have facilitated the discovery of putative immunotherapies and targets for human cancers. However, the lack of public accessible common database for in-depth validation of cancer immunotherapy associated with their predictive biomarkers, targets, control therapies, clinical efficacy, and adverse events remains the current bottleneck. The establishment of such database can help clinicians and researchers to identify and develop novel immunotherapy options for cancer patients. To tackle these problems, we developed the CanImmunother database () through manual curation on 4515 publications. CanImmunother is the first comprehensive database to provide experimentally validated cancer immunotherapies associated with their biomarkers, targets, and control therapies, enabling the comparison and clarification of their respective clinical efficacy and the adverse events. The association data in CanImmunother were consistently annotated with standard terminology and ontology (Figure 1). Currently, CanImmunother provides 3267 experimentally validated associations between 218 cancer sub-types and 484 immunotherapies with 642 biomarkers, 108 targets, and 121 control therapies (Table 1). Each association was reviewed, manually curated by multiple professional curators and incorporated with valuable annotation and cross-references. CanImmunother offers user-friendly web interfaces and web applications such as excel table, association prioritization, and network visualization, to help clinicians and researchers in identifying and discovering advanced cancer immunotherapies and their respective biomarkers and targets. CanImmunother will be able to serve as a useful resource to improve our insight and to facilitate the identification and discovery of advanced immunotherapy options for patients with cancer.
Figure 1.

The data curation and annotation framework of canimmunother

Table 1.

Database contents and features of CanImmunother compared with CKTTDB

Content and web applicationsCanImmunotherCKTTDBCanImmunother/CKTTDB (Fold change)
Non-redundant associations264621012.60
Cancer sub-types218336.61
Immunotherapy484 (Various types of immunotherapy *)53 (Immune checkpoint therapy only)9.13
Control therapy121None--
Biomarker642None--
Target1081051.03
Clinical efficacyYesNone--
Adverse eventYesNone--
Sample information (type and size)YesNone--
Study designYesNone--
Research typeYesNone--
Gene expression profile for targetYesYes--
Association scoreYesYes--
Analysis applicationAssociation prioritization;Network visualizationNoneMore web applications
Data qualityManual curation on literatureCuration with an enhanced text-mining system and data integrationHigher quality

Note: * The various types of immunotherapy include immune checkpoint therapy, tumor vaccine, immune-related cytokine, cellular immunotherapy, oncolytic viruses, and their combination with other non-immunotherapy, such as chemotherapy, target therapy, radiotherapy, surgery, chemoradiotherapy, and hormone therapy (Figure 2c).

The data curation and annotation framework of canimmunother Database contents and features of CanImmunother compared with CKTTDB Note: * The various types of immunotherapy include immune checkpoint therapy, tumor vaccine, immune-related cytokine, cellular immunotherapy, oncolytic viruses, and their combination with other non-immunotherapy, such as chemotherapy, target therapy, radiotherapy, surgery, chemoradiotherapy, and hormone therapy (Figure 2c).

Materials and methods

Literature reviewing and manual curation

To collect literature data manually, according to our previous method [, we searched the National Center for Biotechnology Information (NCBI) PubMed database [ for candidate publications that described the studies for human cancer immunotherapy. Search terms and their combinations used in the search strategy included cancer, carcinoma, neoplasm, tumor, leukemia, lymphoma, melanoma, malignancy, immunotherapy, immune checkpoint, and specific immune checkpoint agent and nonimmune checkpoint agent names, which were described in the systematic review publications.[ 4515 candidate publications were retrieved before August 2020. We then filtered the abstracts of these candidate publications based on two criteria. First, the publications are original research literature (including clinical and basic research papers, as well as case reports). However, review and commentary papers were excluded. Second, the publications reported experimentally validated human cancer immunotherapy, such as immune checkpoint therapy, tumor vaccine, immune related cytokine, cellular immunotherapy, and so on. 1932 publications were retained after filtering abstracts. Furthermore, the full texts of 1932 publications were reviewed and manually curated by multiple professional curators to collect and annotate the association data for cancer immunotherapy. All the curators are professionals in tumor immunology and genetics. To ensure the quality in the data curating process based on our previous manual curation method [, we randomly selected 10.35% (200/1932) of the publications initially. They were curated and discussed among all curators to achieve a consensus for data manual curation. Second, each publication was reviewed and manually curated by at least two curators. Third, if the curated data from the same publication by different curators were not consistent, a third curator would review the publication and further discuss with the team to reach a consistent decision. The collected data include cancer sub-type, immunotherapy and its control therapy, biomarker, target, adverse event, sample information (patients/cell lines/animal models and size), PubMed identifier (PMID), research type, study design, and validation evidence, from the full text of the supporting publications (Figure 1). The study design information collected from publications, such as clinical trials of different phases, meta-analysis, and retrospective cohort study, enables users to clearly evaluate their reliability and methodological bias. In order to make the association data more useful, each association was annotated with a clinical significance, which is a brief summing-up description annotated by professional curators based on the information from the original publications (Figure 1).

Data annotation

To make the extracted data consistent and accessible, the cancer sub-types, immunotherapies and their control therapies, biomarkers, targets, and validating evidences in CanImmunother were manually annotated with standard terminology and ontology (Figure 1). Cancer sub-types were annotated by Experimental Factor Ontology (EFO), which provides a systematic description of many experimental variables available in the European Bioinformatics Institute databases and for many international projects [. The majority of immunotherapies and their control therapies were annotated by DrugBank [, while the rest of the therapies, of which DrugBank does not cover, were annotated by National Cancer Institute Thesaurus (NCIT) OBO Edition [. The UniProtKB [ and NCIT OBO Edition [ resources were adopted to annotate biomarkers and targets. The Evidence & Conclusion Ontology (ECO) [ was used to annotate validation evidences. The multiple terminology and ontology resources were integrated for the data annotation that is to guarantee the accuracy. Moreover, the targets in CanImmunother were annotated with the expression (mean of FPKM) profile on gene level across 15 body parts in the Genotype-Tissue Expression (GTEx) [ and The Cancer Genome Atlas (TCGA) [ resources by using the PreMedKB tool [. In addition, each association was also systematically annotated with title, abstract, published journal, and published date of the supporting publication. The information of resources used in CanImmunother is detailed in Supplemental Table 1.

Association score

In exploiting large collections of aggregated association data, one of the main problems is how to prioritize and interpret the association data [. According to our previous methods,[ we refined a scoring model to compute an association score for each association data with or without predictive biomarker in CanImmunother, respectively. It was based on two evidential metrics: the research types of the supporting publications (e.g., clinical research, basic research, and case report) and the number of supporting publications. Different research types were assigned with different weights based on their reliability. Moreover, as a larger number of publications can enhance association score for the same association, a harmonic sum function [ was employed not only to compute the score of the same association with multiple supporting publications but also to dampen the effect of data volume or quantity. Finally, association scores were normalized to limit their range from 0 to 1.0 for better interpretation. The computation steps of association score were described in detail at the “Help” web-page of the database (http://www.biomedical-web.com/cancerit/help.jsp).

Web implementation

The web database was built with Spring MVC and jQuery AJAX frameworks. All association data in CanImmunother was organized in MySQL. The data accessing and processing programs were written in Java. The web interface was implemented by JavaScript, HTML5, and CSS3. The vis.js widget (http://www.visjs.org) was implemented to display the networks on the web-pages. The widgets of excel-bootstrap-table-filter-bundle.js and dataTables.bootstrap.js were used to implement the filter tables on the web pages. CanImmunother is freely available at the website http://www.biomedical-web.com/cancerit/.

Results

Data contents

Currently, CanImmunother provides 3267 experimentally validated associations between 218 cancer sub-types across 34 body parts and 484 immunotherapies with 642 biomarkers, 108 targets, and 121 control therapies across 1661 supporting publications (Figure 1 & Table 1). Each association was incorporated with valuable annotation and cross-references and given with a unique accession number (e.g., CANIT0000001). The top six cancer sub-types with the largest numbers of associations are melanoma, non-small cell lung carcinoma, renal cell carcinoma, colorectal cancer, urothelial carcinoma, and prostate cancer (Figure 2a). The top six body parts with the largest numbers of associations are skin, lung, kidney, blood and lymph node, bladder, and intestines (Figure 2b). CanImmunother provides various types of immunotherapy, which include immune checkpoint therapy, tumor vaccine, immune-related cytokine, cellular immunotherapy, oncolytic virus, and their combination, or they combined with other non-immunotherapy, such as chemotherapy, target therapy, radiotherapy, hormone therapy, chemoradiotherapy, and surgery (Figure 2c). The top six largest numbers of associations in CanImmunother are related to immune checkpoint therapy, immune checkpoint therapy plus chemotherapy, tumor vaccine, immune checkpoint therapy plus target therapy, immune checkpoint therapy plus radiotherapy, and immune checkpoint therapy plus tumor vaccine (Figure 2c). Importantly, CanImmunother also provides 642 biomarkers and 108 targets for the cancer immunotherapy associations. The top six number of biomarker types are gene expression signature on/in tumor cell, mutational status, disease status, gene expression signature on/in immune cell, gene expression signature in serum, and cell percentage/counts in blood (Figure 2d). Remarkably, the biomarker of PD-L1 expression level makes up the most common associations and is associated with 92 cancer sub-types and 36 immunotherapies to form 223 associations in CanImmunother. In addition, approximately 74.47% (2433/3267), 3.86% (126/3267), and 21.67% (708/3267) of the associations are supported with clinical research publications, basic research publications, and case reports, respectively (Figure 2e).
Figure 2.

The landscape of association data in canimmunother. (a) A word-cloud diagram shows the association landscape of 218 cancer sub-types in canimmunother. Larger sizes and more central locations of the cancer sub-type symbols in the diagram indicate more association data in the database. (b) The numbers of associations in different types of body part. (c) The number of associations in different types of immunotherapy. Others include cell immunotherapy followed by immune checkpoint therapy, cell immunotherapy followed by target therapy, cell immunotherapy plus immune cytokine, immune checkpoint therapy followed by immune checkpoint therapy plus target therapy, immune checkpoint therapy followed by radiotherapy, immune checkpoint therapy plus anti-angiogenesis therapy, immune checkpoint therapy plus cell immunotherapy, immune checkpoint therapy plus cell immunotherapy plus radiotherapy, immune checkpoint therapy plus chemotherapy followed by surgery, immune checkpoint therapy plus chemotherapy or radiotherapy, immune checkpoint therapy plus oncolytic virus plus chemotherapy, immune checkpoint therapy plus systemic therapy, immune cytokine followed immune checkpoint therapy, immune cytokine plus chemotherapy, immune cytokine plus target therapy followed by immune checkpoint therapy, radiotherapy followed by immune checkpoint therapy plus chemotherapy, surgery plus target therapy followed by immune checkpoint therapy, target therapy followed by immune checkpoint therapy plus hormone therapy. (d) The number of immunotherapy related biomarkers for different types of biomarker. (e) The percentage and number of associations in different research types of the supporting publication

The landscape of association data in canimmunother. (a) A word-cloud diagram shows the association landscape of 218 cancer sub-types in canimmunother. Larger sizes and more central locations of the cancer sub-type symbols in the diagram indicate more association data in the database. (b) The numbers of associations in different types of body part. (c) The number of associations in different types of immunotherapy. Others include cell immunotherapy followed by immune checkpoint therapy, cell immunotherapy followed by target therapy, cell immunotherapy plus immune cytokine, immune checkpoint therapy followed by immune checkpoint therapy plus target therapy, immune checkpoint therapy followed by radiotherapy, immune checkpoint therapy plus anti-angiogenesis therapy, immune checkpoint therapy plus cell immunotherapy, immune checkpoint therapy plus cell immunotherapy plus radiotherapy, immune checkpoint therapy plus chemotherapy followed by surgery, immune checkpoint therapy plus chemotherapy or radiotherapy, immune checkpoint therapy plus oncolytic virus plus chemotherapy, immune checkpoint therapy plus systemic therapy, immune cytokine followed immune checkpoint therapy, immune cytokine plus chemotherapy, immune cytokine plus target therapy followed by immune checkpoint therapy, radiotherapy followed by immune checkpoint therapy plus chemotherapy, surgery plus target therapy followed by immune checkpoint therapy, target therapy followed by immune checkpoint therapy plus hormone therapy. (d) The number of immunotherapy related biomarkers for different types of biomarker. (e) The percentage and number of associations in different research types of the supporting publication

Browse and search

CanImmunother (http://www.biomedical-web.com/cancerit/) provides user-friendly web interfaces and web services to enable users to search, browse, and analyze the association data, as well as to download and submit new associations for further investigation and integration. The “Browse all” web-page presents all of the associations in a table to allow users to filter for the interesting associations through the “search” box. Moreover, we implemented “Browse” sub-web-pages to allow users to browse interesting associations through different body parts, cancer sub-types, biomarker types, and immunotherapy types. In addition, at the “Home” web-page, the symbols in the word-cloud diagrams can be navigated to browse their entry details. To enable users to quickly retrieve the interesting associations and their supporting publications, CanImmunother provides a search application using cancer sub-type, immunotherapy, biomarker, or target with setting filtration parameters, such as body part, immunotherapy type, biomarker type, and research type (Figure 3a). The search application also provides a smart assistance by listing the closest entries to that expectation. The resulting associations are shown in a brief table that displays key information, including cancer sub-types, immunotherapies and their control therapies, biomarkers, targets, study designs, research types, and PMIDs (Figure 3b). Moreover, the “Detail” button in the result table links to further webpages for extra information of the association (Figure 3b). The extra information includes clinical significance and adverse event of the immunotherapy compared with the control therapy, sample information, validating evidence, and other information of the supporting publication (Figure 3b). In addition, CanImmunother provides external links to the related reference resources, such as NCBI PubMed, EFO, DrugBank, NCIT OBO Edition, UniProtKB, and ECO (Figure 3b). Furthermore, for the immunotherapy associated target, CanImmunother provides its expression (mean of FPKM) profile on gene level in the GTEx and TCGA resources across 15 body parts (Figure 3c).
Figure 3.

The web interface of search and excel table application. (a) A resulting table by searching words like “non-small cell lung carcinoma” indicates non-small cell lung carcinoma associating with 97 immunotherapies, 219 biomarkers, 20 targets, and 27 control therapies to form 624 associations. (b) Each association in CanImmunother was manually curated and annotated with valuable annotation. (c) The mean expression profile of fragments per kilobase of exon model per million mapped fragments (FPKM) of CTLA-4 in the GTEx and TCGA resources across 15 body parts

The web interface of search and excel table application. (a) A resulting table by searching words like “non-small cell lung carcinoma” indicates non-small cell lung carcinoma associating with 97 immunotherapies, 219 biomarkers, 20 targets, and 27 control therapies to form 624 associations. (b) Each association in CanImmunother was manually curated and annotated with valuable annotation. (c) The mean expression profile of fragments per kilobase of exon model per million mapped fragments (FPKM) of CTLA-4 in the GTEx and TCGA resources across 15 body parts A simple search in CanImmunother with the keyword “non-small cell lung carcinoma” can obtain 624 associations between non-small cell lung carcinoma (NSCLC) and 97 immunotherapies with 219 biomarkers, 20 targets, and 27 control therapies (Figure 3a). In order to allow users to filter out interesting associations efficiently from those large numbers of associations, an excel table application was implemented for combinational filtration. For example, to compare the clinical efficacy and adverse event of atezolizumab immunotherapy with docetaxel chemotherapy in NSCLC, we used the excel table application in CanImmunother and efficiently obtained five associations, which describe atezolizumab versus docetaxel with their biomarkers and targets for NSCLC, from 624 associations (Figure 3a & 4). Moreover, we further accessed and followed the five associations in CanImmunother. The five associations suggested that (1) compared with docetaxel, atezolizumab is safer with better survival in patients with advanced NSCLC, regardless of PD-L1 expression; however, higher PD-L1 levels on tumor cells and tumor-infiltrating immune cells were likely to correlate with better outcome; (2) compared with docetaxel, atezolizumab significantly improved survival in patients with advanced NSCLC in overall and in the EGFR wild-type subgroup, but not in the EGFR mutant subgroup. So, through searching, filtering, accessing, and following the interesting association data in CanImmunother, we identified that atezolizumab immunotherapy is a better therapy for advanced NSCLC in comparison with docetaxel chemotherapy, and PD-L1 expression and EGFR mutational status are predictive biomarkers for the beneficial effect of atezolizumab. CanImmunother assists users to identify better cancer immunotherapies and their predictive biomarkers Multiple publications probably support the same association. To enable further analysis of the associations, 2646 non-redundant associations were produced from the 3267 associations in CanImmunother by removing data redundancy based on the supporting publications. Each non-redundant association was assigned with an association score for further prioritization and analysis. The assignment of association score was described in detail at the “Materials and Methods” and the “Help” webpage. To promptly evaluate the associations, an association prioritization application was established based on the non-redundant association data in CanImmunother. Association prioritization application allows users to retrieve a cancer sub-type, an immunotherapy, a biomarker, and a target alone or together for prioritizing cancer immunotherapy association data with their predictive biomarkers and targets. For instance, we searched by a keyword like “serum lactate dehydrogenase level” and promptly identified that seven cancer sub-types and ten immunotherapies are associated with the biomarker of serum lactate dehydrogenase level to form sixteen associations. The top three associated immunotherapies are ipilimumab, nivolumab, and pembrolizumab, and the top three associated cancer sub-types are melanoma, cutaneous melanoma, and non-small cell lung carcinoma (Figure 5a). The results allow sorting by association scores and filtering by specific cancer sub-types, immunotherapies, biomarkers, and targets through the “search” box (Figure 5a). In addition, each prioritized association data can be optionally visualized in an interactive network diagram to display their relationships (Figure 5b). Figure 5b displays all experimentally validated biomarkers for uveal melanoma with nivolumab therapy to target PD-1 protein.
Figure 5.

The web interface of association prioritization and network visualization in canimmunother. (a) A resulting table prioritizes cancer sub-types, immunotherapies, and targets associated with the biomarker of serum lactate dehydrogenase level. (b) A network diagram displays all experimentally validated biomarkers for uveal melanoma with nivolumab therapy to target PD-1 protein. Green lines connect biomarker with cancer sub-type and immunotherapy, while blue lines connect target with cancer sub-type and immunotherapy. The values on the green lines are association scores. The thicker green lines represent larger association scores, and the thinner green lines for smaller association scores. The association scores are ranging from 0 to 1.0

The web interface of association prioritization and network visualization in canimmunother. (a) A resulting table prioritizes cancer sub-types, immunotherapies, and targets associated with the biomarker of serum lactate dehydrogenase level. (b) A network diagram displays all experimentally validated biomarkers for uveal melanoma with nivolumab therapy to target PD-1 protein. Green lines connect biomarker with cancer sub-type and immunotherapy, while blue lines connect target with cancer sub-type and immunotherapy. The values on the green lines are association scores. The thicker green lines represent larger association scores, and the thinner green lines for smaller association scores. The association scores are ranging from 0 to 1.0 To explore the relationships of the experimentally validated association data in CanImmunother for discovering cancer immunotherapies and their predictive biomarkers and targets, a network visualization application was implemented in CanImmunother. The application allows users to input a set of cancer sub-types, immunotherapies, biomarkers, and targets and to construct interactive networks to display their relationships. For instance, we entered an input of “head and neck squamous cell carcinoma and nasopharyngeal squamous cell carcinoma” and constructed an interactive network to explore the relationships of the two cancer sub-types with immunotherapies, biomarkers, and targets (Figure 6). The interaction diagram shown that head and neck squamous cell carcinoma has nine predictive biomarkers such as PD-L1 and PD-L2 expression level, HPV infection, tumor mutational burden, and DNA mismatch repair-deficient or microsatellite instability for nivolumab, pembrolizumab, and durvalumab therapy to, respectively, target PD-1 and PD-L1 proteins, while nasopharyngeal squamous cell carcinoma has only one predictive biomarker of EBV-positive for MVA-EL vaccine therapy to target EBNA1 and LMB2 proteins (Figure 6). Moreover, the interaction diagram also implied that the nine biomarkers of head and neck squamous cell carcinoma may be potential predictive biomarkers for nasopharyngeal squamous cell carcinoma with nivolumab, pembrolizumab, and durvalumab therapy, but need to further validate and confirm by clinical trials (Figure 6). Furthermore, the network diagram indicates that for patients with head and neck squamous cell carcinoma, PD-L1 expression level on tumor cells is the common biomarker of pembrolizumab and durvalumab by targeting PD-1 and PD-L1 proteins, respectively (Figure 6). So, the network visualization application not only demonstrates relationships between different cancer sub-types and immunotherapies with their respective biomarkers and targets but also discovers and develops potential cancer immunotherapies and their predictive biomarkers and targets from the experimentally validated association data in CanImmunother. As the interactive networks may consist of many nodes and even more interactions (Figure 5b and Figure 6), CanImmunother offers users filtering function to hide those interactions that are less interesting (Figure 6). When selecting or unselecting some of the nodes, the interactive network will be changed accordingly, and a sub-network of the whole interactive network will then be displayed. In addition, all nodes in the network allow adjustment except for node legends.
Figure 6.

Network visualization explores relationships of the experimental association data in canimmunother to discover potential cancer immunotherapies and their predictive biomarkers and targets

Network visualization explores relationships of the experimental association data in canimmunother to discover potential cancer immunotherapies and their predictive biomarkers and targets

Data access

Web service application programming interfaces (APIs) were implemented for programmatic access of association data in the CanImmunother database. The accessing data by the APIs are available in the universal JSON formats. Moreover, all association data in the database can be freely downloaded for further investigation and integration. In addition, we encourage users to submit their new experimentally validated association data for cancer immunotherapy. Once checked and approved by our submission review committee, the submission data will be included in a future release. Finally, a detailed tutorial for the database is available on the ‘Help’ web-page.

Discussion and conclusion

As the publications for cancer immunotherapy exploded exponentially in recent years, a large amount of experimentally validated association data for cancer immunotherapy remains hidden in the literature and is difficult to access and utilize. Therefore, the curation and analysis of these association data from publications can economically utilize the existing data fully for expediting translational research and application of cancer immunotherapy. In this study, we designed and constructed a comprehensive database called CanImmunother through manually curating cancer immunotherapy association data from peer-review publications. We consistently correlated these association data with valuable annotation. As far as we are aware, CanImmunother is the first comprehensive and high-quality database to provide experimentally validated information of cancer immunotherapy in association with their biomarkers, targets, and control therapies. This database can help to compare and clarify on clinical efficacy and adverse events of specific immunotherapy on a particular cancer type. To accelerate the progress of cancer immunotherapy, several useful computational resources for cancer immunology have been developed recently.[ Different from these computational resources such as CKTTD, our CanImmunother database focuses on providing various cancer immunotherapy association data with experimentally validated to help clinicians and researchers for identification and discovery of advanced immunotherapy options for patients with cancer. In Case study 1, we used CanImmunother to access and follow the association data about atezolizumab immunotherapy versus docetaxel chemotherapy in advanced NSCLC and identified that atezolizumab is a better option in terms of efficacy and safety when compared with docetaxel, and PD-L1 expression and EGFR mutational status are predictive biomarkers for atezolizumab benefit. Moreover, in Case study 3, our network visualization application improved our insight on potential cancer immunotherapy and their predictive biomarkers and targets through exploring the relationships between different cancer sub-types, immunotherapies and their biomarkers and targets. Compared with CKTTD, our CanImmunother database significantly outperforms CKTTD in data coverage, data quality, and application feature (Table 1). First, our CanImmunother database covers various types of cancer immunotherapy, including immune checkpoint therapy, tumor vaccine, immune related cytokine, cellular immunotherapy, oncolytic virus, and their combination or they combined with other non-immunotherapy, such as chemotherapy, target therapy, radiotherapy, chemoradiotherapy and hormone therapy, while CKTTD provides information related to single agent of immune checkpoint therapy only (Figure 2c & Table 1). Second, each association in CanImmunother was annotated with extra valuable annotation, including predictive biomarkers, control therapies, clinical efficacy and adverse events, sample information, and the research type and study design of the supporting publications, while associations in CKTTD do not have similar function (Figure 3c & Table 1). Third, CanImmunother offers extra applications for further analysis on the association data, including excel table application, association prioritization, and network visualization (Table 1). Finally, the number of associations, cancer sub-types, and immunotherapies contained in CanImmunother are approximately 12.60-, 6.61-, and 9.13-fold of those in CKTTD (Table 1). In addition, each association in CanImmunother was designated with an association score for prioritization by assigning different weights to different research types of the supporting publications. The comparison of data contents and features between CanImmunother and CKTTD is shown in Table 1. With the rapid advancement of cancer immunotherapy, more and more experimentally validated cancer immunotherapy data are expected to be reported in the near future. To serve the research communities in fully utilizing the vast amount of data, we will update CanImmunother every six months and constantly improve it with more features and functionalities. Currently, all data in CanImmunother were manually curated from peer-review publications, thus without association data from the international collaboration projects and resources, such as TCGA [, International Cancer Genome Consortium (ICGC) [, and Gene Expression Omnibus (GEO) [. Therefore, we plan to enrich new association data through analyzing cancer immunotherapy-related datasets in those projects and resources. In addition, we will also develop and integrate more computational resources [ and tools [ in the database to annotate and analyze those association data, such as similarity prediction between new chemicals and known cancer immunotherapeutic agents. In conclusion, as a timely and helpful resource, CanImmunother will enhance our insight on cancer immunotherapy, and to facilitate the identification and discovery of advanced immunotherapy options for patients with cancer. Click here for additional data file.
  36 in total

1.  The International Cancer Genome Consortium Data Portal.

Authors:  Junjun Zhang; Rosita Bajari; Dusan Andric; Francois Gerthoffert; Alexandru Lepsa; Hardeep Nahal-Bose; Lincoln D Stein; Vincent Ferretti
Journal:  Nat Biotechnol       Date:  2019-04       Impact factor: 54.908

2.  NCBI GEO: archive for functional genomics data sets--update.

Authors:  Tanya Barrett; Stephen E Wilhite; Pierre Ledoux; Carlos Evangelista; Irene F Kim; Maxim Tomashevsky; Kimberly A Marshall; Katherine H Phillippy; Patti M Sherman; Michelle Holko; Andrey Yefanov; Hyeseung Lee; Naigong Zhang; Cynthia L Robertson; Nadezhda Serova; Sean Davis; Alexandra Soboleva
Journal:  Nucleic Acids Res       Date:  2012-11-27       Impact factor: 16.971

3.  IMGT®, the international ImMunoGeneTics information system® 25 years on.

Authors:  Marie-Paule Lefranc; Véronique Giudicelli; Patrice Duroux; Joumana Jabado-Michaloud; Géraldine Folch; Safa Aouinti; Emilie Carillon; Hugo Duvergey; Amélie Houles; Typhaine Paysan-Lafosse; Saida Hadi-Saljoqi; Souphatta Sasorith; Gérard Lefranc; Sofia Kossida
Journal:  Nucleic Acids Res       Date:  2014-11-05       Impact factor: 19.160

4.  Expitope 2.0: a tool to assess immunotherapeutic antigens for their potential cross-reactivity against naturally expressed proteins in human tissues.

Authors:  Victor Jaravine; Anja Mösch; Silke Raffegerst; Dolores J Schendel; Dmitrij Frishman
Journal:  BMC Cancer       Date:  2017-12-28       Impact factor: 4.430

5.  DrugBank 5.0: a major update to the DrugBank database for 2018.

Authors:  David S Wishart; Yannick D Feunang; An C Guo; Elvis J Lo; Ana Marcu; Jason R Grant; Tanvir Sajed; Daniel Johnson; Carin Li; Zinat Sayeeda; Nazanin Assempour; Ithayavani Iynkkaran; Yifeng Liu; Adam Maciejewski; Nicola Gale; Alex Wilson; Lucy Chin; Ryan Cummings; Diana Le; Allison Pon; Craig Knox; Michael Wilson
Journal:  Nucleic Acids Res       Date:  2018-01-04       Impact factor: 16.971

6.  ECO, the Evidence & Conclusion Ontology: community standard for evidence information.

Authors:  Michelle Giglio; Rebecca Tauber; Suvarna Nadendla; James Munro; Dustin Olley; Shoshannah Ball; Elvira Mitraka; Lynn M Schriml; Pascale Gaudet; Elizabeth T Hobbs; Ivan Erill; Deborah A Siegele; James C Hu; Chris Mungall; Marcus C Chibucos
Journal:  Nucleic Acids Res       Date:  2019-01-08       Impact factor: 16.971

7.  TCMIO: A Comprehensive Database of Traditional Chinese Medicine on Immuno-Oncology.

Authors:  Zhihong Liu; Chuipu Cai; Jiewen Du; Bingdong Liu; Lu Cui; Xiude Fan; Qihui Wu; Jiansong Fang; Liwei Xie
Journal:  Front Pharmacol       Date:  2020-04-15       Impact factor: 5.810

8.  DeepHLApan: A Deep Learning Approach for Neoantigen Prediction Considering Both HLA-Peptide Binding and Immunogenicity.

Authors:  Jingcheng Wu; Wenzhe Wang; Jiucheng Zhang; Binbin Zhou; Wenyi Zhao; Zhixi Su; Xun Gu; Jian Wu; Zhan Zhou; Shuqing Chen
Journal:  Front Immunol       Date:  2019-11-01       Impact factor: 7.561

9.  The Immune Epitope Database (IEDB): 2018 update.

Authors:  Randi Vita; Swapnil Mahajan; James A Overton; Sandeep Kumar Dhanda; Sheridan Martini; Jason R Cantrell; Daniel K Wheeler; Alessandro Sette; Bjoern Peters
Journal:  Nucleic Acids Res       Date:  2019-01-08       Impact factor: 16.971

10.  CancerImmunityQTL: a database to systematically evaluate the impact of genetic variants on immune infiltration in human cancer.

Authors:  Jianbo Tian; Yimin Cai; Yue Li; Zequn Lu; Jinyu Huang; Yao Deng; Nan Yang; Xiaoyang Wang; Pingting Ying; Shanshan Zhang; Ying Zhu; Huilan Zhang; Rong Zhong; Jiang Chang; Xiaoping Miao
Journal:  Nucleic Acids Res       Date:  2021-01-08       Impact factor: 16.971

View more
  1 in total

1.  Immune checkpoint inhibitor (ICI) genes and aging in malignant melanoma patients: a clinicogenomic TCGA study.

Authors:  Mohammed Safi; Chenxing Jin; Abdullah Aldanakh; Ping Feng; Henan Qin; Mohammed Alradhi; Lizhi Zhang; Junying Zhang; Salah Adlat; Yi Zhao; Jiwei Liu
Journal:  BMC Cancer       Date:  2022-09-13       Impact factor: 4.638

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.