Literature DB >> 29592813

Cancer Genome Interpreter annotates the biological and clinical relevance of tumor alterations.

David Tamborero1,2, Carlota Rubio-Perez1, Jordi Deu-Pons1,2, Michael P Schroeder1,3, Ana Vivancos4, Ana Rovira5,6, Ignasi Tusquets5,6,7, Joan Albanell5,6,8, Jordi Rodon4, Josep Tabernero4, Carmen de Torres9, Rodrigo Dienstmann4, Abel Gonzalez-Perez1,2, Nuria Lopez-Bigas10,11,12.   

Abstract

While tumor genome sequencing has become widely available in clinical and research settings, the interpretation of tumor somatic variants remains an important bottleneck. Here we present the Cancer Genome Interpreter, a versatile platform that automates the interpretation of newly sequenced cancer genomes, annotating the potential of alterations detected in tumors to act as drivers and their possible effect on treatment response. The results are organized in different levels of evidence according to current knowledge, which we envision can support a broad range of oncology use cases. The resource is publicly available at http://www.cancergenomeinterpreter.org .

Entities:  

Mesh:

Substances:

Year:  2018        PMID: 29592813      PMCID: PMC5875005          DOI: 10.1186/s13073-018-0531-8

Source DB:  PubMed          Journal:  Genome Med        ISSN: 1756-994X            Impact factor:   11.117


Background

The accumulation of so-called “driver” genomic alterations confers on cells tumorigenic capabilities [1]. Thousands of tumor genomes are sequenced around the world every year for both research and clinical purposes. In some cases the whole genome is sequenced while in others the focus is on the exome or a panel of selected genomic regions. It then becomes necessary to annotate which of the somatic mutations identified by the sequencing have a possible role in tumorigenesis and treatment response. This process, which we refer to as “the interpretation of cancer genomes”, is currently tedious and largely unsolved. One of its major bottlenecks is the identification of the alterations driving the tumor. A widely employed approach to solve this hurdle consists in focusing on the mutations affecting known cancer genes, i.e., tumor suppressors and oncogenes. These were initially identified through experimentation, giving rise over the past 40 years to a stable census of human cancer genes [2]. More recently, projects re-sequencing large cohorts of tumors have provided the opportunity to systematically identify genes involved in tumorigenesis through the detection of signals of positive selection in their alteration patterns across tumors of some two dozen malignancies [3-6]. However, many of the somatic variants detected in tumors, even those in cancer genes, still have uncertain significance and thus it is not clear whether or not they are relevant for tumorigenesis. Another hurdle in the interpretation of cancer genomes concerns one of its crucial aims: the identification of tumor alterations that may affect treatment options. Unstructured information on the effectiveness of therapies targeting specific cancer drivers is continuously generated by clinical trials and pre-clinical experiments, and currently several resources are dedicated to gather and curate these data, such as ClinVar [7], DoCM [8], OncoKB [9], My Cancer Genome (https://www.mycancergenome.org), PMKB [10], PCT (https://pct.mdanderson.org), CIViC [11], and JAX-CKB (https://ckb.jax.org). Nevertheless, none of these resources address the whole process of interpretation and researchers and clinicians thus face a challenging task to annotate the variants detected in a newly sequenced cancer genome with their collective information. Here, we describe the Cancer Genome Interpreter (CGI), a platform that systematizes the interpretation of cancer genomes, the main hallmark of which is the streamlining and automatization of the whole process (Additional file 1: Table S1). Specifically, the CGI addresses the two aforementioned challenges. On the one hand, it identifies all known and likely tumorigenic genomic alterations (point mutations, small insertions/deletions, copy number alterations and/or gene fusions) of a newly sequenced tumor, including the assessment of variants of unknown significance. On the other, it annotates all variants of the tumor that constitute state-of-the-art biomarkers of drug response organized using different clinical evidence. The CGI accepts several data formats and its output reports are provided in a user-friendly interactive framework that organizes the results according to distinct levels of clinical relevance, which may thus be used in a broad range of applications.

Construction and content

The CGI employs existing or newly developed resources and computational methods to annotate and analyze the alterations in a tumor according to distinct levels of evidence (Fig. 1a; details in Additional file 2: Note I). The tool is freely available through an API or a web interface at http://www.cancergenomeinterpreter.org, under an open license, with the aim of facilitating its use by cancer researchers and clinical oncologists (Fig. 1b–d). In the following sections we present the blueprint for the interpretation of cancer genomes implemented by the CGI, describe the resource, and discuss its utility.
Fig. 1

Cancer Genome Interpreter. a Outline of the CGI workflow. With a list of genomic alterations as input, the CGI automatically recognizes the format, remaps the variants as needed, and standardizes the annotation for downstream compatibility. All analyses are cancer-specific and thus the tumor type of the sample(s) to analyze is also required. Next, the CGI identifies known driver alterations and annotates and classifies the remaining variants of unknown significance. Finally, alterations that are biomarkers of drug effects are identified. b The CGI may be run via the web at http://www.cancergenomeinterpreter.org (left panel) or through an API. The web results can be stored in a private repository (right panel) for their management. The results of the CGI are provided via interactive reports. c An example of a mutation analysis report. This contains the annotations of all mutations, which empowers the user’s review, and the labels for those known or predicted to be drivers by OncodriveMUT. d An example of a biomarker match report. This contains the putative biomarkers of drug response found in the tumor organized according to distinct levels of clinical relevance. All these web reports are interactive and configurable by the user. CNA copy number alteration

Cancer Genome Interpreter. a Outline of the CGI workflow. With a list of genomic alterations as input, the CGI automatically recognizes the format, remaps the variants as needed, and standardizes the annotation for downstream compatibility. All analyses are cancer-specific and thus the tumor type of the sample(s) to analyze is also required. Next, the CGI identifies known driver alterations and annotates and classifies the remaining variants of unknown significance. Finally, alterations that are biomarkers of drug effects are identified. b The CGI may be run via the web at http://www.cancergenomeinterpreter.org (left panel) or through an API. The web results can be stored in a private repository (right panel) for their management. The results of the CGI are provided via interactive reports. c An example of a mutation analysis report. This contains the annotations of all mutations, which empowers the user’s review, and the labels for those known or predicted to be drivers by OncodriveMUT. d An example of a biomarker match report. This contains the putative biomarkers of drug response found in the tumor organized according to distinct levels of clinical relevance. All these web reports are interactive and configurable by the user. CNA copy number alteration

A comprehensive catalog of cancer genes across tumor types

One of the main aims of the interpretation of cancer genomes is to identify the alterations responsible for tumorigenic traits. In the CGI, this process begins with a focus on alterations that affect the genes capable of driving the cancer hallmarks of a particular tumor type. Therefore, we compiled a catalog of genes involved in the onset and progression of different types of cancer, obtained via different methods and from different sources (Additional file 2: Note II). First, from manually curated resources [2, 7, 8, 12, 13] and the literature we collected genes that have been experimentally or clinically verified to drive tumorigenesis. Second, we incorporated the results of bioinformatics analyses of large tumor cohorts re-sequenced by international initiatives such as The Cancer Genome Atlas (http://cancergenome.nih.gov/abouttcga) and the International Cancer Genome Consortium [14] (specifically, we identified genes whose somatic alterations exhibit signals of positive selection across 6729 tumors representing 28 types of cancer [4]). Each of these cancer genes was annotated with their mode of action in tumorigenesis (i.e., whether they function as oncogenes or tumor suppressors), on the basis of either experimentally verified sources, or in silico prediction [15]. The resulting Catalog of Cancer Genes currently comprises 837 genes with evidence of a tumorigenic role in 193 different cancer types (Fig. 2a). Each entry in the catalog thus includes, along with the name of the driver gene, (i) the malignancies it drives, organized according to available evidence; (ii) the types of alterations involved (mutations, copy number alterations, and/or gene translocations); (iii) the source(s) of this information; (iv) the context (germline or somatic) in which these alterations are tumorigenic; and (v) the gene’s mode of action in cancer as appropriate. The catalog is available for download through the CGI website (https://www.cancergenomeinterpreter.org/genes).
Fig. 2

Annotating mutations in cancer genes. a Catalog of Cancer Genes. Genes that drive tumorigenesis via mutations, copy number alterations, and/or translocations are annotated with their mode of action (MoA). b Catalog of Validated Oncogenic Mutations. Clinically or experimentally validated driver mutations were gathered from manually annotated resources and the cancer literature. c Proportion of validated mutations observed across the cancer genes of 6792 tumors. Cancer types: ALL acute lymphocytic leukemia, AML acute myeloid leukemia, BLCA bladder carcinoma, BRCA breast carcinoma, CLL chronic lymphocytic leukemia, CM cutaneous melanoma, COREAD colorectal adenocarcinoma, DLBC diffuse large B cell lymphoma, ESCA esophageal carcinoma, GBM glioblastoma multiforme, HC hepatocarcinoma, HNSC head and neck squamous cell carcinoma, LGG lower grade glioma, LUAD lung adenocarcinoma, LUSC lung squamous cell carcinoma, MB medulloblastoma, MM multiple myeloma, NB neuroblastoma, NSCLC non-small cell lung carcinoma, OV serous ovarian adenocarcinoma, PA pilocytic astrocytoma, PAAD pancreas adenocarcinoma, PRAD prostate adenocarcinoma, RCC renal clear cell carcinoma, SCLC small cell lung carcinoma, STAD stomach adenocarcinoma, THCA thyroid carcinoma, UCEC uterine corpus endometrioid carcinoma. d OncodriveMUT schema to estimate the oncogenic potential of the variants of unknown significance. A set of heuristic rules combines the annotations obtained for a given mutation with the knowledge about the genes (or regions thereof) in which it is observed, as retrieved from the computational analyses of sequenced cohorts

Annotating mutations in cancer genes. a Catalog of Cancer Genes. Genes that drive tumorigenesis via mutations, copy number alterations, and/or translocations are annotated with their mode of action (MoA). b Catalog of Validated Oncogenic Mutations. Clinically or experimentally validated driver mutations were gathered from manually annotated resources and the cancer literature. c Proportion of validated mutations observed across the cancer genes of 6792 tumors. Cancer types: ALL acute lymphocytic leukemia, AML acute myeloid leukemia, BLCA bladder carcinoma, BRCA breast carcinoma, CLL chronic lymphocytic leukemia, CM cutaneous melanoma, COREAD colorectal adenocarcinoma, DLBC diffuse large B cell lymphoma, ESCA esophageal carcinoma, GBM glioblastoma multiforme, HC hepatocarcinoma, HNSC head and neck squamous cell carcinoma, LGG lower grade glioma, LUAD lung adenocarcinoma, LUSC lung squamous cell carcinoma, MB medulloblastoma, MM multiple myeloma, NB neuroblastoma, NSCLC non-small cell lung carcinoma, OV serous ovarian adenocarcinoma, PA pilocytic astrocytoma, PAAD pancreas adenocarcinoma, PRAD prostate adenocarcinoma, RCC renal clear cell carcinoma, SCLC small cell lung carcinoma, STAD stomach adenocarcinoma, THCA thyroid carcinoma, UCEC uterine corpus endometrioid carcinoma. d OncodriveMUT schema to estimate the oncogenic potential of the variants of unknown significance. A set of heuristic rules combines the annotations obtained for a given mutation with the knowledge about the genes (or regions thereof) in which it is observed, as retrieved from the computational analyses of sequenced cohorts

Most mutations affecting cancer genes are of uncertain significance

The focus on cancer genes described above is a necessary but not sufficient to identify the tumorigenic variants in a tumor, since not all variants observed in a cancer gene are necessarily capable of driving tumorigenesis. Therefore, the CGI next focuses on annotating and analyzing protein-affecting mutations (PAMs) that occur in genes of the Catalog of Cancer Genes. First, validated tumorigenic mutations may confidently be labeled as drivers when detected in a newly sequenced tumor. We compiled an inventory that currently contains 5314 such validated mutations, including cancer-predisposing variants, from dedicated resources [7–9, 12, 13, 16] and the literature (Fig. 2b; Additional file 2: Note III). This Catalog of Validated Oncogenic Mutations is available for download through the CGI website (https://www.cancergenomeinterpreter.org/mutations). Across a pan-cancer cohort of 6792 tumors sequenced at the whole-exome level (mostly at diagnosis) [4] we observed that only 5360 (916 unique variants) of the 44,601 PAMs found in cancer genes appear in this catalog. In other words, 88% of all PAMs that affect cancer genes in this cohort are currently of uncertain significance for tumorigenesis, a proportion that varies widely per gene and tumor type (Fig. 2c; Additional file 2: Note VII). It is therefore crucial to assess the tumorigenic potential of these variants, especially when they affect genes that are—or may be—therapeutic targets. We reasoned that several features of each specific mutation as well as of the genes they affect could help address this question. Moreover, we propose that some of these features of interest can be extracted from the analyses of large sequenced cohorts of healthy and tumor tissue [4, 17]. Examples of relevant attributes include the following: i) the mode of action of the gene in the cancer type (oncogene or tumor suppressor); ii) the consequence type of the mutation (e.g., synonymous, missense, or truncating); iii) its position within the transcript; iv) whether it falls in a mutational hotspot or cluster; v) its predicted functional impact; vi) its frequency within the human population; and vii) whether it occurs in a domain of the protein that is depleted of germline variants. The CGI assesses the tumorigenic potential of the variants of unknown significance via OncodriveMUT, a newly developed rule-based approach that combines the values of these features (Fig. 2d; Additional file 2: Note IVa). We assessed the performance of OncodriveMUT in the task of classifying driver and passenger mutations, using the Catalog of Validated Oncogenic Mutations (n = 5314) and a collected set of likely neutral—i.e., non-tumorigenic—PAMs affecting cancer genes (n = 1676). We found that OncodriveMUT separated the variants of these two data sets with 86% accuracy (Matthews correlation coefficient, 0.64), out-performing other methods employed for this goal (Additional file 2: Note IVb). In addition, for several features, the variants classified as drivers by OncodriveMUT followed the trend expected for oncogenic mutations (e.g., they exhibited larger clonal fractions among all mutations in cancer genes), and OncodriveMUT’s predictions on a set of recently probed uncommon cancer mutations exhibited a high concordance with experimental evidence [18-21] (Additional file 2: Note IVb). Of note, the attributes employed by OncodriveMUT to classify each variant are detailed in the CGI output, which facilitates the user’s review of the results. In summary, the CGI annotates the mutations affecting cancer genes with features relevant to their potential role in cancer, identifying validated oncogenic events and identifying the most likely drivers among the variants of unknown significance.

A database of genomic determinants of anti-cancer drug response

The second major aim of the effort to interpret cancer genomes is to identify which tumor alterations may shape the response to anti-cancer therapies. Knowledge on the influence of genomic alterations on drug response is continuously generated and reported through publications, clinical trials, and conference communications. Nevertheless, collecting and curating relevant information into an easy-to-use resource supporting the comparison with newly sequenced tumors and organize the results according to the needs of different users is challenging. The CGI employs two resources to explore the associations between gene alterations and drug response. The first is the Cancer Biomarkers database, an extension of a previous collection of genomic biomarkers of anti-cancer drug response [12] which currently contains information on 1624 genomic biomarkers of response (sensitivity, resistance, or toxicity) to 310 drugs across 130 types of cancer. Negative results of clinical trials, e.g., the unsuccessful use of BRAF V600 inhibitors as a single therapeutic agent in colorectal cancers bearing that mutation, are also included in the database. Importantly, these biomarkers are organized according to the level of clinical evidence supporting each one, ranging from results of pre-clinical data, case reports, and clinical trials in early (I/II) and late phases (III/IV) to standard-of-care guidelines. The database is continuously updated by a board of medical oncologists and cancer genomics experts (Fig. 3a; Additional file 2: Note V). As explained in the “Introduction”, the Cancer Biomarkers database is only one of the resources currently annotating the biomarkers of tumor response to drugs (Additional file 1: Table S1). The leading institutions developing these knowledgebases were recently integrated into the Variant Interpretation for Cancer Consortium (http://cancervariants.org/) under the umbrella of the Global Alliance for Genomics & Health [22]. Besides the aggregation of the data collected by each individual resource, the aim of this project will be to establish community standards to represent and share this information.
Fig. 3

Cancer Biomarkers Database. a A board of clinical and research experts gather the genomic biomarkers of drug response to be included in the Cancer Biomarkers database through periodic updates. The upper panel displays the simplified schema of the data model. The clinical/research community is encouraged to provide feedback to edit an existing entry or add a novel one by using the comment feature available in the web service. Any suggestion is subsequently evaluated by the scientific team and incorporated as appropriate. A semi-automatic pipeline annotates any novel entry to ensure the consistency of the attributes, including variant re-mapping from protein to genomic coordinates when necessary. The lower panel displays some of the 1574 biomarkers that have been collected in the current version of the database, and the pie charts summarize the content. CNA copy number alteration. b CGI analyses detect putative driver mutations in individual tumors that are rarely observed in the corresponding cancer type. When these variants are known targets of anti-cancer therapies, they may constitute tumor type re-purposing opportunities. The graph summarizes some of these potential opportunities detected by the CGI on 6792 pan-cancer tumors with exome-sequencing data, which are currently unexplored. The barplots display the overall number of tumor samples (separated by cancer type) in which they were observed. The acronym of the cancer type in which the genomic event is demonstrated to confer sensitivity to the drug is shown in parentheses following the name of the drug, and the clinical evidence of that association is represented through color circles (note that the clinical guidelines/recommendations label refers to FDA-approved or international guidelines). Targeted drugs and chemotherapies are shown separately. Cancer acronyms that are not included in the Fig. 2 legend: RA renal angiomyolipoma, BCC basal cell carcinoma, GCA giant cell astrocytoma, G glioma, MCL mantle cell lymphoma, MRT malignant rhabdoid tumor, R renal, CH chollangiocarcinoma. c Therapeutic landscape of 6792 tumors with exome-sequencing data. Fraction of tumors with genomic alterations that are biomarkers of drug response in each cancer type. Colors in the bars denote the clinical evidence supporting the effect of biomarkers in that disease (see evidence colors in b). Note that the event with evidence closest to the clinical evidence is given priority when several biomarkers of drug response co-occur in the same tumor sample. The lower part of the graph indicates the total number of samples per cancer type, detailing the number of samples in which mutation, CNA, and/or fusion data were analyzed. Cancer acronyms as in the Fig. 2 caption. d Same as c for a cohort of 17,462 tumors sequenced by targeted panels and gathered by the GENIE project. Tumors were grouped according to the most specific disease subtype available in the patient information. Cancer acronyms that are not included in the Fig. 2 legend are detailed in Additional file 2: Supplementary content

Cancer Biomarkers Database. a A board of clinical and research experts gather the genomic biomarkers of drug response to be included in the Cancer Biomarkers database through periodic updates. The upper panel displays the simplified schema of the data model. The clinical/research community is encouraged to provide feedback to edit an existing entry or add a novel one by using the comment feature available in the web service. Any suggestion is subsequently evaluated by the scientific team and incorporated as appropriate. A semi-automatic pipeline annotates any novel entry to ensure the consistency of the attributes, including variant re-mapping from protein to genomic coordinates when necessary. The lower panel displays some of the 1574 biomarkers that have been collected in the current version of the database, and the pie charts summarize the content. CNA copy number alteration. b CGI analyses detect putative driver mutations in individual tumors that are rarely observed in the corresponding cancer type. When these variants are known targets of anti-cancer therapies, they may constitute tumor type re-purposing opportunities. The graph summarizes some of these potential opportunities detected by the CGI on 6792 pan-cancer tumors with exome-sequencing data, which are currently unexplored. The barplots display the overall number of tumor samples (separated by cancer type) in which they were observed. The acronym of the cancer type in which the genomic event is demonstrated to confer sensitivity to the drug is shown in parentheses following the name of the drug, and the clinical evidence of that association is represented through color circles (note that the clinical guidelines/recommendations label refers to FDA-approved or international guidelines). Targeted drugs and chemotherapies are shown separately. Cancer acronyms that are not included in the Fig. 2 legend: RA renal angiomyolipoma, BCC basal cell carcinoma, GCA giant cell astrocytoma, G glioma, MCL mantle cell lymphoma, MRT malignant rhabdoid tumor, R renal, CH chollangiocarcinoma. c Therapeutic landscape of 6792 tumors with exome-sequencing data. Fraction of tumors with genomic alterations that are biomarkers of drug response in each cancer type. Colors in the bars denote the clinical evidence supporting the effect of biomarkers in that disease (see evidence colors in b). Note that the event with evidence closest to the clinical evidence is given priority when several biomarkers of drug response co-occur in the same tumor sample. The lower part of the graph indicates the total number of samples per cancer type, detailing the number of samples in which mutation, CNA, and/or fusion data were analyzed. Cancer acronyms as in the Fig. 2 caption. d Same as c for a cohort of 17,462 tumors sequenced by targeted panels and gathered by the GENIE project. Tumors were grouped according to the most specific disease subtype available in the patient information. Cancer acronyms that are not included in the Fig. 2 legend are detailed in Additional file 2: Supplementary content The second resource is the Cancer Bioactivities database, which currently contains information on 20,243 chemical compound–protein product interactions that may support novel research applications. We built this database by compiling a catalog of available results from bioactivity assays of small molecules interacting with cancer proteins. This information was obtained by querying several external databases (Additional file 2: Note VI). The CGI matches the alterations observed in newly sequenced tumors to the biomarkers or target genes in these two databases. This process supports the identification of biomarkers at different levels of gene resolution, ranged from variants affecting a gene region to specific amino acid changes. Of note, the CGI also reports co-occurring alterations that affect the response to a given treatment as appropriate. This includes the co-existence of biomarkers of resistance and sensitivity to the same drug, and biomarkers of drug sensitivity that depend upon simultaneous genomic events. In summary, these two databases constitute comprehensive repositories of genome-guided therapeutic actionability in cancer according to current supporting evidence. Both resources are available for download through the CGI website (https://www.cancergenomeinterpreter.org/biomarkers, https://www.cancergenomeinterpreter.org/bioactivities).

Utility and discussion

The CGI (and the databases created to support its implementation) are distributed under an open license, and the resource can be accessed via its web site at https://www.cancergenomeinterpreter.org and through an Application Programming Interface (API; Additional file 2: Notes Ic and Id). The use of the CGI to automatically interpret cancer genomes has broad potential applications, ranging from basic cancer genomics to the translational research setting. One feature of the CGI that makes it particularly suitable for different types of applications is its usability. The user can input the tumor alterations to be analyzed by uploading files following different standards and/or by typing them in a free-text box. The system is prepared to automatically recognize and re-map as necessary [23] different formats, such as genomic, transcript, or protein-based coordinates for mutations [23] (Additional file 2: Note Ib). The use of the CGI can help addressing questions raised in different oncology research settings. A newly sequenced group of tumors may be readily interpreted, and systematic analyses of large sample sets are supported as exemplified with the 6729 pan-cancer cohort presented in this article. The application of the CGI to the mutations profiled across the whole exomes of these tumors delivered a catalog of putative driver alterations across its 28 cancer types (made available through http://www.intogen.org; Additional file 2: Note VII). The potential of a comprehensive analysis of individual alterations is illustrated by the identification of uncommon events in a tumor cohort that may be exploited by drug re-purposing opportunities (Fig. 3b; Additional file 2: Note VII). Overall, the CGI identified 5.2 and 3.5% of the tumors with genomic alterations that are biomarkers of drug response used in the clinical practice (FDA-approved or international guidelines) or reported in late phase (III–IV) clinical trials, respectively. When considering biomarkers supported by lower levels of clinical relevance, a total of 62% of the tumors exhibited at least one biomarker with increased response to an anti-cancer drug according to findings in early clinical trials, case reports, or pre-clinical assays. These numbers varied greatly across tumor types, partially explained by the relevance of cancer-recurrent alterations in shaping the response to drugs, such as inhibitors of the BRAF V600 mutated form in cutaneous melanoma (clinical guidelines), certain chemotherapies administered for DNMT3A or NPM1 mutant acute myeloid leukemias (clinical guidelines), PIK3CA mutation inhibitors in breast cancer (early clinical trial results [24]), and WEE1 inhibitors in TP53 mutated ovary tumors (early clinical trial results [25]) (Fig. 3c; Additional file 2: Note VII). However, this cohort mostly includes samples sequenced at diagnosis and thus they may not reflect the type of tumors that are evaluated by molecular oncology boards at present. We therefore also applied the CGI to the sequencing data of 17,642 tumors recently released by the GENIE project, which profiled more clinically advanced cancers using targeted panels [26]. The CGI identified a larger percentage of tumors bearing potential actionable genomic alterations in that cohort. Specifically, 8 and 6% of GENIE tumors exhibited biomarkers of drug response used in clinical practice or reported in late clinical trials, and overall 72% of these tumors exhibited at least one alteration reported as a biomarker of drug response supported by any level of clinical evidence (including pre-clinical data; Fig. 3d; Additional file 2: Note VII). These percentages do not include cases in which a tumor exhibits co-occurring alterations that confer resistance to a given drug, in which the therapy was not in silico prescribed accordingly. Of note, the GENIE cohort exhibited a larger number of genomic biomarkers of drug resistance (to both targeted therapies and immune checkpoint blockade agents), as expected of tumors with a higher proportion of recurrence/relapse patients (Additional file 2: Note VII). These analyses provide a comprehensive state-of-the-art snapshot of the putative genomic drivers of cancer and the landscape of genomic-guided therapies according to our current knowledge. In addition, the application of the CGI to analyze the results of drug responses observed in tumors with different genomic architecture can facilitate the discovery of novel genomic biomarkers of drug sensitivity or resistance. The distinction between driver and passenger events recently contributed to the development of better predictive models to identify novel genomic markers of drug response in cancer cell lines [27]. In previous examples, the systematic analysis of large datasets was facilitated by the automatic classification of cancer variants that CGI provides. However, the detailed review of these results is empowered by the inclusion in the output reports of all the annotations employed by the CGI. The ability to review these data is especially critical in the clinical research setting. In this case, the use of the CGI to analyze the list of alterations detected in a patient’s tumor could support decision-making in multiple scenarios, assessing variants of unknown significance that may have implications for response to therapy. Early clinical adopters of the CGI have used the resource to support final decisions about the most appropriate genomic-guided clinical trial to enroll cancer patients or explore potential drug re-purposing opportunities for pediatric tumors unresponsive to standard-of-care therapy (see these use cases in Additional file 2: Note VIII). Crucial to the performance of the CGI are the maintenance and further development of its two distinct types of resources: the repositories of accumulated knowledge, which are continuously generated, and the bioinformatics methods to estimate the relevance of those events that are not yet validated. As new tumor cohorts are re-sequenced and analyzed, more comprehensive catalogs of cancer genes and oncogenic mutations will be obtained, including both new malignancies and new genomic elements. In particular, the possibility to identify non-coding cancer drivers [28] from currently generated whole-genome sequencing data will open up the opportunity to explore the actionability of non-coding genomic alterations (https://dcc.icgc.org/pcawg). With respect to the aggregation, curation, and interpretation of the relevance of cancer variants, our team follows the standard operating procedures developed under the umbrella of the H2020 MedBioinformatics (http://www.medbioinformatics.eu/) project, thus ensuring the mid-term maintenance of these resources. Feedback from the community is also facilitated through the CGI web interface. Access to this type of cancer data is crucial for the advance of precision medicine, but is highly complex for a single institution to comprehensively manage and update. We envision that individual databases will continue to be maintained to fulfill specific needs [11], but our long-term impact will largely rely, first, on the establishment of international standards for the collection of data relevant to associations between cancer variants and clinical outcomes and, second, on our collective success in encouraging the community to share and harmonize such knowledge.

Conclusions

The CGI is a versatile platform that automates the steps proposed here for the interpretation of cancer genomes. It annotates the alterations detected in human tumors with features that may inform about their involvement in tumorigenesis. It also highlights the alterations of the tumor that constitute biomarkers of response to anti-cancer drugs, according to current levels of evidence. The CGI is easy to use, and will improve with new knowledge extracted from the study of thousands of tumors. We envision that it will become established as a useful tool in both the basic and translational cancer research settings. Table S1. The features provided by different resources/methods and details about which of them are employed by the CGI. (XLS 21 kb) Supplementary methods, use cases description, Table S2. and Figures S1–S3. (PDF 6267 kb)
  26 in total

1.  Identification of Variant-Specific Functions of PIK3CA by Rapid Phenotyping of Rare Mutations.

Authors:  Turgut Dogruluk; Yiu Huen Tsang; Maribel Espitia; Fengju Chen; Tenghui Chen; Zechen Chong; Vivek Appadurai; Armel Dogruluk; Agna Karina Eterovic; Penelope E Bonnen; Chad J Creighton; Ken Chen; Gordon B Mills; Kenneth L Scott
Journal:  Cancer Res       Date:  2015-12-01       Impact factor: 12.701

2.  Phase I Dose-Escalation Study of Taselisib, an Oral PI3K Inhibitor, in Patients with Advanced Solid Tumors.

Authors:  Dejan Juric; Ian Krop; Ramesh K Ramanathan; Timothy R Wilson; Joseph A Ware; Sandra M Sanabria Bohorquez; Heidi M Savage; Deepak Sampath; Laurent Salphati; Ray S Lin; Huan Jin; Hema Parmar; Jerry Y Hsu; Daniel D Von Hoff; José Baselga
Journal:  Cancer Discov       Date:  2017-03-22       Impact factor: 39.397

3.  Phase II Study of WEE1 Inhibitor AZD1775 Plus Carboplatin in Patients With TP53-Mutated Ovarian Cancer Refractory or Resistant to First-Line Therapy Within 3 Months.

Authors:  Suzanne Leijen; Robin M J M van Geel; Gabe S Sonke; Daphne de Jong; Efraim H Rosenberg; Serena Marchetti; Dick Pluim; Erik van Werkhoven; Shelonitda Rose; Mark A Lee; Tomoko Freshwater; Jos H Beijnen; Jan H M Schellens
Journal:  J Clin Oncol       Date:  2016-10-28       Impact factor: 44.544

4.  Database of genomic biomarkers for cancer drugs and clinical targetability in solid tumors.

Authors:  Rodrigo Dienstmann; In Sock Jang; Brian Bot; Stephen Friend; Justin Guinney
Journal:  Cancer Discov       Date:  2015-02       Impact factor: 39.397

5.  International network of cancer genome projects.

Authors:  Thomas J Hudson; Warwick Anderson; Axel Artez; Anna D Barker; Cindy Bell; Rosa R Bernabé; M K Bhan; Fabien Calvo; Iiro Eerola; Daniela S Gerhard; Alan Guttmacher; Mark Guyer; Fiona M Hemsley; Jennifer L Jennings; David Kerr; Peter Klatt; Patrik Kolar; Jun Kusada; David P Lane; Frank Laplace; Lu Youyong; Gerd Nettekoven; Brad Ozenberger; Jane Peterson; T S Rao; Jacques Remacle; Alan J Schafer; Tatsuhiro Shibata; Michael R Stratton; Joseph G Vockley; Koichi Watanabe; Huanming Yang; Matthew M F Yuen; Bartha M Knoppers; Martin Bobrow; Anne Cambon-Thomsen; Lynn G Dressler; Stephanie O M Dyke; Yann Joly; Kazuto Kato; Karen L Kennedy; Pilar Nicolás; Michael J Parker; Emmanuelle Rial-Sebbag; Carlos M Romeo-Casabona; Kenna M Shaw; Susan Wallace; Georgia L Wiesner; Nikolajs Zeps; Peter Lichter; Andrew V Biankin; Christian Chabannon; Lynda Chin; Bruno Clément; Enrique de Alava; Françoise Degos; Martin L Ferguson; Peter Geary; D Neil Hayes; Thomas J Hudson; Amber L Johns; Arek Kasprzyk; Hidewaki Nakagawa; Robert Penny; Miguel A Piris; Rajiv Sarin; Aldo Scarpa; Tatsuhiro Shibata; Marc van de Vijver; P Andrew Futreal; Hiroyuki Aburatani; Mónica Bayés; David D L Botwell; Peter J Campbell; Xavier Estivill; Daniela S Gerhard; Sean M Grimmond; Ivo Gut; Martin Hirst; Carlos López-Otín; Partha Majumder; Marco Marra; John D McPherson; Hidewaki Nakagawa; Zemin Ning; Xose S Puente; Yijun Ruan; Tatsuhiro Shibata; Michael R Stratton; Hendrik G Stunnenberg; Harold Swerdlow; Victor E Velculescu; Richard K Wilson; Hong H Xue; Liu Yang; Paul T Spellman; Gary D Bader; Paul C Boutros; Peter J Campbell; Paul Flicek; Gad Getz; Roderic Guigó; Guangwu Guo; David Haussler; Simon Heath; Tim J Hubbard; Tao Jiang; Steven M Jones; Qibin Li; Nuria López-Bigas; Ruibang Luo; Lakshmi Muthuswamy; B F Francis Ouellette; John V Pearson; Xose S Puente; Victor Quesada; Benjamin J Raphael; Chris Sander; Tatsuhiro Shibata; Terence P Speed; Lincoln D Stein; Joshua M Stuart; Jon W Teague; Yasushi Totoki; Tatsuhiko Tsunoda; Alfonso Valencia; David A Wheeler; Honglong Wu; Shancen Zhao; Guangyu Zhou; Lincoln D Stein; Roderic Guigó; Tim J Hubbard; Yann Joly; Steven M Jones; Arek Kasprzyk; Mark Lathrop; Nuria López-Bigas; B F Francis Ouellette; Paul T Spellman; Jon W Teague; Gilles Thomas; Alfonso Valencia; Teruhiko Yoshida; Karen L Kennedy; Myles Axton; Stephanie O M Dyke; P Andrew Futreal; Daniela S Gerhard; Chris Gunter; Mark Guyer; Thomas J Hudson; John D McPherson; Linda J Miller; Brad Ozenberger; Kenna M Shaw; Arek Kasprzyk; Lincoln D Stein; Junjun Zhang; Syed A Haider; Jianxin Wang; Christina K Yung; Anthony Cros; Anthony Cross; Yong Liang; Saravanamuttu Gnaneshan; Jonathan Guberman; Jack Hsu; Martin Bobrow; Don R C Chalmers; Karl W Hasel; Yann Joly; Terry S H Kaan; Karen L Kennedy; Bartha M Knoppers; William W Lowrance; Tohru Masui; Pilar Nicolás; Emmanuelle Rial-Sebbag; Laura Lyman Rodriguez; Catherine Vergely; Teruhiko Yoshida; Sean M Grimmond; Andrew V Biankin; David D L Bowtell; Nicole Cloonan; Anna deFazio; James R Eshleman; Dariush Etemadmoghadam; Brooke B Gardiner; Brooke A Gardiner; James G Kench; Aldo Scarpa; Robert L Sutherland; Margaret A Tempero; Nicola J Waddell; Peter J Wilson; John D McPherson; Steve Gallinger; Ming-Sound Tsao; Patricia A Shaw; Gloria M Petersen; Debabrata Mukhopadhyay; Lynda Chin; Ronald A DePinho; Sarah Thayer; Lakshmi Muthuswamy; Kamran Shazand; Timothy Beck; Michelle Sam; Lee Timms; Vanessa Ballin; Youyong Lu; Jiafu Ji; Xiuqing Zhang; Feng Chen; Xueda Hu; Guangyu Zhou; Qi Yang; Geng Tian; Lianhai Zhang; Xiaofang Xing; Xianghong Li; Zhenggang Zhu; Yingyan Yu; Jun Yu; Huanming Yang; Mark Lathrop; Jörg Tost; Paul Brennan; Ivana Holcatova; David Zaridze; Alvis Brazma; Lars Egevard; Egor Prokhortchouk; Rosamonde Elizabeth Banks; Mathias Uhlén; Anne Cambon-Thomsen; Juris Viksna; Fredrik Ponten; Konstantin Skryabin; Michael R Stratton; P Andrew Futreal; Ewan Birney; Ake Borg; Anne-Lise Børresen-Dale; Carlos Caldas; John A Foekens; Sancha Martin; Jorge S Reis-Filho; Andrea L Richardson; Christos Sotiriou; Hendrik G Stunnenberg; Giles Thoms; Marc van de Vijver; Laura van't Veer; Fabien Calvo; Daniel Birnbaum; Hélène Blanche; Pascal Boucher; Sandrine Boyault; Christian Chabannon; Ivo Gut; Jocelyne D Masson-Jacquemier; Mark Lathrop; Iris Pauporté; Xavier Pivot; Anne Vincent-Salomon; Eric Tabone; Charles Theillet; Gilles Thomas; Jörg Tost; Isabelle Treilleux; Fabien Calvo; Paulette Bioulac-Sage; Bruno Clément; Thomas Decaens; Françoise Degos; Dominique Franco; Ivo Gut; Marta Gut; Simon Heath; Mark Lathrop; Didier Samuel; Gilles Thomas; Jessica Zucman-Rossi; Peter Lichter; Roland Eils; Benedikt Brors; Jan O Korbel; Andrey Korshunov; Pablo Landgraf; Hans Lehrach; Stefan Pfister; Bernhard Radlwimmer; Guido Reifenberger; Michael D Taylor; Christof von Kalle; Partha P Majumder; Rajiv Sarin; T S Rao; M K Bhan; Aldo Scarpa; Paolo Pederzoli; Rita A Lawlor; Massimo Delledonne; Alberto Bardelli; Andrew V Biankin; Sean M Grimmond; Thomas Gress; David Klimstra; Giuseppe Zamboni; Tatsuhiro Shibata; Yusuke Nakamura; Hidewaki Nakagawa; Jun Kusada; Tatsuhiko Tsunoda; Satoru Miyano; Hiroyuki Aburatani; Kazuto Kato; Akihiro Fujimoto; Teruhiko Yoshida; Elias Campo; Carlos López-Otín; Xavier Estivill; Roderic Guigó; Silvia de Sanjosé; Miguel A Piris; Emili Montserrat; Marcos González-Díaz; Xose S Puente; Pedro Jares; Alfonso Valencia; Heinz Himmelbauer; Heinz Himmelbaue; Victor Quesada; Silvia Bea; Michael R Stratton; P Andrew Futreal; Peter J Campbell; Anne Vincent-Salomon; Andrea L Richardson; Jorge S Reis-Filho; Marc van de Vijver; Gilles Thomas; Jocelyne D Masson-Jacquemier; Samuel Aparicio; Ake Borg; Anne-Lise Børresen-Dale; Carlos Caldas; John A Foekens; Hendrik G Stunnenberg; Laura van't Veer; Douglas F Easton; Paul T Spellman; Sancha Martin; Anna D Barker; Lynda Chin; Francis S Collins; Carolyn C Compton; Martin L Ferguson; Daniela S Gerhard; Gad Getz; Chris Gunter; Alan Guttmacher; Mark Guyer; D Neil Hayes; Eric S Lander; Brad Ozenberger; Robert Penny; Jane Peterson; Chris Sander; Kenna M Shaw; Terence P Speed; Paul T Spellman; Joseph G Vockley; David A Wheeler; Richard K Wilson; Thomas J Hudson; Lynda Chin; Bartha M Knoppers; Eric S Lander; Peter Lichter; Lincoln D Stein; Michael R Stratton; Warwick Anderson; Anna D Barker; Cindy Bell; Martin Bobrow; Wylie Burke; Francis S Collins; Carolyn C Compton; Ronald A DePinho; Douglas F Easton; P Andrew Futreal; Daniela S Gerhard; Anthony R Green; Mark Guyer; Stanley R Hamilton; Tim J Hubbard; Olli P Kallioniemi; Karen L Kennedy; Timothy J Ley; Edison T Liu; Youyong Lu; Partha Majumder; Marco Marra; Brad Ozenberger; Jane Peterson; Alan J Schafer; Paul T Spellman; Hendrik G Stunnenberg; Brandon J Wainwright; Richard K Wilson; Huanming Yang
Journal:  Nature       Date:  2010-04-15       Impact factor: 49.962

6.  DoCM: a database of curated mutations in cancer.

Authors:  Benjamin J Ainscough; Malachi Griffith; Adam C Coffman; Alex H Wagner; Jason Kunisaki; Mayank Nk Choudhary; Joshua F McMichael; Robert S Fulton; Richard K Wilson; Obi L Griffith; Elaine R Mardis
Journal:  Nat Methods       Date:  2016-09-29       Impact factor: 28.547

7.  Understanding the function-structure and function-mutation relationships of p53 tumor suppressor protein by high-resolution missense mutation analysis.

Authors:  Shunsuke Kato; Shuang-Yin Han; Wen Liu; Kazunori Otsuka; Hiroyuki Shibata; Ryunosuke Kanamaru; Chikashi Ishioka
Journal:  Proc Natl Acad Sci U S A       Date:  2003-06-25       Impact factor: 11.205

8.  The cancer precision medicine knowledge base for structured clinical-grade mutations and interpretations.

Authors:  Linda Huang; Helen Fernandes; Hamid Zia; Peyman Tavassoli; Hanna Rennert; David Pisapia; Marcin Imielinski; Andrea Sboner; Mark A Rubin; Michael Kluk; Olivier Elemento
Journal:  J Am Med Inform Assoc       Date:  2017-05-01       Impact factor: 4.497

9.  Mutational landscape and significance across 12 major cancer types.

Authors:  Cyriac Kandoth; Michael D McLellan; Fabio Vandin; Kai Ye; Beifang Niu; Charles Lu; Mingchao Xie; Qunyuan Zhang; Joshua F McMichael; Matthew A Wyczalkowski; Mark D M Leiserson; Christopher A Miller; John S Welch; Matthew J Walter; Michael C Wendl; Timothy J Ley; Richard K Wilson; Benjamin J Raphael; Li Ding
Journal:  Nature       Date:  2013-10-17       Impact factor: 49.962

10.  OncodriveROLE classifies cancer driver genes in loss of function and activating mode of action.

Authors:  Michael P Schroeder; Carlota Rubio-Perez; David Tamborero; Abel Gonzalez-Perez; Nuria Lopez-Bigas
Journal:  Bioinformatics       Date:  2014-09-01       Impact factor: 6.937

View more
  158 in total

1.  Molecular Evolution of Early-Onset Prostate Cancer Identifies Molecular Risk Markers and Clinical Trajectories.

Authors:  Clarissa Gerhauser; Francesco Favero; Thomas Risch; Ronald Simon; Lars Feuerbach; Yassen Assenov; Doreen Heckmann; Nikos Sidiropoulos; Sebastian M Waszak; Daniel Hübschmann; Alfonso Urbanucci; Etsehiwot G Girma; Vladimir Kuryshev; Leszek J Klimczak; Natalie Saini; Adrian M Stütz; Dieter Weichenhan; Lisa-Marie Böttcher; Reka Toth; Josephine D Hendriksen; Christina Koop; Pavlo Lutsik; Sören Matzk; Hans-Jörg Warnatz; Vyacheslav Amstislavskiy; Clarissa Feuerstein; Benjamin Raeder; Olga Bogatyrova; Eva-Maria Schmitz; Claudia Hube-Magg; Martina Kluth; Hartwig Huland; Markus Graefen; Chris Lawerenz; Gervaise H Henry; Takafumi N Yamaguchi; Alicia Malewska; Jan Meiners; Daniela Schilling; Eva Reisinger; Roland Eils; Matthias Schlesner; Douglas W Strand; Robert G Bristow; Paul C Boutros; Christof von Kalle; Dmitry Gordenin; Holger Sültmann; Benedikt Brors; Guido Sauter; Christoph Plass; Marie-Laure Yaspo; Jan O Korbel; Thorsten Schlomm; Joachim Weischenfeldt
Journal:  Cancer Cell       Date:  2018-12-10       Impact factor: 31.743

2.  Database of evidence for precision oncology portal.

Authors:  Sam Q Sun; R Jay Mashl; Sohini Sengupta; Adam D Scott; Weihua Wang; Prag Batra; Liang-Bo Wang; Matthew A Wyczalkowski; Li Ding
Journal:  Bioinformatics       Date:  2018-12-15       Impact factor: 6.937

3.  Personalised mapping of tumour development in synchronous colorectal cancer patients.

Authors:  Valentina Thomas; Maura B Cotter; Miriam Tosetto; Yi Ling Khaw; Robert Geraghty; Desmond C Winter; Elizabeth J Ryan; Kieran Sheahan; Simon J Furney
Journal:  NPJ Genom Med       Date:  2020-07-03       Impact factor: 8.617

4.  A framework to rank genomic alterations as targets for cancer precision medicine: the ESMO Scale for Clinical Actionability of molecular Targets (ESCAT).

Authors:  J Mateo; D Chakravarty; R Dienstmann; S Jezdic; A Gonzalez-Perez; N Lopez-Bigas; C K Y Ng; P L Bedard; G Tortora; J-Y Douillard; E M Van Allen; N Schultz; C Swanton; F André; L Pusztai
Journal:  Ann Oncol       Date:  2018-09-01       Impact factor: 32.976

5.  Estimating growth patterns and driver effects in tumor evolution from individual samples.

Authors:  Leonidas Salichos; William Meyerson; Jonathan Warrell; Mark Gerstein
Journal:  Nat Commun       Date:  2020-02-05       Impact factor: 14.919

6.  Pancreatic cancer organoids recapitulate disease and allow personalized drug screening.

Authors:  Else Driehuis; Arne van Hoeck; Kat Moore; Sigrid Kolders; Hayley E Francies; M Can Gulersonmez; Edwin C A Stigter; Boudewijn Burgering; Veerle Geurts; Ana Gracanin; Gergana Bounova; Folkert H Morsink; Robert Vries; Sylvia Boj; Johan van Es; G Johan A Offerhaus; Onno Kranenburg; Mathew J Garnett; Lodewyk Wessels; Edwin Cuppen; Lodewijk A A Brosens; Hans Clevers
Journal:  Proc Natl Acad Sci U S A       Date:  2019-12-09       Impact factor: 11.205

7.  Data-Independent Acquisition Mass Spectrometry To Quantify Protein Levels in FFPE Tumor Biopsies for Molecular Diagnostics.

Authors:  Yeoun Jin Kim; Steve M M Sweet; Jarrett D Egertson; Andrew J Sedgewick; Sunghee Woo; Wei-Li Liao; Gennifer E Merrihew; Brian C Searle; Charlie Vaske; Robert Heaton; Michael J MacCoss; Todd Hembrough
Journal:  J Proteome Res       Date:  2018-12-12       Impact factor: 4.466

8.  Discover novel disease-associated genes based on regulatory networks of long-range chromatin interactions.

Authors:  Hao Wang; Jiaxin Yang; Yu Zhang; Jianrong Wang
Journal:  Methods       Date:  2020-10-21       Impact factor: 3.608

9.  Comparison of Annotation Services for Next-Generation Sequencing in a Large-Scale Precision Oncology Program.

Authors:  Evangelia Katsoulakis; Jill E Duffy; Bradley Hintze; Neil L Spector; Michael J Kelley
Journal:  JCO Precis Oncol       Date:  2020-03-24

10.  Constitutive Activation of RAS/MAPK Pathway Cooperates with Trisomy 21 and Is Therapeutically Exploitable in Down Syndrome B-cell Leukemia.

Authors:  Anouchka P Laurent; Aurélie Siret; Cathy Ignacimouttou; Kunjal Panchal; M'Boyba Diop; Silvia Jenni; Yi-Chien Tsai; Damien Roos-Weil; Zakia Aid; Nais Prade; Stephanie Lagarde; Damien Plassard; Gaelle Pierron; Estelle Daudigeos; Yann Lecluse; Nathalie Droin; Beat C Bornhauser; Laurence C Cheung; John D Crispino; Muriel Gaudry; Olivier A Bernard; Elizabeth Macintyre; Carole Barin Bonnigal; Rishi S Kotecha; Birgit Geoerger; Paola Ballerini; Jean-Pierre Bourquin; Eric Delabesse; Thomas Mercher; Sebastien Malinge
Journal:  Clin Cancer Res       Date:  2020-03-27       Impact factor: 12.531

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.