Literature DB >> 29272339

Personal Cancer Genome Reporter: variant interpretation report for precision oncology.

Sigve Nakken1, Ghislain Fournous1, Daniel Vodák1, Lars Birger Aasheim1,2, Ola Myklebost1,3, Eivind Hovig1,4,5.   

Abstract

Summary: Individual tumor genomes pose a major challenge for clinical interpretation due to their unique sets of acquired mutations. There is a general scarcity of tools that can (i) systematically interrogate cancer genomes in the context of diagnostic, prognostic, and therapeutic biomarkers, (ii) prioritize and highlight the most important findings and (iii) present the results in a format accessible to clinical experts. We have developed a stand-alone, open-source software package for somatic variant annotation that integrates a comprehensive set of knowledge resources related to tumor biology and therapeutic biomarkers, both at the gene and variant level. Our application generates a tiered report that will aid the interpretation of individual cancer genomes in a clinical setting. Availability and implementation: The software is implemented in Python/R, and is freely available through Docker technology. Documentation, example reports, and installation instructions are accessible via the project GitHub page: https://github.com/sigven/pcgr. Contact: sigven@ifi.uio.no. Supplementary information: Supplementary data are available at Bioinformatics online.

Entities:  

Mesh:

Year:  2018        PMID: 29272339      PMCID: PMC5946881          DOI: 10.1093/bioinformatics/btx817

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


1 Introduction

A range of different tools have been developed for functional annotation of genomic variants (McLaren ; Ramos ; Wang ). Although some have a specific focus on the oncology domain (e.g. Oncotator), they are all largely targeted towards the research community. Existing solutions offer limited support for summaries and reports at the level of individual cancer genomes, particularly when it comes to clinical relevance. Moreover, the degree of quality control and update frequency of annotation resources vary considerably. Recently, we have seen significant development of databases that harvest reports from the scientific literature about cancer genome variants and their particular relationships to tumorigenesis, druggability and clinical outcomes. These include the Database of Curated Mutations (DoCM), The Drug Gene Interaction Database (DGIdb) and most importantly the community-driven database of Clinical Interpretations of Variants in Cancer (CIViC) (Ainscough ; Griffith ; Wagner ). Other resources covering known cancer mutation hotspots, mutational signatures and predicted driver mutations have also emerged, which collectively indicate potential underlying mechanisms of tumor development and relevance for different treatment regimes (Chang ; Gonzalez-Perez ; Secrier ). We have developed the Personal Cancer Genome Reporter (PCGR), a software package for the generation of clinically interpretable reports of individual cancer genomes. The software extends basic variant annotations from Variant Effect Predictor (VEP) with oncology-relevant annotations retrieved flexibly with vcfanno (Pedersen ), and produces interactive HTML reports that are intended for clinical translation.

2 PCGR workflow

The pipeline for generation of personal cancer genome reports comprises four major steps: (i) basic variant consequence annotation using VEP, (ii) allele-specific annotation for precision oncology using vcfanno, (iii) functional and cancer-focused gene annotation and (iv) prediction of MSI status, estimation of mutational signature contributions, summary, prioritization and reporting with the R language and R markdown templates (Supplementary Fig. S1). All software components are integrated and provided by means of the Docker technology, implying that all underlying dependencies are packaged into a standardized software container. The Docker solution was chosen to offer individual labs a simple installation of PCGR in their in-house workflow for high-throughput analysis of tumor genomes. The application comes with an annotation data bundle, which we plan to update on a quarterly basis. The human genome assembly GRCh37 is currently supported. The PCGR workflow accepts two types of input files: (i) a single-sample VCF file encoding the genomic coordinates of somatic SNVs/Indels and (ii) a basic somatic copy number segment file that encodes chromosomal segment locations and their log-2 ratios (specific requirements are given in the GitHub documentation). For SNVs and Indels encoded in a VCF file, VEP is utilized to determine variant consequence information. For the sake of simplicity and ease of use, the VEP annotation is run with a fixed set of parameters that includes all gene cross-references, protein domain annotations, and overlap with regulatory regions, using GENCODE as the underlying gene transcript model. All transcript-dependent consequences per variant are retained in the VEP-annotated VCF file, and the consequence block of highest functional relevance (as provided by VEP’s –flag_pick option) is flagged for further downstream analysis. Next, vcfanno is applied on multiple variant databases in parallel in order to enrich each somatic call with allele-specific annotations that are directly or indirectly relevant for clinical interpretation. These allele-specific annotations include, (i) pathogenicity predictions for splice-site and missense variants by multiple algorithms, (ii) overlap with known mutational cancer hotspots and previously predicted driver mutations, (iii) tissue/tumor type frequency in case of previously detected somatic variants, (iv) known disease/cancer associations and (v) clinical evidence items of relevance for diagnosis, prognosis, predisposition, or drug sensitivity/resistance. In the third step of the workflow, gene-level annotations are aggregated. Known proto-oncogenes and tumor suppressors are marked, as are other genes implicated (either by prediction or curation) in cancer (Piñero ). Antineoplastic agents and their molecular targets are also annotated. Each gene is finally assigned a score that reflects its relative strength of association to cancer in the biomedical literature, enabling ranking of novel variants according to functional relevance (Rocco ). The annotation of copy number aberrations is limited to this third step, in which gene transcripts that intersect gained or depleted segments are identified, and clinical and etiologic cancer associations are retrieved. The final and fourth step of the PCGR workflow summarises and prioritizes the annotated variants in a structured and interactive report, adopting recently proposed recommendations (Dienstmann ; Ritter ). Specifically, a tiered report is constructed, starting from actionable markers in Tier 1, toward aberrations relevant for tumorigenesis in Tier 2 and 3, and ending with variants of unknown functional relevance in Tier 4 and 5. In addition to the tier structure, mutated genes in Tier 3-5 are prioritized by means of the above-mentioned literature-derived score for oncogenic potential, which draws attention to the most relevant findings. Finally, the report offers optional prediction of microsatellite instability, in addition to estimates of known mutational signatures present in the tumor, and their associated underlying etiologies (Rosenthal ). An example report for a breast cancer genome is found within the Supplementary Material.

3 Discussion

We have utilized the Docker technology to develop a report engine for clinical interpretation of cancer genomes. The tool has a particular focus on actionable, coding variants, i.e. variants found through exome sequencing. The Cancer Genome Interpreter (CGI, https://cancergenomeinterpreter.org) can identify tumor alterations that are therapeutically actionable, similar to the functionality that is implemented in PCGR. CGI is a web-based solution, while PCGR is a stand-alone annotation engine intended for integration in tumor sequencing pipelines. Moreover, PCGR exploits the total spectrum of tumor variants to compute additional measures that can inform precision therapy, i.e. MSI status, mutational signatures, and mutational burden. Matching biomarker results more stringently to the cancer type of the query will be prioritized in future extensions of PCGR. Furthermore, we foresee that the report can be significantly strengthened through the addition of other molecular profiling datasets, such as gene expression. Expression will not only add an important layer on top of results found at the DNA level, it can also add significant value towards assessment of therapeutic potential by other analyses, such as inference of cell type composition in the tumor tissue, and in silico predictions of drug response. Click here for additional data file.
  15 in total

1.  DoCM: a database of curated mutations in cancer.

Authors:  Benjamin J Ainscough; Malachi Griffith; Adam C Coffman; Alex H Wagner; Jason Kunisaki; Mayank Nk Choudhary; Joshua F McMichael; Robert S Fulton; Richard K Wilson; Obi L Griffith; Elaine R Mardis
Journal:  Nat Methods       Date:  2016-09-29       Impact factor: 28.547

2.  OncoScore: a novel, Internet-based tool to assess the oncogenic potential of genes.

Authors:  Rocco Piazza; Daniele Ramazzotti; Roberta Spinelli; Alessandra Pirola; Luca De Sano; Pierangelo Ferrari; Vera Magistroni; Nicoletta Cordani; Nitesh Sharma; Carlo Gambacorti-Passerini
Journal:  Sci Rep       Date:  2017-04-07       Impact factor: 4.379

3.  CIViC is a community knowledgebase for expert crowdsourcing the clinical interpretation of variants in cancer.

Authors:  Malachi Griffith; Nicholas C Spies; Kilannin Krysiak; Joshua F McMichael; Adam C Coffman; Arpad M Danos; Benjamin J Ainscough; Cody A Ramirez; Damian T Rieke; Lynzey Kujan; Erica K Barnell; Alex H Wagner; Zachary L Skidmore; Amber Wollam; Connor J Liu; Martin R Jones; Rachel L Bilski; Robert Lesurf; Yan-Yang Feng; Nakul M Shah; Melika Bonakdar; Lee Trani; Matthew Matlock; Avinash Ramu; Katie M Campbell; Gregory C Spies; Aaron P Graubert; Karthik Gangavarapu; James M Eldred; David E Larson; Jason R Walker; Benjamin M Good; Chunlei Wu; Andrew I Su; Rodrigo Dienstmann; Adam A Margolin; David Tamborero; Nuria Lopez-Bigas; Steven J M Jones; Ron Bose; David H Spencer; Lukas D Wartman; Richard K Wilson; Elaine R Mardis; Obi L Griffith
Journal:  Nat Genet       Date:  2017-01-31       Impact factor: 38.330

4.  DisGeNET: a comprehensive platform integrating information on human disease-associated genes and variants.

Authors:  Janet Piñero; Àlex Bravo; Núria Queralt-Rosinach; Alba Gutiérrez-Sacristán; Jordi Deu-Pons; Emilio Centeno; Javier García-García; Ferran Sanz; Laura I Furlong
Journal:  Nucleic Acids Res       Date:  2016-10-19       Impact factor: 16.971

5.  Vcfanno: fast, flexible annotation of genetic variants.

Authors:  Brent S Pedersen; Ryan M Layer; Aaron R Quinlan
Journal:  Genome Biol       Date:  2016-06-01       Impact factor: 13.583

6.  The Ensembl Variant Effect Predictor.

Authors:  William McLaren; Laurent Gil; Sarah E Hunt; Harpreet Singh Riat; Graham R S Ritchie; Anja Thormann; Paul Flicek; Fiona Cunningham
Journal:  Genome Biol       Date:  2016-06-06       Impact factor: 13.583

7.  DGIdb 2.0: mining clinically relevant drug-gene interactions.

Authors:  Alex H Wagner; Adam C Coffman; Benjamin J Ainscough; Nicholas C Spies; Zachary L Skidmore; Katie M Campbell; Kilannin Krysiak; Deng Pan; Joshua F McMichael; James M Eldred; Jason R Walker; Richard K Wilson; Elaine R Mardis; Malachi Griffith; Obi L Griffith
Journal:  Nucleic Acids Res       Date:  2015-11-03       Impact factor: 16.971

8.  Identifying recurrent mutations in cancer reveals widespread lineage diversity and mutational specificity.

Authors:  Matthew T Chang; Saurabh Asthana; Sizhi Paul Gao; Byron H Lee; Jocelyn S Chapman; Cyriac Kandoth; JianJiong Gao; Nicholas D Socci; David B Solit; Adam B Olshen; Nikolaus Schultz; Barry S Taylor
Journal:  Nat Biotechnol       Date:  2015-11-30       Impact factor: 54.908

9.  IntOGen-mutations identifies cancer drivers across tumor types.

Authors:  Abel Gonzalez-Perez; Christian Perez-Llamas; Jordi Deu-Pons; David Tamborero; Michael P Schroeder; Alba Jene-Sanz; Alberto Santos; Nuria Lopez-Bigas
Journal:  Nat Methods       Date:  2013-09-15       Impact factor: 28.547

10.  Mutational signatures in esophageal adenocarcinoma define etiologically distinct subgroups with therapeutic relevance.

Authors:  Maria Secrier; Xiaodun Li; Nadeera de Silva; Matthew D Eldridge; Gianmarco Contino; Jan Bornschein; Shona MacRae; Nicola Grehan; Maria O'Donovan; Ahmad Miremadi; Tsun-Po Yang; Lawrence Bower; Hamza Chettouh; Jason Crawte; Núria Galeano-Dalmau; Anna Grabowska; John Saunders; Tim Underwood; Nicola Waddell; Andrew P Barbour; Barbara Nutzinger; Achilleas Achilleos; Paul A W Edwards; Andy G Lynch; Simon Tavaré; Rebecca C Fitzgerald
Journal:  Nat Genet       Date:  2016-09-05       Impact factor: 38.330

View more
  10 in total

1.  Strategies to inhibit FGFR4 V550L-driven rhabdomyosarcoma.

Authors:  Elisa Fiorito; Patrycja Szybowska; Ellen M Haugsten; Michal Kostas; Geir F Øy; Antoni Wiedlocha; Sachin Singh; Sigve Nakken; Gunhild M Mælandsmo; Jonathan A Fletcher; Leonardo A Meza-Zepeda; Jørgen Wesche
Journal:  Br J Cancer       Date:  2022-09-12       Impact factor: 9.075

2.  The Quandary of DNA-Based Treatment Assessment in De Novo Metastatic Prostate Cancer in the Era of Precision Oncology.

Authors:  Sigve Nakken; Wolfgang Lilleby; Marta D Switlyk; Karen E Knudsen; Oscar Lilleby; Sen Zhao; Fatemeh Kaveh; Per O Ekstrøm; Alfonso Urbanucci; Eivind Hovig
Journal:  J Pers Med       Date:  2021-04-22

3.  OncoGEMINI: software for investigating tumor variants from multiple biopsies with integrated cancer annotations.

Authors:  Thomas J Nicholas; Michael J Cormier; Xiaomeng Huang; Yi Qiao; Gabor T Marth; Aaron R Quinlan
Journal:  Genome Med       Date:  2021-03-26       Impact factor: 11.117

4.  Integrative genomic analysis of peritoneal malignant mesothelioma: understanding a case with extraordinary chemotherapy response.

Authors:  Christin Lund-Andersen; Sigve Nakken; Ståle Nygård; Bastian Fromm; Lars B Aasheim; Ben Davidson; Lars Julsrud; Torveig W Abrahamsen; Annette T Kristensen; Brit Dybdahl; Stein G Larsen; Eivind Hovig; Kjersti Flatmark
Journal:  Cold Spring Harb Mol Case Stud       Date:  2019-04-01

5.  Low-viscosity matrix suspension culture enables scalable analysis of patient-derived organoids and tumoroids from the large intestine.

Authors:  Yumiko Hirokawa; Jordan Clarke; Michelle Palmieri; Tao Tan; Dmitri Mouradov; Shan Li; Cong Lin; Fuqiang Li; Huijuan Luo; Kui Wu; Maree Faux; Chin Wee Tan; Margaret Lee; Grace Gard; Peter Gibbs; Antony W Burgess; Oliver M Sieber
Journal:  Commun Biol       Date:  2021-09-13

6.  A platform for oncogenomic reporting and interpretation.

Authors:  Caralyn Reisle; Laura M Williamson; Erin Pleasance; Anna Davies; Brayden Pellegrini; Dustin W Bleile; Karen L Mungall; Eric Chuah; Martin R Jones; Yussanne Ma; Eleanor Lewis; Isaac Beckie; David Pham; Raphael Matiello Pletz; Amir Muhammadzadeh; Brandon M Pierce; Jacky Li; Ross Stevenson; Hansen Wong; Lance Bailey; Abbey Reisle; Matthew Douglas; Melika Bonakdar; Jessica M T Nelson; Cameron J Grisdale; Martin Krzywinski; Ana Fisic; Teresa Mitchell; Daniel J Renouf; Stephen Yip; Janessa Laskin; Marco A Marra; Steven J M Jones
Journal:  Nat Commun       Date:  2022-02-09       Impact factor: 14.919

7.  Whole exome sequencing of high-risk neuroblastoma identifies novel non-synonymous variants.

Authors:  Weronika Przybyła; Kirsti Marie Gjersvoll Paulsen; Charitra Kumar Mishra; Ståle Nygård; Solveig Engebretsen; Ellen Ruud; Gunhild Trøen; Klaus Beiske; Lars Oliver Baumbusch
Journal:  PLoS One       Date:  2022-08-29       Impact factor: 3.752

8.  CCAS: One-stop and comprehensive annotation system for individual cancer genome at multi-omics level.

Authors:  Xinchang Zheng; Wenting Zong; Zhaohua Li; Yingke Ma; Yanling Sun; Zhuang Xiong; Song Wu; Fei Yang; Wei Zhao; Congfan Bu; Zhenglin Du; Jingfa Xiao; Yiming Bao
Journal:  Front Genet       Date:  2022-08-11       Impact factor: 4.772

9.  Arteria: An automation system for a sequencing core facility.

Authors:  Johan Dahlberg; Johan Hermansson; Steinar Sturlaugsson; Mariya Lysenkova; Patrik Smeds; Claes Ladenvall; Roman Valls Guimera; Florian Reisinger; Oliver Hofmann; Pontus Larsson
Journal:  Gigascience       Date:  2019-12-01       Impact factor: 6.524

10.  GenomeChronicler: The Personal Genome Project UK Genomic Report Generator Pipeline.

Authors:  José Afonso Guerra-Assunção; Lucia Conde; Ismail Moghul; Amy P Webster; Simone Ecker; Olga Chervova; Christina Chatzipantsiou; Pablo P Prieto; Stephan Beck; Javier Herrero
Journal:  Front Genet       Date:  2020-09-24       Impact factor: 4.599

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.