Anna Pedrola1, Sebastià Franch-Expósito2, Sara Lahoz3, Roger Esteban-Fabró4, Rodrigo Dienstmann1, Laia Bassaganyas5, Jordi Camps3,6. 1. Oncology Data Science (ODysSey) Group, Vall d'Hebron Institute of Oncology, Barcelona, 08035, Spain. 2. Marie-Josée and Henry R. Kravis Center for Molecular Oncology, Memorial Sloan Kettering Cancer Center, New York, NY, 10065, USA. 3. Translational Colorectal Cancer Genomics, Gastrointestinal and Pancreatic Oncology Team, IDIBAPS, Hospital Clínic de Barcelona, CIBEREHD, Barcelona, 08036, Spain. 4. Liver Cancer Translational Research Group, Liver Unit, IDIBAPS, Hospital Clínic de Barcelona, CIBEREHD, Barcelona, 08036, Spain. 5. Department of Medical Genetics, University of Cambridge, Cambridge Biomedical Campus, Cambridge, CB2 0QQ, UK. 6. Department of Cell Biology, Physiology and Immunology, Universitat Autònoma de Barcelona, Bellaterra, 08193, Spain.
Abstract
MOTIVATION: Genomic alterations can modulate the tumor immunophenotype depending on their nature and tissue of origin. While this immune-genomic interaction may shape disease progression and response to immunotherapy, the factors governing such dynamics and the influence of each tissue-specific context remain poorly understood. RESULTS: Here, we have developed the PanCancer ImmunoGenomics (PCIG) tool, a web-based resource that provides researchers with the opportunity to mine immunome-genome relationships across several cancer types using data from the Pan-Cancer Analysis of Whole-Genomes (PCAWG) study, which comprises >2,600 samples spanning across 20 different cancer primary sites. PCIG yields an integrative analysis of the crosstalk between somatic genomic alterations and different immune features, thus helping to understand immune response-related processes. AVAILABILITY: PCIG is freely available at https://pcig.vhio.net, and is supported by all major web browsers. PCIG was developed with Django, which is a Python-based free and open-source framework, and it uses SQL Server as a relational database management system. The code is freely available for download at GitHub https://github.com/AnnaPG/PCIG. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
MOTIVATION: Genomic alterations can modulate the tumor immunophenotype depending on their nature and tissue of origin. While this immune-genomic interaction may shape disease progression and response to immunotherapy, the factors governing such dynamics and the influence of each tissue-specific context remain poorly understood. RESULTS: Here, we have developed the PanCancer ImmunoGenomics (PCIG) tool, a web-based resource that provides researchers with the opportunity to mine immunome-genome relationships across several cancer types using data from the Pan-Cancer Analysis of Whole-Genomes (PCAWG) study, which comprises >2,600 samples spanning across 20 different cancer primary sites. PCIG yields an integrative analysis of the crosstalk between somatic genomic alterations and different immune features, thus helping to understand immune response-related processes. AVAILABILITY: PCIG is freely available at https://pcig.vhio.net, and is supported by all major web browsers. PCIG was developed with Django, which is a Python-based free and open-source framework, and it uses SQL Server as a relational database management system. The code is freely available for download at GitHub https://github.com/AnnaPG/PCIG. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Cancer genomes play key roles in determining tumor immune features, hence having important implications on disease progression and response to immunotherapy (Thorsson ). Somatic genomic alterations may enhance anti-tumor immune activity by enabling the differentiation between self and non-self (tumor) through neoantigen presentation, but they promote immune evasion in later stages of the disease (Litchfield ; Mizuno ). In fact, the nature of genomic events can affect the genome–immune interaction. Overall, the mutational burden is generally associated with an activated tumor immunome environment and better responses to immunotherapy, whereas high burdens of copy-number alterations (CNAs) often correlate with immune depletion and immunotherapy resistance (Davoli ; Tamborero ). Moreover, the cancer type and the tissue of origin may also influence the pattern of immune infiltrates (Varn ). This complex and dynamic interplay between the cancer genomic landscape and the tumor immune infiltration still remains poorly understood.To overcome the limited analysis of whole-exome sequencing data, which can hinder the complete view of genomic alterations and complexity, the recently published Pan-Cancer Analysis of Whole Genomes (PCAWG) project (https://dcc.icgc.org/pcawg) includes whole-genome sequencing (WGS) data for more than 2,600 samples from The Cancer Genome Atlas (TCGA) and the International Cancer Genome Consortium (ICGC), spanning up to 20 different cancer types. Importantly, for a subset of 1,300 samples, PCAWG also incorporates whole-transcriptomic data analyzed by RNAseq, providing an exceptional opportunity to comprehensively investigate relationships between genomic alterations and the immune system in primary tumors from different origins (ICGC/TCGA Pan-Cancer Analysis of Whole Genomes Consortium, 2020). Nevertheless, WGS data analysis can be costly and time-consuming, limiting its feasibility for rapid and comprehensive exploration.Here, to examine genome–immunome interactions across a wide spectrum of cancer types, we exploited PCAWG data (https://pcawg.xenahubs.net) to create the PanCancer ImmunoGenomics (PCIG) tool, a web-accessible resource that can be used as a launching platform for clinical and translational research studies to deepen in the exploration of cancer immunogenomics. To this end, we surveyed somatic non-synonymous mutations, CNAs, complex structural variations (SVs), as well as gene expression and clinical classification. Moreover, we created an array of integrative analyses with additional estimated variables, such as the tumor immune composition, the expression of a chemokine-gene signature, the presence of chromothripsis, deletions encompassing the human leukocyte antigen locus, and the levels of broad and focal CNAs. The goal of PCIG is to provide researchers with a fast and easy-to-use tool to visualize the relationships between cancer genomes and immune-related phenotypes to better understand tumor immunogenicity.
2 Implementation
PCIG is a web-based interface to explore and visualize the integration of multiple genomic, transcriptomic and immunological features in different cancer types. This tool was created by using the Python-based open-source Django framework. An extended explanation of the PCIG pipeline for data mining and analysis, the genomic and transcriptomic datasets from PCAWG used here, and a description of additional estimated variables are detailed in Supplementary Material. PCIG also provides an extensive User Guide, explaining details for each analysis and different plots presented on the website.A diagram showing the main parameters and flowchart of the analysis performed by PCIG is depicted in Figure 1A. Briefly, PCIG explores relationships between numerous immune-genomic parameters (Fig. 1A) from 2,658 samples across 40 different cancer types classified based on the primary site (Fig. 1B). Specifically, PCIG employs WGS data to quantify CNA scores and the tumor mutational load per sample, and considers transcriptomic profiles associated with stromal and immune-related genes to perform correlation and integrative analyses to assess the dynamics between these tumor features, and with tumor baseline clinical characteristics (Supplementary Material).
Fig. 1.
Schematic diagram of PCIG and examples of its performance. (A) Flowchart presenting the data sources, types, and analytical tools used by PCIG. (B) Detailed summary of the analyses PCIG performs under each tab on the website. (C) Correlation plots between ImmuneScore (source: ESTIMATE) and BCS (top) or FCS (bottom) (source: CNApp) using the COAD-US cohort. (D) Correlation plot between levels of MHC expression (source: Immunophenoscore) and BCS (top) or FCS (bottom) (source: CNApp) using the OV-US cohort. (E) Correlation plot between ImmuneScore (source: ESTIMATE) and the mutational load (source: https://pcawg.xenahubs.net) using the SKCM-US cohort.
Schematic diagram of PCIG and examples of its performance. (A) Flowchart presenting the data sources, types, and analytical tools used by PCIG. (B) Detailed summary of the analyses PCIG performs under each tab on the website. (C) Correlation plots between ImmuneScore (source: ESTIMATE) and BCS (top) or FCS (bottom) (source: CNApp) using the COAD-US cohort. (D) Correlation plot between levels of MHC expression (source: Immunophenoscore) and BCS (top) or FCS (bottom) (source: CNApp) using the OV-US cohort. (E) Correlation plot between ImmuneScore (source: ESTIMATE) and the mutational load (source: https://pcawg.xenahubs.net) using the SKCM-US cohort.Three main sections are deployed upon selection of primary site and cancer type: Summary, Genomics and Immuno-Genomics. In summary, the main clinical and molecular characteristics are detailed for the selected subset of tumor samples, along with the genomic profiling at the subcytoband level and corresponding parameters analyzed using PCIG’s pipeline. Genomics section presents results from the correlative analysis between different genomic variables obtained by WGS data (∼2,600 samples), including the number of non-synonymous mutations, broad and focal CNA scores (BCS and FCS, respectively; Franch-Expósito ), and the presence of chromothripsis events, indicative of complex SVs (Cortés-Ciriano ). Finally, established correlation analyses between genomic variables and different tumor immune metrics computationally derived from transcriptomic data (∼1,300 samples) are depicted in the Immuno-Genomics section, including (i) global level of immune and stromal cell infiltrates by ESTIMATE (i.e. ImmuneScore and StromalScore; Yoshihara ), (ii) quantification of the four main determinants of tumor immunogenicity by Immunophenoscore (i.e. major histocompatibility complex [MHC]-related antigen processing genes; checkpoints; effector cells; suppressor cells [SC]; Charoentong ) and (iii) tumor inflammation assessment through a 12-chemokine gene signature, also associated with the presence of tertiary lymphoid structures, suggestive of a good prognosis in several cancers (Sautès-Fridman ).PCIG provides high comprehensive plots that can be downloaded together with their associated processed data for further analysis. Because of differences in data sources for each cancer type and the limited number of cases in some cohorts, some analyses may require the use of validation datasets.
3 Results and discussion
To exemplify the analytical applicability of PCIG, we explored five datasets: colon adenocarcinoma (COAD-US, n = 44), head and neck squamous-cell carcinoma (HNSC-US, n = 44), lung adenocarcinoma (n = 38), ovarian cancer (OV-US, n = 42) and skin cancer (SKCM-US, n = 37). Analysis of genomic imbalances showed that ovarian tumors displayed the highest BCS and FCS values (Supplementary Fig. S1A and B), suggesting gross and chromosome-specific aneuploidies. In contrast, the highest values of mutational load were observed in colon cancer, probably due to the presence of POLE-mutated or mismatch repair deficient tumors in the COAD-US dataset (Supplementary Fig. S2). In agreement with previous reports of depleted CD8+ lymphocytic activity in highly aneuploid tumors (Bassaganyas ; Davoli ), we observed a significant negative correlation between ImmuneScore and BCS or FCS in the majority of cancer types, especially affecting the COAD-US, HNSC-US and OV-US datasets (Fig. 1C and Supplementary Fig. S3A and B). Likewise, tumors with high BCS or FCS such as OV-US exhibited decreased expression of antigen-presenting MHC-related machinery (Fig. 1D and Supplementary Fig. S4), confirming that highly complex genomic tumors bear cold immunophenotypes. Conversely, the presence of a high mutational load observed in skin melanoma and colon adenocarcinoma appeared to be significantly associated with more active immunophenotype profiles (Fig. 1E and Supplementary Fig. S5).In summary, PCIG correlates cancer genomic traits and immune-related phenotypes, thus helping the interpretation of tumor immunogenicity. In this sense, our analysis of five different cancer types included in PCAWG further suggests that the tissue of origin and the genomic landscape have an impact on the tumor immune infiltrate. Altogether, PCIG assists in the processing and visualization of large datasets, facilitates exhaustive immune-genomic analyses for hypotheses generation, and displays very complex data in an easy and comprehensible manner.Click here for additional data file.
Authors: Vésteinn Thorsson; David L Gibbs; Scott D Brown; Denise Wolf; Dante S Bortone; Tai-Hsien Ou Yang; Eduard Porta-Pardo; Galen F Gao; Christopher L Plaisier; James A Eddy; Elad Ziv; Aedin C Culhane; Evan O Paull; I K Ashok Sivakumar; Andrew J Gentles; Raunaq Malhotra; Farshad Farshidfar; Antonio Colaprico; Joel S Parker; Lisle E Mose; Nam Sy Vo; Jianfang Liu; Yuexin Liu; Janet Rader; Varsha Dhankani; Sheila M Reynolds; Reanne Bowlby; Andrea Califano; Andrew D Cherniack; Dimitris Anastassiou; Davide Bedognetti; Younes Mokrab; Aaron M Newman; Arvind Rao; Ken Chen; Alexander Krasnitz; Hai Hu; Tathiane M Malta; Houtan Noushmehr; Chandra Sekhar Pedamallu; Susan Bullman; Akinyemi I Ojesina; Andrew Lamb; Wanding Zhou; Hui Shen; Toni K Choueiri; John N Weinstein; Justin Guinney; Joel Saltz; Robert A Holt; Charles S Rabkin; Alexander J Lazar; Jonathan S Serody; Elizabeth G Demicco; Mary L Disis; Benjamin G Vincent; Ilya Shmulevich Journal: Immunity Date: 2018-04-05 Impact factor: 43.474
Authors: Kosuke Yoshihara; Maria Shahmoradgoli; Emmanuel Martínez; Rahulsimham Vegesna; Hoon Kim; Wandaliz Torres-Garcia; Victor Treviño; Hui Shen; Peter W Laird; Douglas A Levine; Scott L Carter; Gad Getz; Katherine Stemke-Hale; Gordon B Mills; Roel G W Verhaak Journal: Nat Commun Date: 2013 Impact factor: 14.919
Authors: Laia Bassaganyas; Roser Pinyol; Roger Esteban-Fabró; Laura Torrens; Sara Torrecilla; Catherine E Willoughby; Sebastià Franch-Expósito; Maria Vila-Casadesús; Itziar Salaverria; Robert Montal; Vincenzo Mazzaferro; Jordi Camps; Daniela Sia; Josep M Llovet Journal: Clin Cancer Res Date: 2020-09-01 Impact factor: 12.531
Authors: Stephanie T Schmidt; Neal Akhave; Ryan E Knightly; Alexandre Reuben; Natalie Vokes; Jianhua Zhang; Jun Li; Junya Fujimoto; Lauren A Byers; Beatriz Sanchez-Espiridion; Lixia Diao; Jing Wang; Lorenzo Federico; Marie-Andree Forget; Daniel J McGrail; Annikka Weissferdt; Shiaw-Yih Lin; Younghee Lee; Erika Suzuki; Jeffrey J Kovacs; Carmen Behrens; Ignacio I Wistuba; Andrew Futreal; Ara Vaporciyan; Boris Sepesi; John V Heymach; Chantale Bernatchez; Cara Haymaker; Tina Cascone; Jianjun Zhang; Christopher A Bristow; Timothy P Heffernan; Marcelo V Negrao; Don L Gibbons Journal: JCO Clin Cancer Inform Date: 2022-07