| Literature DB >> 34900135 |
Lindsay M Wong1,2, Wei Tse Li1,2, Neil Shende1,2, Joseph C Tsai1,2, Jiayan Ma1,2, Jaideep Chakladar1,2, Aditi Gnanasekar1,2, Yuanhao Qu1,2, Kypros Dereschuk1,2, Jessica Wang-Rodriguez3,4, Weg M Ongkeko1,2.
Abstract
BACKGROUND: The mechanisms of carcinogenesis from viral infections are extraordinarily complex and not well understood. Traditional methods of analyzing RNA-sequencing data may not be sufficient for unraveling complicated interactions between viruses and host cells. Using RNA and DNA-sequencing data from The Cancer Genome Atlas (TCGA), we aim to explore whether virus-induced tumors exhibit similar immune-associated (IA) dysregulations using a new algorithm we developed that focuses on the most important biological mechanisms involved in virus-induced cancers. Differential expression, survival correlation, and clinical variable correlations were used to identify the most clinically relevant IA genes dysregulated in 5 virus-induced cancers (HPV-induced head and neck squamous cell carcinoma, HPV-induced cervical cancer, EBV-induced stomach cancer, HBV-induced liver cancer, and HCV-induced liver cancer) after which a mechanistic approach was adopted to identify pathways implicated in IA gene dysregulation.Entities:
Keywords: Algorithm; C2, Canonical pathway; C6, Oncogenic signature; C7, Immunological signature; CA, Cancer-associated; CESC, Cervical Squamous Cell Carcinoma and Endocervical Adenocarcinoma; CNA, Copy number alteration; Cervical squamous cell carcinoma and endocervical adenocarcinoma; EBV, Epstein-Barr virus; Epstein-Barr virus; FDR, False discovery rate; GSEA, Gene set enrichment analysis; HBV, Hepatitis B virus; HCV, Hepatitis C virus; HNSCC, Head and Neck Squamous Cell Carcinoma; HPV, Human papillomavirus; Head and neck squamous cell carcinoma; Hepatitis B; Hepatitis C; Human papillomavirus; IA, Immune-associated; LIHC, Liver Hepatocellular Carcinoma; Liver hepatocellular carcinoma; MSigDB, Molecular Signature Database; STAD, Stomach Adenocarcinoma; Stomach adenocarcinoma; TCGA; TCGA, The Cancer Genome Atlas
Year: 2021 PMID: 34900135 PMCID: PMC8636736 DOI: 10.1016/j.csbj.2021.11.013
Source DB: PubMed Journal: Comput Struct Biotechnol J ISSN: 2001-0370 Impact factor: 7.271
Fig. 1Summary of study procedures, differential expression results, and survival correlation comparisons. (A) Schematic of computational analyses and data processing procedures used. The direction of workflow is always downwards and is sometimes indicated by converging colored lines. Left of the circle involves pathways-level analysis while right of the circle indicates the gene-level analysis. Orange, pink, and blue boxes indicate genes or gene sets, analyses, and results, respectively. Number of patients in each cohort are indicated within the parentheses. (B) Heatmap of significantly dysregulated IA genes visualized in pathway annotations (FDR < 1x1010) from ReactomeFIViz when comparing virus samples to normal samples in each cohort. Green, pink, and gold circles indicate that they are found in HPV, LIHC, and in all five comparisons, respectively. (C) Venn diagram demonstrating unique IA genes and overlapping IA genes between cohorts after filtering differentially expressed IA genes for patient survival. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)
Fig. 2IA gene correlations with patient survival and clinical variables in virus versus normal samples for all five comparisons. (A) Hazard ratio plots of significantly dysregulated differentially expressed IA genes for each of the five comparisons. Only genes with two or more significant clinical variable correlations were displayed. Individual IA genes with center lines greater or less than the hazard ratio cutoff of one indicate that downregulation or upregulation of the gene corresponds to worse patient survival, respectively. Whiskers extending from the center lines denote the confidence interval. (B) Kaplan Meier plots of select significantly dysregulated differentially expressed IA genes for each of the five comparisons. (C) Pie charts demonstrating the proportion of significantly dysregulated IA genes that are correlated with their respective cohort across all five cohorts for the clinical variables neoplasm histologic grade, cancer neoplasm status, perineural invasion presence, residual tumor, clinical/pathologic stage, clinical/pathologic T, clinical/pathologic N, and clinical/pathologic M (Kruskal-Wallis, p < 0.05). (D) Boxplot examples of the most significant clinical variable-IA gene correlations for each of the eight clinical variables shown in Fig. 2C.
Fig. 3Canonical (C2) IA and CA pathway enrichment using GSEA. (A) Stacked bar plot demonstrating the proportions of pathways that fall into nine categories (Antigen Presentation and Processing (B cells), Cytokines & Interleukins (includes interferons), Viral Infection Response, General Innate Response, General Adaptive Response (T cells), Extracellular Matrix, Tumor Suppressor, Oncogenes, Tumor Suppressor & Oncogenes) for each cohort (FDR < 0.01). (B) Select GSEA plot examples of pathways categorized in Fig. 3A (FDR < 0.01). The green peak or valley in the GSEA plot corresponds to the upregulation or downregulation of the pathway listed in the plot title, respectively. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)
Fig. 4Canonical IA pathway and immunologic signatures most implicated in each cohort following integration with gene-level analysis. (A) Horizontal bar graphs comparing enrichment scores of all five cohorts for canonical pathways (C2) with the greatest number of cohort-specific IA genes and neighboring genes, including Kegg small cell lung cancer, Reactome A1 Rhodopsin-like receptors, PID IL12 pathway, PID p73 pathway, PID E2F pathway, Reactome metabolism of lipids and lipoproteins, and PID FOXM1 pathway. (B) Vertical bar graphs demonstrating the top ten immunologic signatures (C7) with the greatest number of cohort-specific IA genes and neighboring genes.
Fig. 5Oncogenic signature comparisons. (A) Five-way Venn Diagram comparing oncogenic signatures (C6) most implicated in each cohort following integration with gene-level analysis. (B) Superimposed GSEA plots of six oncogenic signatures that are most similarly enriched across all 5 cohorts.
Fig. 6Genomic alterations correlated with IA gene dysregulation and inferred mechanistic explanations. (A) Bar graph comparison of R2 and CIC values of genomic alterations associated with cohort-specific IA genes. Only the best R2 value is displayed if the genomic alteration locus contains multiple genes. (B) Interaction map depicting possible mechanisms of the effects of genomic alterations on the respective IA gene. Green circles indicate the genes related to the cohort-specific IA genes identified by our algorithm, dark cyan circles indicate linker genes, blue circles indicate the starting gene (genomic alteration), yellow circles indicate genes that fall under the same categories as both green and blue circles, and purple circles indicate the unique IA gene. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)