| Literature DB >> 30837539 |
Hyojin Song1,2, Yoojoo Lim3, Hogune Im4, Jeong Mo Bae5, Gyeong Hoon Kang5, Junhak Ahn6,7, Daehyun Baek6,7, Tae-You Kim3, Sung-Soo Yoon8,9, Youngil Koh10,11.
Abstract
We report a novel transcriptomic analysis workflow called LiEB (Life cycle of Epstein-Barr virus) to characterize distributions of oncogenic virus, Epstein-Barr virus (EBV) infection in human tumors. We analyzed 851 The Cancer Genome Atlas whole-transcriptome sequencing (WTS) data to investigate EBV infection by life cycle information using three-step LiEB workflow: 1) characterize virus infection generally; 2) align transcriptome sequences against a hybrid human-EBV genome, and 3) quantify EBV gene expression. Our results agreed with EBV infection status of public cell line data. Analysis in stomach adenocarcinoma identified EBV-positive cases involving PIK3CA mutations and/or CDKN2A silencing with biologically more determination, compared to previous reports. In this study, we found that a small number of colorectal adenocarcinoma cases involved with EBV lytic gene expression. Expression of EBV lytic genes was also observed in 3% of external colon cancer cohort upon WTS analysis. Gene set enrichment analysis showed elevated expression of genes related to E2F targeting and interferon-gamma responses in EBV-associated tumors. Finally, we suggest that interpretation of EBV life cycle is essential when analyzing its infection in tumors, and LiEB provides high capability of detecting EBV-positive tumors. Observation of EBV lytic gene expression in a subset of colon cancers warrants further research.Entities:
Mesh:
Year: 2019 PMID: 30837539 PMCID: PMC6401378 DOI: 10.1038/s41598-019-39706-0
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Workflow of the three-step EBV in silico detection algorithm. We examined 851 WTS samples covering 23 cancer types from TCGA database. Our workflow (LiEB) involves three steps to detect EBV-positive samples: (1) detection of viral infection; (2) alignment against a hybrid genome; (3) quantification of EBV gene expression.
Figure 2(a) Proportion of EBV-positive samples among 23 cancer types examined. Each bar represents the percentage of samples in which one or more mapped read was detected for each of the four EBV strains (HHV4_wt, HHV4_GD1, HHV4_AG876, and EBV_artifactual_join) examined. Note that we did not include READ in the further analysis set due to an insufficient number of sample sets (N = 7), although this cancer type shows a high proportion of EBV infection in this figure. (b,c) Proportion of samples expressing EBV latent and lytic genes among 23 cancer types examined. Each stacked bar indicates the percentage of samples of each cancer type expressing EBV latent (b) and lytic (c) genes. Colors represent each TCGA cancer project.
Significant mutations related to STAD (TCGA)*.
| STAD_id | EBV (TCGA) | EBV_Lytic (Hsong) | EBV (Li Ding) | TPM (CDKN2A) | CDKN2A | PIK3CA | ARID1A | BCOR | RHOA | TPM (PDL1) | TPM (PDL2) |
|---|---|---|---|---|---|---|---|---|---|---|---|
| STAD_13 | 1 | 1 | 1 | 3.97 | 0 | Missense | Nonsense | Splice_Site | 0 | 309.51 | 14 |
| STAD_28 | 1 | 1 | 1 | 4.59 | Missense | Missense | fsDEL, ifDEL, Nonsense | 0 | 0 | 125.1 | 9.86 |
| STAD_24 | 1 | 1 | 1 | 18.99 | 0 | Missense | 3′UTR | 0 | Missense | 29.19 | 2.84 |
| STAD_11 | 1 | 1 | 1 | 26.12 | Missense | Missense | fsDEL | 0 | 0 | 19.25 | 2.71 |
| STAD_34 | 1 | 1 | 1 | 33.4 | 0 | 0 | ifDEL | 0 | 0 | 7.58 | 3.28 |
| STAD_27 | 1 | 1 | 1 | 6.48 | 0 | Missense | Nonsense | 0 | 0 | 3.74 | 3.06 |
| STAD_31 | 1 | 1 | 1 | 0.55 | 0 | fsDEL | 0 | 0 | 1 | 0.19 | |
| STAD_8 | 1 | 1 | 1 | 2.23 | 0 | 0 | 0 | 0 | 6.37 | 3.96 | |
| STAD_21 | 0 | 1 | 1 | 6.25 | 0 | Missense | fsDEL, Splice_Region | Missense | 0 | 4.06 | 2.74 |
| STAD_19 | 0 | 1 | 1 | 7.83 | 0 | Missense, ifDEL | Missense | 4.78 | 1.28 | ||
| STAD_17 | 0 | 1 | 1 | 1.45 | 0 | Missense | ifDEL | 0 | 3.74 | 1.37 | |
| STAD_3 | 0 | 1 | 1 | 17.4 | 0 | 0 | 0 | 0 | 0 | 2.94 | 2.8 |
| STAD_25 | 0 | 1 | 1 | 13.73 | 0 | 0 | 0 | 0 | 0.95 | 0.57 | |
| STAD_10 | 0 | 0 | 1 | 669.8 | fsDEL, ifDEL | 0 | ifDEL | 0 | 0 | 3.81 | 1.59 |
| STAD_1 | 0 | 0 | 1 | 349.74 | Missense | 0 | 0 | Missense | 0 | 6.1 | 1.25 |
| STAD_32 | 0 | 0 | 0 | 124.75 | 0 | 0 | ifDEL | 0 | 0 | 2.14 | 2.19 |
| STAD_4 | 0 | 0 | 0 | 12 | 0 | 0 | 0 | 0 | 0 | 3.65 | 2.09 |
| STAD_5 | 0 | 0 | 0 | 12.49 | Missense | Missense, fsDEL, ifDEL | Missense | 0 | 2.82 | 2.19 | |
| STAD_22 | 0 | 0 | 0 | 2.89 | 0 | 0 | ifDEL | 0 | 0 | 3.59 | 0.38 |
| STAD_16 | 0 | 0 | 0 | 1.09 | 0 | 0 | fsINS | 0 | Missense | 1.21 | 0.25 |
| STAD_2 | 0 | 0 | 0 | 3.61 | 0 | Missense | fsDEL, ifINS | 0 | 2.03 | 1.21 | |
| STAD_6 | 0 | 0 | 0 | 41.64 | 0 | 0 | 0 | 0 | 0 | 3.14 | 2.47 |
| STAD_7 | 0 | 0 | 0 | 44.47 | 0 | Missense | Missense, ifDEL | 0 | 0 | 2.8 | 2.67 |
| STAD_9 | 0 | 0 | 0 | 26.12 | 0 | 0 | ifDEL | 0 | 0 | 1.48 | 1.4 |
| STAD_33 | 0 | 0 | 0 | 183.79 | 0 | 0 | ifDEL | 0 | 0 | 5.51 | 0.99 |
| STAD_14 | 0 | 0 | 0 | 19.54 | 0 | fsINS, 3′UTR, Splice_Site, Splice_Region | fsDEL, ifDEL | 0 | 3.13 | 1.31 | |
| STAD_18 | 0 | 0 | 0 | 160.75 | Missense | 0 | fsDEL, ifDEL | Missense | 0 | 2.83 | 2.27 |
| STAD_36 | 0 | 0 | 0 | 207.21 | Missense, fsDEL, 3′UTR | 11.24 | 3.48 | ||||
| STAD_20 | 0 | 0 | 0 | 41.25 | 0 | 0 | 3′UTR, ifDEL | 0 | 0 | 0.58 | 0.26 |
| STAD_23 | 0 | 0 | 0 | 29.9 | 0 | 0 | ifDEL | 0 | 0 | 0.78 | 1.11 |
| STAD_15 | 0 | 0 | 0 | 234.54 | Missense | 0 | 0 | 0 | 0.39 | 0.35 | |
| STAD_35 | 0 | 0 | 0 | 42.19 | 0 | 0 | 0 | 0 | 0 | 1.54 | 1.04 |
| STAD_30 | 0 | 0 | 0 | 11.78 | 0 | 0 | ifDEL | 0 | 0 | 0.17 | 0.41 |
| STAD_29 | 0 | 0 | 0 | 11.96 | 0 | 0 | 0 | 0 | 0 | 0.19 | 0.41 |
| STAD_26 | 0 | 0 | 0 | 2.36 | 0 | 0 | ifDEL | 0 | 0 | 0.83 | 0.41 |
| STAD_12 | 0 | 0 | 0 | 2.99 | 0 | 0 | Missense | Nonsense | 0 | 3.36 | 1.41 |
*fsINS, Frame_Shift_Ins; fsDEL, Frame_Shift_Del; ifDEL, In_Frame_Del; ifINS, In_Frame_Ins.
Figure 3(a) EBV lytic gene expression and its partial genomic structure. (Left) Thirteen EBV lytic genes were classified into three categories according to the sequential processes of the lytic stage: early transcription factors, initiation complex genes, and lytic antigens (c.f., the list of lytic genes refers to Draborg, A. H. et al. Clinical and Developmental Immunology [2012]). (Right) Heatmap illustrating the expression of EBV lytic genes in three types of gastrointestinal cancer types: colon adenocarcinoma (COAD, red), rectal adenocarcinoma (READ, blue), and stomach adenocarcinoma (STAD, green). The gradation of blue color indicates the row-scaled expression value of each EBV gene. Three annotation bars on STAD samples indicate those that were EBV positive as detected by our LiEB workflow and in published preliminary reports: TCGA Network group[20] and Li Ding group[18]. (b) Comparison of EBV lytic gene expression in TCGA COAD, READ, and SNUH_COAD. Heatmap illustrating the expression of EBV lytic genes in three gastrointestinal cancer cohorts: SNUH colon cancer (COAD-SNUH, red), TCGA colon adenocarcinoma (COAD-TCGA, blue), and rectal adenocarcinoma (READ-TCGA, green). The gradation of blue color indicates the column-scaled expression value of each EBV gene.
Figure 4Dot plot of enriched pathways determined from GSEA results. Each dot plot demonstrates enriched pathways in TCGA (5a) and LiEB (5b) comparison of GSEA results. The size of the dot represents gene count, and the color represents the adjusted p-value.