| Literature DB >> 31991047 |
Daichi Sadato1,2,3, Mina Ogawa2,3,4, Chizuko Hirama1,2,3, Tsunekazu Hishima5, Shin-Ichiro Horiguchi5, Yuka Harada3, Tatsu Shimoyama4, Masanari Itokawa2, Kazuteru Ohashi1, Keisuke Oboki2.
Abstract
Epstein-Barr virus (EBV)-associated gastric cancer (EBVaGC), whose prognosis remains controversial, is diagnosed by in situ hybridization of EBV-derived EBER1/2 small RNAs. In The Cancer Genome Atlas (TCGA) Stomach Adenocarcinoma (STAD) project, the EBV molecular subtype was determined through a combination of multiple next-generation sequencing methods, but not by the gold standard in situ hybridization method. This leaves unanswered questions regarding the discordance of EBV positivity detected by different approaches and the threshold of sequencing reads. Therefore, we reanalyzed the TCGA-STAD RNA sequencing (RNA-seq) dataset including 375 tumor and 32 normal samples, using our analysis pipeline. We defined a reliable threshold for EBV-derived next-generation sequencing reads by mapping them to the EBV genome with three different random arbitrary alignments. We analyzed the prognostic impact of EBV status on the histopathological subtypes of gastric cancer. EBV-positive cases identified by reanalysis comprised nearly half of the cases (49.6%) independent from infiltrating lymphocyte signatures, and showed significantly longer overall survival for adenocarcinomas of the 'not-otherwise-specified' type [P = 0.016 (log-rank test); hazard ratios (HR): 0.476; 95% CI: 0.260-0.870, P = 0.016 (Cox univariate analysis)], but shorter overall survival for the tubular adenocarcinoma type [P = 0.005 (log-rank test); HR: 3.329; 95% CI: 1.406-7.885, P = 0.006 (Cox univariate analysis)]. These results demonstrate that the EBV positivity rates were higher when determined by RNA-seq than when determined by EBER1/2 in situ hybridization. The RNA-seq-based EBV positivity demonstrated distinct results for gastric cancer prognosis depending on the histopathological subtype, suggesting its potential to be used in clinical prognoses.Entities:
Keywords: EBV; RNA-seq; bioinformatics; gastric cancer; histopathological subtype; survival analysis
Mesh:
Substances:
Year: 2020 PMID: 31991047 PMCID: PMC7050242 DOI: 10.1002/2211-5463.12803
Source DB: PubMed Journal: FEBS Open Bio ISSN: 2211-5463 Impact factor: 2.693
Figure 1Scheme for detection of viral sequences in TCGA‐STAD RNA‐seq data. Analysis pipeline for detecting viral sequence reads. Each mapped read of EBV was normalized by the total reads mapped to the human genome.
Figure 2Optimizing the EBV threshold value and scores of tumor‐infiltrating lymphocytes. (A) EBV reads detected in this study (left) and the original report (right). (B) The ambiguity of EBV read counts. Three independent bowtie2 mappings to the EBV genome were implemented for all samples with an option (‐‐non‐deterministic) that led to different results due to the arbitrary choice of alignments (http://bowtie-bio.sourceforge.net/bowtie2/manual.shtml). X‐axis values show the detected maximum values of EBV reads. (C) Normalized EBV expression levels in all cases and in cases where EBV expression level ranges from 0 to 3. The dotted line indicates a threshold corresponding to the maximum value of cases in which only one EBV read was detected. (D) Dot plots of normalized EBV reads in normal and tumor samples. A newly defined threshold was applied. (E) Comparison of EBV reads in normal and tumor samples. (F) Lymphocyte infiltration scores in cases grouped by EBV status and each corrected EBV read range. To estimate the statistical difference, the Kruskal–Wallis test with a post hoc Dunn's test was used. Scatter plots of EBV reads, and scores of intratumoral naïve B cells (G) or intratumoral memory B cells (H) are shown. Box plots show the median (central line), first and third quartiles (box), and minimum and maximum values (whiskers above and below the boxes, respectively). The Spearman correlation coefficients were calculated for testing correlation between EBV read and intratumoral immune cell scores.
Figure 3Characteristics of EBV‐positive and EBV‐negative cases classified according to provisional histopathological categories. (A–E) Normalized EBV expression levels in histopathological categories. Lymphocyte infiltration scores in patients with EBV‐negative or EBV‐positive GC, including diffuse (B), intestinal (C), NOS (D), or mixed (E) types. Box plots show the median (central line), first and third quartiles (box), and minimum and maximum values (whiskers above and below the boxes). (F–I) The Kaplan–Meier plots of patients distinguished by RNA‐seq‐based EBV positivity in cases showing four differentiated types. Covariates were adjusted by IPW. (J, K) The Kaplan–Meier plots of patients classified according to the number of EBV reads.
Multivariate Cox survival analysis in patient groups distinguished by the amount of EBV reads.
| EBV reads | HR | 95% CI |
|
|---|---|---|---|
| Intestinal | |||
| Negative | 1.000 | ||
| 7.1–100 | 1.634 | 0.923–2.900 | 0.092 |
| > 100 | 2.103 | 0.956–4.627 | 0.065 |
| NOS | |||
| Negative | 1.000 | ||
| 7.1–100 | 0.528 | 0.283–0.983 | 0.044 |
| > 100 | 0.489 | 0.161–1.485 | 0.207 |
P < 0.05.
Figure 4Prognostic effects of EBV‐positive and EBV‐negative cases classified according to tumor grade and the provisional histopathological categories. The Kaplan–Meier plots of patients distinguished by EBV positivity in cases showing the provisional histopathological categories (diffuse, intestinal, NOS, and mixed) and tumor grades (G1/G2, upper; and G3, lower). Covariates were adjusted by IPW. Differences between survival curves were tested for significance by the log‐rank test.
Figure 5Survival analysis of EBV‐positive and EBV‐negative cases stratified by histopathological features (ICD‐O‐3‐harmonized) and the tumor grade. The Kaplan–Meier plots of patients distinguished by RNA‐seq‐based EBV positivity in cases stratified by ICD‐O‐3‐harmonized histopathology [adenocarcinoma intestinal type (A, D, G), tubular adenocarcinoma (B, E, H), and adenocarcinoma NOS (C, F, I)]. Groups stratified by tumor grade are shown in the middle row (G1/G2) and the lower row (G3). Covariates were adjusted by IPW. P values were determined with the log‐rank test.