| Literature DB >> 25526364 |
Yuichi Shiraishi1, Akihiro Fujimoto2, Mayuko Furuta3, Hiroko Tanaka4, Ken-ichi Chiba1, Keith A Boroevich5, Tetsuo Abe5, Yoshiiku Kawakami6, Masaki Ueno7, Kunihito Gotoh8, Shun-ichi Ariizumi9, Tetsuo Shibuya4, Kaoru Nakano3, Aya Sasaki3, Kazuhiro Maejima3, Rina Kitada3, Shinya Hayami7, Yoshinobu Shigekawa7, Shigeru Marubashi8, Terumasa Yamada8, Michiaki Kubo10, Osamu Ishikawa8, Hiroshi Aikata6, Koji Arihiro11, Hideki Ohdan12, Masakazu Yamamoto9, Hiroki Yamaue7, Kazuaki Chayama6, Tatsuhiko Tsunoda5, Satoru Miyano13, Hidewaki Nakagawa3.
Abstract
Recent studies applying high-throughput sequencing technologies have identified several recurrently mutated genes and pathways in multiple cancer genomes. However, transcriptional consequences from these genomic alterations in cancer genome remain unclear. In this study, we performed integrated and comparative analyses of whole genomes and transcriptomes of 22 hepatitis B virus (HBV)-related hepatocellular carcinomas (HCCs) and their matched controls. Comparison of whole genome sequence (WGS) and RNA-Seq revealed much evidence that various types of genomic mutations triggered diverse transcriptional changes. Not only splice-site mutations, but also silent mutations in coding regions, deep intronic mutations and structural changes caused splicing aberrations. HBV integrations generated diverse patterns of virus-human fusion transcripts depending on affected gene, such as TERT, CDK15, FN1 and MLL4. Structural variations could drive over-expression of genes such as WNT ligands, with/without creating gene fusions. Furthermore, by taking account of genomic mutations causing transcriptional aberrations, we could improve the sensitivity of deleterious mutation detection in known cancer driver genes (TP53, AXIN1, ARID2, RPS6KA3), and identified recurrent disruptions in putative cancer driver genes such as HNF4A, CPS1, TSC1 and THRAP3 in HCCs. These findings indicate genomic alterations in cancer genome have diverse transcriptomic effects, and integrated analysis of WGS and RNA-Seq can facilitate the interpretation of a large number of genomic alterations detected in cancer genome.Entities:
Mesh:
Year: 2014 PMID: 25526364 PMCID: PMC4272259 DOI: 10.1371/journal.pone.0114263
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1The outline of theRNA-Seq study integrated with whole genome sequencing.
(A) First, we detected various types of genomic and transcriptomic changes from RNA-Seq data of 22 HCCs. The characterized changes detected by each analysis were compared to reveal the effects of somatic genomic changes on transcriptomic aberrations. (B) The four types of splicing aberrations defined in this study. Green lines and arrows indicate normal transcription whereas red lines and arrows indicate aberrant transcriptions.
Figure 2Several examples of genomic changes other than essential splice-site mutations causing splicing aberrations obtained from our comparative whole genome and transcriptome sequencing analyses.
Exonic and intronic sequences are designated by capital and small letters, respectively. Red sequences are somatic mutations in HCCs. Blue and green numbers on the side of sequences are edit distances from splicing donor motif (AG|GTRAGT, [38]) and splicing acceptor motif (YYYYNCAG|G), respectively. Most somatic mutations changed the edit distance to splicing donor motifs so that the corresponding alteration can be enhanced.
Figure 3The expression profiles of 22 HCCs and non-cancerous liver samples for eight over-expressed genes.
Blue and red bars show the FKPMs for HCCs and the corresponding non-cancerous liver, respectively, which is calculated by RNA-Seq data. Red circles indicate samples with HBV integrations on the loci of the overexpressed genes. Green circles indicate those with gene fusions and/or SVs that can drive gene over-expression.
Figure 4HBV integrations and fusion events in 22 HCCs.
(A) Seven HBV-TERT fusion transcripts were detected in RK010. One transcript was an un-spliced transcript having the same breakpoint as the genomic integration breakpoint. The others existed in spliced forms and GT-AG splicing motifs were observed at the breakpoints of all but one. In addition to HBV fusion splicing hotspot (458 bp), 3 fusion transcripts were spliced at the coordinate of 1634 bp coordinates in HBV sequences. One fusion transcript included a newly generated 87 bp pseudo-exon sequence as well as subsequence exonic sequences. (B) HBV integrations in the MLL4 loci and their resultant fusion transcripts in five samples. Green triangles on the genome sequence show the HBV integration sites. Most fusion transcripts shared breakpoints with those of genomic HBV integration coordinates for both sides, and thus, they appear to exist in un-spliced forms. The fusion transcripts for RK141 and RK159 were validated to be concatenated (Figure S11). (C) HBV-FN1 fusion transcripts for 7 adjacent non-cancerous liver samples. Almost all the fusion transcripts had the breakpoint at the HBV fusion splicing hotspot. The other fusion transcripts which had breakpoints at intronic regions appear to be un-spliced transcripts around the integration sites.
Figure 5RNA editing candidates in 22 HCCs.
(A) The number of cancer-specific RNA mutation events (RNA editing candidates) and their substitution patterns for each sample. (B) Scatter plot between the number of A:T>G:C RNA-editing events and ADAR expression value (FKPM) calculated by whole transcriptome sequence data. There is a significant correlation (P-value = 2.38×10−7 by Wilcoxon rank sum test) between the number of A:T>G:C events and ADAR expression levels.
Figure 6The status of genomic and transcriptomic alterations of representative genes, detected by WGS and RNA-Seq of 22 HBV-related HCCs.
The list of genes were extracted by (1) significantly mutated genes in WGS analysis, (2) having no less than 3 mutations (point mutations or indels in coding regions, or GMTAs), (3) having no less than 2 GMTAs and registered in cancer gene census [17]. (4) involved in WNT signaling pathway, (5) TERT or MLL4.