| Literature DB >> 28412734 |
Ruifang Mao1,2, Jie Liu2, Guanfeng Liu2, Shanshan Jin2, Qingzhong Xue3, Liang Ma2, Yan Fu4, Na Zhao2, Jinliang Xing5, Lanjuan Li1, Yunqing Qiu6, Biaoyang Lin1,2,7.
Abstract
Hepatocellular carcinoma (HCC) has become the third most deadly disease worldwide and HBV is the major factor in Asia and Africa. We conducted 9 WGS (whole genome sequencing) analyses for matched samples of tumor, adjacent non-tumor tissues and normal blood samples of HCC patients from three HBV positive patients. We then validated the mutations identified in a larger cohort of 177 HCC patients. We found that the number of the unique somatic mutations (average of 59,136) in tumor samples is significantly less than that in adjacent non-tumor tissues (average 83, 633). We discovered that the TP53 R249S mutation occurred in 7.7% of the HCC patients, and it was significantly associated with poor diagnosis. In addition, we found that the L104P mutation in the VCX gene (Variable charge, X-linked) was absent in white blood cell samples, but present at 11.1% frequency in the adjacent tissues and increased to 14.6% in HCC tissues, suggesting that this mutation might be a tumor driver gene driving HCC carcinogenesis. Finally, we identified a TK1-RNU7 fusion, which would result in a deletion of 103 amino acids from its C-terminal. The frequencies of this fusion event decreased from the adjacent tissues (29.2%) to the tumors (16.7%), suggesting that a truncated thymidine Kinase1 (TK1) caused by the fusion event might be deleterious and be selected against during tumor progression. The three-way comparisons allow the identification of potential driver mutations of carcinogenesis. Furthermore, our dataset provides the research community a valuable dataset for identifying dynamic changes of mutation profiles and driver mutations for HCC.Entities:
Keywords: hepatitis B virus (HBV); hepatocellular carcinoma (HCC); mutations; next-generation sequencing; whole genome sequencing
Mesh:
Year: 2017 PMID: 28412734 PMCID: PMC5432249 DOI: 10.18632/oncotarget.15428
Source DB: PubMed Journal: Oncotarget ISSN: 1949-2553
A summary statistics of whole-genome sequencing results
| Sample ID | Gross mapping yield (Gb) | Read Depth | Fully called genome fraction | SNP total count | INS total count | DEL total count | SUB total count | Total CNV segment count |
|---|---|---|---|---|---|---|---|---|
| 193.538 | 66.44 | 0.971 | 3492002 | 295890 | 299955 | 95653 | 246 | |
| 185.055 | 64.19 | 0.961 | 3486488 | 272954 | 274526 | 98051 | 330 | |
| 189.624 | 66.12 | 0.956 | 3457971 | 277212 | 280823 | 101310 | 360 | |
| 194.519 | 66.57 | 0.974 | 3469773 | 303858 | 302061 | 94328 | 251 | |
| 184.075 | 63.32 | 0.969 | 3342011 | 280658 | 279064 | 92470 | 409 | |
| 290.237 | 100.78 | 0.96 | 3475951 | 305251 | 300437 | 107964 | 362 | |
| 187.147 | 64.78 | 0.963 | 3460780 | 279997 | 286541 | 99381 | 243 | |
| 296.613 | 103.31 | 0.957 | 3473247 | 303717 | 291929 | 111119 | 390 | |
| 173.431 | 59.97 | 0.964 | 3470317 | 270147 | 270543 | 95311 | 529 |
Abbreviations: N = normal blood cells, TT = tumor cells, TA = adjacent non-tumor cells.
Figure 1A Circos plot showing the summary of somatic genomic alterations in the three HCC patients
Various types of somatic alterations in three HCC tumor genomes were shown in different colored lines or bars. High confidence somatic inter chromosomal SVs are shown as lines in red. Somatic copy number alterations are shown as bar plots with copy number gain in red and loss in blue. Each circus from outer to inner circles represents different kinds of genomic alterations in the study. a, chromosome number. b, line plots of union of somatic SNVs in 3 patients. c, copy number alterations in patient A368. d, copy number alterations in patient A355. e, copy number alterations in patient A357.
Figure 2The number of single nucleotide variations throughout the genomes of the tumor and adjacent non-tumor tissues and the matched normal blood cells
(A) Venn diagram illustrates overlapped SNVs among the union of the tumors, the adjacent tissues and the blood samples. B, C and D show overlapped SNVs for each patient (B) patient A355; (C) patient A357; (D) patient A368, respectively.
Figure 3Characteristics of mutations in HCC patients
(A) Distributiong of nonsilent mutations in tumor of each HCC patient. Bar charts of the scores of SIFT (B), Polyphen2 (C) and Mutation Assessor (D) Of unique nonsynonymous point mutations between tumor and adjacent tissues (all with Mann-Whitney test P < 0.001). (E) Mutation spectrum of transition and transversion categories in tumor and adjacent tissues.
Figure 4Kaplan-Meier survival plot for TP53 (R249S) wild-type and mutant HCC patients
HCC patients with the TP53 (R249S) mutations have significant poor survival.
The cancer driver and supressor genes found in the tumor tissues of the HCC patients
| Gene | Classification | Core pathway | Process | mutation | dbSNP or not | chr | start | end | ref | obs | A355 | A357 | A368 |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| ARID1B | TSG | Chromatin Modification | Cell Fate | nonsynonymous SNV | novel | 6 | 157150542 | 157150542 | A | T | 1 | 0 | 0 |
| AXIN1 | TSG | APC | Cell Fate | stopgain | novel | 16 | 338223 | 338223 | C | A | 1 | 0 | 0 |
| stopgain | novel | 16 | 396701 | 396701 | C | A | 0 | 1 | 0 | ||||
| CREBBP | TSG | Chromatin Modification; Transcriptional Regulation | Cell Fate | stopgain | novel | 16 | 3778795 | 3778795 | G | A | 0 | 1 | 0 |
| GATA3 | TSG | Transcriptional Regulation | Cell Fate | nonframeshift substitution | novel | 10 | 8097738 | 8097739 | GT | CG | 1 | 0 | 0 |
| MLL3 | TSG | Chromatin Modification | Cell Fate | frameshift insertion | novel | 7 | 151874147 | 151874147 | - | T | 0 | 0 | 1 |
| nonsynonymous SNV | yes | 7 | 151919751 | 151919751 | A | G | 1 | 0 | 0 | ||||
| stopgain | yes | 7 | 151945071 | 151945071 | - | T | 0 | 0 | 1 | ||||
| CNV deletion | novel | 7 | 151900000 | 152200000 | 1 | 0 | 0 | ||||||
| MPL | Oncogene | STAT | Cell Survival | nonsynonymous SNV | novel | 1 | 43803860 | 43803860 | C | A | 0 | 1 | 0 |
| NOTCH1 | TSG | NOTCH | Cell Fate | frameshift insertion | novel | 9 | 139399408 | 139399408 | - | C | 0 | 1 | 0 |
| NOTCH2 | TSG | NOTCH | Cell Fate | nonsynonymous SNV | yes | 1 | 120611960 | 120611960 | C | T | 1 | 1 | 0 |
| TP53 | TSG | Cell Cycle/Apoptosis; DNA Damage Control | Cell Survival | nonsynonymous SNV | yes | 17 | 7577534 | 7577534 | C | A | 0 | 1 | 0 |
The cancer driver and supressor genes found in the adjacent tissues of the HCC patients
| Gene | Classification | Core pathway | Process | mutation | dbSNP or not | chr | start | end | ref | obs | A355 | A357 | A368 |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| ARID1B | TSG | Chromatin Modification | Cell Fate | frameshift deletion | novel | 6 | 157222580 | 157222580 | A | - | 1 | 0 | 0 |
| CIC | TSG | RAS | Cell Survival | nonframeshift substitution | novel | 19 | 42796589 | 42796590 | GC+A12:I21 | AG | 0 | 0 | 1 |
| EP300 | TSG | Chromatin Modification; APC; TGF-β; NOTCH | Cell Survival/Fate | nonframeshift substitution | novel | 22 | 41545942 | 41545945 | CAGC | GCAA | 0 | 1 | 0 |
| FLT3 | Oncogene | RAS; PI3K; STAT | Cell Survival | nonsynonymous SNV | yes | 13 | 28674628 | 28674628 | T | C | 0 | 1 | 0 |
| MSH6 | TSG | DNA Damage Control | Genome Maintenance | nonframeshift substitution | novel | 2 | 48028228 | 48028229 | CT | AG | 0 | 0 | 1 |
| NCOR1 | TSG | Chromatin Modification | Cell Fate | stopgain | novel | 17 | 15983770 | 15983770 | G | A | 0 | 0 | 1 |
| RB1 | TSG | Cell Cycle/Apoptosis | Cell Survival | nonsynonymous SNV | novel | 13 | 49051518 | 49051518 | T | G | 1 | 0 | 0 |
| MLL3 | TSG | Chromatin Modification | Cell Fate | nonsynonymous SNV | yes | 7 | 151919690 | 151919690 | C | T | 0 | 0 | 1 |
| MLL2 | TSG | Chromatin Modification | Cell Fate | nonsynonymous SNV | novel | 12 | 49427278 | 49427278 | T | G | 1 | 0 | 0 |
Pathway impact factor analysis for significantly (significance at the Bonferroni corrected P value level of 5%) enriched KEGG pathways
| Comparison | Pathway Name | Impact Factor | %Pathway Genes in Input | Corrected | Classified Group |
|---|---|---|---|---|---|
| Tumor unique | ECM-receptor interaction | 25.54 | 90.48 | 7.54E-10 | Tumor environmental information processing |
| Cell adhesion molecules (CAMs) | 24.38 | 78.36 | 2.41E-09 | ||
| Adjacent non-tumor unique | Olfactory transduction | 25.50 | 78.27 | 7.90E-10 | Organismal systems |
| Intersection of tumor and adjacent non-tumor | Neuroactive ligand-receptor interaction | 20.19 | 66.41 | 1.58E-07 | Tumor environmental information processing |
| ABC transporters | 12.47 | 81.82 | 3.59E-04 | ||
| Calcium signaling pathway | 14.89 | 66.48 | 3.17E-05 | ||
| Phosphatidylinositol signaling system | 14.69 | 76.32 | 3.88E-05 | ||
| Complement and coagulation cascades | 12.65 | 75.36 | 2.98E-04 | Organismal systems | |
| Taste transduction | 8.92 | 73.59 | 0.0125 | ||
| GnRH signaling pathway | 8.70 | 66.02 | 0.015 | ||
| Hematopoietic cell lineage | 8.01 | 66.67 | 0.031 | ||
| Gap junction | 9.522 | 67.71 | 0.0068 | Cellular process | |
| Focal adhesion | 16.46 | 66.50 | 6.63E-06 | ||
| Pathways in cancer | 8.61 | 57.88 | 0.017 | Human disease |
Figure 5Structural variations of HCCs
(A) The number of unique structure variations (SV) predicted in the tumors and the adjacent non-tumor tissues. The number of SVs in the adjacent tissues is more than 5 times the SVs in the tumor tissues. (B) Sequencing validation of the PCR product harboring the TK1-RNU7 fusion. The TK1 gene on chromosome 17 was broken after the fifth exon and fused with the upstream regulating region of the RNA gene RNU7-177P on chromosome 8. The breakpoint was amplified by PCR from the tumor tissues and subjected to Sanger Sequencing to verify the fusion. The sequencing chromatographs covering the breakpoint are shown. Red box=exons, purple box=Y-RNA gene, green box=RNU7-177P gene. (C) PCR analysis of TK1-RNU7 fusion in a cohort of 48 tumors. (D). PCR analysis of TK1-RNU7 fusion in the cohort of matched 48 adjacent tissues. The PCR product with the fusion is 556bp. The figure shows fusion was detected both in tumors and adjacent tissues, but more in adjacent. Plus sign (+) means fusion was detected.