| Literature DB >> 23236287 |
Dong Ding1, Xiaoyan Lou, Dasong Hua, Wei Yu, Lisha Li, Jun Wang, Feng Gao, Na Zhao, Guoping Ren, Lanjuan Li, Biaoyang Lin.
Abstract
Integration of the viral DNA into host chromosomes was found in most of the hepatitis B virus (HBV)-related hepatocellular carcinomas (HCCs). Here we devised a massive anchored parallel sequencing (MAPS) method using next-generation sequencing to isolate and sequence HBV integrants. Applying MAPS to 40 pairs of HBV-related HCC tissues (cancer and adjacent tissues), we identified 296 HBV integration events corresponding to 286 unique integration sites (UISs) with precise HBV-Human DNA junctions. HBV integration favored chromosome 17 and preferentially integrated into human transcript units. HBV targeted genes were enriched in GO terms: cAMP metabolic processes, T cell differentiation and activation, TGF beta receptor pathway, ncRNA catabolic process, and dsRNA fragmentation and cellular response to dsRNA. The HBV targeted genes include 7 genes (PTPRJ, CNTN6, IL12B, MYOM1, FNDC3B, LRFN2, FN1) containing IPR003961 (Fibronectin, type III domain), 7 genes (NRG3, MASP2, NELL1, LRP1B, ADAM21, NRXN1, FN1) containing IPR013032 (EGF-like region, conserved site), and three genes (PDE7A, PDE4B, PDE11A) containing IPR002073 (3', 5'-cyclic-nucleotide phosphodiesterase). Enriched pathways include hsa04512 (ECM-receptor interaction), hsa04510 (Focal adhesion), and hsa04012 (ErbB signaling pathway). Fewer integration events were found in cancers compared to cancer-adjacent tissues, suggesting a clonal expansion model in HCC development. Finally, we identified 8 genes that were recurrent target genes by HBV integration including fibronectin 1 (FN1) and telomerase reverse transcriptase (TERT1), two known recurrent target genes, and additional novel target genes such as SMAD family member 5 (SMAD5), phosphatase and actin regulator 4 (PHACTR4), and RNA binding protein fox-1 homolog (C. elegans) 1 (RBFOX1). Integrating analysis with recently published whole-genome sequencing analysis, we identified 14 additional recurrent HBV target genes, greatly expanding the HBV recurrent target list. This global survey of HBV integration events, together with recently published whole-genome sequencing analyses, furthered our understanding of the HBV-related HCC.Entities:
Mesh:
Substances:
Year: 2012 PMID: 23236287 PMCID: PMC3516541 DOI: 10.1371/journal.pgen.1003065
Source DB: PubMed Journal: PLoS Genet ISSN: 1553-7390 Impact factor: 5.917
Figure 1An overview of MAPS.
(A) Detailed illustration of the ‘Y’ shaped PE 2 Walking Adapter and the barcoded HBX nested primer. (B) Schematic of the MAPS approach. Adapters and primers are colored in accordance with (A). Detailed primer sequence information is provided in the Table S1, S2 and S3.
Figure 2Interpretation of the results of MAPS.
(A). A representative mapping of the paired reads from MAPS at integration sites. Read 1 sequences are anchored, since they extend from the fixed HBX nested primer; while the positions of Read 2 sequences vary due to the random fragmentation. (B). Insertion sites illustration in UCSC genome browser. (i) Insertion in ERBB4. (ii) Insertion in GRM8. (C). Integrant sequences of ERBB4 and GRM8. Breakpoint is indicated by arrow. The numbers are HBV genome coordinates, based on the reference of GQ205441. The human gene sequences are underlined. Orange bases indicate the common base between HBV and Human at the junction. (D). Distribution of integrant lengths in the sequenced library of ERBB4 and GRM8 integrations.
Summary of HBV targeted genes identified.
| Tissue disease state | chr | Insertion orientation | Nubmer of unique end fragments | Number of total reads | Description | Gene symbol (10 kb annotation)* |
| Adjacent tissue | chrX | + | 25 | 15550 | Septin 6 | SEPT6 |
| Adjacent tissue | chr16 | − | 6 | 4565 | RNA binding protein, fox-1 homolog (C. elegans) 1 | A2BP1 (RBFOX1) |
| Adjacent tissue | chr16 | + | 3 | 358 | RNA binding protein, fox-1 homolog (C. elegans) 1 | A2BP1 (RBFOX1) |
| Adjacent tissue | chr2 | − | 3 | 122 | Abl interactor 2 | ABI2 |
| Adjacent tissue | chr16 | + | 4 | 1449 | acyl-CoA synthetase medium-chain family member 2A | ACSM2A |
| Adjacent tissue | chr12 | − | 23 | 2062 | Acyl-CoA synthetase short-chain family member 3 | ACSS3 |
| Adjacent tissue | chr14 | − | 41 | 4606 | chromosome 14 open reading frame 55 | ADAM21 |
| Adjacent tissue | chr4 | − | 4 | 811 | Alcohol dehydrogenase 1B (class I), beta polypeptide | ADH1B |
| Adjacent tissue | chr10 | − | 5 | 1453 | Aldo-keto reductase family 1, member C4 (chlordecone reductase; 3-alpha hydroxysteroid dehydrogenase, type I; dihydrodiol dehydrogenase 4) | AKR1C4 |
| Adjacent tissue | chr4 | − | 125 | 61810 | Albumin | ALB |
| Cancer tissue | chr12 | − | 9 | 30554 | Asparagine-linked glycosylation 10 homolog B (yeast, alpha-1,2-glucosyltransferase) | ALG10B |
| Adjacent tissue | chrX | − | 9 | 131 | Asparagine-linked glycosylation 13 homolog (S. cerevisiae) | ALG13 |
| Adjacent tissue | chr2 | − | 31 | 15568 | Anaplastic lymphoma kinase (Ki-1) | ALK |
| Adjacent tissue | chr2 | − | 18 | 11626 | Alkaline phosphatase, intestinal | ALPI |
| Adjacent tissue | chr7 | − | 52 | 44022 | Amphiphysin | AMPH |
| Adjacent tissue | chr22 | + | 4 | 536 | Apolipoprotein L, 1 | APOL1 |
| Adjacent tissue | chr11 | − | 16 | 862 | Rho guanine nucleotide exchange factor (GEF) 12 | ARHGEF12 |
| Adjacent tissue | chr11 | − | 6 | 3607 | Rho guanine nucleotide exchange factor (GEF) 12 | ARHGEF12 |
| Adjacent tissue | chr8 | − | 7 | 6932 | ATPase, H+ transporting, lysosomal 38 kDa, V0 subunit d2 | ATP6V0D2 |
| Adjacent tissue | chr22 | + | 18 | 9882 | BCL2-interacting killer (apoptosis-inducing) | BIK |
| Adjacent tissue | chr7 | + | 6 | 658 | V-raf murine sarcoma viral oncogene homolog B1 | BRAF |
| Adjacent tissue | chr12 | − | 20 | 615 | Chromosome 12 open reading frame 40 | C12orf40 |
| Adjacent tissue | chr15 | + | 6 | 326 | DNM1 pseudogene 46 | C15orf51 |
| Adjacent tissue | chr4 | + | 37 | 2363 | C1q and tumor necrosis factor related protein 7 | C1QTNF7 |
| Adjacent tissue | chr9 | − | 9 | 250 | Chromosome 9 open reading frame 93 | C9orf93 |
| Adjacent tissue | chr4 | + | 9 | 835 | coiled-coil domain containing 158 | CCDC158 |
| Cancer tissue | chr4 | − | 130 | 73841 | Cyclin A2 | CCNA2 |
| Adjacent tissue | chr4 | − | 14 | 9198 | Cyclin A2 | CCNA2 |
| Adjacent tissue | chr6 | − | 6 | 878 | Cyclin C | CCNC |
| Adjacent tissue | chr17 | − | 11 | 2684 | Cell division cycle 27 homolog (S. cerevisiae) | CDC27 |
| Adjacent tissue | chr7 | − | 4 | 2255 | cyclin-dependent kinase 13 | CDC2L5 |
| Adjacent tissue | chr5 | + | 18 | 470 | Cadherin 10, type 2 (T2-cadherin) | CDH10 |
| Adjacent tissue | chr16 | + | 14 | 6882 | Cadherin 8, type 2 | CDH8 |
| Adjacent tissue | chr3 | + | 5 | 187 | Centrosomal protein 63 kDa | CEP63 |
| Adjacent tissue | chr1 | + | 3 | 174 | Chitinase 3-like 2 | CHI3L2 |
| Adjacent tissue | chr12 | + | 38 | 693 | Citron (rho-interacting, serine/threonine kinase 21) | CIT |
| Cancer | chr3 | − | 236 | 66086 | Contactin 6 | CNTN6 |
| Adjacent tissue | chr21 | − | 42 | 2632 | Collagen, type XVIII, alpha 1 | COL18A1 |
| Adjacent tissue | chrX | − | 45 | 1849 | Collagen, type IV, alpha 6 | COL4A6 |
| Adjacent tissue | chr21 | − | 4 | 322 | Collagen, type VI, alpha 2 | COL6A2 |
| Adjacent tissue | chr2 | + | 95 | 91537 | Carbamoyl-phosphate synthetase 1, mitochondrial | CPS1 |
| Adjacent tissue | chr15 | − | 28 | 2082 | CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A) small phosphatase like 2 | CTDSPL2 |
| Adjacent tissue | chr11 | + | 9 | 7279 | CUGBP, Elav-like family member 1 | CUGBP1 |
| Adjacent tissue | chr10 | − | 5 | 92 | CUGBP, Elav-like family member 2 | CUGBP2 |
| Cancer tissue | chr10 | − | 298 | 261343 | Cytochrome P450, family 2, subfamily C, polypeptide 8 | CYP2C8 |
| Adjacent tissue | chr10 | + | 29 | 6742 | Cytochrome P450, family 2, subfamily C, polypeptide 8 | CYP2C8 |
| Adjacent tissue | chr11 | + | 5 | 168 | Discs, large homolog 2, chapsyn-110 (Drosophila) | DLG2 |
| Adjacent tissue | chr8 | − | 3 | 46 | Dihydropyrimidinase | DPYS |
| Adjacent tissue | chr6 | − | 61 | 5013 | Ectonucleotide pyrophosphatase/phosphodiesterase 1 | ENPP1 |
| Adjacent tissue | chr1 | + | 4 | 60 | Epidermal growth factor receptor pathway substrate 15 | EPS15 |
| Adjacent tissue | chr2 | + | 301 | 792607 | V-erb-a erythroblastic leukemia viral oncogene homolog 4 (avian) | ERBB4 |
| Cancer | chr16 | + | 16 | 6616 | Endoplasmic reticulum to nucleus signaling 2 | ERN2 |
| Adjacent tissue | chr1 | + | 58 | 7369 | Estrogen-related receptor gamma | ESRRG |
| Adjacent tissue | chr4 | − | 15 | 8951 | Ellis van Creveld syndrome | EVC |
| Adjacent tissue | chr17 | + | 14 | 10665 | Family with sequence similarity 117, member A | FAM117A |
| Adjacent tissue | chr3 | − | 25 | 1844 | family with sequence similarity 162, member A | FAM162A |
| Adjacent tissue | chr5 | + | 4 | 957 | F-box and leucine-rich repeat protein 17 | FBXL17 |
| Adjacent tissue | chr5 | + | 9 | 196 | F-box and leucine-rich repeat protein 7 | FBXL7 |
| Adjacent tissue | chr4 | − | 23 | 318 | KIAA1727 protein | FHDC1 |
| Adjacent tissue | chr18 | + | 6 | 5412 | Formin homology 2 domain containing 3 | FHOD3 |
| Adjacent tissue | chr2 | − | 169 | 36268 | Fibronectin 1 | FN1 |
| Adjacent tissue | chr2 | − | 9 | 1054 | Fibronectin 1 | FN1 |
| Adjacent tissue | chr2 | − | 47 | 631 | Fibronectin 1 | FN1 |
| Adjacent tissue | chr2 | − | 8 | 301 | Fibronectin 1 | FN1 |
| Cancer tissue | chr2 | − | 58 | 33209 | Fibronectin 1 | FN1 |
| Adjacent tissue | chr17 | + | 9 | 30405 | Fructosamine-3-kinase-related protein | FN3KRP |
| Adjacent tissue | chr3 | + | 89 | 15775 | Fibronectin type III domain containing 3B | FNDC3B |
| Adjacent tissue | chr4 | + | 47 | 30826 | Fraser syndrome 1 | FRAS1 |
| Adjacent tissue | chr4 | + | 32 | 1331 | Follistatin-like 5 | FSTL5 |
| Adjacent tissue | chr14 | − | 3 | 35 | exonuclease 3′-5′ domain containing 2 | GALNTL1 |
| Adjacent tissue | chr7 | − | 3 | 2771 | UDP-N-acetyl-alpha-D-galactosamine:polypeptide N-acetylgalactosaminyltransferase-like 5 | GALNTL5 |
| Cancer tissue | chr5 | − | 30 | 2690 | Glucosamine-6-phosphate deaminase 1 | GNPDA1 |
| Adjacent tissue | chr13 | − | 4 | 1057 | Glypican 6 | GPC6 |
| Adjacent tissue | chr4 | − | 5 | 125 | Glutamate receptor, ionotropic, delta 2 | GRID2 |
| Adjacent tissue | chr7 | + | 156 | 127769 | Glutamate receptor, metabotropic 8 | GRM8 |
| Adjacent tissue | chr12 | + | 22 | 453 | Glycogen synthase 2 (liver) | GYS2 |
| Adjacent tissue | chr11 | + | 6 | 22742 | Mps one binder kinase activator-like 2 | HCCA2 |
| Adjacent tissue | chr6 | − | 6 | 3624 | HCG27 | |
| Adjacent tissue | chr6 | + | 6 | 3221 | Histone cluster 1, H2bd | HIST1H2BD |
| Adjacent tissue | chr19 | − | 6 | 256 | Histocompatibility (minor) HA-1 | HMHA1 |
| Adjacent tissue | chr10 | + | 25 | 1214 | Heparanase 2 | HPSE2 |
| Adjacent tissue | chr3 | + | 18 | 504 | Histidine-rich glycoprotein | HRG |
| Adjacent tissue | chr5 | + | 5 | 568 | Interleukin 12B (natural killer cell stimulatory factor 2, cytotoxic lymphocyte maturation factor 2, p40) | IL12B |
| Adjacent tissue | chr3 | + | 5 | 2776 | Interleukin 1 receptor accessory protein | IL1RAP |
| Adjacent tissue | chr10 | − | 4 | 2968 | Interleukin 2 receptor, alpha | IL2RA |
| Adjacent tissue | chr6 | + | 5 | 431 | Interphotoreceptor matrix proteoglycan 1 | IMPG1 |
| Adjacent tissue | chr19 | − | 10 | 3440 | Inositol 1,4,5-trisphosphate 3-kinase C | ITPKC |
| Adjacent tissue | chr17 | + | 124 | 1293 | K(lysine) acetyltransferase 2A | KAT2A |
| Adjacent tissue | chr14 | + | 8 | 291 | KIAA0317 | KIAA0317 |
| Adjacent tissue | chr3 | + | 8 | 636 | Kelch-like 6 (Drosophila) | KLHL6 |
| Adjacent tissue | chr6 | − | 9 | 304 | Lin-28 homolog B (C. elegans) | LIN28B |
| Cancer tissue | chr9 | − | 53 | 4292 | Leucine rich repeat and Ig domain containing 2 | LINGO2 |
| Adjacent tissue | chr7 | + | 45 | 1917 | Limb region 1 homolog (mouse) | LMBR1 |
| Adjacent tissue | chr12 | − | 4 | 119 | WAS protein family homolog 1 pseudogene | LOC100288778 |
| Adjacent tissue | chr5 | + | 7 | 10522 | Hypothetical protein LOC285627 | LOC285627 |
| Adjacent tissue | chr17 | − | 5 | 246 | LOC644172 | |
| Adjacent tissue | chr6 | + | 62 | 1101 | Leucine rich repeat and fibronectin type III domain containing 2 | LRFN2 |
| Adjacent tissue | chr2 | + | 45 | 703 | Low density lipoprotein-related protein 1B (deleted in tumors) | LRP1B |
| Adjacent tissue | chr15 | − | 5 | 582 | Leucine-rich repeat kinase 1 | LRRK1 |
| Adjacent tissue | chr4 | + | 14 | 14654 | Mastermind-like 3 (Drosophila) | MAML3 |
| Adjacent tissue | chr1 | + | 6 | 2023 | Mannan-binding lectin serine peptidase 2 | MASP2 |
| Adjacent tissue | chr3 | − | 3 | 48 | Muscleblind-like (Drosophila) | MBNL1 |
| Adjacent tissue | chr6 | − | 43 | 3090 | Mediator complex subunit 23 | MED23 |
| Adjacent tissue | chr19 | + | 12 | 9041 | Meis homeobox 3 | MEIS3 |
| Adjacent tissue | chr7 | + | 5 | 68 | microRNA 182 | MIR182 |
| Adjacent tissue | chr6 | + | 4 | 90 | microRNA 206 | MIR206 |
| Adjacent tissue | chr7 | + | 11 | 631 | Myeloid/lymphoid or mixed-lineage leukemia 3 | MLL3 |
| Cancer tissue | chr19 | − | 80 | 4468 | Myeloid/lymphoid or mixed-lineage leukemia 4 | MLL4 |
| Cancer | chr2 | − | 26 | 840 | Melanoregulin | MREG |
| Adjacent tissue | chr5 | + | 16 | 650 | Myosin X | MYO10 |
| Adjacent tissue | chr18 | + | 31 | 17222 | Myomesin 1, 185 kDa | MYOM1 |
| Cancer tissue | chr7 | + | 192 | 76342 | long intergenic non-protein coding RNA 174 | NCRNA00174 |
| Adjacent tissue | chr11 | + | 3 | 129 | NEL-like 1 (chicken) | NELL1 |
| Adjacent tissue | chr6 | − | 127 | 123237 | Na+/K+ transporting ATPase interacting 2 | NKAIN2 |
| Adjacent tissue | chr10 | − | 88 | 73961 | Neuregulin 3 | NRG3 |
| Adjacent tissue | chr2 | − | 43 | 31154 | Neurexin 1 | NRXN1 |
| Adjacent tissue | chr3 | − | 6 | 7428 | Poly (ADP-ribose) polymerase family, member 9 | PARP9 |
| Adjacent tissue | chr2 | + | 4 | 84 | Phosphodiesterase 11A | PDE11A |
| Adjacent tissue | chr1 | − | 4 | 278 | Phosphodiesterase 4B, cAMP-specific (phosphodiesterase E4 dunce homolog, Drosophila) | PDE4B |
| Adjacent tissue | chr8 | + | 8 | 10926 | Phosphodiesterase 7A | PDE7A |
| Adjacent tissue | chr6 | + | 14 | 442 | enoyl-CoA delta isomerase 2 | PECI |
| Adjacent tissue | chr14 | + | 4 | 1864 | Placental growth factor, vascular endothelial growth factor-related protein | PGF |
| Cancer | chr5 | − | 289 | 3293 | Protein geranylgeranyltransferase type I, beta subunit | PGGT1B |
| Adjacent tissue | chr1 | − | 14 | 216 | Phosphatase and actin regulator 4 | PHACTR4 |
| Adjacent tissue | chr1 | − | 11 | 491 | Phosphatase and actin regulator 4 | PHACTR4 |
| Cancer tissue | chr12 | − | 54 | 6030 | Polyhomeotic homolog 1 (Drosophila) | PHC1 |
| Cancer | chr3 | − | 107 | 7765 | Phosphoinositide-3-kinase, catalytic, beta polypeptide | PIK3CB |
| Adjacent tissue | chr20 | − | 7 | 1858 | Phospholipase C, beta 1 (phosphoinositide-specific) | PLCB1 |
| Adjacent tissue | chr1 | − | 12 | 5993 | Phospholipase D family, member 5 | PLD5 |
| Adjacent tissue | chr7 | − | 29 | 10098 | Plexin A4 | PLXNA4 |
| Adjacent tissue | chr12 | + | 6 | 256 | Protein phosphatase 1H (PP2C domain containing) | PPM1H |
| Adjacent tissue | chr8 | + | 20 | 700 | Protein kinase, DNA-activated, catalytic polypeptide | PRKDC |
| Adjacent tissue | chr17 | + | 7 | 2875 | Proteasome (prosome, macropain) 26S subunit, non-ATPase, 12 | PSMD12 |
| Adjacent tissue | chr11 | + | 43 | 9668 | Protein tyrosine phosphatase, receptor type, J | PTPRJ |
| Adjacent tissue | chr17 | + | 18 | 714 | RAB5C, member RAS oncogene family | RAB5C |
| Adjacent tissue | chr17 | − | 6 | 320 | Retinoic acid induced 1 | RAI1 |
| Adjacent tissue | chr17 | − | 77 | 9045 | Retinoic acid receptor, alpha | RARA |
| Cancer | chr10 | − | 306 | 96735 | Hypothetical protein LOC220980 | RASSF4 |
| Adjacent tissue | chr10 | − | 4 | 127 | Retinol binding protein 4, plasma | RBP4 |
| Adjacent tissue | chr6 | + | 39 | 2877 | RALBP1 associated Eps domain containing 1 | REPS1 |
| Adjacent tissue | chrX | + | 4 | 1297 | Regucalcin (senescence marker protein-30) | RGN |
| Adjacent tissue | chr17 | + | 11 | 4735 | Rho GTPase activating protein 44 | RICH2 |
| Adjacent tissue | chr15 | + | 3 | 120 | S phase cyclin A-associated protein in the ER | SCAPER |
| Adjacent tissue | chr6 | + | 138 | 23599 | Sex comb on midleg-like 4 (Drosophila) | SCML4 |
| Adjacent tissue | chr2 | + | 4 | 1532 | SEC14 and spectrin domains 1 | SESTD1 |
| Adjacent tissue | chr8 | + | 3 | 554 | Secreted frizzled-related protein 1 | SFRP1 |
| Adjacent tissue | chr14 | + | 77 | 795 | surfactant associated 3 | SFTA3 |
| Adjacent tissue | chr5 | + | 6 | 579 | Superkiller viralicidic activity 2-like 2 (S. cerevisiae) | SKIV2L2 |
| Adjacent tissue | chr12 | − | 27 | 22758 | Solute carrier family 2 (facilitated glucose transporter), member 13 | SLC2A13 |
| Adjacent tissue | chr3 | + | 26 | 14482 | Sarcolemma associated protein | SLMAP |
| Adjacent tissue | chr4 | − | 6 | 8441 | SMAD family member 1 | SMAD1 |
| Cancer | chr5 | + | 570 | 20953 | SMAD family member 5 | SMAD5 |
| Adjacent tissue | chr5 | + | 31 | 310 | SMAD family member 5 | SMAD5 |
| Cancer | chr5 | + | 30 | 1124 | SMAD family member 5 | SMAD5 |
| Adjacent tissue | chr2 | − | 14 | 2397 | Son of sevenless homolog 1 (Drosophila) | SOS1 |
| Adjacent tissue | chr12 | + | 16 | 7109 | SRY (sex determining region Y)-box 5 | SOX5 |
| Adjacent tissue | chr11 | + | 4 | 56 | Spondin 1, extracellular matrix protein | SPON1 |
| Adjacent tissue | chr1 | + | 7 | 8119 | Single stranded DNA binding protein 3 | SSBP3 |
| Adjacent tissue | chr8 | + | 11 | 4638 | Suppression of tumorigenicity 18 (breast carcinoma) (zinc finger protein) | ST18 |
| Adjacent tissue | chr1 | − | 17 | 1196 | Syntaxin 6 | STX6 |
| Adjacent tissue | chr5 | + | 7 | 469 | Synaptic vesicle glycoprotein 2C | SV2C |
| Adjacent tissue | chr8 | + | 42 | 8798 | Transforming, acidic coiled-coil containing protein 1 | TACC1 |
| Adjacent tissue | chr8 | + | 16 | 4216 | TAF2 RNA polymerase II, TATA box binding protein (TBP)-associated factor, 150 kDa | TAF2 |
| Cancer tissue | chr5 | − | 382 | 46659 | Telomerase reverse transcriptase | TERT |
| Cancer | chr5 | − | 86 | 2724 | Telomerase reverse transcriptase | TERT |
| Cancer tissue | chr5 | − | 63 | 3789 | Telomerase reverse transcriptase | TERT |
| Cancer | chr5 | − | 230 | 2853 | Telomerase reverse transcriptase | TERT |
| Cancer tissue | chr5 | − | 270 | 7445 | Telomerase reverse transcriptase | TERT |
| Adjacent tissue | chr5 | − | 5 | 259 | Telomerase reverse transcriptase | TERT |
| Cancer tissue | chr5 | − | 392 | 47491 | Telomerase reverse transcriptase | TERT |
| Adjacent tissue | chr5 | − | 55 | 2941 | Telomerase reverse transcriptase | TERT |
| Adjacent tissue | chr12 | + | 47 | 3611 | Transmembrane protein 117 | TMEM117 |
| Adjacent tissue | chr12 | + | 3 | 105 | Transmembrane protein 132B | TMEM132B |
| Adjacent tissue | chr15 | − | 11 | 6194 | Transmembrane protein 87A | TMEM87A |
| Adjacent tissue | chr1 | + | 7 | 412 | Ubiquitin specific peptidase 33 | USP33 |
| Adjacent tissue | chr2 | + | 4 | 225 | Xin actin-binding repeat containing 2 | XIRP2 |
| Adjacent tissue | chr11 | − | 3 | 244 | Zinc finger and BTB domain containing 16 | ZBTB16 |
| Adjacent tissue | chr16 | − | 81 | 13492 | Zinc finger protein 1 homolog (mouse) | ZFP1 |
| Adjacent tissue | chr9 | + | 12 | 5969 | Zinc finger protein 618 | ZNF618 |
| Adjacent tissue | chr7 | − | 9 | 370 | Zinc finger protein 804B | ZNF804B |
Common HBV targeted genes among the three studies.
| Our study | Sung et al. | Jiang et al. | Description |
| ADH1B | ADH1B | Alcohol dehydrogenase 1B (class I), beta polypeptide | |
| CPS1 | CPS1 | Carbamoyl-phosphate synthetase 1, mitochondrial | |
| ESRRG | ESRRG | Estrogen-related receptor gamma | |
| FN1 | FN1 | FN1 | Fibronectin 1 |
| LRFN2 | LRFN2 | Leucine rich repeat and fibronectin type III domain containing 2 | |
| MLL4 | MLL4 | MLL4 | Myeloid/lymphoid or mixed-lineage leukemia 4 |
| MYOM1 | MYOM1 | Myomesin 1, 185 kDa | |
| RAI1 | RAI1 | Retinoic acid induced 1 | |
| TERT | TERT | Telomerase reverse transcriptase | |
| CTDSPL2 | CTDSPL2 | CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A) small phosphatase like 2 | |
| LRP1B | LRP1B | Low density lipoprotein-related protein 1B (deleted in tumors) |
Figure 3Distribution of the identified UISs.
Locations of UISs in the human genome. Circles indicate the UISs found only in cancer tissue, triangles represent UISs only found in the adjacent tissue, and rectangles mean UISs found in both. The nearest gene in the vicinity of UIS was annotated, and the filled shapes marks UISs that integrate in the transcript unit.
Figure 4Number of UISs per chromosome.
A frequency of 1 (represented by the dotted line) stands for the same level of integration as the random data, i.e., this chromosome is neither favored nor disfavored for HBV integration.
Figure 5Quantitative feature of MAPS.
(A) Semi-quantitative PCR results in HCC DNA sample N13. Each panel represents different integration frequencies, judged by the number of unique tags identified by MAPS, as indicated by the number followed by the sample name on the left column labels. The amounts of genomic DNA used for PCR were indicated at the top of the lanes. (B) Semi-quantitative PCR results in multiplexed DNA samples. The left column indicates the HCC DNA sample identifications and the number of unique tags identified. (C) Correlation co-efficiency (R2 = 0.9278) of the numbers of unique tags covering the common insertions retrieved between two technical replicate runs. (D) Rarefaction analysis of integration sites recovered in MAPS.
Figure 6More clonal expansion but fewer insertions were found in cancer than in the adjacent tissue.
(A) We counted the number of unique tags to quantify the relative abundance of each UIS. The bar is the median of the number of unique tags. (B) Number of HBV integrations identified in cancer and adjacent tissue. The bar represents the median of the number of integrations.
Recurrent HBV targeted genes identified by the MAPS approach.
| Sample ID | Tissue disease state | chr | insertion orientation | number of unique end fragments | number of total reads | Gene symbol | Description | transcript direction | junction nucleotide position | read covering the junction point (Underlined for HBV) |
| N23 | Adjacent tissue | chr11 | − | 16 | 862 | ARHGEF12 | Rho guanine nucleotide exchange factor (GEF) 12 | + | 120259620 |
|
| N22 | Adjacent tissue | chr11 | − | 6 | 3607 | ARHGEF12 | Rho guanine nucleotide exchange factor (GEF) 12 | + | 120345441 |
|
| C23 | Cancer tissue | chr10 | − | 298 | 261343 | CYP2C8 | Cytochrome P450, family 2, subfamily C, polypeptide 8 | − | 96804432 |
|
| N102 | Adjacent tissue | chr10 | + | 29 | 6742 | CYP2C8 | Cytochrome P450, family 2, subfamily C, polypeptide 8 | − | 96817072 |
|
| N113 | Adjacent tissue | chr2 | − | 169 | 36268 | FN1 | Fibronectin 1 | − | 216249015 |
|
| N19 | Adjacent tissue | chr2 | − | 9 | 1054 | FN1 | Fibronectin 1 | − | 216252251 |
|
| N118 | Adjacent tissue | chr2 | − | 47 | 631 | FN1 | Fibronectin 1 | − | 216254297 |
|
| N106 | Adjacent tissue | chr2 | − | 8 | 301 | FN1 | Fibronectin 1 | − | 216274425 |
|
| C25 | Cancer tissue | chr2 | − | 58 | 33209 | FN1 | Fibronectin 1 | − | 216293206 |
|
| N125 | Adjacent tissue | chr1 | − | 14 | 216 | PHACTR4 | Phosphatase and actin regulator 4 | + | 28785619 |
|
| N23 | Adjacent tissue | chr1 | − | 11 | 491 | PHACTR4 | Phosphatase and actin regulator 4 | + | 28819591 |
|
| N9 | Adjacent tissue | chr7 | − | 29 | 10098 | PLXNA4 | Plexin A4 | − | 132158848 |
|
| C116 | Cancer tissue | chr7 | − | 4 | 521 | PLXNA4 upstream | within 150 kb of Plexin A4 | − | 132358062 |
|
| N13 | Adjacent tissue | chr16 | − | 6 | 4565 | RBFOX1 | RNA binding protein, fox-1 homolog (C. elegans) 1 | + | 6075152 |
|
| N104 | Adjacent tissue | chr16 | + | 3 | 358 | RBFOX1 | RNA binding protein, fox-1 homolog (C. elegans) 1 | + | 7464068 |
|
| C109 | Cancer | chr5 | + | 570 | 20953 | SMAD5 | SMAD family member 5 | + | 135491847 |
|
| N109 | Adjacent tissue | chr5 | + | 31 | 310 | SMAD5 | SMAD family member 5 | + | 135491847 |
|
| C110 | Cancer | chr5 | + | 30 | 1124 | SMAD5 | SMAD family member 5 | − | 135491851 |
|
| C18 | Cancer tissue | chr5 | − | 382 | 46659 | TERT | Telomerase reverse transcriptase | − | 1295135 |
|
| C100 | Cancer | chr5 | − | 86 | 2724 | TERT | Telomerase reverse transcriptase | − | 1295396 |
|
| C116 | Cancer tissue | chr5 | − | 63 | 3789 | TERT | Telomerase reverse transcriptase | − | 1295441 |
|
| C105 | Cancer | chr5 | − | 230 | 2853 | TERT | Telomerase reverse transcriptase | − | 1295563 |
|
| C117 | Cancer tissue | chr5 | − | 270 | 7445 | TERT | Telomerase reverse transcriptase | − | 1295715 |
|
| N117 | Adjacent tissue | chr5 | − | 5 | 259 | TERT | Telomerase reverse transcriptase | − | 1295715 |
|
| C124 | Cancer tissue | chr5 | − | 392 | 47491 | TERT | Telomerase reverse transcriptase | − | 1298846 |
|
| N124 | Adjacent tissue | chr5 | − | 55 | 2941 | TERT | Telomerase reverse transcriptase | − | 1298846 |
|
Figure 7Illustration of the integrations targeted TERT.
The sequences were representative Read 2 tags that cover the HBV integration sites to the TERT gene. HBV DNA was shown in lowercase, and the Human DNA was in uppercase and underlined. The numbered boxes represent nearby exons. The nucleotide position at the human and HBV were also indicated (based on GenBank AY800389.1 and Human hg19 assemble).
Figure 8Illustration of the integrations in FN1.
The sequences were representative Read 2 tags that cover the junction points of the HBV integration to the FN1 gene. HBV DNA was shown in lowercase, and the Human DNA was in uppercase and underlined. The numbered boxes represent nearby exons. The nucleotide position at the human and HBV were also indicated (based on GenBank AY800389.1 and Human hg19 assemble).
Recurrent targeted combining the integrative analysis.
| Gene Symbol | Studies identified | Descriptions |
| ARHGEF12 | our study | Rho guanine nucleotide exchange factor (GEF) 12 |
| CYP2C8 | our study | Cytochrome P450, family 2, subfamily C, polypeptide 8 |
| PHACTR4 | our study | Phosphatase and actin regulator 4 |
| PLXNA4 | our study | Plexin A4 |
| RBFOX1 | our study | |
| SMAD5 | our study | SMAD family member 5 |
| ADH1B | integrated analysis | Alcohol dehydrogenase 1B (class I), beta polypeptide |
| CPS1 | integrated analysis | Carbamoyl-phosphate synthetase 1, mitochondrial |
| ESRRG | integrated analysis | Estrogen-related receptor gamma |
| LRFN2 | integrated analysis | Leucine rich repeat and fibronectin type III domain containing 2 |
| MYOM1 | integrated analysis | Myomesin 1, 185 kDa |
| RAI1 | integrated analysis | Retinoic acid induced 1 |
| CTDSPL2 | integrated analysis | CTD (carboxy-terminal domain, RNA polymerase II, polypeptide A) small phosphatase like 2 |
| LRP1B | integrated analysis | Low density lipoprotein-related protein 1B (deleted in tumors) |
| FN1 | our study, Sung | Fibronectin 1 |
| TERT | our study, Sung | Telomerase reverse transcriptase |
| MLL4 | Sung | Myeloid/lymphoid or mixed-lineage leukemia 4 |
| CCNE1 | Sung | Cyclin E1 |
| SENP5 | Sung | SUMO1/sentrin specific peptidase 5 |
| ROCK1 | Sung | Rho-associated, coiled-coil containing protein kinase 1 |
Integrated analysis of our HBV target list with the recently published whole genome sequencing studies.