| Literature DB >> 26087185 |
Daniel W H Ho1,2, Karen M F Sze1,2, Irene O L Ng1,2.
Abstract
Viral integration into the human genome upon infection is an important risk factor for various human malignancies. We developed viral integration site detection tool called Virus-Clip, which makes use of information extracted from soft-clipped sequencing reads to identify exact positions of human and virus breakpoints of integration events. With initial read alignment to virus reference genome and streamlined procedures, Virus-Clip delivers a simple, fast and memory-efficient solution to viral integration site detection. Moreover, it can also automatically annotate the integration events with the corresponding affected human genes. Virus-Clip has been verified using whole-transcriptome sequencing data and its detection was validated to have satisfactory sensitivity and specificity. Marked advancement in performance was detected, compared to existing tools. It is applicable to versatile types of data including whole-genome sequencing, whole-transcriptome sequencing, and targeted sequencing. Virus-Clip is available at http://web.hku.hk/~dwhho/Virus-Clip.zip.Entities:
Keywords: breakpoint detection; next-generation sequencing; viral integration; viral integration site detection
Mesh:
Year: 2015 PMID: 26087185 PMCID: PMC4673242 DOI: 10.18632/oncotarget.4187
Source DB: PubMed Journal: Oncotarget ISSN: 1949-2553
Figure 1Workflow of Virus-Clip
Benchmark result for viral integration site detection
| Sample | Tool | # of reads (M) | Execution time (min) | # of CPU | Memory used (GB) | # of viral integration events | Key affected human gene |
|---|---|---|---|---|---|---|---|
| HBV-associated HCC 1 | Virus-Clip | 63.8 | 35.9 | 1 | 2.1 | 8 | |
| VirusFinder | 244.6 | 8 | 15.9 | 1 | |||
| ViralFusionSeq | Execution failure | ||||||
| HBV-associated HCC 2 | Virus-Clip | 70.0 | 36.4 | 1 | 2.4 | 14 | |
| VirusFinder | 259.3 | 8 | 15.8 | 3 | |||
| ViralFusionSeq | Execution failure | ||||||
Note: Error encountered during ViralFusionSeq execution
Validation experiment on 17 selected HBV integration events
| Sample | Lane in Figure | Integrated genic region | Affected human gene | Supporting read count | Forward primer | Reverse primer | PCR product size (bp) | Validated |
|---|---|---|---|---|---|---|---|---|
| HBV-associated HCC 1 | 1 | intronic | 17 | GGAGGAGTTGGGGGAGGAG | CTGGAAAGTGTCCAAGGAGG | 162 | Yes | |
| 2 | exonic | 7 | CTCAAGAGAGCCAAAGTGCAGC | ACACAGAATAGCTTGCCTGAG | 170 | Yes | ||
| 3 | splicing | 6 | AATTTGTCCTGGTTATCGCTGG | CTCCGGCCACCTCCTCCATCTGC | 141 | Yes | ||
| 4 | UTR3 | 5 | GCAACCTGCTCAACTAGGGCCCCTGCTG | GATTACATATCCCATGAAGTTAAGG | 147 | No | ||
| 5 | intronic | 2 | AGCAGAAGGTGGCAGCTTCCATG | CGGGTCAATGTCCATGCCCCAAAGC | 116 | Yes | ||
| 6 | UTR5 | 1 | TCTCGCAGCGCCGCCGCTGCCATC | AGACGGGGAGTCCGCGTAAAGAGAG | 101 | No | ||
| 7 | intergenic | - | 1 | CTTTAATTAGTATCTTCTAC | GGCCATTGATCCGTGTTGG | 101 | No | |
| 8 | exonic | 1 | TGGACTTTCAGCAATGTCAACG | GATCTGCTTGACATCCCCGGCCAC | 101 | No | ||
| HBV-associated HCC 2 | 1 | upstream | 37 | ATCCCAGTAGAGTAGGAG | CAAATACTCAAGAACAGTTTC | 148 | Yes | |
| 2 | upstream | 15 | GGCGAGAAACTTCTGGGTCTC | GCATTTGGTGGTCTGTAAGC | 154 | Yes | ||
| 3 | exonic | 12 | GCTGGATGGGTCGGCGGCG | GCAGGAACTTGGCCAGGATC | 150 | Yes | ||
| 4 | intergenic | - | 9 | ACCAACATTTGAACAGTCACC | TACGGGTCAATGTCCATGCC | 134 | Yes | |
| 5 | exonic | 7 | GCGGCGTTTTATCATCTTCC | GCACAGCCTCTGCAGCACTC | 112 | Yes | ||
| 6 | intergenic | - | 4 | GAGTTGGGGGAGGAGATTAG | GTTTCTGAGCTCTGTCAAAACGG | 154 | Yes | |
| 7 | UTR5 | 2 | GTTTTATCATCTTCCTCTTCATCCTG | CTTGAATTGTAACAGTGGCTGC | 132 | No | ||
| 8 | intergenic | - | 2 | GGAGATTAGGTTAAAGGTC | GCCAAAGTTAAGGACACTCTTGTGAC | 113 | No | |
| 9 | exonic | 2 | CTTTGGAAGAGAAACTGTTCTTGAG | GGACTGTGCTGCCTTCATAATGCC | 109 | No |
It summarizes the details of the events and corresponds to the PCR amplification in Figure 2. Integration events are sorted according to supporting read count, with those successfully amplified by PCR and confirmed by Sanger sequencing remarked as validated in the rightmost column.
Figure 2PCR amplification of selected HBV integration events
The lanes correspond to the integration events listed in Table 2. Order of integration events is sorted according to supporting read count with the leftmost one supported by the most.