Literature DB >> 28802713

Detecting exact breakpoints of deletions with diversity in hepatitis B viral genomic DNA from next-generation sequencing data.

Ji-Hong Cheng1, Wen-Chun Liu2, Ting-Tsung Chang2, Sun-Yuan Hsieh1, Vincent S Tseng3.   

Abstract

Many studies have suggested that deletions of Hepatitis B Viral (HBV) are associated with the development of progressive liver diseases, even ultimately resulting in hepatocellular carcinoma (HCC). Among the methods for detecting deletions from next-generation sequencing (NGS) data, few methods considered the characteristics of virus, such as high evolution rates and high divergence among the different HBV genomes. Sequencing high divergence HBV genome sequences using the NGS technology outputs millions of reads. Thus, detecting exact breakpoints of deletions from these big and complex data incurs very high computational cost. We proposed a novel analytical method named VirDelect (Virus Deletion Detect), which uses split read alignment base to detect exact breakpoint and diversity variable to consider high divergence in single-end reads data, such that the computational cost can be reduced without losing accuracy. We use four simulated reads datasets and two real pair-end reads datasets of HBV genome sequence to verify VirDelect accuracy by score functions. The experimental results show that VirDelect outperforms the state-of-the-art method Pindel in terms of accuracy score for all simulated datasets and VirDelect had only two base errors even in real datasets. VirDelect is also shown to deliver high accuracy in analyzing the single-end read data as well as pair-end data. VirDelect can serve as an effective and efficient bioinformatics tool for physiologists with high accuracy and efficient performance and applicable to further analysis with characteristics similar to HBV on genome length and high divergence. The software program of VirDelect can be downloaded at https://sourceforge.net/projects/virdelect/.
Copyright © 2017. Published by Elsevier Inc.

Entities:  

Keywords:  Breakpoint; Data mining; Deletion detection; Hepatitis B virus; Machine learning; Next-generation sequencing

Mesh:

Year:  2017        PMID: 28802713     DOI: 10.1016/j.ymeth.2017.08.005

Source DB:  PubMed          Journal:  Methods        ISSN: 1046-2023            Impact factor:   3.608


  2 in total

Review 1.  Applications of next-generation sequencing analysis for the detection of hepatocellular carcinoma-associated hepatitis B virus mutations.

Authors:  I-Chin Wu; Wen-Chun Liu; Ting-Tsung Chang
Journal:  J Biomed Sci       Date:  2018-06-02       Impact factor: 8.410

2.  Machine learning methods and systems for data-driven discovery in biomedical informatics.

Authors:  Sungroh Yoon; Seunghak Lee; Wei Wang
Journal:  Methods       Date:  2017-10-01       Impact factor: 3.608

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.