Xin Li1, Samaneh Saadat1, Haiyan Hu1, Xiaoman Li2. 1. Department of Computer Science, College of Medicine, University of Central Florida, Orlando, FL 32816, USA. 2. Burnett School of Biomedical Science, College of Medicine, University of Central Florida, Orlando, FL 32816, USA.
Abstract
MOTIVATION: The bacterial haplotype reconstruction is critical for selecting proper treatments for diseases caused by unknown haplotypes. Existing methods and tools do not work well on this task, because they are usually developed for viral instead of bacterial populations. RESULTS: In this study, we developed BHap, a novel algorithm based on fuzzy flow networks, for reconstructing bacterial haplotypes from next generation sequencing data. Tested on simulated and experimental datasets, we showed that BHap was capable of reconstructing haplotypes of bacterial populations with an average F1 score of 0.87, an average precision of 0.87 and an average recall of 0.88. We also demonstrated that BHap had a low susceptibility to sequencing errors, was capable of reconstructing haplotypes with low coverage and could handle a wide range of mutation rates. Compared with existing approaches, BHap outperformed them in terms of higher F1 scores, better precision, better recall and more accurate estimation of the number of haplotypes. AVAILABILITY AND IMPLEMENTATION: The BHap tool is available at http://www.cs.ucf.edu/∼xiaoman/BHap/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
MOTIVATION: The bacterial haplotype reconstruction is critical for selecting proper treatments for diseases caused by unknown haplotypes. Existing methods and tools do not work well on this task, because they are usually developed for viral instead of bacterial populations. RESULTS: In this study, we developed BHap, a novel algorithm based on fuzzy flow networks, for reconstructing bacterial haplotypes from next generation sequencing data. Tested on simulated and experimental datasets, we showed that BHap was capable of reconstructing haplotypes of bacterial populations with an average F1 score of 0.87, an average precision of 0.87 and an average recall of 0.88. We also demonstrated that BHap had a low susceptibility to sequencing errors, was capable of reconstructing haplotypes with low coverage and could handle a wide range of mutation rates. Compared with existing approaches, BHap outperformed them in terms of higher F1 scores, better precision, better recall and more accurate estimation of the number of haplotypes. AVAILABILITY AND IMPLEMENTATION: The BHap tool is available at http://www.cs.ucf.edu/∼xiaoman/BHap/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Authors: Sergio Pulido-Tamayo; Aminael Sánchez-Rodríguez; Toon Swings; Bram Van den Bergh; Akanksha Dubey; Hans Steenackers; Jan Michiels; Jan Fostier; Kathleen Marchal Journal: Nucleic Acids Res Date: 2015-05-18 Impact factor: 16.971
Authors: David W Eyre; Madeleine L Cule; David Griffiths; Derrick W Crook; Tim E A Peto; A Sarah Walker; Daniel J Wilson Journal: PLoS Comput Biol Date: 2013-05-02 Impact factor: 4.475
Authors: Chen Cao; Jingni He; Lauren Mak; Deshan Perera; Devin Kwok; Jia Wang; Minghao Li; Tobias Mourier; Stefan Gavriliuc; Matthew Greenberg; A Sorana Morrissy; Laura K Sycuro; Guang Yang; Daniel C Jeffares; Quan Long Journal: Mol Biol Evol Date: 2021-05-19 Impact factor: 16.240
Authors: Marija Dmitrijeva; Christian R Kahlert; Rounak Feigelman; Rebekka L Kleiner; Oliver Nolte; Werner C Albrich; Florent Baty; Christian von Mering Journal: mBio Date: 2021-03-09 Impact factor: 7.867