Literature DB >> 23718149

Mutant-bin: unsupervised haplotype estimation of viral population diversity without reference genome.

Shruthi Prabhakara1, Raunaq Malhotra, Raj Acharya, Mary Poss.   

Abstract

High genetic variability in viral populations plays an important role in disease progression, pathogenesis, and drug resistance. The last few years has seen significant progress in the development of methods for reconstruction of viral populations using data from next-generation sequencing technologies. These methods identify the differences between individual haplotypes by mapping the short reads to a reference genome. Much less has been published about resolving the population structure when a reference genome is lacking or is not well-defined, which severely limits the application of these new technologies to resolve virus population structure. We describe a computational framework, called Mutant-Bin, for clustering individual haplotypes in a viral population and determining their prevalence based on a set of deep sequencing reads. The main advantages of our method are that: (i) it enables determination of the population structure and haplotype frequencies when a reference genome is lacking; (ii) the method is unsupervised-the number of haplotypes does not have to be specified in advance; and (iii) it identifies the polymorphic sites that co-occur in a subset of haplotypes and the frequency with which they appear in the viral population. The method was evaluated on simulated reads with sequencing errors and 454 pyrosequencing reads from HIV samples. Our method clustered a high percentage of haplotypes with low false-positive rates, even at low genetic diversity.

Mesh:

Year:  2013        PMID: 23718149     DOI: 10.1089/cmb.2012.0174

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  3 in total

1.  Evaluation of haplotype callers for next-generation sequencing of viruses.

Authors:  Anton Eliseev; Keylie M Gibson; Pavel Avdeyev; Dmitry Novik; Matthew L Bendall; Marcos Pérez-Losada; Nikita Alexeev; Keith A Crandall
Journal:  Infect Genet Evol       Date:  2020-03-06       Impact factor: 3.342

2.  Error correction and statistical analyses for intra-host comparisons of feline immunodeficiency virus diversity from high-throughput sequencing data.

Authors:  Yang Liu; Francesca Chiaromonte; Howard Ross; Raunaq Malhotra; Daniel Elleder; Mary Poss
Journal:  BMC Bioinformatics       Date:  2015-06-30       Impact factor: 3.169

3.  VTBuilder: a tool for the assembly of multi isoform transcriptomes.

Authors:  John Archer; Gareth Whiteley; Nicholas R Casewell; Robert A Harrison; Simon C Wagstaff
Journal:  BMC Bioinformatics       Date:  2014-12-03       Impact factor: 3.169

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.