Literature DB >> 23257116

Benchmarking of viral haplotype reconstruction programmes: an overview of the capacities and limitations of currently available programmes.

Melanie Schirmer1, William T Sloan, Christopher Quince.   

Abstract

Viral haplotype reconstruction from a set of observed reads is one of the most challenging problems in bioinformatics today. Next-generation sequencing technologies enable us to detect single-nucleotide polymorphisms (SNPs) of haplotypes-even if the haplotypes appear at low frequencies. However, there are two major problems. First, we need to distinguish real SNPs from sequencing errors. Second, we need to determine which SNPs occur on the same haplotype, which cannot be inferred from the reads if the distance between SNPs on a haplotype exceeds the read length. We conducted an independent benchmarking study that directly compares the currently available viral haplotype reconstruction programmes. We also present nine in silico data sets that we generated to reflect biologically plausible populations. For these data sets, we simulated 454 and Illumina reads and applied the programmes to test their capacity to reconstruct whole genomes and individual genes. We developed a novel statistical framework to demonstrate the strengths and limitations of the programmes. Our benchmarking demonstrated that all the programmes we tested performed poorly when sequence divergence was low and failed to recover haplotype populations with rare haplotypes.

Keywords:  benchmarking; in silico data sets; quasispecies; statistics for validation; viral haplotype reconstruction programmes

Mesh:

Year:  2012        PMID: 23257116     DOI: 10.1093/bib/bbs081

Source DB:  PubMed          Journal:  Brief Bioinform        ISSN: 1467-5463            Impact factor:   11.622


  33 in total

1.  Characterization of Hepatitis C Virus (HCV) Envelope Diversification from Acute to Chronic Infection within a Sexually Transmitted HCV Cluster by Using Single-Molecule, Real-Time Sequencing.

Authors:  Cynthia K Y Ho; Jayna Raghwani; Sylvie Koekkoek; Richard H Liang; Jan T M Van der Meer; Marc Van Der Valk; Menno De Jong; Oliver G Pybus; Janke Schinkel; Richard Molenkamp
Journal:  J Virol       Date:  2017-02-28       Impact factor: 5.103

2.  Multiple Sources of Genetic Diversity of Influenza A Viruses during the Hajj.

Authors:  Joanna C A Cobbin; Mohammad Alfelali; Osamah Barasheed; Janette Taylor; Dominic E Dwyer; Jen Kok; Robert Booy; Edward C Holmes; Harunor Rashid
Journal:  J Virol       Date:  2017-05-12       Impact factor: 5.103

3.  Evaluation of haplotype callers for next-generation sequencing of viruses.

Authors:  Anton Eliseev; Keylie M Gibson; Pavel Avdeyev; Dmitry Novik; Matthew L Bendall; Marcos Pérez-Losada; Nikita Alexeev; Keith A Crandall
Journal:  Infect Genet Evol       Date:  2020-03-06       Impact factor: 3.342

4.  Separation and assembly of deep sequencing data into discrete sub-population genomes.

Authors:  Konstantinos Karagiannis; Vahan Simonyan; Konstantin Chumakov; Raja Mazumder
Journal:  Nucleic Acids Res       Date:  2017-11-02       Impact factor: 16.971

5.  Evaluating assembly and variant calling software for strain-resolved analysis of large DNA viruses.

Authors:  Zhi-Luo Deng; Akshay Dhingra; Adrian Fritz; Jasper Götting; Philipp C Münch; Lars Steinbrück; Thomas F Schulz; Tina Ganzenmüller; Alice C McHardy
Journal:  Brief Bioinform       Date:  2021-05-20       Impact factor: 11.622

6.  Insight into biases and sequencing errors for amplicon sequencing with the Illumina MiSeq platform.

Authors:  Melanie Schirmer; Umer Z Ijaz; Rosalinda D'Amore; Neil Hall; William T Sloan; Christopher Quince
Journal:  Nucleic Acids Res       Date:  2015-01-13       Impact factor: 16.971

7.  Deconvoluting the composition of low-frequency hepatitis C viral quasispecies: comparison of genotypes and NS3 resistance-associated variants between HCV/HIV coinfected hemophiliacs and HCV monoinfected patients in Japan.

Authors:  Masato Ogishi; Hiroshi Yotsuyanagi; Takeya Tsutsumi; Hiroyuki Gatanaga; Hirotaka Ode; Wataru Sugiura; Kyoji Moriya; Shinichi Oka; Satoshi Kimura; Kazuhiko Koike
Journal:  PLoS One       Date:  2015-03-06       Impact factor: 3.240

8.  ViVaMBC: estimating viral sequence variation in complex populations from illumina deep-sequencing data using model-based clustering.

Authors:  Bie Verbist; Lieven Clement; Joke Reumers; Kim Thys; Alexander Vapirev; Willem Talloen; Yves Wetzels; Joris Meys; Jeroen Aerssens; Luc Bijnens; Olivier Thas
Journal:  BMC Bioinformatics       Date:  2015-02-22       Impact factor: 3.169

9.  A penalized regression approach to haplotype reconstruction of viral populations arising in early HIV/SIV infection.

Authors:  Sivan Leviyang; Igor Griva; Sergio Ita; Welkin E Johnson
Journal:  Bioinformatics       Date:  2017-08-15       Impact factor: 6.937

10.  Intra-Population Competition during Adaptation to Increased Temperature in an RNA Bacteriophage.

Authors:  María Arribas; Ester Lázaro
Journal:  Int J Mol Sci       Date:  2021-06-24       Impact factor: 5.923

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.