Literature DB >> 23129296

PBSIM: PacBio reads simulator--toward accurate genome assembly.

Yukiteru Ono1, Kiyoshi Asai, Michiaki Hamada.   

Abstract

MOTIVATION: PacBio sequencers produce two types of characteristic reads (continuous long reads: long and high error rate and circular consensus sequencing: short and low error rate), both of which could be useful for de novo assembly of genomes. Currently, there is no available simulator that targets the specific generation of PacBio libraries.
RESULTS: Our analysis of 13 PacBio datasets showed characteristic features of PacBio reads (e.g. the read length of PacBio reads follows a log-normal distribution). We have developed a read simulator, PBSIM, that captures these features using either a model-based or sampling-based method. Using PBSIM, we conducted several hybrid error correction and assembly tests for PacBio reads, suggesting that a continuous long reads coverage depth of at least 15 in combination with a circular consensus sequencing coverage depth of at least 30 achieved extensive assembly results. AVAILABILITY: PBSIM is freely available from the web under the GNU GPL v2 license (http://code.google.com/p/pbsim/).

Entities:  

Mesh:

Year:  2012        PMID: 23129296     DOI: 10.1093/bioinformatics/bts649

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  97 in total

1.  Long Single-Molecule Reads Can Resolve the Complexity of the Influenza Virus Composed of Rare, Closely Related Mutant Variants.

Authors:  Alexander Artyomenko; Nicholas C Wu; Serghei Mangul; Eleazar Eskin; Ren Sun; Alex Zelikovsky
Journal:  J Comput Biol       Date:  2016-11-30       Impact factor: 1.479

2.  Minimap2: pairwise alignment for nucleotide sequences.

Authors:  Heng Li
Journal:  Bioinformatics       Date:  2018-09-15       Impact factor: 6.937

3.  lordFAST: sensitive and Fast Alignment Search Tool for LOng noisy Read sequencing Data.

Authors:  Ehsan Haghshenas; S Cenk Sahinalp; Faraz Hach
Journal:  Bioinformatics       Date:  2019-01-01       Impact factor: 6.937

4.  The design and construction of reference pangenome graphs with minigraph.

Authors:  Heng Li; Xiaowen Feng; Chong Chu
Journal:  Genome Biol       Date:  2020-10-16       Impact factor: 13.583

5.  proovread: large-scale high-accuracy PacBio correction through iterative short read consensus.

Authors:  Thomas Hackl; Rainer Hedrich; Jörg Schultz; Frank Förster
Journal:  Bioinformatics       Date:  2014-07-10       Impact factor: 6.937

6.  Sim3C: simulation of Hi-C and Meta3C proximity ligation sequencing technologies.

Authors:  Matthew Z DeMaere; Aaron E Darling
Journal:  Gigascience       Date:  2018-02-01       Impact factor: 6.524

7.  LSCplus: a fast solution for improving long read accuracy by short read alignment.

Authors:  Ruifeng Hu; Guibo Sun; Xiaobo Sun
Journal:  BMC Bioinformatics       Date:  2016-11-09       Impact factor: 3.169

8.  Weighted minimizer sampling improves long read mapping.

Authors:  Chirag Jain; Arang Rhie; Haowen Zhang; Claudia Chu; Brian P Walenz; Sergey Koren; Adam M Phillippy
Journal:  Bioinformatics       Date:  2020-07-01       Impact factor: 6.937

9.  Fundamental Bounds for Sequence Reconstruction from Nanopore Sequencers.

Authors:  Abram Magner; Jarosław Duda; Wojciech Szpankowski; Ananth Grama
Journal:  IEEE Trans Mol Biol Multiscale Commun       Date:  2016-06

10.  High-resolution characterization of the structural features and genetic variation of six feline leukocyte antigen class I loci via single molecule, real-time (SMRT) sequencing.

Authors:  Jennifer C Holmes; Elizabeth H Scholl; Allison N Dickey; Paul R Hess
Journal:  Immunogenetics       Date:  2021-06-27       Impact factor: 2.846

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.