Literature DB >> 25015988

proovread: large-scale high-accuracy PacBio correction through iterative short read consensus.

Thomas Hackl1, Rainer Hedrich2, Jörg Schultz2, Frank Förster2.   

Abstract

MOTIVATION: Today, the base code of DNA is mostly determined through sequencing by synthesis as provided by the Illumina sequencers. Although highly accurate, resulting reads are short, making their analyses challenging. Recently, a new technology, single molecule real-time (SMRT) sequencing, was developed that could address these challenges, as it generates reads of several thousand bases. But, their broad application has been hampered by a high error rate. Therefore, hybrid approaches that use high-quality short reads to correct erroneous SMRT long reads have been developed. Still, current implementations have great demands on hardware, work only in well-defined computing infrastructures and reject a substantial amount of reads. This limits their usability considerably, especially in the case of large sequencing projects.
RESULTS: Here we present proovread, a hybrid correction pipeline for SMRT reads, which can be flexibly adapted on existing hardware and infrastructure from a laptop to a high-performance computing cluster. On genomic and transcriptomic test cases covering Escherichia coli, Arabidopsis thaliana and human, proovread achieved accuracies up to 99.9% and outperformed the existing hybrid correction programs. Furthermore, proovread-corrected sequences were longer and the throughput was higher. Thus, proovread combines the most accurate correction results with an excellent adaptability to the available hardware. It will therefore increase the applicability and value of SMRT sequencing.
AVAILABILITY AND IMPLEMENTATION: proovread is available at the following URL: http://proovread.bioapps.biozentrum.uni-wuerzburg.de.
© The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Mesh:

Year:  2014        PMID: 25015988      PMCID: PMC4609002          DOI: 10.1093/bioinformatics/btu392

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  24 in total

1.  Nonhybrid, finished microbial genome assemblies from long-read SMRT sequencing data.

Authors:  Chen-Shan Chin; David H Alexander; Patrick Marks; Aaron A Klammer; James Drake; Cheryl Heiner; Alicia Clum; Alex Copeland; John Huddleston; Evan E Eichler; Stephen W Turner; Jonas Korlach
Journal:  Nat Methods       Date:  2013-05-05       Impact factor: 28.547

2.  The Sequence Alignment/Map format and SAMtools.

Authors:  Heng Li; Bob Handsaker; Alec Wysoker; Tim Fennell; Jue Ruan; Nils Homer; Gabor Marth; Goncalo Abecasis; Richard Durbin
Journal:  Bioinformatics       Date:  2009-06-08       Impact factor: 6.937

3.  PBSIM: PacBio reads simulator--toward accurate genome assembly.

Authors:  Yukiteru Ono; Kiyoshi Asai; Michiaki Hamada
Journal:  Bioinformatics       Date:  2012-11-04       Impact factor: 6.937

4.  Improving PacBio long read accuracy by short read alignment.

Authors:  Kin Fai Au; Jason G Underwood; Lawrence Lee; Wing Hung Wong
Journal:  PLoS One       Date:  2012-10-04       Impact factor: 3.240

5.  Quake: quality-aware detection and correction of sequencing errors.

Authors:  David R Kelley; Michael C Schatz; Steven L Salzberg
Journal:  Genome Biol       Date:  2010-11-29       Impact factor: 13.583

6.  Automated generation of heuristics for biological sequence comparison.

Authors:  Guy St C Slater; Ewan Birney
Journal:  BMC Bioinformatics       Date:  2005-02-15       Impact factor: 3.169

7.  Hybrid error correction and de novo assembly of single-molecule sequencing reads.

Authors:  Sergey Koren; Michael C Schatz; Brian P Walenz; Jeffrey Martin; Jason T Howard; Ganeshkumar Ganapathy; Zhong Wang; David A Rasko; W Richard McCombie; Erich D Jarvis
Journal:  Nat Biotechnol       Date:  2012-07-01       Impact factor: 54.908

8.  The advantages of SMRT sequencing.

Authors:  Richard J Roberts; Mauricio O Carneiro; Michael C Schatz
Journal:  Genome Biol       Date:  2013-07-03       Impact factor: 13.583

9.  Aggressive assembly of pyrosequencing reads with mates.

Authors:  Jason R Miller; Arthur L Delcher; Sergey Koren; Eli Venter; Brian P Walenz; Anushka Brownley; Justin Johnson; Kelvin Li; Clark Mobarry; Granger Sutton
Journal:  Bioinformatics       Date:  2008-10-24       Impact factor: 6.937

10.  Substantial biases in ultra-short read data sets from high-throughput DNA sequencing.

Authors:  Juliane C Dohm; Claudio Lottaz; Tatiana Borodina; Heinz Himmelbauer
Journal:  Nucleic Acids Res       Date:  2008-07-26       Impact factor: 16.971

View more
  190 in total

1.  Adaptation by Loss of Heterozygosity in Saccharomyces cerevisiae Clones Under Divergent Selection.

Authors:  Timothy Y James; Lucas A Michelotti; Alexander D Glasco; Rebecca A Clemons; Robert A Powers; Ellen S James; D Rabern Simmons; Fengyan Bai; Shuhua Ge
Journal:  Genetics       Date:  2019-08-01       Impact factor: 4.562

2.  Host genome integration and giant virus-induced reactivation of the virophage mavirus.

Authors:  Matthias G Fischer; Thomas Hackl
Journal:  Nature       Date:  2016-12-07       Impact factor: 49.962

3.  Single molecule RNA sequencing uncovers trans-splicing and improves annotations in Anopheles stephensi.

Authors:  X Jiang; A B Hall; J K Biedler; Z Tu
Journal:  Insect Mol Biol       Date:  2017-02-09       Impact factor: 3.585

4.  LSCplus: a fast solution for improving long read accuracy by short read alignment.

Authors:  Ruifeng Hu; Guibo Sun; Xiaobo Sun
Journal:  BMC Bioinformatics       Date:  2016-11-09       Impact factor: 3.169

5.  Genome-wide study of saprotrophy-related genes in the basal fungus Conidiobolus heterosporus.

Authors:  Yulong Wang; Yong Nie; Deshui Yu; Xiangyun Xie; Li Qin; Yang Yang; Bo Huang
Journal:  Appl Microbiol Biotechnol       Date:  2020-05-22       Impact factor: 4.813

6.  Nanopore direct RNA sequencing maps the complexity of Arabidopsis mRNA processing and m6A modification.

Authors:  Matthew T Parker; Katarzyna Knop; Anna V Sherwood; Nicholas J Schurch; Katarzyna Mackinnon; Peter D Gould; Anthony Jw Hall; Geoffrey J Barton; Gordon G Simpson
Journal:  Elife       Date:  2020-01-14       Impact factor: 8.140

7.  Human Migration and the Spread of the Nematode Parasite Wuchereria bancrofti.

Authors:  Scott T Small; Frédéric Labbé; Yaya I Coulibaly; Thomas B Nutman; Christopher L King; David Serre; Peter A Zimmerman
Journal:  Mol Biol Evol       Date:  2019-09-01       Impact factor: 16.240

8.  Hercules: a profile HMM-based hybrid error correction algorithm for long reads.

Authors:  Can Firtina; Ziv Bar-Joseph; Can Alkan; A Ercument Cicek
Journal:  Nucleic Acids Res       Date:  2018-11-30       Impact factor: 16.971

9.  Genome and Plasmid Analysis of blaIMP-4-Carrying Citrobacter freundii B38.

Authors:  Jianhui Xiong; Maxime Déraspe; Naeem Iqbal; Jennifer Ma; Frances B Jamieson; Jessica Wasserscheid; Ken Dewar; Peter M Hawkey; Paul H Roy
Journal:  Antimicrob Agents Chemother       Date:  2016-10-21       Impact factor: 5.191

10.  The Amaryllidaceae alkaloids: biosynthesis and methods for enzyme discovery.

Authors:  Matthew B Kilgore; Toni M Kutchan
Journal:  Phytochem Rev       Date:  2015-12-17       Impact factor: 5.374

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.