Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 proovread: large-scale high-accuracy PacBio correction through iterative short read consensus.

Literature DB >> 25015988

proovread: large-scale high-accuracy PacBio correction through iterative short read consensus.

Thomas Hackl¹, Rainer Hedrich², Jörg Schultz², Frank Förster².

Abstract

MOTIVATION: Today, the base code of DNA is mostly determined through sequencing by synthesis as provided by the Illumina sequencers. Although highly accurate, resulting reads are short, making their analyses challenging. Recently, a new technology, single molecule real-time (SMRT) sequencing, was developed that could address these challenges, as it generates reads of several thousand bases. But, their broad application has been hampered by a high error rate. Therefore, hybrid approaches that use high-quality short reads to correct erroneous SMRT long reads have been developed. Still, current implementations have great demands on hardware, work only in well-defined computing infrastructures and reject a substantial amount of reads. This limits their usability considerably, especially in the case of large sequencing projects.
RESULTS: Here we present proovread, a hybrid correction pipeline for SMRT reads, which can be flexibly adapted on existing hardware and infrastructure from a laptop to a high-performance computing cluster. On genomic and transcriptomic test cases covering Escherichia coli, Arabidopsis thaliana and human, proovread achieved accuracies up to 99.9% and outperformed the existing hybrid correction programs. Furthermore, proovread-corrected sequences were longer and the throughput was higher. Thus, proovread combines the most accurate correction results with an excellent adaptability to the available hardware. It will therefore increase the applicability and value of SMRT sequencing.
AVAILABILITY AND IMPLEMENTATION: proovread is available at the following URL: http://proovread.bioapps.biozentrum.uni-wuerzburg.de.

Entities: Species

Mesh：

Year: 2014 PMID： 25015988 PMCID： PMC4609002 DOI： 10.1093/bioinformatics/btu392

Source DB: PubMed Journal: Bioinformatics ISSN： 1367-4803 Impact factor: 6.937

24 in total

1. Nonhybrid, finished microbial genome assemblies from long-read SMRT sequencing data.

Authors: Chen-Shan Chin; David H Alexander; Patrick Marks; Aaron A Klammer; James Drake; Cheryl Heiner; Alicia Clum; Alex Copeland; John Huddleston; Evan E Eichler; Stephen W Turner; Jonas Korlach
Journal: Nat Methods Date: 2013-05-05 Impact factor: 28.547

proovread: large-scale high-accuracy PacBio correction through iterative short read consensus.

1. Nonhybrid, finished microbial genome assemblies from long-read SMRT sequencing data.

2. The Sequence Alignment/Map format and SAMtools.

3. PBSIM: PacBio reads simulator--toward accurate genome assembly.

4. Improving PacBio long read accuracy by short read alignment.

5. Quake: quality-aware detection and correction of sequencing errors.

6. Automated generation of heuristics for biological sequence comparison.

7. Hybrid error correction and de novo assembly of single-molecule sequencing reads.

8. The advantages of SMRT sequencing.

9. Aggressive assembly of pyrosequencing reads with mates.

10. Substantial biases in ultra-short read data sets from high-throughput DNA sequencing.

1. Adaptation by Loss of Heterozygosity in Saccharomyces cerevisiae Clones Under Divergent Selection.

2. Host genome integration and giant virus-induced reactivation of the virophage mavirus.

3. Single molecule RNA sequencing uncovers trans-splicing and improves annotations in Anopheles stephensi.

4. LSCplus: a fast solution for improving long read accuracy by short read alignment.

5. Genome-wide study of saprotrophy-related genes in the basal fungus Conidiobolus heterosporus.

6. Nanopore direct RNA sequencing maps the complexity of Arabidopsis mRNA processing and m⁶A modification.

7. Human Migration and the Spread of the Nematode Parasite Wuchereria bancrofti.

8. Hercules: a profile HMM-based hybrid error correction algorithm for long reads.

9. Genome and Plasmid Analysis of blaIMP-4-Carrying Citrobacter freundii B38.

10. The Amaryllidaceae alkaloids: biosynthesis and methods for enzyme discovery.