Literature DB >> 26177965

Karect: accurate correction of substitution, insertion and deletion errors for next-generation sequencing data.

Amin Allam1, Panos Kalnis1, Victor Solovyev1.   

Abstract

MOTIVATION: Next-generation sequencing generates large amounts of data affected by errors in the form of substitutions, insertions or deletions of bases. Error correction based on the high-coverage information, typically improves de novo assembly. Most existing tools can correct substitution errors only; some support insertions and deletions, but accuracy in many cases is low.
RESULTS: We present Karect, a novel error correction technique based on multiple alignment. Our approach supports substitution, insertion and deletion errors. It can handle non-uniform coverage as well as moderately covered areas of the sequenced genome. Experiments with data from Illumina, 454 FLX and Ion Torrent sequencing machines demonstrate that Karect is more accurate than previous methods, both in terms of correcting individual-bases errors (up to 10% increase in accuracy gain) and post de novo assembly quality (up to 10% increase in NGA50). We also introduce an improved framework for evaluating the quality of error correction.
AVAILABILITY AND IMPLEMENTATION: Karect is available at: http://aminallam.github.io/karect. CONTACT: amin.allam@kaust.edu.sa SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

Mesh:

Year:  2015        PMID: 26177965     DOI: 10.1093/bioinformatics/btv415

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  22 in total

1.  Comparison of error correction algorithms for Ion Torrent PGM data: application to hepatitis B virus.

Authors:  Liting Song; Wenxun Huang; Juan Kang; Yuan Huang; Hong Ren; Keyue Ding
Journal:  Sci Rep       Date:  2017-08-14       Impact factor: 4.379

Review 2.  Genome sequence assembly algorithms and misassembly identification methods.

Authors:  Yue Meng; Yu Lei; Jianlong Gao; Yuxuan Liu; Enze Ma; Yunhong Ding; Yixin Bian; Hongquan Zu; Yucui Dong; Xiao Zhu
Journal:  Mol Biol Rep       Date:  2022-09-23       Impact factor: 2.742

3.  Aberration-corrected ultrafine analysis of miRNA reads at single-base resolution: a k-mer lattice approach.

Authors:  Xuan Zhang; Pengyao Ping; Gyorgy Hutvagner; Michael Blumenstein; Jinyan Li
Journal:  Nucleic Acids Res       Date:  2021-10-11       Impact factor: 16.971

4.  Myoglobin primary structure reveals multiple convergent transitions to semi-aquatic life in the world's smallest mammalian divers.

Authors:  Kai He; Triston G Eastman; Hannah Czolacz; Shuhao Li; Akio Shinohara; Shin-Ichiro Kawada; Mark S Springer; Michael Berenbrink; Kevin L Campbell
Journal:  Elife       Date:  2021-04-29       Impact factor: 8.140

5.  An efficient error correction algorithm using FM-index.

Authors:  Yao-Ting Huang; Yu-Wen Huang
Journal:  BMC Bioinformatics       Date:  2017-11-28       Impact factor: 3.169

6.  Evaluation of the impact of Illumina error correction tools on de novo genome assembly.

Authors:  Mahdi Heydari; Giles Miclotte; Piet Demeester; Yves Van de Peer; Jan Fostier
Journal:  BMC Bioinformatics       Date:  2017-08-18       Impact factor: 3.169

7.  Iterative error correction of long sequencing reads maximizes accuracy and improves contig assembly.

Authors:  Katrin Sameith; Juliana G Roscito; Michael Hiller
Journal:  Brief Bioinform       Date:  2016-02-10       Impact factor: 11.622

8.  Jabba: hybrid error correction for long sequencing reads.

Authors:  Giles Miclotte; Mahdi Heydari; Piet Demeester; Stephane Rombauts; Yves Van de Peer; Pieter Audenaert; Jan Fostier
Journal:  Algorithms Mol Biol       Date:  2016-05-03       Impact factor: 1.405

9.  Assembly of chloroplast genomes with long- and short-read data: a comparison of approaches using Eucalyptus pauciflora as a test case.

Authors:  Weiwen Wang; Miriam Schalamun; Alejandro Morales-Suarez; David Kainer; Benjamin Schwessinger; Robert Lanfear
Journal:  BMC Genomics       Date:  2018-12-29       Impact factor: 3.969

10.  Single-Nucleotide Polymorphism-Based Genetic Diversity Analysis of Clinical Pseudomonas aeruginosa Isolates.

Authors:  Uthayakumar Muthukumarasamy; Matthias Preusse; Adrian Kordes; Michal Koska; Monika Schniederjans; Ariane Khaledi; Susanne Häussler
Journal:  Genome Biol Evol       Date:  2020-04-01       Impact factor: 3.416

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.