Literature DB >> 27587673

CoLoRMap: Correcting Long Reads by Mapping short reads.

Ehsan Haghshenas1, Faraz Hach2, S Cenk Sahinalp3, Cedric Chauve4.   

Abstract

MOTIVATION: Second generation sequencing technologies paved the way to an exceptional increase in the number of sequenced genomes, both prokaryotic and eukaryotic. However, short reads are difficult to assemble and often lead to highly fragmented assemblies. The recent developments in long reads sequencing methods offer a promising way to address this issue. However, so far long reads are characterized by a high error rate, and assembling from long reads require a high depth of coverage. This motivates the development of hybrid approaches that leverage the high quality of short reads to correct errors in long reads.
RESULTS: We introduce CoLoRMap, a hybrid method for correcting noisy long reads, such as the ones produced by PacBio sequencing technology, using high-quality Illumina paired-end reads mapped onto the long reads. Our algorithm is based on two novel ideas: using a classical shortest path algorithm to find a sequence of overlapping short reads that minimizes the edit score to a long read and extending corrected regions by local assembly of unmapped mates of mapped short reads. Our results on bacterial, fungal and insect data sets show that CoLoRMap compares well with existing hybrid correction methods.
AVAILABILITY AND IMPLEMENTATION: The source code of CoLoRMap is freely available for non-commercial use at https://github.com/sfu-compbio/colormap CONTACT: ehaghshe@sfu.ca or cedric.chauve@sfu.ca SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

Mesh:

Year:  2016        PMID: 27587673     DOI: 10.1093/bioinformatics/btw463

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  16 in total

1.  Hercules: a profile HMM-based hybrid error correction algorithm for long reads.

Authors:  Can Firtina; Ziv Bar-Joseph; Can Alkan; A Ercument Cicek
Journal:  Nucleic Acids Res       Date:  2018-11-30       Impact factor: 16.971

Review 2.  Nanopore sequencing technology, bioinformatics and applications.

Authors:  Yunhao Wang; Yue Zhao; Audrey Bollas; Yuru Wang; Kin Fai Au
Journal:  Nat Biotechnol       Date:  2021-11-08       Impact factor: 54.908

Review 3.  Genome sequence assembly algorithms and misassembly identification methods.

Authors:  Yue Meng; Yu Lei; Jianlong Gao; Yuxuan Liu; Enze Ma; Yunhong Ding; Yixin Bian; Hongquan Zu; Yucui Dong; Xiao Zhu
Journal:  Mol Biol Rep       Date:  2022-09-23       Impact factor: 2.742

4.  Integrated RNA and miRNA sequencing analysis reveals a complex regulatory network of Magnolia sieboldii seed germination.

Authors:  Mei Mei; Jun Wei; Wanfeng Ai; Lijie Zhang; Xiu-Jun Lu
Journal:  Sci Rep       Date:  2021-05-25       Impact factor: 4.379

5.  FMLRC: Hybrid long read error correction using an FM-index.

Authors:  Jeremy R Wang; James Holt; Leonard McMillan; Corbin D Jones
Journal:  BMC Bioinformatics       Date:  2018-02-09       Impact factor: 3.169

6.  HALC: High throughput algorithm for long read error correction.

Authors:  Ergude Bao; Lingxiao Lan
Journal:  BMC Bioinformatics       Date:  2017-04-05       Impact factor: 3.169

7.  A comparative evaluation of hybrid error correction methods for error-prone long reads.

Authors:  Shuhua Fu; Anqi Wang; Kin Fai Au
Journal:  Genome Biol       Date:  2019-02-04       Impact factor: 13.583

8.  A study of the extraordinarily strong and tough silk produced by bagworms.

Authors:  Taiyo Yoshioka; Takuya Tsubota; Kohji Tashiro; Akiya Jouraku; Tsunenori Kameda
Journal:  Nat Commun       Date:  2019-04-01       Impact factor: 14.919

9.  Context-aware seeds for read mapping.

Authors:  Hongyi Xin; Mingfu Shao; Carl Kingsford
Journal:  Algorithms Mol Biol       Date:  2020-05-23       Impact factor: 1.405

10.  HECIL: A Hybrid Error Correction Algorithm for Long Reads with Iterative Learning.

Authors:  Olivia Choudhury; Ankush Chakrabarty; Scott J Emrich
Journal:  Sci Rep       Date:  2018-07-02       Impact factor: 4.379

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.