Literature DB >> 28545961

PECC: Correcting contigs based on paired-end read distribution.

Min Li1, Binbin Wu2, Xiaodong Yan2, Junwei Luo2, Yi Pan3, Fang-Xiang Wu4, Jianxin Wang2.   

Abstract

MOTIVATION: Cheap and fast next generation sequencing (NGS) technologies facilitate research of de novo assembly greatly. The reliability of contigs is critical to construct reliable scaffolding. However, contigs generated from most assemblers contain errors because of the limitation of assembly strategy and computation complexity. Among all these errors, the misassembly error is one of the most harmful types.
RESULTS: In this paper, we propose a new method named "PECC" to identify and correct misassembly errors in contigs based on the paired-end read distribution. PECC extracts sequence regions with lower paired-end reads supports and verifies them based on the distribution of paired-end supports. To validate the effectiveness of PECC, we applied PECC to the contigs produced by five popular assemblers on four real datasets, and we also carried out experiments to analyze the influences of PECC on scaffolding. The results show that PECC can reduce misassembly errors and improve the performance of scaffolding results, which demonstrate the promising applications of PECC in de novo assembly.
Copyright © 2017 Elsevier Ltd. All rights reserved.

Keywords:  Contigs; De novo assembly; Next generation sequencing; Paired-end reads

Year:  2017        PMID: 28545961     DOI: 10.1016/j.compbiolchem.2017.03.012

Source DB:  PubMed          Journal:  Comput Biol Chem        ISSN: 1476-9271            Impact factor:   2.877


  3 in total

Review 1.  Genome sequence assembly algorithms and misassembly identification methods.

Authors:  Yue Meng; Yu Lei; Jianlong Gao; Yuxuan Liu; Enze Ma; Yunhong Ding; Yixin Bian; Hongquan Zu; Yucui Dong; Xiao Zhu
Journal:  Mol Biol Rep       Date:  2022-09-23       Impact factor: 2.742

2.  VAliBS: a visual aligner for bisulfite sequences.

Authors:  Min Li; Ping Huang; Xiaodong Yan; Jianxin Wang; Yi Pan; Fang-Xiang Wu
Journal:  BMC Bioinformatics       Date:  2017-10-16       Impact factor: 3.169

3.  A Sequence-Based Novel Approach for Quality Evaluation of Third-Generation Sequencing Reads.

Authors:  Wenjing Zhang; Neng Huang; Jiantao Zheng; Xingyu Liao; Jianxin Wang; Hong-Dong Li
Journal:  Genes (Basel)       Date:  2019-01-14       Impact factor: 4.096

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.