Literature DB >> 31603794

EPGA-SC : A Framework for de novo Assembly of Single-Cell Sequencing Reads.

Xingyu Liao, Min Li, Junwei Luo, You Zou, Fang-Xiang Wu, Feng Luo, Jianxin Wang.   

Abstract

Assembling genomes from single-cell sequencing data is essential for single-cell studies. However, single-cell assemblies are challenging due to (i) the highly non-uniform read coverage and (ii) the elevated levels of sequencing errors and chimeric reads. Although several assemblers for single-cell data have been proposed in recent years, most of them fail to construct correct long contigs. In this study, we present a new framework called EPGA-SC for de novo assembly of single-cell sequencing reads. The EPGA assembler has designed strategies to solve the problems caused by sequencing errors, sequencing biases, and repetitive regions. However, the extremely unbalanced and richer error types prevent EPGA to achieve high performance in single-cell sequencing data. In this study, we designed EPGA-SC based on EPGA. The main innovations of EPGA-SC are as follows: (i) classifying reads to reduce the proportion of false reads; (ii) using multiple sets of high precision paired-end reads generated from the high precision assemblies produced by other assembler such as SPAdes to overcome the impact of sequencing biases and repetitive regions; and (iii) developing novel algorithms for removing chimeric errors and extending contigs. We test EPGA-SC with seven datasets. The experimental results show that EPGA-SC can generate better assemblies than most current tools in most time in term of MAX contig, N50, NG50, NA50, and NGA50.

Entities:  

Mesh:

Year:  2021        PMID: 31603794     DOI: 10.1109/TCBB.2019.2945761

Source DB:  PubMed          Journal:  IEEE/ACM Trans Comput Biol Bioinform        ISSN: 1545-5963            Impact factor:   3.710


  2 in total

1.  RepAHR: an improved approach for de novo repeat identification by assembly of the high-frequency reads.

Authors:  Xingyu Liao; Xin Gao; Xiankai Zhang; Fang-Xiang Wu; Jianxin Wang
Journal:  BMC Bioinformatics       Date:  2020-10-19       Impact factor: 3.169

2.  MAC: Merging Assemblies by Using Adjacency Algebraic Model and Classification.

Authors:  Li Tang; Min Li; Fang-Xiang Wu; Yi Pan; Jianxin Wang
Journal:  Front Genet       Date:  2020-01-31       Impact factor: 4.599

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.