Literature DB >> 23044551

COPE: an accurate k-mer-based pair-end reads connection tool to facilitate genome assembly.

Binghang Liu1, Jianying Yuan, Siu-Ming Yiu, Zhenyu Li, Yinlong Xie, Yanxiang Chen, Yujian Shi, Hao Zhang, Yingrui Li, Tak-Wah Lam, Ruibang Luo.   

Abstract

MOTIVATION: The boost of next-generation sequencing technologies provides us with an unprecedented opportunity for elucidating genetic mysteries, yet the short-read length hinders us from better assembling the genome from scratch. New protocols now exist that can generate overlapping pair-end reads. By joining the 3' ends of each read pair, one is able to construct longer reads for assembling. However, effectively joining two overlapped pair-end reads remains a challenging task. RESULT: In this article, we present an efficient tool called Connecting Overlapped Pair-End (COPE) reads, to connect overlapping pair-end reads using k-mer frequencies. We evaluated our tool on 30× simulated pair-end reads from Arabidopsis thaliana with 1% base error. COPE connected over 99% of reads with 98.8% accuracy, which is, respectively, 10 and 2% higher than the recently published tool FLASH. When COPE is applied to real reads for genome assembly, the resulting contigs are found to have fewer errors and give a 14-fold improvement in the N50 measurement when compared with the contigs produced using unconnected reads.
AVAILABILITY AND IMPLEMENTATION: COPE is implemented in C++ and is freely available as open-source code at ftp://ftp.genomics.org.cn/pub/cope. CONTACT: twlam@cs.hku.hk or luoruibang@genomics.org.cn

Entities:  

Mesh:

Year:  2012        PMID: 23044551     DOI: 10.1093/bioinformatics/bts563

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  69 in total

1.  IMonitor: A Robust Pipeline for TCR and BCR Repertoire Analysis.

Authors:  Wei Zhang; Yuanping Du; Zheng Su; Changxi Wang; Xiaojing Zeng; Ruifang Zhang; Xueyu Hong; Chao Nie; Jinghua Wu; Hongzhi Cao; Xun Xu; Xiao Liu
Journal:  Genetics       Date:  2015-08-21       Impact factor: 4.562

2.  Genomic analysis of MHC-based mate choice in the monogamous California mouse.

Authors:  Jesyka Meléndez-Rosa; Ke Bi; Eileen A Lacey
Journal:  Behav Ecol       Date:  2018-07-12       Impact factor: 2.671

3.  The genome sequences of Arachis duranensis and Arachis ipaensis, the diploid ancestors of cultivated peanut.

Authors:  David John Bertioli; Steven B Cannon; Lutz Froenicke; Guodong Huang; Andrew D Farmer; Ethalinda K S Cannon; Xin Liu; Dongying Gao; Josh Clevenger; Sudhansu Dash; Longhui Ren; Márcio C Moretzsohn; Kenta Shirasawa; Wei Huang; Bruna Vidigal; Brian Abernathy; Ye Chu; Chad E Niederhuth; Pooja Umale; Ana Cláudia G Araújo; Alexander Kozik; Kyung Do Kim; Mark D Burow; Rajeev K Varshney; Xingjun Wang; Xinyou Zhang; Noelle Barkley; Patrícia M Guimarães; Sachiko Isobe; Baozhu Guo; Boshou Liao; H Thomas Stalker; Robert J Schmitz; Brian E Scheffler; Soraya C M Leal-Bertioli; Xu Xun; Scott A Jackson; Richard Michelmore; Peggy Ozias-Akins
Journal:  Nat Genet       Date:  2016-02-22       Impact factor: 38.330

4.  Konnector v2.0: pseudo-long reads from paired-end sequencing data.

Authors:  Benjamin P Vandervalk; Chen Yang; Zhuyi Xue; Karthika Raghavan; Justin Chu; Hamid Mohamadi; Shaun D Jackman; Readman Chiu; René L Warren; Inanç Birol
Journal:  BMC Med Genomics       Date:  2015-09-23       Impact factor: 3.063

5.  Fecal bacterial community of finishing beef steers fed ruminally protected and non-protected active dried yeast.

Authors:  Tao Ran; Peixin Jiao; Ousama AlZahal; Xiaolai Xie; Karen A Beauchemin; Dongyan Niu; Wenzhu Yang
Journal:  J Anim Sci       Date:  2020-04-01       Impact factor: 3.159

6.  Identification of characteristic TRB V usage in HBV-associated HCC by using differential expression profiling analysis.

Authors:  Yingxin Han; Xing Liu; Yuqi Wang; Xiaolei Wu; Yanfang Guan; Hongmei Li; Xinchun Chen; Boping Zhou; Qing Yuan; Ying Ou; Renhua Wu; Wanqiu Huang; Yun Wang; Ming Zhang; Yinxin Zhang; Dongxing Zhu; Hongmei Zhu; Ling Yang; Xin Yi; Chen Huang; Jian Huang
Journal:  Oncoimmunology       Date:  2015-04-02       Impact factor: 8.110

7.  CMash: fast, multi-resolution estimation of k-mer-based Jaccard and containment indices.

Authors:  Shaopeng Liu; David Koslicki
Journal:  Bioinformatics       Date:  2022-06-24       Impact factor: 6.931

8.  Taxonomic Identification of Ruminal Epithelial Bacterial Diversity during Rumen Development in Goats.

Authors:  Jinzhen Jiao; Jinyu Huang; Chuanshe Zhou; Zhiliang Tan
Journal:  Appl Environ Microbiol       Date:  2015-03-13       Impact factor: 4.792

9.  leeHom: adaptor trimming and merging for Illumina sequencing reads.

Authors:  Gabriel Renaud; Udo Stenzel; Janet Kelso
Journal:  Nucleic Acids Res       Date:  2014-08-06       Impact factor: 16.971

10.  Using RNA-Seq to assemble a rose transcriptome with more than 13,000 full-length expressed genes and to develop the WagRhSNP 68k Axiom SNP array for rose (Rosa L.).

Authors:  Carole F S Koning-Boucoiran; G Danny Esselink; Mirjana Vukosavljev; Wendy P C van 't Westende; Virginia W Gitonga; Frans A Krens; Roeland E Voorrips; W Eric van de Weg; Dietmar Schulz; Thomas Debener; Chris Maliepaard; Paul Arens; Marinus J M Smulders
Journal:  Front Plant Sci       Date:  2015-04-21       Impact factor: 5.753

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.