Literature DB >> 26399504

Konnector v2.0: pseudo-long reads from paired-end sequencing data.

Benjamin P Vandervalk, Chen Yang, Zhuyi Xue, Karthika Raghavan, Justin Chu, Hamid Mohamadi, Shaun D Jackman, Readman Chiu, René L Warren, Inanç Birol.   

Abstract

BACKGROUND: Reading the nucleotides from two ends of a DNA fragment is called paired-end tag (PET) sequencing. When the fragment length is longer than the combined read length, there remains a gap of unsequenced nucleotides between read pairs. If the target in such experiments is sequenced at a level to provide redundant coverage, it may be possible to bridge these gaps using bioinformatics methods. Konnector is a local de novo assembly tool that addresses this problem. Here we report on version 2.0 of our tool.
RESULTS: Konnector uses a probabilistic and memory-efficient data structure called Bloom filter to represent a k-mer spectrum - all possible sequences of length k in an input file, such as the collection of reads in a PET sequencing experiment. It performs look-ups to this data structure to construct an implicit de Bruijn graph, which describes (k-1) base pair overlaps between adjacent k-mers. It traverses this graph to bridge the gap between a given pair of flanking sequences.
CONCLUSIONS: Here we report the performance of Konnector v2.0 on simulated and experimental datasets, and compare it against other tools with similar functionality. We note that, representing k-mers with 1.5 bytes of memory on average, Konnector can scale to very large genomes. With our parallel implementation, it can also process over a billion bases on commodity hardware.

Entities:  

Mesh:

Substances:

Year:  2015        PMID: 26399504      PMCID: PMC4582294          DOI: 10.1186/1755-8794-8-S3-S1

Source DB:  PubMed          Journal:  BMC Med Genomics        ISSN: 1755-8794            Impact factor:   3.063


  25 in total

1.  pIRS: Profile-based Illumina pair-end reads simulator.

Authors:  Xuesong Hu; Jianying Yuan; Yujian Shi; Jianliang Lu; Binghang Liu; Zhenyu Li; Yanxiang Chen; Desheng Mu; Hao Zhang; Nan Li; Zhen Yue; Fan Bai; Heng Li; Wei Fan
Journal:  Bioinformatics       Date:  2012-04-15       Impact factor: 6.937

2.  Short read fragment assembly of bacterial genomes.

Authors:  Mark J Chaisson; Pavel A Pevzner
Journal:  Genome Res       Date:  2007-12-14       Impact factor: 9.043

3.  The MaSuRCA genome assembler.

Authors:  Aleksey V Zimin; Guillaume Marçais; Daniela Puiu; Michael Roberts; Steven L Salzberg; James A Yorke
Journal:  Bioinformatics       Date:  2013-08-29       Impact factor: 6.937

4.  VarScan: variant detection in massively parallel sequencing of individual and pooled samples.

Authors:  Daniel C Koboldt; Ken Chen; Todd Wylie; David E Larson; Michael D McLellan; Elaine R Mardis; George M Weinstock; Richard K Wilson; Li Ding
Journal:  Bioinformatics       Date:  2009-06-19       Impact factor: 6.937

5.  ELOPER: elongation of paired-end reads as a pre-processing tool for improved de novo genome assembly.

Authors:  David H Silver; Shay Ben-Elazar; Alexei Bogoslavsky; Itai Yanai
Journal:  Bioinformatics       Date:  2013-04-19       Impact factor: 6.937

6.  RSVSim: an R/Bioconductor package for the simulation of structural variations.

Authors:  Christoph Bartenhagen; Martin Dugas
Journal:  Bioinformatics       Date:  2013-04-25       Impact factor: 6.937

7.  Efficient de novo assembly of large genomes using compressed data structures.

Authors:  Jared T Simpson; Richard Durbin
Journal:  Genome Res       Date:  2011-12-07       Impact factor: 9.043

8.  GapFiller: a de novo assembly approach to fill the gap within paired reads.

Authors:  Francesca Nadalin; Francesco Vezzi; Alberto Policriti
Journal:  BMC Bioinformatics       Date:  2012-09-07       Impact factor: 3.169

9.  SOAPdenovo2: an empirically improved memory-efficient short-read de novo assembler.

Authors:  Ruibang Luo; Binghang Liu; Yinlong Xie; Zhenyu Li; Weihua Huang; Jianying Yuan; Guangzhu He; Yanxiang Chen; Qi Pan; Yunjie Liu; Jingbo Tang; Gengxiong Wu; Hao Zhang; Yujian Shi; Yong Liu; Chang Yu; Bo Wang; Yao Lu; Changlei Han; David W Cheung; Siu-Ming Yiu; Shaoliang Peng; Zhu Xiaoqian; Guangming Liu; Xiangke Liao; Yingrui Li; Huanming Yang; Jian Wang; Tak-Wah Lam; Jun Wang
Journal:  Gigascience       Date:  2012-12-27       Impact factor: 6.524

10.  Aggressive assembly of pyrosequencing reads with mates.

Authors:  Jason R Miller; Arthur L Delcher; Sergey Koren; Eli Venter; Brian P Walenz; Anushka Brownley; Justin Johnson; Kelvin Li; Clark Mobarry; Granger Sutton
Journal:  Bioinformatics       Date:  2008-10-24       Impact factor: 6.937

View more
  11 in total

1.  Generation and application of pseudo-long reads for metagenome assembly.

Authors:  Mikang Sim; Jongin Lee; Suyeon Wy; Nayoung Park; Daehwan Lee; Daehong Kwon; Jaebum Kim
Journal:  Gigascience       Date:  2022-05-17       Impact factor: 7.658

2.  RResolver: efficient short-read repeat resolution within ABySS.

Authors:  Vladimir Nikolić; Amirhossein Afshinfard; Justin Chu; Johnathan Wong; Lauren Coombe; Ka Ming Nip; René L Warren; Inanç Birol
Journal:  BMC Bioinformatics       Date:  2022-06-21       Impact factor: 3.307

3.  riboSeed: leveraging prokaryotic genomic architecture to assemble across ribosomal regions.

Authors:  Nicholas R Waters; Florence Abram; Fiona Brennan; Ashleigh Holmes; Leighton Pritchard
Journal:  Nucleic Acids Res       Date:  2018-06-20       Impact factor: 16.971

4.  Assembly of the Complete Sitka Spruce Chloroplast Genome Using 10X Genomics' GemCode Sequencing Data.

Authors:  Lauren Coombe; René L Warren; Shaun D Jackman; Chen Yang; Benjamin P Vandervalk; Richard A Moore; Stephen Pleasance; Robin J Coope; Joerg Bohlmann; Robert A Holt; Steven J M Jones; Inanc Birol
Journal:  PLoS One       Date:  2016-09-15       Impact factor: 3.240

5.  The North American bullfrog draft genome provides insight into hormonal regulation of long noncoding RNA.

Authors:  S Austin Hammond; René L Warren; Benjamin P Vandervalk; Erdi Kucuk; Hamza Khan; Ewan A Gibb; Pawan Pandoh; Heather Kirk; Yongjun Zhao; Martin Jones; Andrew J Mungall; Robin Coope; Stephen Pleasance; Richard A Moore; Robert A Holt; Jessica M Round; Sara Ohora; Branden V Walle; Nik Veldhoen; Caren C Helbing; Inanc Birol
Journal:  Nat Commun       Date:  2017-11-10       Impact factor: 14.919

6.  ABySS 2.0: resource-efficient assembly of large genomes using a Bloom filter.

Authors:  Shaun D Jackman; Benjamin P Vandervalk; Hamid Mohamadi; Justin Chu; Sarah Yeo; S Austin Hammond; Golnaz Jahesh; Hamza Khan; Lauren Coombe; Rene L Warren; Inanc Birol
Journal:  Genome Res       Date:  2017-02-23       Impact factor: 9.043

7.  The Genome of the Beluga Whale (Delphinapterus leucas).

Authors:  Steven J M Jones; Gregory A Taylor; Simon Chan; René L Warren; S Austin Hammond; Steven Bilobram; Gideon Mordecai; Curtis A Suttle; Kristina M Miller; Angela Schulze; Amy M Chan; Samantha J Jones; Kane Tse; Irene Li; Dorothy Cheung; Karen L Mungall; Caleb Choo; Adrian Ally; Noreen Dhalla; Angela K Y Tam; Armelle Troussard; Heather Kirk; Pawan Pandoh; Daniel Paulino; Robin J N Coope; Andrew J Mungall; Richard Moore; Yongjun Zhao; Inanc Birol; Yussanne Ma; Marco Marra; Martin Haulena
Journal:  Genes (Basel)       Date:  2017-12-11       Impact factor: 4.096

8.  The Genome of the Northern Sea Otter (Enhydra lutris kenyoni).

Authors:  Samantha J Jones; Martin Haulena; Gregory A Taylor; Simon Chan; Steven Bilobram; René L Warren; S Austin Hammond; Karen L Mungall; Caleb Choo; Heather Kirk; Pawan Pandoh; Adrian Ally; Noreen Dhalla; Angela K Y Tam; Armelle Troussard; Daniel Paulino; Robin J N Coope; Andrew J Mungall; Richard Moore; Yongjun Zhao; Inanc Birol; Yussanne Ma; Marco Marra; Steven J M Jones
Journal:  Genes (Basel)       Date:  2017-12-11       Impact factor: 4.096

9.  ChopStitch: exon annotation and splice graph construction using transcriptome assembly and whole genome sequencing data.

Authors:  Hamza Khan; Hamid Mohamadi; Benjamin P Vandervalk; Rene L Warren; Justin Chu; Inanc Birol
Journal:  Bioinformatics       Date:  2018-05-15       Impact factor: 6.937

10.  Organellar Genomes of White Spruce (Picea glauca): Assembly and Annotation.

Authors:  Shaun D Jackman; René L Warren; Ewan A Gibb; Benjamin P Vandervalk; Hamid Mohamadi; Justin Chu; Anthony Raymond; Stephen Pleasance; Robin Coope; Mark R Wildung; Carol E Ritland; Jean Bousquet; Steven J M Jones; Joerg Bohlmann; Inanç Birol
Journal:  Genome Biol Evol       Date:  2015-12-08       Impact factor: 3.416

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.