Literature DB >> 25107872

Fast construction of FM-index for long sequence reads.

Heng Li1.   

Abstract

SUMMARY: We present a new method to incrementally construct the FM-index for both short and long sequence reads, up to the size of a genome. It is the first algorithm that can build the index while implicitly sorting the sequences in the reverse (complement) lexicographical order without a separate sorting step. The implementation is among the fastest for indexing short reads and the only one that practically works for reads of averaged kilobases in length.
AVAILABILITY AND IMPLEMENTATION: https://github.com/lh3/ropebwt2 CONTACT: hengli@broadinstitute.org.
© The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

Mesh:

Year:  2014        PMID: 25107872      PMCID: PMC4221129          DOI: 10.1093/bioinformatics/btu541

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  4 in total

1.  Large-scale compression of genomic sequence databases with the Burrows-Wheeler transform.

Authors:  Anthony J Cox; Markus J Bauer; Tobias Jakobi; Giovanna Rosone
Journal:  Bioinformatics       Date:  2012-05-03       Impact factor: 6.937

2.  Efficient de novo assembly of large genomes using compressed data structures.

Authors:  Jared T Simpson; Richard Durbin
Journal:  Genome Res       Date:  2011-12-07       Impact factor: 9.043

3.  A framework for variation discovery and genotyping using next-generation DNA sequencing data.

Authors:  Mark A DePristo; Eric Banks; Ryan Poplin; Kiran V Garimella; Jared R Maguire; Christopher Hartl; Anthony A Philippakis; Guillermo del Angel; Manuel A Rivas; Matt Hanna; Aaron McKenna; Tim J Fennell; Andrew M Kernytsky; Andrey Y Sivachenko; Kristian Cibulskis; Stacey B Gabriel; David Altshuler; Mark J Daly
Journal:  Nat Genet       Date:  2011-04-10       Impact factor: 38.330

4.  The diploid genome sequence of an individual human.

Authors:  Samuel Levy; Granger Sutton; Pauline C Ng; Lars Feuk; Aaron L Halpern; Brian P Walenz; Nelson Axelrod; Jiaqi Huang; Ewen F Kirkness; Gennady Denisov; Yuan Lin; Jeffrey R MacDonald; Andy Wing Chun Pang; Mary Shago; Timothy B Stockwell; Alexia Tsiamouri; Vineet Bafna; Vikas Bansal; Saul A Kravitz; Dana A Busam; Karen Y Beeson; Tina C McIntosh; Karin A Remington; Josep F Abril; John Gill; Jon Borman; Yu-Hui Rogers; Marvin E Frazier; Stephen W Scherer; Robert L Strausberg; J Craig Venter
Journal:  PLoS Biol       Date:  2007-09-04       Impact factor: 8.029

  4 in total
  15 in total

1.  FermiKit: assembly-based variant calling for Illumina resequencing data.

Authors:  Heng Li
Journal:  Bioinformatics       Date:  2015-07-27       Impact factor: 6.937

2.  Somatic genetic aberrations in gallbladder cancer: comparison between Chinese and US patients.

Authors:  Pingzhou Yang; Milind Javle; Fei Pang; Wei Zhao; Reham Abdel-Wahab; Xiaofeng Chen; Funda Meric-Bernstam; Huanwei Chen; Mitesh J Borad; Yu Liu; Chuntao Zou; Shuo Mu; Yutong Xing; Kai Wang; Chuang Peng; Xu Che
Journal:  Hepatobiliary Surg Nutr       Date:  2019-12       Impact factor: 7.293

3.  deBWT: parallel construction of Burrows-Wheeler Transform for large collection of genomes with de Bruijn-branch encoding.

Authors:  Bo Liu; Dixian Zhu; Yadong Wang
Journal:  Bioinformatics       Date:  2016-06-15       Impact factor: 6.937

4.  Whole Genome Sequence of Two Wild-Derived Mus musculus domesticus Inbred Strains, LEWES/EiJ and ZALENDE/EiJ, with Different Diploid Numbers.

Authors:  Andrew P Morgan; John P Didion; Anthony G Doran; James M Holt; Leonard McMillan; Thomas M Keane; Fernando Pardo-Manuel de Villena
Journal:  G3 (Bethesda)       Date:  2016-12-07       Impact factor: 3.154

5.  A framework for space-efficient read clustering in metagenomic samples.

Authors:  Jarno Alanko; Fabio Cunial; Djamal Belazzougui; Veli Mäkinen
Journal:  BMC Bioinformatics       Date:  2017-03-14       Impact factor: 3.169

6.  FMLRC: Hybrid long read error correction using an FM-index.

Authors:  Jeremy R Wang; James Holt; Leonard McMillan; Corbin D Jones
Journal:  BMC Bioinformatics       Date:  2018-02-09       Impact factor: 3.169

7.  Desiccation tolerance in bryophytes: The dehydration and rehydration transcriptomes in the desiccation-tolerant bryophyte Bryum argenteum.

Authors:  Bei Gao; Xiaoshuang Li; Daoyuan Zhang; Yuqing Liang; Honglan Yang; Moxian Chen; Yuanming Zhang; Jianhua Zhang; Andrew J Wood
Journal:  Sci Rep       Date:  2017-08-08       Impact factor: 4.379

8.  Genomes of the Mouse Collaborative Cross.

Authors:  Anuj Srivastava; Andrew P Morgan; Maya L Najarian; Vishal Kumar Sarsani; J Sebastian Sigmon; John R Shorter; Anwica Kashfeen; Rachel C McMullan; Lucy H Williams; Paola Giusti-Rodríguez; Martin T Ferris; Patrick Sullivan; Pablo Hock; Darla R Miller; Timothy A Bell; Leonard McMillan; Gary A Churchill; Fernando Pardo-Manuel de Villena
Journal:  Genetics       Date:  2017-06       Impact factor: 4.562

9.  Whole Genome Sequencing and Progress Toward Full Inbreeding of the Mouse Collaborative Cross Population.

Authors:  John R Shorter; Maya L Najarian; Timothy A Bell; Matthew Blanchard; Martin T Ferris; Pablo Hock; Anwica Kashfeen; Kathryn E Kirchoff; Colton L Linnertz; J Sebastian Sigmon; Darla R Miller; Leonard McMillan; Fernando Pardo-Manuel de Villena
Journal:  G3 (Bethesda)       Date:  2019-05-07       Impact factor: 3.154

10.  Integrating long-range connectivity information into de Bruijn graphs.

Authors:  Isaac Turner; Kiran V Garimella; Zamin Iqbal; Gil McVean
Journal:  Bioinformatics       Date:  2018-08-01       Impact factor: 6.937

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.