Literature DB >> 24597675

Parallel continuous flow: a parallel suffix tree construction tool for whole genomes.

Matteo Comin1, Montse Farreras.   

Abstract

The construction of suffix trees for very long sequences is essential for many applications, and it plays a central role in the bioinformatic domain. With the advent of modern sequencing technologies, biological sequence databases have grown dramatically. Also the methodologies required to analyze these data have become more complex everyday, requiring fast queries to multiple genomes. In this article, we present parallel continuous flow (PCF), a parallel suffix tree construction method that is suitable for very long genomes. We tested our method for the suffix tree construction of the entire human genome, about 3GB. We showed that PCF can scale gracefully as the size of the input genome grows. Our method can work with an efficiency of 90% with 36 processors and 55% with 172 processors. We can index the human genome in 7 minutes using 172 processes.

Entities:  

Mesh:

Year:  2014        PMID: 24597675      PMCID: PMC3962650          DOI: 10.1089/cmb.2012.0256

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  5 in total

1.  REPuter: the manifold applications of repeat analysis on a genomic scale.

Authors:  S Kurtz; J V Choudhuri; E Ohlebusch; C Schleiermacher; J Stoye; R Giegerich
Journal:  Nucleic Acids Res       Date:  2001-11-15       Impact factor: 16.971

2.  VARUN: discovering extensible motifs under saturation constraints.

Authors:  Alberto Apostolico; Matteo Comin; Laxmi Parida
Journal:  IEEE/ACM Trans Comput Biol Bioinform       Date:  2010 Oct-Dec       Impact factor: 3.710

Review 3.  A space-efficient construction of the Burrows-Wheeler transform for genomic data.

Authors:  Ross A Lippert; Clark M Mobarry; Brian P Walenz
Journal:  J Comput Biol       Date:  2005-09       Impact factor: 1.479

4.  The irredundant class method for remote homology detection of protein sequences.

Authors:  Matteo Comin; Davide Verzotto
Journal:  J Comput Biol       Date:  2011-05-06       Impact factor: 1.479

Review 5.  Prospects and limitations of full-text index structures in genome analysis.

Authors:  Michaël Vyverman; Bernard De Baets; Veerle Fack; Peter Dawyndt
Journal:  Nucleic Acids Res       Date:  2012-05-13       Impact factor: 16.971

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.