MOTIVATION: Comparative genomics heavily relies on alignments of large and often complex DNA sequences. From an engineering perspective, the problem here is to provide maximum sensitivity (to find all there is to find), specificity (to only find real homology) and speed (to accommodate the billions of base pairs of vertebrate genomes). RESULTS: Satsuma addresses all three issues through novel strategies: (i) cross-correlation, implemented via fast Fourier transform; (ii) a match scoring scheme that eliminates almost all false hits; and (iii) an asynchronous 'battleship'-like search that allows for aligning two entire fish genomes (470 and 217 Mb) in 120 CPU hours using 15 processors on a single machine. AVAILABILITY: Satsuma is part of the Spines software package, implemented in C++ on Linux. The latest version of Spines can be freely downloaded under the LGPL license from http://www.broadinstitute.org/science/programs/genome-biology/spines/.
MOTIVATION: Comparative genomics heavily relies on alignments of large and often complex DNA sequences. From an engineering perspective, the problem here is to provide maximum sensitivity (to find all there is to find), specificity (to only find real homology) and speed (to accommodate the billions of base pairs of vertebrate genomes). RESULTS: Satsuma addresses all three issues through novel strategies: (i) cross-correlation, implemented via fast Fourier transform; (ii) a match scoring scheme that eliminates almost all false hits; and (iii) an asynchronous 'battleship'-like search that allows for aligning two entire fish genomes (470 and 217 Mb) in 120 CPU hours using 15 processors on a single machine. AVAILABILITY: Satsuma is part of the Spines software package, implemented in C++ on Linux. The latest version of Spines can be freely downloaded under the LGPL license from http://www.broadinstitute.org/science/programs/genome-biology/spines/.
Authors: W James Kent; Robert Baertsch; Angie Hinrichs; Webb Miller; David Haussler Journal: Proc Natl Acad Sci U S A Date: 2003-09-19 Impact factor: 11.205
Authors: Luca Fontanesi; Federica Di Palma; Paul Flicek; Andrew T Smith; Carl-Gustaf Thulin; Paulo C Alves Journal: J Hered Date: 2016-02-26 Impact factor: 2.645
Authors: James J Lewis; Rachel C Geltman; Patrick C Pollak; Kathleen E Rondem; Steven M Van Belleghem; Melissa J Hubisz; Paul R Munn; Linlin Zhang; Caleb Benson; Anyi Mazo-Vargas; Charles G Danko; Brian A Counterman; Riccardo Papa; Robert D Reed Journal: Proc Natl Acad Sci U S A Date: 2019-11-11 Impact factor: 11.205
Authors: Bork A Berghoff; Torgny Karlsson; Thomas Källman; E Gerhart H Wagner; Manfred G Grabherr Journal: BioData Min Date: 2017-09-05 Impact factor: 2.522
Authors: Kira Delmore; Juan Carlos Illera; Javier Pérez-Tris; Gernot Segelbacher; Juan S Lugo Ramos; Gillian Durieux; Jun Ishigohoka; Miriam Liedvogel Journal: Elife Date: 2020-04-21 Impact factor: 8.140
Authors: Liuqi Gu; Patrick F Reilly; James J Lewis; Robert D Reed; Peter Andolfatto; James R Walters Journal: Curr Biol Date: 2019-11-14 Impact factor: 10.834
Authors: Abou Yobi; Karen A Schlauch; Richard L Tillett; Won C Yim; Catherine Espinoza; Bernard W M Wone; John C Cushman; Melvin J Oliver Journal: BMC Plant Biol Date: 2017-03-28 Impact factor: 4.215