Literature DB >> 26568624

Optimal seed solver: optimizing seed selection in read mapping.

Hongyi Xin1, Sunny Nahar1, Richard Zhu1, John Emmons2, Gennady Pekhimenko1, Carl Kingsford3, Can Alkan4, Onur Mutlu5.   

Abstract

MOTIVATION: Optimizing seed selection is an important problem in read mapping. The number of non-overlapping seeds a mapper selects determines the sensitivity of the mapper while the total frequency of all selected seeds determines the speed of the mapper. Modern seed-and-extend mappers usually select seeds with either an equal and fixed-length scheme or with an inflexible placement scheme, both of which limit the ability of the mapper in selecting less frequent seeds to speed up the mapping process. Therefore, it is crucial to develop a new algorithm that can adjust both the individual seed length and the seed placement, as well as derive less frequent seeds.
RESULTS: We present the Optimal Seed Solver (OSS), a dynamic programming algorithm that discovers the least frequently-occurring set of x seeds in an L-base-pair read in [Formula: see text] operations on average and in [Formula: see text] operations in the worst case, while generating a maximum of [Formula: see text] seed frequency database lookups. We compare OSS against four state-of-the-art seed selection schemes and observe that OSS provides a 3-fold reduction in average seed frequency over the best previous seed selection optimizations.
AVAILABILITY AND IMPLEMENTATION: We provide an implementation of the Optimal Seed Solver in C++ at: https://github.com/CMU-SAFARI/Optimal-Seed-Solver CONTACT: hxin@cmu.edu, calkan@cs.bilkent.edu.tr or onur@cmu.edu SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Mesh:

Year:  2015        PMID: 26568624      PMCID: PMC6363230          DOI: 10.1093/bioinformatics/btv670

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  6 in total

1.  Fast and efficient short read mapping based on a succinct hash index.

Authors:  Haowen Zhang; Yuandong Chan; Kaichao Fan; Bertil Schmidt; Weiguo Liu
Journal:  BMC Bioinformatics       Date:  2018-03-09       Impact factor: 3.169

Review 2.  Technology dictates algorithms: recent developments in read alignment.

Authors:  Mohammed Alser; Jeremy Rotman; Onur Mutlu; Serghei Mangul; Dhrithi Deshpande; Kodi Taraszka; Huwenbo Shi; Pelin Icer Baykal; Harry Taegyun Yang; Victor Xue; Sergey Knyazev; Benjamin D Singer; Brunilda Balliu; David Koslicki; Pavel Skums; Alex Zelikovsky; Can Alkan
Journal:  Genome Biol       Date:  2021-08-26       Impact factor: 13.583

Review 3.  Multiple genome alignment in the telomere-to-telomere assembly era.

Authors:  Bryce Kille; Advait Balaji; Fritz J Sedlazeck; Michael Nute; Todd J Treangen
Journal:  Genome Biol       Date:  2022-08-29       Impact factor: 17.906

Review 4.  From molecules to genomic variations: Accelerating genome analysis via intelligent algorithms and architectures.

Authors:  Mohammed Alser; Joel Lindegger; Can Firtina; Nour Almadhoun; Haiyu Mao; Gagandeep Singh; Juan Gomez-Luna; Onur Mutlu
Journal:  Comput Struct Biotechnol J       Date:  2022-08-18       Impact factor: 6.155

5.  Context-aware seeds for read mapping.

Authors:  Hongyi Xin; Mingfu Shao; Carl Kingsford
Journal:  Algorithms Mol Biol       Date:  2020-05-23       Impact factor: 1.405

6.  GRIM-Filter: Fast seed location filtering in DNA read mapping using processing-in-memory technologies.

Authors:  Jeremie S Kim; Damla Senol Cali; Hongyi Xin; Donghyuk Lee; Saugata Ghose; Mohammed Alser; Hasan Hassan; Oguz Ergin; Can Alkan; Onur Mutlu
Journal:  BMC Genomics       Date:  2018-05-09       Impact factor: 3.969

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.