Literature DB >> 27667793

LAMSA: fast split read alignment with long approximate matches.

Bo Liu1, Yan Gao1, Yadong Wang1.   

Abstract

MOTIVATION: Read length is continuously increasing with the development of novel high-throughput sequencing technologies, which has enormous potentials on cutting-edge genomic studies. However, longer reads could more frequently span the breakpoints of structural variants (SVs) than that of shorter reads. This may greatly influence read alignment, since most state-of-the-art aligners are designed for handling relatively small variants in a co-linear alignment framework. Meanwhile, long read alignment is still not as efficient as that of short reads, which could be also a bottleneck for the upcoming wide application.
RESULTS: We propose long approximate matches-based split aligner (LAMSA), a novel split read alignment approach. It takes the advantage of the rareness of SVs to implement a specifically designed two-step strategy. That is, LAMSA initially splits the read into relatively long fragments and co-linearly align them to solve the small variations or sequencing errors, and mitigate the effect of repeats. The alignments of the fragments are then used for implementing a sparse dynamic programming-based split alignment approach to handle the large or non-co-linear variants. We benchmarked LAMSA with simulated and real datasets having various read lengths and sequencing error rates, the results demonstrate that it is substantially faster than the state-of-the-art long read aligners; meanwhile, it also has good ability to handle various categories of SVs.
AVAILABILITY AND IMPLEMENTATION: LAMSA is available at https://github.com/hitbc/LAMSA CONTACT: Ydwang@hit.edu.cnSupplementary information: Supplementary data are available at Bioinformatics online.
© The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

Mesh:

Year:  2016        PMID: 27667793     DOI: 10.1093/bioinformatics/btw594

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  9 in total

1.  Minimap2: pairwise alignment for nucleotide sequences.

Authors:  Heng Li
Journal:  Bioinformatics       Date:  2018-09-15       Impact factor: 6.937

2.  lordFAST: sensitive and Fast Alignment Search Tool for LOng noisy Read sequencing Data.

Authors:  Ehsan Haghshenas; S Cenk Sahinalp; Faraz Hach
Journal:  Bioinformatics       Date:  2019-01-01       Impact factor: 6.937

3.  kngMap: Sensitive and Fast Mapping Algorithm for Noisy Long Reads Based on the K-Mer Neighborhood Graph.

Authors:  Ze-Gang Wei; Xing-Guo Fan; Hao Zhang; Xiao-Dan Zhang; Fei Liu; Yu Qian; Shao-Wu Zhang
Journal:  Front Genet       Date:  2022-05-05       Impact factor: 4.772

4.  Featherweight long read alignment using partitioned reference indexes.

Authors:  Hasindu Gamaarachchi; Sri Parameswaran; Martin A Smith
Journal:  Sci Rep       Date:  2019-03-13       Impact factor: 4.379

5.  TideHunter: efficient and sensitive tandem repeat detection from noisy long-reads using seed-and-chain.

Authors:  Yan Gao; Bo Liu; Yadong Wang; Yi Xing
Journal:  Bioinformatics       Date:  2019-07-15       Impact factor: 6.937

Review 6.  Mobile genomics: tools and techniques for tackling transposons.

Authors:  Kathryn O'Neill; David Brocks; Molly Gale Hammell
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2020-02-10       Impact factor: 6.237

7.  deSALT: fast and accurate long transcriptomic read alignment with de Bruijn graph-based index.

Authors:  Bo Liu; Yadong Liu; Junyi Li; Hongzhe Guo; Tianyi Zang; Yadong Wang
Journal:  Genome Biol       Date:  2019-12-16       Impact factor: 13.583

Review 8.  Technology dictates algorithms: recent developments in read alignment.

Authors:  Mohammed Alser; Jeremy Rotman; Onur Mutlu; Serghei Mangul; Dhrithi Deshpande; Kodi Taraszka; Huwenbo Shi; Pelin Icer Baykal; Harry Taegyun Yang; Victor Xue; Sergey Knyazev; Benjamin D Singer; Brunilda Balliu; David Koslicki; Pavel Skums; Alex Zelikovsky; Can Alkan
Journal:  Genome Biol       Date:  2021-08-26       Impact factor: 13.583

Review 9.  The third generation sequencing: the advanced approach to genetic diseases.

Authors:  Tiantian Xiao; Wenhao Zhou
Journal:  Transl Pediatr       Date:  2020-04
  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.