Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 LAMSA: fast split read alignment with long approximate matches.

Literature DB >> 27667793

LAMSA: fast split read alignment with long approximate matches.

Abstract

MOTIVATION: Read length is continuously increasing with the development of novel high-throughput sequencing technologies, which has enormous potentials on cutting-edge genomic studies. However, longer reads could more frequently span the breakpoints of structural variants (SVs) than that of shorter reads. This may greatly influence read alignment, since most state-of-the-art aligners are designed for handling relatively small variants in a co-linear alignment framework. Meanwhile, long read alignment is still not as efficient as that of short reads, which could be also a bottleneck for the upcoming wide application.
RESULTS: We propose long approximate matches-based split aligner (LAMSA), a novel split read alignment approach. It takes the advantage of the rareness of SVs to implement a specifically designed two-step strategy. That is, LAMSA initially splits the read into relatively long fragments and co-linearly align them to solve the small variations or sequencing errors, and mitigate the effect of repeats. The alignments of the fragments are then used for implementing a sparse dynamic programming-based split alignment approach to handle the large or non-co-linear variants. We benchmarked LAMSA with simulated and real datasets having various read lengths and sequencing error rates, the results demonstrate that it is substantially faster than the state-of-the-art long read aligners; meanwhile, it also has good ability to handle various categories of SVs.
AVAILABILITY AND IMPLEMENTATION: LAMSA is available at https://github.com/hitbc/LAMSA CONTACT: Ydwang@hit.edu.cnSupplementary information: Supplementary data are available at Bioinformatics online.

Mesh：

Year: 2016 PMID： 27667793 DOI： 10.1093/bioinformatics/btw594

Source DB: PubMed Journal: Bioinformatics ISSN： 1367-4803 Impact factor: 6.937

Keyword Cloud
Cited

9 in total

1. Minimap2: pairwise alignment for nucleotide sequences.

Authors: Heng Li
Journal: Bioinformatics Date: 2018-09-15 Impact factor: 6.937

2. lordFAST: sensitive and Fast Alignment Search Tool for LOng noisy Read sequencing Data.

Authors: Ehsan Haghshenas; S Cenk Sahinalp; Faraz Hach
Journal: Bioinformatics Date: 2019-01-01 Impact factor: 6.937

3. kngMap: Sensitive and Fast Mapping Algorithm for Noisy Long Reads Based on the K-Mer Neighborhood Graph.

Authors: Ze-Gang Wei; Xing-Guo Fan; Hao Zhang; Xiao-Dan Zhang; Fei Liu; Yu Qian; Shao-Wu Zhang
Journal: Front Genet Date: 2022-05-05 Impact factor: 4.772

4. Featherweight long read alignment using partitioned reference indexes.

Authors: Hasindu Gamaarachchi; Sri Parameswaran; Martin A Smith
Journal: Sci Rep Date: 2019-03-13 Impact factor: 4.379

5. TideHunter: efficient and sensitive tandem repeat detection from noisy long-reads using seed-and-chain.

Authors: Yan Gao; Bo Liu; Yadong Wang; Yi Xing
Journal: Bioinformatics Date: 2019-07-15 Impact factor: 6.937

Review 6. Mobile genomics: tools and techniques for tackling transposons.

Authors: Kathryn O'Neill; David Brocks; Molly Gale Hammell
Journal: Philos Trans R Soc Lond B Biol Sci Date: 2020-02-10 Impact factor: 6.237

7. deSALT: fast and accurate long transcriptomic read alignment with de Bruijn graph-based index.

Authors: Bo Liu; Yadong Liu; Junyi Li; Hongzhe Guo; Tianyi Zang; Yadong Wang
Journal: Genome Biol Date: 2019-12-16 Impact factor: 13.583

Review 8. Technology dictates algorithms: recent developments in read alignment.

Authors: Mohammed Alser; Jeremy Rotman; Onur Mutlu; Serghei Mangul; Dhrithi Deshpande; Kodi Taraszka; Huwenbo Shi; Pelin Icer Baykal; Harry Taegyun Yang; Victor Xue; Sergey Knyazev; Benjamin D Singer; Brunilda Balliu; David Koslicki; Pavel Skums; Alex Zelikovsky; Can Alkan
Journal: Genome Biol Date: 2021-08-26 Impact factor: 13.583

Review 9. The third generation sequencing: the advanced approach to genetic diseases.

Authors: Tiantian Xiao; Wenhao Zhou
Journal: Transl Pediatr Date: 2020-04

9 in total