Literature DB >> 30561550

lordFAST: sensitive and Fast Alignment Search Tool for LOng noisy Read sequencing Data.

Ehsan Haghshenas1, S Cenk Sahinalp1,2, Faraz Hach3,4.   

Abstract

Motivation: Recent advances in genomics and precision medicine have been made possible through the application of high throughput sequencing (HTS) to large collections of human genomes. Although HTS technologies have proven their use in cataloging human genome variation, computational analysis of the data they generate is still far from being perfect. The main limitation of Illumina and other popular sequencing technologies is their short read length relative to the lengths of (common) genomic repeats. Newer (single molecule sequencing - SMS) technologies such as Pacific Biosciences and Oxford Nanopore are producing longer reads, making it theoretically possible to overcome the difficulties imposed by repeat regions. Unfortunately, because of their high sequencing error rate, reads generated by these technologies are very difficult to work with and cannot be used in many of the standard downstream analysis pipelines. Note that it is not only difficult to find the correct mapping locations of such reads in a reference genome, but also to establish their correct alignment so as to differentiate sequencing errors from real genomic variants. Furthermore, especially since newer SMS instruments provide higher throughput, mapping and alignment need to be performed much faster than before, maintaining high sensitivity.
Results: We introduce lordFAST, a novel long-read mapper that is specifically designed to align reads generated by PacBio and potentially other SMS technologies to a reference. lordFAST not only has higher sensitivity than the available alternatives, it is also among the fastest and has a very low memory footprint. Availability and implementation: lordFAST is implemented in C++ and supports multi-threading. The source code of lordFAST is available at https://github.com/vpc-ccg/lordfast. Supplementary information: Supplementary data are available at Bioinformatics online.

Entities:  

Mesh:

Year:  2019        PMID: 30561550      PMCID: PMC6298053          DOI: 10.1093/bioinformatics/bty544

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  60 in total

1.  Reducing storage requirements for biological sequence comparison.

Authors:  Michael Roberts; Wayne Hayes; Brian R Hunt; Stephen M Mount; James A Yorke
Journal:  Bioinformatics       Date:  2004-07-15       Impact factor: 6.937

2.  A hybrid approach for the automated finishing of bacterial genomes.

Authors:  Ali Bashir; Aaron Klammer; William P Robins; Chen-Shan Chin; Dale Webster; Ellen Paxinos; David Hsu; Meredith Ashby; Susana Wang; Paul Peluso; Robert Sebra; Jon Sorenson; James Bullard; Jackie Yen; Marie Valdovino; Emilia Mollova; Khai Luong; Steven Lin; Brianna LaMay; Amruta Joshi; Lori Rowe; Michael Frace; Cheryl L Tarr; Maryann Turnsek; Brigid M Davis; Andrew Kasarskis; John J Mekalanos; Matthew K Waldor; Eric E Schadt
Journal:  Nat Biotechnol       Date:  2012-07-01       Impact factor: 54.908

3.  A flexible and efficient template format for circular consensus sequencing and SNP detection.

Authors:  Kevin J Travers; Chen-Shan Chin; David R Rank; John S Eid; Stephen W Turner
Journal:  Nucleic Acids Res       Date:  2010-06-22       Impact factor: 16.971

4.  Real-time DNA sequencing from single polymerase molecules.

Authors:  John Eid; Adrian Fehr; Jeremy Gray; Khai Luong; John Lyle; Geoff Otto; Paul Peluso; David Rank; Primo Baybayan; Brad Bettman; Arkadiusz Bibillo; Keith Bjornson; Bidhan Chaudhuri; Frederick Christians; Ronald Cicero; Sonya Clark; Ravindra Dalal; Alex Dewinter; John Dixon; Mathieu Foquet; Alfred Gaertner; Paul Hardenbol; Cheryl Heiner; Kevin Hester; David Holden; Gregory Kearns; Xiangxu Kong; Ronald Kuse; Yves Lacroix; Steven Lin; Paul Lundquist; Congcong Ma; Patrick Marks; Mark Maxham; Devon Murphy; Insil Park; Thang Pham; Michael Phillips; Joy Roy; Robert Sebra; Gene Shen; Jon Sorenson; Austin Tomaney; Kevin Travers; Mark Trulson; John Vieceli; Jeffrey Wegener; Dawn Wu; Alicia Yang; Denis Zaccarin; Peter Zhao; Frank Zhong; Jonas Korlach; Stephen Turner
Journal:  Science       Date:  2008-11-20       Impact factor: 47.728

5.  Resolving multicopy duplications de novo using polyploid phasing.

Authors:  Mark J Chaisson; Sudipto Mukherjee; Sreeram Kannan; Evan E Eichler
Journal:  Res Comput Mol Biol       Date:  2017-04-12

6.  Reconstructing complex regions of genomes using long-read sequencing technology.

Authors:  John Huddleston; Swati Ranade; Maika Malig; Francesca Antonacci; Mark Chaisson; Lawrence Hon; Peter H Sudmant; Tina A Graves; Can Alkan; Megan Y Dennis; Richard K Wilson; Stephen W Turner; Jonas Korlach; Evan E Eichler
Journal:  Genome Res       Date:  2014-01-13       Impact factor: 9.043

7.  Mapping DNA methylation with high-throughput nanopore sequencing.

Authors:  Arthur C Rand; Miten Jain; Jordan M Eizenga; Audrey Musselman-Brown; Hugh E Olsen; Mark Akeson; Benedict Paten
Journal:  Nat Methods       Date:  2017-02-20       Impact factor: 28.547

8.  HySA: a Hybrid Structural variant Assembly approach using next-generation and single-molecule sequencing technologies.

Authors:  Xian Fan; Mark Chaisson; Luay Nakhleh; Ken Chen
Journal:  Genome Res       Date:  2017-01-19       Impact factor: 9.043

9.  Edlib: a C/C ++ library for fast, exact sequence alignment using edit distance.

Authors:  Martin Šošic; Mile Šikic
Journal:  Bioinformatics       Date:  2017-05-01       Impact factor: 6.937

10.  Fast and accurate short read alignment with Burrows-Wheeler transform.

Authors:  Heng Li; Richard Durbin
Journal:  Bioinformatics       Date:  2009-05-18       Impact factor: 6.937

View more
  8 in total

1.  Long-read mapping to repetitive reference sequences using Winnowmap2.

Authors:  Chirag Jain; Arang Rhie; Nancy F Hansen; Sergey Koren; Adam M Phillippy
Journal:  Nat Methods       Date:  2022-04-01       Impact factor: 28.547

Review 2.  Nanopore sequencing technology, bioinformatics and applications.

Authors:  Yunhao Wang; Yue Zhao; Audrey Bollas; Yuru Wang; Kin Fai Au
Journal:  Nat Biotechnol       Date:  2021-11-08       Impact factor: 54.908

3.  kngMap: Sensitive and Fast Mapping Algorithm for Noisy Long Reads Based on the K-Mer Neighborhood Graph.

Authors:  Ze-Gang Wei; Xing-Guo Fan; Hao Zhang; Xiao-Dan Zhang; Fei Liu; Yu Qian; Shao-Wu Zhang
Journal:  Front Genet       Date:  2022-05-05       Impact factor: 4.772

Review 4.  Mobile genomics: tools and techniques for tackling transposons.

Authors:  Kathryn O'Neill; David Brocks; Molly Gale Hammell
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2020-02-10       Impact factor: 6.237

5.  Genomic and transcriptomic analyses reveal a tandem amplification unit of 11 genes and mutations in mismatch repair genes in methotrexate-resistant HT-29 cells.

Authors:  Ahreum Kim; Jong-Yeon Shin; Jeong-Sun Seo
Journal:  Exp Mol Med       Date:  2021-09-14       Impact factor: 12.153

Review 6.  Technology dictates algorithms: recent developments in read alignment.

Authors:  Mohammed Alser; Jeremy Rotman; Onur Mutlu; Serghei Mangul; Dhrithi Deshpande; Kodi Taraszka; Huwenbo Shi; Pelin Icer Baykal; Harry Taegyun Yang; Victor Xue; Sergey Knyazev; Benjamin D Singer; Brunilda Balliu; David Koslicki; Pavel Skums; Alex Zelikovsky; Can Alkan
Journal:  Genome Biol       Date:  2021-08-26       Impact factor: 13.583

7.  Context-aware seeds for read mapping.

Authors:  Hongyi Xin; Mingfu Shao; Carl Kingsford
Journal:  Algorithms Mol Biol       Date:  2020-05-23       Impact factor: 1.405

8.  HASLR: Fast Hybrid Assembly of Long Reads.

Authors:  Ehsan Haghshenas; Hossein Asghari; Jens Stoye; Cedric Chauve; Faraz Hach
Journal:  iScience       Date:  2020-07-25
  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.