Literature DB >> 28502990

Short Read Mapping: An Algorithmic Tour.

Stefan Canzar, Steven L Salzberg.   

Abstract

Ultra-high-throughput next-generation sequencing (NGS) technology allows us to determine the sequence of nucleotides of many millions of DNA molecules in parallel. Accompanied by a dramatic reduction in cost since its introduction in 2004, NGS technology has provided a new way of addressing a wide range of biological and biomedical questions, from the study of human genetic disease to the analysis of gene expression, protein-DNA interactions, and patterns of DNA methylation. The data generated by NGS instruments comprise huge numbers of very short DNA sequences, or 'reads', that carry little information by themselves. These reads therefore have to be pieced together by well-engineered algorithms to reconstruct biologically meaningful measurments, such as the level of expression of a gene. To solve this complex, high-dimensional puzzle, reads must be mapped back to a reference genome to determine their origin Due to sequencing errors and to genuine differences between the reference genome and the individual being sequenced, this mapping process must be tolerant of mismatches, insertions, and deletions. Although optimal alignment algorithms to solve this problem have long been available, the practical requirements of aligning hundreds of millions of short reads to the 3 billion base pair long human genome have stimulated the development of new, more efficient methods, which today are used routinely throughout the world for the analysis of NGS data.

Entities:  

Keywords:  Burrows-Wheeler transform; DNA sequencing; sequence alignment; string matching; suffix trees

Year:  2015        PMID: 28502990      PMCID: PMC5425171          DOI: 10.1109/JPROC.2015.2455551

Source DB:  PubMed          Journal:  Proc IEEE Inst Electr Electron Eng        ISSN: 0018-9219            Impact factor:   10.961


  85 in total

1.  Anatomy of a hash-based long read sequence mapping algorithm for next generation DNA sequencing.

Authors:  Sanchit Misra; Ankit Agrawal; Wei-keng Liao; Alok Choudhary
Journal:  Bioinformatics       Date:  2010-11-18       Impact factor: 6.937

2.  SOAP2: an improved ultrafast tool for short read alignment.

Authors:  Ruiqiang Li; Chang Yu; Yingrui Li; Tak-Wah Lam; Siu-Ming Yiu; Karsten Kristiansen; Jun Wang
Journal:  Bioinformatics       Date:  2009-06-03       Impact factor: 6.937

3.  Mapping short DNA sequencing reads and calling variants using mapping quality scores.

Authors:  Heng Li; Jue Ruan; Richard Durbin
Journal:  Genome Res       Date:  2008-08-19       Impact factor: 9.043

4.  The common colorectal cancer predisposition SNP rs6983267 at chromosome 8q24 confers potential to enhanced Wnt signaling.

Authors:  Sari Tuupanen; Mikko Turunen; Rainer Lehtonen; Outi Hallikas; Sakari Vanharanta; Teemu Kivioja; Mikael Björklund; Gonghong Wei; Jian Yan; Iina Niittymäki; Jukka-Pekka Mecklin; Heikki Järvinen; Ari Ristimäki; Mariachiara Di-Bernardo; Phil East; Luis Carvajal-Carmona; Richard S Houlston; Ian Tomlinson; Kimmo Palin; Esko Ukkonen; Auli Karhu; Jussi Taipale; Lauri A Aaltonen
Journal:  Nat Genet       Date:  2009-06-28       Impact factor: 38.330

5.  Genome-wide mapping of in vivo protein-DNA interactions.

Authors:  David S Johnson; Ali Mortazavi; Richard M Myers; Barbara Wold
Journal:  Science       Date:  2007-05-31       Impact factor: 47.728

6.  SOAP3-dp: fast, accurate and sensitive GPU-based short read aligner.

Authors:  Ruibang Luo; Thomas Wong; Jianqiao Zhu; Chi-Man Liu; Xiaoqian Zhu; Edward Wu; Lap-Kei Lee; Haoxiang Lin; Wenjuan Zhu; David W Cheung; Hing-Fung Ting; Siu-Ming Yiu; Shaoliang Peng; Chang Yu; Yingrui Li; Ruiqiang Li; Tak-Wah Lam
Journal:  PLoS One       Date:  2013-05-31       Impact factor: 3.240

7.  Searching for SNPs with cloud computing.

Authors:  Ben Langmead; Michael C Schatz; Jimmy Lin; Mihai Pop; Steven L Salzberg
Journal:  Genome Biol       Date:  2009-11-20       Impact factor: 13.583

8.  Short read DNA fragment anchoring algorithm.

Authors:  Wendi Wang; Peiheng Zhang; Xinchun Liu
Journal:  BMC Bioinformatics       Date:  2009-01-30       Impact factor: 3.169

9.  MOSAIK: a hash-based algorithm for accurate next-generation sequencing short-read mapping.

Authors:  Wan-Ping Lee; Michael P Stromberg; Alistair Ward; Chip Stewart; Erik P Garrison; Gabor T Marth
Journal:  PLoS One       Date:  2014-03-05       Impact factor: 3.240

10.  Improving read mapping using additional prefix grams.

Authors:  Jongik Kim; Chen Li; Xiaohui Xie
Journal:  BMC Bioinformatics       Date:  2014-02-05       Impact factor: 3.169

View more
  10 in total

1.  GateKeeper: a new hardware architecture for accelerating pre-alignment in DNA short read mapping.

Authors:  Mohammed Alser; Hasan Hassan; Hongyi Xin; Oguz Ergin; Onur Mutlu; Can Alkan
Journal:  Bioinformatics       Date:  2017-11-01       Impact factor: 6.937

Review 2.  The Current State and Future of CRISPR-Cas9 gRNA Design Tools.

Authors:  Laurence O W Wilson; Aidan R O'Brien; Denis C Bauer
Journal:  Front Pharmacol       Date:  2018-07-12       Impact factor: 5.810

3.  Genomic diversity affects the accuracy of bacterial single-nucleotide polymorphism-calling pipelines.

Authors:  Stephen J Bush; Dona Foster; David W Eyre; Emily L Clark; Nicola De Maio; Liam P Shaw; Nicole Stoesser; Tim E A Peto; Derrick W Crook; A Sarah Walker
Journal:  Gigascience       Date:  2020-02-01       Impact factor: 6.524

4.  Boosting the power of transcriptomics by developing an efficient gene expression profiling approach.

Authors:  Jing Wang; Jun Xu; Xiaohan Yang; Song Xu; Ming Zhang; Fei Lu
Journal:  Plant Biotechnol J       Date:  2021-09-23       Impact factor: 9.803

Review 5.  Technology dictates algorithms: recent developments in read alignment.

Authors:  Mohammed Alser; Jeremy Rotman; Onur Mutlu; Serghei Mangul; Dhrithi Deshpande; Kodi Taraszka; Huwenbo Shi; Pelin Icer Baykal; Harry Taegyun Yang; Victor Xue; Sergey Knyazev; Benjamin D Singer; Brunilda Balliu; David Koslicki; Pavel Skums; Alex Zelikovsky; Can Alkan
Journal:  Genome Biol       Date:  2021-08-26       Impact factor: 13.583

Review 6.  Metagenomics: a path to understanding the gut microbiome.

Authors:  Sandi Yen; Jethro S Johnson
Journal:  Mamm Genome       Date:  2021-07-14       Impact factor: 2.957

Review 7.  Mycobacterium bovis: From Genotyping to Genome Sequencing.

Authors:  Ana M S Guimaraes; Cristina K Zimpel
Journal:  Microorganisms       Date:  2020-05-03

Review 8.  Genetic Renal Diseases: The Emerging Role of Zebrafish Models.

Authors:  Mohamed A Elmonem; Sante Princiero Berlingerio; Lambertus P van den Heuvel; Peter A de Witte; Martin Lowe; Elena N Levtchenko
Journal:  Cells       Date:  2018-09-01       Impact factor: 6.600

Review 9.  The Many Faces of Gene Regulation in Cancer: A Computational Oncogenomics Outlook.

Authors:  Enrique Hernández-Lemus; Helena Reyes-Gopar; Jesús Espinal-Enríquez; Soledad Ochoa
Journal:  Genes (Basel)       Date:  2019-10-30       Impact factor: 4.096

10.  ARAMIS: From systematic errors of NGS long reads to accurate assemblies.

Authors:  E Sacristán-Horcajada; S González-de la Fuente; R Peiró-Pastor; F Carrasco-Ramiro; R Amils; J M Requena; J Berenguer; B Aguado
Journal:  Brief Bioinform       Date:  2021-11-05       Impact factor: 11.622

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.