Literature DB >> 26055432

Review of alignment and SNP calling algorithms for next-generation sequencing data.

M Mielczarek1, J Szyda2.   

Abstract

Application of the massive parallel sequencing technology has become one of the most important issues in life sciences. Therefore, it was crucial to develop bioinformatics tools for next-generation sequencing (NGS) data processing. Currently, two of the most significant tasks include alignment to a reference genome and detection of single nucleotide polymorphisms (SNPs). In many types of genomic analyses, great numbers of reads need to be mapped to the reference genome; therefore, selection of the aligner is an essential step in NGS pipelines. Two main algorithms-suffix tries and hash tables-have been introduced for this purpose. Suffix array-based aligners are memory-efficient and work faster than hash-based aligners, but they are less accurate. In contrast, hash table algorithms tend to be slower, but more sensitive. SNP and genotype callers may also be divided into two main different approaches: heuristic and probabilistic methods. A variety of software has been subsequently developed over the past several years. In this paper, we briefly review the current development of NGS data processing algorithms and present the available software.

Keywords:  Alignment; Genotype calling; NGS; Review; SNP calling; Software

Mesh:

Year:  2015        PMID: 26055432     DOI: 10.1007/s13353-015-0292-7

Source DB:  PubMed          Journal:  J Appl Genet        ISSN: 1234-1983            Impact factor:   3.240


  49 in total

1.  Accurate prediction of genetic values for complex traits by whole-genome resequencing.

Authors:  Theo Meuwissen; Mike Goddard
Journal:  Genetics       Date:  2010-03-22       Impact factor: 4.562

2.  SOAP2: an improved ultrafast tool for short read alignment.

Authors:  Ruiqiang Li; Chang Yu; Yingrui Li; Tak-Wah Lam; Siu-Ming Yiu; Karsten Kristiansen; Jun Wang
Journal:  Bioinformatics       Date:  2009-06-03       Impact factor: 6.937

3.  Mapping short DNA sequencing reads and calling variants using mapping quality scores.

Authors:  Heng Li; Jue Ruan; Richard Durbin
Journal:  Genome Res       Date:  2008-08-19       Impact factor: 9.043

4.  A SNP discovery method to assess variant allele probability from next-generation resequencing data.

Authors:  Yufeng Shen; Zhengzheng Wan; Cristian Coarfa; Rafal Drabek; Lei Chen; Elizabeth A Ostrowski; Yue Liu; George M Weinstock; David A Wheeler; Richard A Gibbs; Fuli Yu
Journal:  Genome Res       Date:  2009-12-17       Impact factor: 9.043

5.  A global view of gene activity and alternative splicing by deep sequencing of the human transcriptome.

Authors:  Marc Sultan; Marcel H Schulz; Hugues Richard; Alon Magen; Andreas Klingenhoff; Matthias Scherf; Martin Seifert; Tatjana Borodina; Aleksey Soldatov; Dmitri Parkhomchuk; Dominic Schmidt; Sean O'Keeffe; Stefan Haas; Martin Vingron; Hans Lehrach; Marie-Laure Yaspo
Journal:  Science       Date:  2008-07-03       Impact factor: 47.728

Review 6.  Computational methods for discovering structural variation with next-generation sequencing.

Authors:  Paul Medvedev; Monica Stanciu; Michael Brudno
Journal:  Nat Methods       Date:  2009-11       Impact factor: 28.547

Review 7.  Sequencing technologies - the next generation.

Authors:  Michael L Metzker
Journal:  Nat Rev Genet       Date:  2009-12-08       Impact factor: 53.242

8.  Fast gapped-read alignment with Bowtie 2.

Authors:  Ben Langmead; Steven L Salzberg
Journal:  Nat Methods       Date:  2012-03-04       Impact factor: 28.547

9.  SOAP3-dp: fast, accurate and sensitive GPU-based short read aligner.

Authors:  Ruibang Luo; Thomas Wong; Jianqiao Zhu; Chi-Man Liu; Xiaoqian Zhu; Edward Wu; Lap-Kei Lee; Haoxiang Lin; Wenjuan Zhu; David W Cheung; Hing-Fung Ting; Siu-Ming Yiu; Shaoliang Peng; Chang Yu; Yingrui Li; Ruiqiang Li; Tak-Wah Lam
Journal:  PLoS One       Date:  2013-05-31       Impact factor: 3.240

10.  SOAPdenovo2: an empirically improved memory-efficient short-read de novo assembler.

Authors:  Ruibang Luo; Binghang Liu; Yinlong Xie; Zhenyu Li; Weihua Huang; Jianying Yuan; Guangzhu He; Yanxiang Chen; Qi Pan; Yunjie Liu; Jingbo Tang; Gengxiong Wu; Hao Zhang; Yujian Shi; Yong Liu; Chang Yu; Bo Wang; Yao Lu; Changlei Han; David W Cheung; Siu-Ming Yiu; Shaoliang Peng; Zhu Xiaoqian; Guangming Liu; Xiangke Liao; Yingrui Li; Huanming Yang; Jian Wang; Tak-Wah Lam; Jun Wang
Journal:  Gigascience       Date:  2012-12-27       Impact factor: 6.524

View more
  19 in total

1.  High-Throughput Genotyping Technologies in Plant Taxonomy.

Authors:  Monica F Danilevicz; Cassandria G Tay Fernandez; Jacob I Marsh; Philipp E Bayer; David Edwards
Journal:  Methods Mol Biol       Date:  2021

Review 2.  Clinical Genomics: Challenges and Opportunities.

Authors:  Priyanka Vijay; Alexa B R McIntyre; Christopher E Mason; Jeffrey P Greenfield; Sheng Li
Journal:  Crit Rev Eukaryot Gene Expr       Date:  2016       Impact factor: 1.807

3.  Dealing with Pseudogenes in Molecular Diagnostics in the Next Generation Sequencing Era.

Authors:  Kathleen B M Claes; Toon Rosseel; Kim De Leeneer
Journal:  Methods Mol Biol       Date:  2021

Review 4.  Next Generation Sequencing and Bioinformatics Analysis of Family Genetic Inheritance.

Authors:  Aquillah M Kanzi; James Emmanuel San; Benjamin Chimukangara; Eduan Wilkinson; Maryam Fish; Veron Ramsuran; Tulio de Oliveira
Journal:  Front Genet       Date:  2020-10-23       Impact factor: 4.599

5.  Simulation of African and non-African low and high coverage whole genome sequence data to assess variant calling approaches.

Authors:  Shatha Alosaimi; Noëlle van Biljon; Denis Awany; Prisca K Thami; Joel Defo; Jacquiline W Mugo; Christian D Bope; Gaston K Mazandu; Nicola J Mulder; Emile R Chimusa
Journal:  Brief Bioinform       Date:  2021-07-20       Impact factor: 11.622

6.  MEGARes: an antimicrobial resistance database for high throughput sequencing.

Authors:  Steven M Lakin; Chris Dean; Noelle R Noyes; Adam Dettenwanger; Anne Spencer Ross; Enrique Doster; Pablo Rovira; Zaid Abdo; Kenneth L Jones; Jaime Ruiz; Keith E Belk; Paul S Morley; Christina Boucher
Journal:  Nucleic Acids Res       Date:  2016-11-28       Impact factor: 16.971

7.  microTaboo: a general and practical solution to the k-disjoint problem.

Authors:  Mohammed Al-Jaff; Eric Sandström; Manfred Grabherr
Journal:  BMC Bioinformatics       Date:  2017-05-02       Impact factor: 3.169

Review 8.  Food Safety in the Age of Next Generation Sequencing, Bioinformatics, and Open Data Access.

Authors:  Eduardo N Taboada; Morag R Graham; João A Carriço; Gary Van Domselaar
Journal:  Front Microbiol       Date:  2017-05-23       Impact factor: 5.640

9.  The Site Frequency/Dosage Spectrum of Autopolyploid Populations.

Authors:  Luca Ferretti; Paolo Ribeca; Sebastian E Ramos-Onsins
Journal:  Front Genet       Date:  2018-10-23       Impact factor: 4.599

10.  Analysis of single nucleotide polymorphisms based on RNA sequencing data of diverse bio-geographical accessions in barley.

Authors:  Kotaro Takahagi; Yukiko Uehara-Yamaguchi; Takuhiro Yoshida; Tetsuya Sakurai; Kazuo Shinozaki; Keiichi Mochida; Daisuke Saisho
Journal:  Sci Rep       Date:  2016-09-12       Impact factor: 4.379

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.