Literature DB >> 22506536

Mapping reads on a genomic sequence: an algorithmic overview and a practical comparative analysis.

Sophie Schbath1, Véronique Martin, Matthias Zytnicki, Julien Fayolle, Valentin Loux, Jean-François Gibrat.   

Abstract

Mapping short reads against a reference genome is classically the first step of many next-generation sequencing data analyses, and it should be as accurate as possible. Because of the large number of reads to handle, numerous sophisticated algorithms have been developped in the last 3 years to tackle this problem. In this article, we first review the underlying algorithms used in most of the existing mapping tools, and then we compare the performance of nine of these tools on a well controled benchmark built for this purpose. We built a set of reads that exist in single or multiple copies in a reference genome and for which there is no mismatch, and a set of reads with three mismatches. We considered as reference genome both the human genome and a concatenation of all complete bacterial genomes. On each dataset, we quantified the capacity of the different tools to retrieve all the occurrences of the reads in the reference genome. Special attention was paid to reads uniquely reported and to reads with multiple hits.

Entities:  

Mesh:

Year:  2012        PMID: 22506536      PMCID: PMC3375638          DOI: 10.1089/cmb.2012.0022

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  16 in total

1.  SSAHA: a fast search method for large DNA databases.

Authors:  Z Ning; A J Cox; J C Mullikin
Journal:  Genome Res       Date:  2001-10       Impact factor: 9.043

2.  SOAP2: an improved ultrafast tool for short read alignment.

Authors:  Ruiqiang Li; Chang Yu; Yingrui Li; Tak-Wah Lam; Siu-Ming Yiu; Karsten Kristiansen; Jun Wang
Journal:  Bioinformatics       Date:  2009-06-03       Impact factor: 6.937

3.  Mapping short DNA sequencing reads and calling variants using mapping quality scores.

Authors:  Heng Li; Jue Ruan; Richard Durbin
Journal:  Genome Res       Date:  2008-08-19       Impact factor: 9.043

4.  ZOOM! Zillions of oligos mapped.

Authors:  Hao Lin; Zefeng Zhang; Michael Q Zhang; Bin Ma; Ming Li
Journal:  Bioinformatics       Date:  2008-08-06       Impact factor: 6.937

5.  SeqMap: mapping massive amount of oligonucleotides to the genome.

Authors:  Hui Jiang; Wing Hung Wong
Journal:  Bioinformatics       Date:  2008-08-12       Impact factor: 6.937

6.  Comparative analysis of algorithms for next-generation sequencing read alignment.

Authors:  Matthew Ruffalo; Thomas LaFramboise; Mehmet Koyutürk
Journal:  Bioinformatics       Date:  2011-08-19       Impact factor: 6.937

7.  RazerS--fast read mapping with sensitivity control.

Authors:  David Weese; Anne-Katrin Emde; Tobias Rausch; Andreas Döring; Knut Reinert
Journal:  Genome Res       Date:  2009-07-10       Impact factor: 9.043

8.  Ultrafast and memory-efficient alignment of short DNA sequences to the human genome.

Authors:  Ben Langmead; Cole Trapnell; Mihai Pop; Steven L Salzberg
Journal:  Genome Biol       Date:  2009-03-04       Impact factor: 13.583

9.  SOAP: short oligonucleotide alignment program.

Authors:  Ruiqiang Li; Yingrui Li; Karsten Kristiansen; Jun Wang
Journal:  Bioinformatics       Date:  2008-01-28       Impact factor: 6.937

10.  Fast and accurate short read alignment with Burrows-Wheeler transform.

Authors:  Heng Li; Richard Durbin
Journal:  Bioinformatics       Date:  2009-05-18       Impact factor: 6.937

View more
  25 in total

Review 1.  From next-generation resequencing reads to a high-quality variant data set.

Authors:  S P Pfeifer
Journal:  Heredity (Edinb)       Date:  2016-10-19       Impact factor: 3.821

2.  Comparative assessment of methods for the computational inference of transcript isoform abundance from RNA-seq data.

Authors:  Alexander Kanitz; Foivos Gypas; Andreas J Gruber; Andreas R Gruber; Georges Martin; Mihaela Zavolan
Journal:  Genome Biol       Date:  2015-07-23       Impact factor: 13.583

3.  Another lesson from unmapped reads: in-depth analysis of RNA-Seq reads from various horse tissues.

Authors:  Artur Gurgul; Tomasz Szmatoła; Ewa Ocłoń; Igor Jasielczuk; Ewelina Semik-Gurgul; Carrie J Finno; Jessica L Petersen; Rebecca Bellone; Erin N Hales; Tomasz Ząbek; Zbigniew Arent; Małgorzata Kotula-Balak; Monika Bugno-Poniewierska
Journal:  J Appl Genet       Date:  2022-06-07       Impact factor: 2.653

4.  An Enumerative Combinatorics Model for Fragmentation Patterns in RNA Sequencing Provides Insights into Nonuniformity of the Expected Fragment Starting-Point and Coverage Profile.

Authors:  Celine Prakash; Arndt Von Haeseler
Journal:  J Comput Biol       Date:  2016-09-23       Impact factor: 1.479

5.  Accurate read-based metagenome characterization using a hierarchical suite of unique signatures.

Authors:  Tracey Allen K Freitas; Po-E Li; Matthew B Scholz; Patrick S G Chain
Journal:  Nucleic Acids Res       Date:  2015-03-12       Impact factor: 16.971

6.  Identification of large-scale genomic variation in cancer genomes using in silico reference models.

Authors:  Sarah Killcoyne; Antonio Del Sol
Journal:  Nucleic Acids Res       Date:  2015-08-11       Impact factor: 16.971

7.  Unbiased Taxonomic Annotation of Metagenomic Samples.

Authors:  Bruno Fosso; Graziano Pesole; Francesc Rosselló; Gabriel Valiente
Journal:  J Comput Biol       Date:  2017-10-13       Impact factor: 1.479

Review 8.  Analysis and Performance Assessment of the Whole Genome Bisulfite Sequencing Data Workflow: Currently Available Tools and a Practical Guide to Advance DNA Methylation Studies.

Authors:  Ting Gong; Heather Borgard; Zao Zhang; Shaoqiu Chen; Zitong Gao; Youping Deng
Journal:  Small Methods       Date:  2022-01-22

9.  Benchmarking short sequence mapping tools.

Authors:  Ayat Hatem; Doruk Bozdağ; Amanda E Toland; Ümit V Çatalyürek
Journal:  BMC Bioinformatics       Date:  2013-06-07       Impact factor: 3.169

Review 10.  Best practices for evaluating single nucleotide variant calling methods for microbial genomics.

Authors:  Nathan D Olson; Steven P Lund; Rebecca E Colman; Jeffrey T Foster; Jason W Sahl; James M Schupp; Paul Keim; Jayne B Morrow; Marc L Salit; Justin M Zook
Journal:  Front Genet       Date:  2015-07-07       Impact factor: 4.599

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.