Literature DB >> 27759079

From next-generation resequencing reads to a high-quality variant data set.

S P Pfeifer1,2,3.   

Abstract

Sequencing has revolutionized biology by permitting the analysis of genomic variation at an unprecedented resolution. High-throughput sequencing is fast and inexpensive, making it accessible for a wide range of research topics. However, the produced data contain subtle but complex types of errors, biases and uncertainties that impose several statistical and computational challenges to the reliable detection of variants. To tap the full potential of high-throughput sequencing, a thorough understanding of the data produced as well as the available methodologies is required. Here, I review several commonly used methods for generating and processing next-generation resequencing data, discuss the influence of errors and biases together with their resulting implications for downstream analyses and provide general guidelines and recommendations for producing high-quality single-nucleotide polymorphism data sets from raw reads by highlighting several sophisticated reference-based methods representing the current state of the art.

Mesh:

Year:  2016        PMID: 27759079      PMCID: PMC5234474          DOI: 10.1038/hdy.2016.102

Source DB:  PubMed          Journal:  Heredity (Edinb)        ISSN: 0018-067X            Impact factor:   3.821


  154 in total

1.  SNP detection and genotyping from low-coverage sequencing data on multiple diploid samples.

Authors:  Si Quang Le; Richard Durbin
Journal:  Genome Res       Date:  2010-10-27       Impact factor: 9.043

2.  SOAP2: an improved ultrafast tool for short read alignment.

Authors:  Ruiqiang Li; Chang Yu; Yingrui Li; Tak-Wah Lam; Siu-Ming Yiu; Karsten Kristiansen; Jun Wang
Journal:  Bioinformatics       Date:  2009-06-03       Impact factor: 6.937

3.  Mapping short DNA sequencing reads and calling variants using mapping quality scores.

Authors:  Heng Li; Jue Ruan; Richard Durbin
Journal:  Genome Res       Date:  2008-08-19       Impact factor: 9.043

4.  LookSeq: a browser-based viewer for deep sequencing data.

Authors:  Heinrich Magnus Manske; Dominic P Kwiatkowski
Journal:  Genome Res       Date:  2009-08-13       Impact factor: 9.043

5.  High quality SNP calling using Illumina data at shallow coverage.

Authors:  Nawar Malhis; Steven J M Jones
Journal:  Bioinformatics       Date:  2010-02-26       Impact factor: 6.937

6.  Tablet--next generation sequence assembly visualization.

Authors:  Iain Milne; Micha Bayer; Linda Cardle; Paul Shaw; Gordon Stephen; Frank Wright; David Marshall
Journal:  Bioinformatics       Date:  2009-12-04       Impact factor: 6.937

7.  Quake: quality-aware detection and correction of sequencing errors.

Authors:  David R Kelley; Michael C Schatz; Steven L Salzberg
Journal:  Genome Biol       Date:  2010-11-29       Impact factor: 13.583

8.  Assessing the accuracy and power of population genetic inference from low-pass next-generation sequencing data.

Authors:  Jacob E Crawford; Brian P Lazzaro
Journal:  Front Genet       Date:  2012-04-24       Impact factor: 4.599

9.  TagGD: fast and accurate software for DNA Tag generation and demultiplexing.

Authors:  Paul Igor Costea; Joakim Lundeberg; Pelin Akan
Journal:  PLoS One       Date:  2013-03-04       Impact factor: 3.240

10.  Substantial biases in ultra-short read data sets from high-throughput DNA sequencing.

Authors:  Juliane C Dohm; Claudio Lottaz; Tatiana Borodina; Heinz Himmelbauer
Journal:  Nucleic Acids Res       Date:  2008-07-26       Impact factor: 16.971

View more
  17 in total

1.  A Fine-Scale Genetic Map for Vervet Monkeys.

Authors:  Susanne P Pfeifer
Journal:  Mol Biol Evol       Date:  2020-07-01       Impact factor: 16.240

2.  Recommendations for improving statistical inference in population genomics.

Authors:  Parul Johri; Charles F Aquadro; Mark Beaumont; Brian Charlesworth; Laurent Excoffier; Adam Eyre-Walker; Peter D Keightley; Michael Lynch; Gil McVean; Bret A Payseur; Susanne P Pfeifer; Wolfgang Stephan; Jeffrey D Jensen
Journal:  PLoS Biol       Date:  2022-05-31       Impact factor: 9.593

Review 3.  Next Generation Sequencing and Bioinformatics Analysis of Family Genetic Inheritance.

Authors:  Aquillah M Kanzi; James Emmanuel San; Benjamin Chimukangara; Eduan Wilkinson; Maryam Fish; Veron Ramsuran; Tulio de Oliveira
Journal:  Front Genet       Date:  2020-10-23       Impact factor: 4.599

Review 4.  The population genomics of within-host Mycobacterium tuberculosis.

Authors:  Ana Y Morales-Arce; Susanna J Sabin; Anne C Stone; Jeffrey D Jensen
Journal:  Heredity (Edinb)       Date:  2020-10-15       Impact factor: 3.821

5.  A Perfect Match Genomic Landscape Provides a Unified Framework for the Precise Detection of Variation in Natural and Synthetic Haploid Genomes.

Authors:  Kim Palacios-Flores; Jair García-Sotelo; Alejandra Castillo; Carina Uribe; Luis Aguilar; Lucía Morales; Laura Gómez-Romero; José Reyes; Alejandro Garciarubio; Margareta Boege; Guillermo Dávila
Journal:  Genetics       Date:  2018-01-24       Impact factor: 4.562

6.  Comparison of Read Mapping and Variant Calling Tools for the Analysis of Plant NGS Data.

Authors:  Hanna Marie Schilbert; Andreas Rempel; Boas Pucker
Journal:  Plants (Basel)       Date:  2020-04-02

7.  De novo mutation rate estimation in wolves of known pedigree.

Authors:  Evan Koch; Rena M Schweizer; Teia M Schweizer; Daniel R Stahler; Douglas W Smith; Robert K Wayne; John Novembre
Journal:  Mol Biol Evol       Date:  2019-07-12       Impact factor: 16.240

8.  Genotyping-by-sequencing reveals three QTL for clubroot resistance to six pathotypes of Plasmodiophora brassicae in Brassica rapa.

Authors:  Fengqun Yu; Xingguo Zhang; Gary Peng; Kevin C Falk; Stephen E Strelkov; Bruce D Gossen
Journal:  Sci Rep       Date:  2017-07-03       Impact factor: 4.379

9.  The Evolutionary History of Nebraska Deer Mice: Local Adaptation in the Face of Strong Gene Flow.

Authors:  Susanne P Pfeifer; Stefan Laurent; Vitor C Sousa; Catherine R Linnen; Matthieu Foll; Laurent Excoffier; Hopi E Hoekstra; Jeffrey D Jensen
Journal:  Mol Biol Evol       Date:  2018-04-01       Impact factor: 16.240

10.  Identification of Key Genes in Gastric Cancer by Bioinformatics Analysis.

Authors:  Xinyu Chong; Rui Peng; Yan Sun; Luyu Zhang; Zheng Zhang
Journal:  Biomed Res Int       Date:  2020-09-21       Impact factor: 3.411

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.