Literature DB >> 22886560

A beginners guide to SNP calling from high-throughput DNA-sequencing data.

André Altmann1, Peter Weber, Daniel Bader, Michael Preuss, Elisabeth B Binder, Bertram Müller-Myhsok.   

Abstract

High-throughput DNA sequencing (HTS) is of increasing importance in the life sciences. One of its most prominent applications is the sequencing of whole genomes or targeted regions of the genome such as all exonic regions (i.e., the exome). Here, the objective is the identification of genetic variants such as single nucleotide polymorphisms (SNPs). The extraction of SNPs from the raw genetic sequences involves many processing steps and the application of a diverse set of tools. We review the essential building blocks for a pipeline that calls SNPs from raw HTS data. The pipeline includes quality control, mapping of short reads to the reference genome, visualization and post-processing of the alignment including base quality recalibration. The final steps of the pipeline include the SNP calling procedure along with filtering of SNP candidates. The steps of this pipeline are accompanied by an analysis of a publicly available whole-exome sequencing dataset. To this end, we employ several alignment programs and SNP calling routines for highlighting the fact that the choice of the tools significantly affects the final results.

Entities:  

Mesh:

Year:  2012        PMID: 22886560     DOI: 10.1007/s00439-012-1213-z

Source DB:  PubMed          Journal:  Hum Genet        ISSN: 0340-6717            Impact factor:   4.132


  55 in total

Review 1.  Detecting structural variations in the human genome using next generation sequencing.

Authors:  Ruibin Xi; Tae-Min Kim; Peter J Park
Journal:  Brief Funct Genomics       Date:  2011-01-06       Impact factor: 4.241

2.  De novo assembly and analysis of RNA-seq data.

Authors:  Gordon Robertson; Jacqueline Schein; Readman Chiu; Richard Corbett; Matthew Field; Shaun D Jackman; Karen Mungall; Sam Lee; Hisanaga Mark Okada; Jenny Q Qian; Malachi Griffith; Anthony Raymond; Nina Thiessen; Timothee Cezard; Yaron S Butterfield; Richard Newsome; Simon K Chan; Rong She; Richard Varhol; Baljit Kamoh; Anna-Liisa Prabhu; Angela Tam; YongJun Zhao; Richard A Moore; Martin Hirst; Marco A Marra; Steven J M Jones; Pamela A Hoodless; Inanc Birol
Journal:  Nat Methods       Date:  2010-10-10       Impact factor: 28.547

3.  Continuous base identification for single-molecule nanopore DNA sequencing.

Authors:  James Clarke; Hai-Chen Wu; Lakmal Jayasinghe; Alpesh Patel; Stuart Reid; Hagan Bayley
Journal:  Nat Nanotechnol       Date:  2009-02-22       Impact factor: 39.213

4.  Mapping short DNA sequencing reads and calling variants using mapping quality scores.

Authors:  Heng Li; Jue Ruan; Richard Durbin
Journal:  Genome Res       Date:  2008-08-19       Impact factor: 9.043

5.  Simultaneous genotype calling and haplotype phasing improves genotype accuracy and reduces false-positive associations for genome-wide association studies.

Authors:  Brian L Browning; Zhaoxia Yu
Journal:  Am J Hum Genet       Date:  2009-12       Impact factor: 11.025

6.  Sequencing of 50 human exomes reveals adaptation to high altitude.

Authors:  Xin Yi; Yu Liang; Emilia Huerta-Sanchez; Xin Jin; Zha Xi Ping Cuo; John E Pool; Xun Xu; Hui Jiang; Nicolas Vinckenbosch; Thorfinn Sand Korneliussen; Hancheng Zheng; Tao Liu; Weiming He; Kui Li; Ruibang Luo; Xifang Nie; Honglong Wu; Meiru Zhao; Hongzhi Cao; Jing Zou; Ying Shan; Shuzheng Li; Qi Yang; Peixiang Ni; Geng Tian; Junming Xu; Xiao Liu; Tao Jiang; Renhua Wu; Guangyu Zhou; Meifang Tang; Junjie Qin; Tong Wang; Shuijian Feng; Guohong Li; Jiangbai Luosang; Wei Wang; Fang Chen; Yading Wang; Xiaoguang Zheng; Zhuo Li; Zhuoma Bianba; Ge Yang; Xinping Wang; Shuhui Tang; Guoyi Gao; Yong Chen; Zhen Luo; Lamu Gusang; Zheng Cao; Qinghui Zhang; Weihan Ouyang; Xiaoli Ren; Huiqing Liang; Huisong Zheng; Yebo Huang; Jingxiang Li; Lars Bolund; Karsten Kristiansen; Yingrui Li; Yong Zhang; Xiuqing Zhang; Ruiqiang Li; Songgang Li; Huanming Yang; Rasmus Nielsen; Jun Wang; Jian Wang
Journal:  Science       Date:  2010-07-02       Impact factor: 47.728

Review 7.  RNA-Seq: a revolutionary tool for transcriptomics.

Authors:  Zhong Wang; Mark Gerstein; Michael Snyder
Journal:  Nat Rev Genet       Date:  2009-01       Impact factor: 53.242

8.  Galaxy: a comprehensive approach for supporting accessible, reproducible, and transparent computational research in the life sciences.

Authors:  Jeremy Goecks; Anton Nekrutenko; James Taylor
Journal:  Genome Biol       Date:  2010-08-25       Impact factor: 13.583

Review 9.  Exome sequencing: the sweet spot before whole genomes.

Authors:  Jamie K Teer; James C Mullikin
Journal:  Hum Mol Genet       Date:  2010-08-12       Impact factor: 6.150

10.  GenomeView: a next-generation genome browser.

Authors:  Thomas Abeel; Thomas Van Parys; Yvan Saeys; James Galagan; Yves Van de Peer
Journal:  Nucleic Acids Res       Date:  2011-11-18       Impact factor: 16.971

View more
  40 in total

Review 1.  Review of alignment and SNP calling algorithms for next-generation sequencing data.

Authors:  M Mielczarek; J Szyda
Journal:  J Appl Genet       Date:  2015-06-09       Impact factor: 3.240

2.  PhyResSE: a Web Tool Delineating Mycobacterium tuberculosis Antibiotic Resistance and Lineage from Whole-Genome Sequencing Data.

Authors:  Silke Feuerriegel; Viola Schleusener; Patrick Beckert; Thomas A Kohl; Paolo Miotto; Daniela M Cirillo; Andrea M Cabibbe; Stefan Niemann; Kurt Fellenberg
Journal:  J Clin Microbiol       Date:  2015-04-08       Impact factor: 5.948

3.  The E. histolytica Genome Structure and Virulence.

Authors:  Carol A Gilchrist
Journal:  Curr Trop Med Rep       Date:  2016-10-03

4.  Study designs and methods post genome-wide association studies.

Authors:  Andreas Ziegler; Yan V Sun
Journal:  Hum Genet       Date:  2012-10       Impact factor: 4.132

5.  Reduced representation approach for identification of genome-wide SNPs and their annotation for economically important traits in Indian Tharparkar cattle.

Authors:  M Joel Devadasan; D Ravi Kumar; M R Vineeth; Anjali Choudhary; T Surya; S K Niranjan; Archana Verma; Jayakumar Sivalingam
Journal:  3 Biotech       Date:  2020-06-16       Impact factor: 2.406

6.  Detecting rare variants for psychiatric disorders using next generation sequencing: a methods primer.

Authors:  Andre Altmann; Carina Quast; Peter Weber
Journal:  Curr Psychiatry Rep       Date:  2013-01       Impact factor: 5.285

Review 7.  Next-generation sequencing diagnostics for neurological diseases/disorders: from a clinical perspective.

Authors:  Jia Nee Foo; Jianjun Liu; Eng-King Tan
Journal:  Hum Genet       Date:  2013-03-23       Impact factor: 4.132

Review 8.  Integration of cancer genomics with treatment selection: from the genome to predictive biomarkers.

Authors:  Thomas J Ow; Vlad C Sandulache; Heath D Skinner; Jeffrey N Myers
Journal:  Cancer       Date:  2013-08-20       Impact factor: 6.860

9.  Reducing false-positive incidental findings with ensemble genotyping and logistic regression based variant filtering methods.

Authors:  Kyu-Baek Hwang; In-Hee Lee; Jin-Ho Park; Tina Hambuch; Yongjoon Choe; MinHyeok Kim; Kyungjoon Lee; Taemin Song; Matthew B Neu; Neha Gupta; Isaac S Kohane; Robert C Green; Sek Won Kong
Journal:  Hum Mutat       Date:  2014-06-24       Impact factor: 4.878

Review 10.  A genomic perspective on hybridization and speciation.

Authors:  Bret A Payseur; Loren H Rieseberg
Journal:  Mol Ecol       Date:  2016-03-09       Impact factor: 6.185

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.