Literature DB >> 20019143

A SNP discovery method to assess variant allele probability from next-generation resequencing data.

Yufeng Shen1, Zhengzheng Wan, Cristian Coarfa, Rafal Drabek, Lei Chen, Elizabeth A Ostrowski, Yue Liu, George M Weinstock, David A Wheeler, Richard A Gibbs, Fuli Yu.   

Abstract

Accurate identification of genetic variants from next-generation sequencing (NGS) data is essential for immediate large-scale genomic endeavors such as the 1000 Genomes Project, and is crucial for further genetic analysis based on the discoveries. The key challenge in single nucleotide polymorphism (SNP) discovery is to distinguish true individual variants (occurring at a low frequency) from sequencing errors (often occurring at frequencies orders of magnitude higher). Therefore, knowledge of the error probabilities of base calls is essential. We have developed Atlas-SNP2, a computational tool that detects and accounts for systematic sequencing errors caused by context-related variables in a logistic regression model learned from training data sets. Subsequently, it estimates the posterior error probability for each substitution through a Bayesian formula that integrates prior knowledge of the overall sequencing error probability and the estimated SNP rate with the results from the logistic regression model for the given substitutions. The estimated posterior SNP probability can be used to distinguish true SNPs from sequencing errors. Validation results show that Atlas-SNP2 achieves a false-positive rate of lower than 10%, with an approximately 5% or lower false-negative rate.

Mesh:

Year:  2009        PMID: 20019143      PMCID: PMC2813483          DOI: 10.1101/gr.096388.109

Source DB:  PubMed          Journal:  Genome Res        ISSN: 1088-9051            Impact factor:   9.043


  20 in total

1.  A general approach to single-nucleotide polymorphism discovery.

Authors:  G T Marth; I Korf; M D Yandell; R T Yeh; Z Gu; H Zakeri; N O Stitziel; L Hillier; P Y Kwok; W R Gish
Journal:  Nat Genet       Date:  1999-12       Impact factor: 38.330

2.  An SNP map of the human genome generated by reduced representation shotgun sequencing.

Authors:  D Altshuler; V J Pollara; C R Cowles; W J Van Etten; J Baldwin; L Linton; E S Lander
Journal:  Nature       Date:  2000-09-28       Impact factor: 49.962

3.  BLAT--the BLAST-like alignment tool.

Authors:  W James Kent
Journal:  Genome Res       Date:  2002-04       Impact factor: 9.043

4.  SNP detection for massively parallel whole-genome resequencing.

Authors:  Ruiqiang Li; Yingrui Li; Xiaodong Fang; Huanming Yang; Jian Wang; Karsten Kristiansen; Jun Wang
Journal:  Genome Res       Date:  2009-05-06       Impact factor: 9.043

5.  VarScan: variant detection in massively parallel sequencing of individual and pooled samples.

Authors:  Daniel C Koboldt; Ken Chen; Todd Wylie; David E Larson; Michael D McLellan; Elaine R Mardis; George M Weinstock; Richard K Wilson; Li Ding
Journal:  Bioinformatics       Date:  2009-06-19       Impact factor: 6.937

6.  Next-generation DNA sequencing.

Authors:  Jay Shendure; Hanlee Ji
Journal:  Nat Biotechnol       Date:  2008-10       Impact factor: 54.908

7.  Sequencing of natural strains of Arabidopsis thaliana with short reads.

Authors:  Stephan Ossowski; Korbinian Schneeberger; Richard M Clark; Christa Lanz; Norman Warthmann; Detlef Weigel
Journal:  Genome Res       Date:  2008-09-25       Impact factor: 9.043

8.  Base-calling of automated sequencer traces using phred. II. Error probabilities.

Authors:  B Ewing; P Green
Journal:  Genome Res       Date:  1998-03       Impact factor: 9.043

9.  DNA sequencing of a cytogenetically normal acute myeloid leukaemia genome.

Authors:  Timothy J Ley; Elaine R Mardis; Li Ding; Bob Fulton; Michael D McLellan; Ken Chen; David Dooling; Brian H Dunford-Shore; Sean McGrath; Matthew Hickenbotham; Lisa Cook; Rachel Abbott; David E Larson; Dan C Koboldt; Craig Pohl; Scott Smith; Amy Hawkins; Scott Abbott; Devin Locke; Ladeana W Hillier; Tracie Miner; Lucinda Fulton; Vincent Magrini; Todd Wylie; Jarret Glasscock; Joshua Conyers; Nathan Sander; Xiaoqi Shi; John R Osborne; Patrick Minx; David Gordon; Asif Chinwalla; Yu Zhao; Rhonda E Ries; Jacqueline E Payton; Peter Westervelt; Michael H Tomasson; Mark Watson; Jack Baty; Jennifer Ivanovich; Sharon Heath; William D Shannon; Rakesh Nagarajan; Matthew J Walter; Daniel C Link; Timothy A Graubert; John F DiPersio; Richard K Wilson
Journal:  Nature       Date:  2008-11-06       Impact factor: 49.962

10.  Substantial biases in ultra-short read data sets from high-throughput DNA sequencing.

Authors:  Juliane C Dohm; Claudio Lottaz; Tatiana Borodina; Heinz Himmelbauer
Journal:  Nucleic Acids Res       Date:  2008-07-26       Impact factor: 16.971

View more
  97 in total

1.  SNP calling using genotype model selection on high-throughput sequencing data.

Authors:  Na You; Gabriel Murillo; Xiaoquan Su; Xiaowei Zeng; Jian Xu; Kang Ning; Shoudong Zhang; Jiankang Zhu; Xinping Cui
Journal:  Bioinformatics       Date:  2012-01-16       Impact factor: 6.937

2.  Replication strategies for rare variant complex trait association studies via next-generation sequencing.

Authors:  Dajiang J Liu; Suzanne M Leal
Journal:  Am J Hum Genet       Date:  2010-12-10       Impact factor: 11.025

3.  PyroHMMvar: a sensitive and accurate method to call short indels and SNPs for Ion Torrent and 454 data.

Authors:  Feng Zeng; Rui Jiang; Ting Chen
Journal:  Bioinformatics       Date:  2013-08-31       Impact factor: 6.937

4.  Mutations in VRK1 associated with complex motor and sensory axonal neuropathy plus microcephaly.

Authors:  Claudia Gonzaga-Jauregui; Timothy Lotze; Leila Jamal; Samantha Penney; Ian M Campbell; Davut Pehlivan; Jill V Hunter; Suzanne L Woodbury; Gerald Raymond; Adekunle M Adesina; Shalini N Jhangiani; Jeffrey G Reid; Donna M Muzny; Eric Boerwinkle; James R Lupski; Richard A Gibbs; Wojciech Wiszniewski
Journal:  JAMA Neurol       Date:  2013-12       Impact factor: 18.302

Review 5.  Review of alignment and SNP calling algorithms for next-generation sequencing data.

Authors:  M Mielczarek; J Szyda
Journal:  J Appl Genet       Date:  2015-06-09       Impact factor: 3.240

6.  Detection of ultra-rare mutations by next-generation sequencing.

Authors:  Michael W Schmitt; Scott R Kennedy; Jesse J Salk; Edward J Fox; Joseph B Hiatt; Lawrence A Loeb
Journal:  Proc Natl Acad Sci U S A       Date:  2012-08-01       Impact factor: 11.205

7.  MetaSeq: privacy preserving meta-analysis of sequencing-based association studies.

Authors:  Angad Pal Singh; Samreen Zafer; Itsik Pe'er
Journal:  Pac Symp Biocomput       Date:  2013

8.  Mutations in NMNAT1 cause Leber congenital amaurosis and identify a new disease pathway for retinal degeneration.

Authors:  Robert K Koenekoop; Hui Wang; Jacek Majewski; Xia Wang; Irma Lopez; Huanan Ren; Yiyun Chen; Yumei Li; Gerald A Fishman; Mohammed Genead; Jeremy Schwartzentruber; Naimesh Solanki; Elias I Traboulsi; Jingliang Cheng; Clare V Logan; Martin McKibbin; Bruce E Hayward; David A Parry; Colin A Johnson; Mohammed Nageeb; James A Poulter; Moin D Mohamed; Hussain Jafri; Yasmin Rashid; Graham R Taylor; Vafa Keser; Graeme Mardon; Huidan Xu; Chris F Inglehearn; Qing Fu; Carmel Toomes; Rui Chen
Journal:  Nat Genet       Date:  2012-07-29       Impact factor: 38.330

9.  NR2F1 mutations cause optic atrophy with intellectual disability.

Authors:  Daniëlle G M Bosch; F Nienke Boonstra; Claudia Gonzaga-Jauregui; Mafei Xu; Joep de Ligt; Shalini Jhangiani; Wojciech Wiszniewski; Donna M Muzny; Helger G Yntema; Rolph Pfundt; Lisenka E L M Vissers; Liesbeth Spruijt; Ellen A W Blokland; Chun-An Chen; Richard A Lewis; Sophia Y Tsai; Richard A Gibbs; Ming-Jer Tsai; James R Lupski; Huda Y Zoghbi; Frans P M Cremers; Bert B A de Vries; Christian P Schaaf
Journal:  Am J Hum Genet       Date:  2014-01-23       Impact factor: 11.025

10.  Exome Sequence Analysis Suggests that Genetic Burden Contributes to Phenotypic Variability and Complex Neuropathy.

Authors:  Claudia Gonzaga-Jauregui; Tamar Harel; Tomasz Gambin; Maria Kousi; Laurie B Griffin; Ludmila Francescatto; Burcak Ozes; Ender Karaca; Shalini N Jhangiani; Matthew N Bainbridge; Kim S Lawson; Davut Pehlivan; Yuji Okamoto; Marjorie Withers; Pedro Mancias; Anne Slavotinek; Pamela J Reitnauer; Meryem T Goksungur; Michael Shy; Thomas O Crawford; Michel Koenig; Jason Willer; Brittany N Flores; Igor Pediaditrakis; Onder Us; Wojciech Wiszniewski; Yesim Parman; Anthony Antonellis; Donna M Muzny; Nicholas Katsanis; Esra Battaloglu; Eric Boerwinkle; Richard A Gibbs; James R Lupski
Journal:  Cell Rep       Date:  2015-08-06       Impact factor: 9.423

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.