Literature DB >> 21903627

A statistical framework for SNP calling, mutation discovery, association mapping and population genetical parameter estimation from sequencing data.

Heng Li1.   

Abstract

MOTIVATION: Most existing methods for DNA sequence analysis rely on accurate sequences or genotypes. However, in applications of the next-generation sequencing (NGS), accurate genotypes may not be easily obtained (e.g. multi-sample low-coverage sequencing or somatic mutation discovery). These applications press for the development of new methods for analyzing sequence data with uncertainty.
RESULTS: We present a statistical framework for calling SNPs, discovering somatic mutations, inferring population genetical parameters and performing association tests directly based on sequencing data without explicit genotyping or linkage-based imputation. On real data, we demonstrate that our method achieves comparable accuracy to alternative methods for estimating site allele count, for inferring allele frequency spectrum and for association mapping. We also highlight the necessity of using symmetric datasets for finding somatic mutations and confirm that for discovering rare events, mismapping is frequently the leading source of errors. AVAILABILITY: http://samtools.sourceforge.net. CONTACT: hengli@broadinstitute.org.

Mesh:

Year:  2011        PMID: 21903627      PMCID: PMC3198575          DOI: 10.1093/bioinformatics/btr509

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  36 in total

1.  Enredo and Pecan: genome-wide mammalian consistency-based multiple alignment with paralogs.

Authors:  Benedict Paten; Javier Herrero; Kathryn Beal; Stephen Fitzgerald; Ewan Birney
Journal:  Genome Res       Date:  2008-10-10       Impact factor: 9.043

2.  Simultaneous genotype calling and haplotype phasing improves genotype accuracy and reduces false-positive associations for genome-wide association studies.

Authors:  Brian L Browning; Zhaoxia Yu
Journal:  Am J Hum Genet       Date:  2009-12       Impact factor: 11.025

3.  Maximum-likelihood estimation of molecular haplotype frequencies in a diploid population.

Authors:  L Excoffier; M Slatkin
Journal:  Mol Biol Evol       Date:  1995-09       Impact factor: 16.240

4.  The Sequence Alignment/Map format and SAMtools.

Authors:  Heng Li; Bob Handsaker; Alec Wysoker; Tim Fennell; Jue Ruan; Nils Homer; Gabor Marth; Goncalo Abecasis; Richard Durbin
Journal:  Bioinformatics       Date:  2009-06-08       Impact factor: 6.937

Review 5.  Genotype imputation.

Authors:  Yun Li; Cristen Willer; Serena Sanna; Gonçalo Abecasis
Journal:  Annu Rev Genomics Hum Genet       Date:  2009       Impact factor: 8.929

6.  Recurring mutations found by sequencing an acute myeloid leukemia genome.

Authors:  Elaine R Mardis; Li Ding; David J Dooling; David E Larson; Michael D McLellan; Ken Chen; Daniel C Koboldt; Robert S Fulton; Kim D Delehaunty; Sean D McGrath; Lucinda A Fulton; Devin P Locke; Vincent J Magrini; Rachel M Abbott; Tammi L Vickery; Jerry S Reed; Jody S Robinson; Todd Wylie; Scott M Smith; Lynn Carmichael; James M Eldred; Christopher C Harris; Jason Walker; Joshua B Peck; Feiyu Du; Adam F Dukes; Gabriel E Sanderson; Anthony M Brummett; Eric Clark; Joshua F McMichael; Rick J Meyer; Jonathan K Schindler; Craig S Pohl; John W Wallis; Xiaoqi Shi; Ling Lin; Heather Schmidt; Yuzhu Tang; Carrie Haipek; Madeline E Wiechert; Jolynda V Ivy; Joelle Kalicki; Glendoria Elliott; Rhonda E Ries; Jacqueline E Payton; Peter Westervelt; Michael H Tomasson; Mark A Watson; Jack Baty; Sharon Heath; William D Shannon; Rakesh Nagarajan; Daniel C Link; Matthew J Walter; Timothy A Graubert; John F DiPersio; Richard K Wilson; Timothy J Ley
Journal:  N Engl J Med       Date:  2009-08-05       Impact factor: 91.245

7.  Mutational evolution in a lobular breast tumour profiled at single nucleotide resolution.

Authors:  Sohrab P Shah; Ryan D Morin; Jaswinder Khattra; Leah Prentice; Trevor Pugh; Angela Burleigh; Allen Delaney; Karen Gelmon; Ryan Guliany; Janine Senz; Christian Steidl; Robert A Holt; Steven Jones; Mark Sun; Gillian Leung; Richard Moore; Tesa Severson; Greg A Taylor; Andrew E Teschendorff; Kane Tse; Gulisa Turashvili; Richard Varhol; René L Warren; Peter Watson; Yongjun Zhao; Carlos Caldas; David Huntsman; Martin Hirst; Marco A Marra; Samuel Aparicio
Journal:  Nature       Date:  2009-10-08       Impact factor: 49.962

8.  Fast and accurate short read alignment with Burrows-Wheeler transform.

Authors:  Heng Li; Richard Durbin
Journal:  Bioinformatics       Date:  2009-05-18       Impact factor: 6.937

9.  DNA sequencing of a cytogenetically normal acute myeloid leukaemia genome.

Authors:  Timothy J Ley; Elaine R Mardis; Li Ding; Bob Fulton; Michael D McLellan; Ken Chen; David Dooling; Brian H Dunford-Shore; Sean McGrath; Matthew Hickenbotham; Lisa Cook; Rachel Abbott; David E Larson; Dan C Koboldt; Craig Pohl; Scott Smith; Amy Hawkins; Scott Abbott; Devin Locke; Ladeana W Hillier; Tracie Miner; Lucinda Fulton; Vincent Magrini; Todd Wylie; Jarret Glasscock; Joshua Conyers; Nathan Sander; Xiaoqi Shi; John R Osborne; Patrick Minx; David Gordon; Asif Chinwalla; Yu Zhao; Rhonda E Ries; Jacqueline E Payton; Peter Westervelt; Michael H Tomasson; Mark Watson; Jack Baty; Jennifer Ivanovich; Sharon Heath; William D Shannon; Rakesh Nagarajan; Matthew J Walter; Daniel C Link; Timothy A Graubert; John F DiPersio; Richard K Wilson
Journal:  Nature       Date:  2008-11-06       Impact factor: 49.962

10.  A flexible and accurate genotype imputation method for the next generation of genome-wide association studies.

Authors:  Bryan N Howie; Peter Donnelly; Jonathan Marchini
Journal:  PLoS Genet       Date:  2009-06-19       Impact factor: 5.917

View more
  1956 in total

1.  Genotype-Frequency Estimation from High-Throughput Sequencing Data.

Authors:  Takahiro Maruki; Michael Lynch
Journal:  Genetics       Date:  2015-07-29       Impact factor: 4.562

2.  RAD Capture (Rapture): Flexible and Efficient Sequence-Based Genotyping.

Authors:  Omar A Ali; Sean M O'Rourke; Stephen J Amish; Mariah H Meek; Gordon Luikart; Carson Jeffres; Michael R Miller
Journal:  Genetics       Date:  2015-12-29       Impact factor: 4.562

3.  The thick aleurone1 Gene Encodes a NOT1 Subunit of the CCR4-NOT Complex and Regulates Cell Patterning in Endosperm.

Authors:  Hao Wu; Bryan C Gontarek; Gibum Yi; Brandon D Beall; Anjanasree K Neelakandan; Bibechana Adhikari; Rumei Chen; Donald R McCarty; Andrew J Severin; Philip W Becraft
Journal:  Plant Physiol       Date:  2020-07-31       Impact factor: 8.340

4.  TypeTE: a tool to genotype mobile element insertions from whole genome resequencing data.

Authors:  Clément Goubert; Jainy Thomas; Lindsay M Payer; Jeffrey M Kidd; Julie Feusier; W Scott Watkins; Kathleen H Burns; Lynn B Jorde; Cédric Feschotte
Journal:  Nucleic Acids Res       Date:  2020-04-06       Impact factor: 16.971

5.  Whole genome genotyping reveals discrete genetic diversity in north-east Atlantic maerl beds.

Authors:  Tom L Jenkins; Marie-Laure Guillemin; Cornelia Simon-Nutbrown; Heidi L Burdett; Jamie R Stevens; Viviana Peña
Journal:  Evol Appl       Date:  2021-03-30       Impact factor: 5.183

6.  Small molecule inhibition of the CBFβ/RUNX interaction decreases ovarian cancer growth and migration through alterations in genes related to epithelial-to-mesenchymal transition.

Authors:  Anne L Carlton; Anuradha Illendula; Yan Gao; Danielle C Llaneza; Adam Boulton; Anant Shah; Roger A Rajewski; Charles N Landen; David Wotton; John H Bushweller
Journal:  Gynecol Oncol       Date:  2018-03-16       Impact factor: 5.482

7.  PhredEM: a phred-score-informed genotype-calling approach for next-generation sequencing studies.

Authors:  Peizhou Liao; Glen A Satten; Yi-Juan Hu
Journal:  Genet Epidemiol       Date:  2017-05-31       Impact factor: 2.135

8.  Genetic diagnosis of a Chinese multiple endocrine neoplasia type 2A family through whole genome sequencing.

Authors:  Zhen-Fang DU; Peng-Fei Li; Jian-Qiang Zhao; Zhi-Lie Cao; Feng Li; Ju-Ming Ma; Xiao-Ping Qi
Journal:  J Biosci       Date:  2017-06       Impact factor: 1.826

9.  Homozygous and compound-heterozygous mutations in TGDS cause Catel-Manzke syndrome.

Authors:  Nadja Ehmke; Almuth Caliebe; Rainer Koenig; Sarina G Kant; Zornitza Stark; Valérie Cormier-Daire; Dagmar Wieczorek; Gabriele Gillessen-Kaesbach; Kirstin Hoff; Amit Kawalia; Holger Thiele; Janine Altmüller; Björn Fischer-Zirnsak; Alexej Knaus; Na Zhu; Verena Heinrich; Celine Huber; Izabela Harabula; Malte Spielmann; Denise Horn; Uwe Kornak; Jochen Hecht; Peter M Krawitz; Peter Nürnberg; Reiner Siebert; Hermann Manzke; Stefan Mundlos
Journal:  Am J Hum Genet       Date:  2014-12-04       Impact factor: 11.025

10.  Identification of Transcribed Enhancers by Genome-Wide Chromatin Immunoprecipitation Sequencing.

Authors:  Steven Blinka; Michael H Reimer; Kirthi Pulakanti; Luca Pinello; Guo-Cheng Yuan; Sridhar Rao
Journal:  Methods Mol Biol       Date:  2017
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.