Literature DB >> 22865643

A sample selection strategy for next-generation sequencing.

Chul Joo Kang1, Paul Marjoram.   

Abstract

Next-generation sequencing technology provides us with vast amounts of sequence data. It is efficient and cheaper than previous sequencing technologies, but deep resequencing of entire samples is still expensive. Therefore, sensible strategies for choosing subsets of samples to sequence are required. Here we describe an algorithm for selection of a sub-sample of an existing sample if one has either of two possible goals in mind: maximizing the number of new polymorphic sites that are detected, or improving the efficiency with which the remaining unsequenced individuals can have their types imputed at newly discovered polymorphisms. We then describe a variation on our algorithm that is more focused on detecting rarer variants. We demonstrate the performance of our algorithm using simulated data and data from the 1000 Genomes Project.
© 2012 Wiley Periodicals, Inc.

Entities:  

Mesh:

Year:  2012        PMID: 22865643      PMCID: PMC4272568          DOI: 10.1002/gepi.21664

Source DB:  PubMed          Journal:  Genet Epidemiol        ISSN: 0741-0395            Impact factor:   2.135


  13 in total

1.  Conditional genealogies and the age of a neutral mutant.

Authors:  C Wiuf; P Donnelly
Journal:  Theor Popul Biol       Date:  1999-10       Impact factor: 1.570

2.  How old is the most recent ancestor of two copies of an allele?

Authors:  Nick J Patterson
Journal:  Genetics       Date:  2004-11-01       Impact factor: 4.562

3.  Methods for detecting associations with rare variants for common diseases: application to analysis of sequence data.

Authors:  Bingshan Li; Suzanne M Leal
Journal:  Am J Hum Genet       Date:  2008-08-07       Impact factor: 11.025

4.  The frequency spectrum of a mutation, and its age, in a general diffusion model.

Authors:  R C Griffiths
Journal:  Theor Popul Biol       Date:  2003-09       Impact factor: 1.570

5.  Ancestral inference from samples of DNA sequences with recombination.

Authors:  R C Griffiths; P Marjoram
Journal:  J Comput Biol       Date:  1996       Impact factor: 1.479

6.  Properties of a neutral allele model with intragenic recombination.

Authors:  R R Hudson
Journal:  Theor Popul Biol       Date:  1983-04       Impact factor: 1.570

7.  Common SNPs explain a large proportion of the heritability for human height.

Authors:  Jian Yang; Beben Benyamin; Brian P McEvoy; Scott Gordon; Anjali K Henders; Dale R Nyholt; Pamela A Madden; Andrew C Heath; Nicholas G Martin; Grant W Montgomery; Michael E Goddard; Peter M Visscher
Journal:  Nat Genet       Date:  2010-06-20       Impact factor: 38.330

8.  A novel adaptive method for the analysis of next-generation sequencing data to detect complex trait associations with rare variants due to gene main effects and interactions.

Authors:  Dajiang J Liu; Suzanne M Leal
Journal:  PLoS Genet       Date:  2010-10-14       Impact factor: 5.917

9.  Testing for an unusual distribution of rare variants.

Authors:  Benjamin M Neale; Manuel A Rivas; Benjamin F Voight; David Altshuler; Bernie Devlin; Marju Orho-Melander; Sekar Kathiresan; Shaun M Purcell; Kathryn Roeder; Mark J Daly
Journal:  PLoS Genet       Date:  2011-03-03       Impact factor: 5.917

10.  A flexible and accurate genotype imputation method for the next generation of genome-wide association studies.

Authors:  Bryan N Howie; Peter Donnelly; Jonathan Marchini
Journal:  PLoS Genet       Date:  2009-06-19       Impact factor: 5.917

View more
  4 in total

1.  Choosing Subsamples for Sequencing Studies by Minimizing the Average Distance to the Closest Leaf.

Authors:  Jonathan T L Kang; Peng Zhang; Sebastian Zöllner; Noah A Rosenberg
Journal:  Genetics       Date:  2015-08-24       Impact factor: 4.562

2.  Imputation of missing genotypes within LD-blocks relying on the basic coalescent and beyond: consideration of population growth and structure.

Authors:  Maria Kabisch; Ute Hamann; Justo Lorenzo Bermejo
Journal:  BMC Genomics       Date:  2017-10-17       Impact factor: 3.969

3.  Genotype imputation reference panel selection using maximal phylogenetic diversity.

Authors:  Peng Zhang; Xiaowei Zhan; Noah A Rosenberg; Sebastian Zöllner
Journal:  Genetics       Date:  2013-08-09       Impact factor: 4.562

4.  Sequencing and imputation in GWAS: Cost-effective strategies to increase power and genomic coverage across diverse populations.

Authors:  Corbin Quick; Pramod Anugu; Solomon Musani; Scott T Weiss; Esteban G Burchard; Marquitta J White; Kevin L Keys; Francesco Cucca; Carlo Sidore; Michael Boehnke; Christian Fuchsberger
Journal:  Genet Epidemiol       Date:  2020-06-09       Impact factor: 2.135

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.