Literature DB >> 21840859

Inference of population mutation rate and detection of segregating sites from next-generation sequence data.

Chul Joo Kang1, Paul Marjoram.   

Abstract

We live in an age in which our ability to collect large amounts of genome-wide genetic variation data offers the promise of providing the key to the understanding and treatment of genetic diseases. Over the next few years this effort will be spearheaded by so-called next-generation sequencing technologies, which provide vast amounts of short-read sequence data at relatively low cost. This technology is often used to detect unknown variation in regions that have been linked with a given disease or phenotype. However, error rates are significant, leading to some nontrivial issues when it comes to interpreting the data. In this article, we present a method with which to address questions of widespread interest: calling variants and estimating the population mutation rate. We show performance of the method using simulation studies before applying our approach to an analysis of data from the 1000 Genomes project.

Entities:  

Mesh:

Year:  2011        PMID: 21840859      PMCID: PMC3189800          DOI: 10.1534/genetics.111.130898

Source DB:  PubMed          Journal:  Genetics        ISSN: 0016-6731            Impact factor:   4.562


  21 in total

1.  Two-locus sampling distributions and their application.

Authors:  R R Hudson
Journal:  Genetics       Date:  2001-12       Impact factor: 4.562

2.  Generating samples under a Wright-Fisher neutral model of genetic variation.

Authors:  Richard R Hudson
Journal:  Bioinformatics       Date:  2002-02       Impact factor: 6.937

3.  The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data.

Authors:  Aaron McKenna; Matthew Hanna; Eric Banks; Andrey Sivachenko; Kristian Cibulskis; Andrew Kernytsky; Kiran Garimella; David Altshuler; Stacey Gabriel; Mark Daly; Mark A DePristo
Journal:  Genome Res       Date:  2010-07-19       Impact factor: 9.043

4.  Accurate detection and genotyping of SNPs utilizing population sequencing data.

Authors:  Vikas Bansal; Olivier Harismendy; Ryan Tewhey; Sarah S Murray; Nicholas J Schork; Eric J Topol; Kelly A Frazer
Journal:  Genome Res       Date:  2010-02-11       Impact factor: 9.043

5.  Low-coverage sequencing: implications for design of complex trait association studies.

Authors:  Yun Li; Carlo Sidore; Hyun Min Kang; Michael Boehnke; Gonçalo R Abecasis
Journal:  Genome Res       Date:  2011-04-01       Impact factor: 9.043

6.  A map of human genome variation from population-scale sequencing.

Authors:  Gonçalo R Abecasis; David Altshuler; Adam Auton; Lisa D Brooks; Richard M Durbin; Richard A Gibbs; Matt E Hurles; Gil A McVean
Journal:  Nature       Date:  2010-10-28       Impact factor: 49.962

Review 7.  The Sanger FASTQ file format for sequences with quality scores, and the Solexa/Illumina FASTQ variants.

Authors:  Peter J A Cock; Christopher J Fields; Naohisa Goto; Michael L Heuer; Peter M Rice
Journal:  Nucleic Acids Res       Date:  2009-12-16       Impact factor: 16.971

Review 8.  Finding the missing heritability of complex diseases.

Authors:  Teri A Manolio; Francis S Collins; Nancy J Cox; David B Goldstein; Lucia A Hindorff; David J Hunter; Mark I McCarthy; Erin M Ramos; Lon R Cardon; Aravinda Chakravarti; Judy H Cho; Alan E Guttmacher; Augustine Kong; Leonid Kruglyak; Elaine Mardis; Charles N Rotimi; Montgomery Slatkin; David Valle; Alice S Whittemore; Michael Boehnke; Andrew G Clark; Evan E Eichler; Greg Gibson; Jonathan L Haines; Trudy F C Mackay; Steven A McCarroll; Peter M Visscher
Journal:  Nature       Date:  2009-10-08       Impact factor: 49.962

9.  Discovery of rare variants via sequencing: implications for the design of complex trait association studies.

Authors:  Bingshan Li; Suzanne M Leal
Journal:  PLoS Genet       Date:  2009-05-15       Impact factor: 5.917

10.  Fast and accurate short read alignment with Burrows-Wheeler transform.

Authors:  Heng Li; Richard Durbin
Journal:  Bioinformatics       Date:  2009-05-18       Impact factor: 6.937

View more
  6 in total

1.  Exact coalescent simulation of new haplotype data from existing reference haplotypes.

Authors:  Chul Joo Kang; Paul Marjoram
Journal:  Bioinformatics       Date:  2012-01-17       Impact factor: 6.937

2.  Neutrality tests for sequences with missing data.

Authors:  Luca Ferretti; Emanuele Raineri; Sebastian Ramos-Onsins
Journal:  Genetics       Date:  2012-06-01       Impact factor: 4.562

3.  Quantifying population genetic differentiation from next-generation sequencing data.

Authors:  Matteo Fumagalli; Filipe G Vieira; Thorfinn Sand Korneliussen; Tyler Linderoth; Emilia Huerta-Sánchez; Anders Albrechtsen; Rasmus Nielsen
Journal:  Genetics       Date:  2013-08-26       Impact factor: 4.562

4.  Characterizing bias in population genetic inferences from low-coverage sequencing data.

Authors:  Eunjung Han; Janet S Sinsheimer; John Novembre
Journal:  Mol Biol Evol       Date:  2013-11-27       Impact factor: 16.240

Review 5.  From next-generation resequencing reads to a high-quality variant data set.

Authors:  S P Pfeifer
Journal:  Heredity (Edinb)       Date:  2016-10-19       Impact factor: 3.821

6.  Assessing the effect of sequencing depth and sample size in population genetics inferences.

Authors:  Matteo Fumagalli
Journal:  PLoS One       Date:  2013-11-18       Impact factor: 3.240

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.