Literature DB >> 16954540

Inference of population genetic parameters in metagenomics: a clean look at messy data.

Philip L F Johnson1, Montgomery Slatkin.   

Abstract

Metagenomic projects generate short, overlapping fragments of DNA sequence, each deriving from a different individual. We report a new method for inferring the scaled mutation rate, theta = 2Neu, and the scaled exponential growth rate, R = Ner, from the site-frequency spectrum of these data while accounting for sequencing error via Phred quality scores. After obtaining maximum likelihood parameter estimates for theta and R, we calculate empirical Bayes quality scores reflecting the posterior probability that each apparently polymorphic site is truly polymorphic; these scores can then be used for other applications such as SNP discovery. For realistic parameter ranges, analytic and simulation results show our estimates to be essentially unbiased with tight confidence intervals. In contrast, choosing an arbitrary quality score cutoff (e.g., trimming reads) and ignoring further quality information during inference yields biased estimates with greater variance. We illustrate the use of our technique on a new project analyzing activated sludge from a lab-scale bioreactor seeded by a wastewater treatment plant.

Entities:  

Mesh:

Substances:

Year:  2006        PMID: 16954540      PMCID: PMC1581441          DOI: 10.1101/gr.5431206

Source DB:  PubMed          Journal:  Genome Res        ISSN: 1088-9051            Impact factor:   9.043


  36 in total

1.  Estimation of population parameters and recombination rates from single nucleotide polymorphisms.

Authors:  R Nielsen
Journal:  Genetics       Date:  2000-02       Impact factor: 4.562

2.  Directional selection and the site-frequency spectrum.

Authors:  C D Bustamante; J Wakeley; S Sawyer; D L Hartl
Journal:  Genetics       Date:  2001-12       Impact factor: 4.562

Review 3.  Recombination and the population structures of bacterial pathogens.

Authors:  E J Feil; B G Spratt
Journal:  Annu Rev Microbiol       Date:  2001       Impact factor: 15.500

4.  The allele frequency spectrum in genome-wide human variation data reveals signals of differential demographic history in three large world populations.

Authors:  Gabor T Marth; Eva Czabarka; Janos Murvai; Stephen T Sherry
Journal:  Genetics       Date:  2004-01       Impact factor: 4.562

5.  Population genetics of polymorphism and divergence for diploid selection models with arbitrary dominance.

Authors:  Scott Williamson; Adi Fledel-Alon; Carlos D Bustamante
Journal:  Genetics       Date:  2004-09       Impact factor: 4.562

6.  Community genomics among stratified microbial assemblages in the ocean's interior.

Authors:  Edward F DeLong; Christina M Preston; Tracy Mincer; Virginia Rich; Steven J Hallam; Niels-Ulrik Frigaard; Asuncion Martinez; Matthew B Sullivan; Robert Edwards; Beltran Rodriguez Brito; Sallie W Chisholm; David M Karl
Journal:  Science       Date:  2006-01-27       Impact factor: 47.728

7.  Base-calling of automated sequencer traces using phred. II. Error probabilities.

Authors:  B Ewing; P Green
Journal:  Genome Res       Date:  1998-03       Impact factor: 9.043

8.  Rates of DNA sequence evolution in experimental populations of Escherichia coli during 20,000 generations.

Authors:  Richard E Lenski; Cynthia L Winkworth; Margaret A Riley
Journal:  J Mol Evol       Date:  2003-04       Impact factor: 2.395

9.  Nucleotide diversity and linkage disequilibrium in loblolly pine.

Authors:  Garth R Brown; Geoffrey P Gill; Robert J Kuntz; Charles H Langley; David B Neale
Journal:  Proc Natl Acad Sci U S A       Date:  2004-10-11       Impact factor: 11.205

10.  Bioinformatics for whole-genome shotgun sequencing of microbial communities.

Authors:  Kevin Chen; Lior Pachter
Journal:  PLoS Comput Biol       Date:  2005-07       Impact factor: 4.475

View more
  38 in total

1.  Detecting directional selection in the presence of recent admixture in African-Americans.

Authors:  Kirk E Lohmueller; Carlos D Bustamante; Andrew G Clark
Journal:  Genetics       Date:  2010-12-31       Impact factor: 4.562

Review 2.  Population genetic studies in the genomic sequencing era.

Authors:  Hua Chen
Journal:  Dongwuxue Yanjiu       Date:  2015-07-18

3.  mlRho - a program for estimating the population mutation and recombination rates from shotgun-sequenced diploid genomes.

Authors:  Bernhard Haubold; Peter Pfaffelhuber; Michael Lynch
Journal:  Mol Ecol       Date:  2010-03       Impact factor: 6.185

4.  The joint allele-frequency spectrum in closely related species.

Authors:  Hua Chen; Richard E Green; Svante Pääbo; Montgomery Slatkin
Journal:  Genetics       Date:  2007-07-01       Impact factor: 4.562

Review 5.  A bioinformatician's guide to metagenomics.

Authors:  Victor Kunin; Alex Copeland; Alla Lapidus; Konstantinos Mavromatis; Philip Hugenholtz
Journal:  Microbiol Mol Biol Rev       Date:  2008-12       Impact factor: 11.056

6.  Inferring population mutation rate and sequencing error rate using the SNP frequency spectrum in a sample of DNA sequences.

Authors:  Xiaoming Liu; Taylor J Maxwell; Eric Boerwinkle; Yun-Xin Fu
Journal:  Mol Biol Evol       Date:  2009-03-24       Impact factor: 16.240

7.  Population genetic inference from resequencing data.

Authors:  Rong Jiang; Simon Tavaré; Paul Marjoram
Journal:  Genetics       Date:  2008-11-03       Impact factor: 4.562

8.  A novel approach to estimating heterozygosity from low-coverage genome sequence.

Authors:  Katarzyna Bryc; Nick Patterson; David Reich
Journal:  Genetics       Date:  2013-08-09       Impact factor: 4.562

Review 9.  Population genetic inference from genomic sequence variation.

Authors:  John E Pool; Ines Hellmann; Jeffrey D Jensen; Rasmus Nielsen
Journal:  Genome Res       Date:  2010-01-12       Impact factor: 9.043

10.  Genome-wide mutational diversity in an evolving population of Escherichia coli.

Authors:  J E Barrick; R E Lenski
Journal:  Cold Spring Harb Symp Quant Biol       Date:  2009-09-23
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.