Literature DB >> 19952140

Estimating population genetic parameters and comparing model goodness-of-fit using DNA sequences with error.

Xiaoming Liu1, Yun-Xin Fu, Taylor J Maxwell, Eric Boerwinkle.   

Abstract

It is known that sequencing error can bias estimation of evolutionary or population genetic parameters. This problem is more prominent in deep resequencing studies because of their large sample size n, and a higher probability of error at each nucleotide site. We propose a new method based on the composite likelihood of the observed SNP configurations to infer population mutation rate theta = 4N(e)micro, population exponential growth rate R, and error rate epsilon, simultaneously. Using simulation, we show the combined effects of the parameters, theta, n, epsilon, and R on the accuracy of parameter estimation. We compared our maximum composite likelihood estimator (MCLE) of theta with other theta estimators that take into account the error. The results show the MCLE performs well when the sample size is large or the error rate is high. Using parametric bootstrap, composite likelihood can also be used as a statistic for testing the model goodness-of-fit of the observed DNA sequences. The MCLE method is applied to sequence data on the ANGPTL4 gene in 1832 African American and 1045 European American individuals.

Entities:  

Mesh:

Substances:

Year:  2009        PMID: 19952140      PMCID: PMC2798822          DOI: 10.1101/gr.097543.109

Source DB:  PubMed          Journal:  Genome Res        ISSN: 1088-9051            Impact factor:   9.043


  28 in total

1.  Estimation of population parameters and recombination rates from single nucleotide polymorphisms.

Authors:  R Nielsen
Journal:  Genetics       Date:  2000-02       Impact factor: 4.562

2.  Generating samples under a Wright-Fisher neutral model of genetic variation.

Authors:  Richard R Hudson
Journal:  Bioinformatics       Date:  2002-02       Impact factor: 6.937

3.  On the number of segregating sites in genetical models without recombination.

Authors:  G A Watterson
Journal:  Theor Popul Biol       Date:  1975-04       Impact factor: 1.570

4.  A note on distributions of times to coalescence, under time-dependent population size.

Authors:  A Polanski; A Bobrowski; M Kimmel
Journal:  Theor Popul Biol       Date:  2003-02       Impact factor: 1.570

5.  New explicit expressions for relative frequencies of single-nucleotide polymorphisms with application to statistical inference on population growth.

Authors:  A Polanski; M Kimmel
Journal:  Genetics       Date:  2003-09       Impact factor: 4.562

6.  Sequencing errors and molecular evolutionary analysis.

Authors:  A G Clark; T S Whittam
Journal:  Mol Biol Evol       Date:  1992-07       Impact factor: 16.240

7.  Statistical method for testing the neutral mutation hypothesis by DNA polymorphism.

Authors:  F Tajima
Journal:  Genetics       Date:  1989-11       Impact factor: 4.562

8.  Statistical tests of neutrality of mutations.

Authors:  Y X Fu; W H Li
Journal:  Genetics       Date:  1993-03       Impact factor: 4.562

9.  Evolutionary relationship of DNA sequences in finite populations.

Authors:  F Tajima
Journal:  Genetics       Date:  1983-10       Impact factor: 4.562

10.  Accurate and fast methods to estimate the population mutation rate from error prone sequences.

Authors:  Bjarne Knudsen; Michael M Miyamoto
Journal:  BMC Bioinformatics       Date:  2009-08-11       Impact factor: 3.169

View more
  11 in total

Review 1.  Population genetic studies in the genomic sequencing era.

Authors:  Hua Chen
Journal:  Dongwuxue Yanjiu       Date:  2015-07-18

2.  Characterizing bias in population genetic inferences from low-coverage sequencing data.

Authors:  Eunjung Han; Janet S Sinsheimer; John Novembre
Journal:  Mol Biol Evol       Date:  2013-11-27       Impact factor: 16.240

3.  jPopGen Suite: population genetic analysis of DNA polymorphism from nucleotide sequences with errors.

Authors:  Xiaoming Liu
Journal:  Methods Ecol Evol       Date:  2012-03-02       Impact factor: 7.781

4.  Theoretical framework of population genetics with somatic mutations taken into account: application to copy number variations in humans.

Authors:  K Ezawa; H Innan
Journal:  Heredity (Edinb)       Date:  2013-08-28       Impact factor: 3.821

5.  Inference of site frequency spectra from high-throughput sequence data: quantification of selection on nonsynonymous and synonymous sites in humans.

Authors:  Peter D Keightley; Daniel L Halligan
Journal:  Genetics       Date:  2011-05-19       Impact factor: 4.562

6.  Exploring population size changes using SNP frequency spectra.

Authors:  Xiaoming Liu; Yun-Xin Fu
Journal:  Nat Genet       Date:  2015-04-06       Impact factor: 38.330

7.  Efficiency of ddRAD target enriched sequencing across spiny rock lobster species (Palinuridae: Jasus).

Authors:  Carla A Souza; Nicholas Murphy; Cecilia Villacorta-Rath; Laura N Woodings; Irina Ilyushkina; Cristian E Hernandez; Bridget S Green; James J Bell; Jan M Strugnell
Journal:  Sci Rep       Date:  2017-07-28       Impact factor: 4.379

8.  Properties of Samples With Segregating Polymerase Chain Reaction (PCR) Dropout Mutations Within a Species.

Authors:  Cortland K Griswold
Journal:  Evol Bioinform Online       Date:  2019-11-05       Impact factor: 1.625

9.  Empirical validation of pooled whole genome population re-sequencing in Drosophila melanogaster.

Authors:  Yuan Zhu; Alan O Bergland; Josefa González; Dmitri A Petrov
Journal:  PLoS One       Date:  2012-07-26       Impact factor: 3.240

10.  Calculation of Tajima's D and other neutrality test statistics from low depth next-generation sequencing data.

Authors:  Thorfinn Sand Korneliussen; Ida Moltke; Anders Albrechtsen; Rasmus Nielsen
Journal:  BMC Bioinformatics       Date:  2013-10-02       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.