Literature DB >> 10954684

Accuracy of haplotype frequency estimation for biallelic loci, via the expectation-maximization algorithm for unphased diploid genotype data.

D Fallin1, N J Schork.   

Abstract

Haplotype analyses have become increasingly common in genetic studies of human disease because of their ability to identify unique chromosomal segments likely to harbor disease-predisposing genes. The study of haplotypes is also used to investigate many population processes, such as migration and immigration rates, linkage-disequilibrium strength, and the relatedness of populations. Unfortunately, many haplotype-analysis methods require phase information that can be difficult to obtain from samples of nonhaploid species. There are, however, strategies for estimating haplotype frequencies from unphased diploid genotype data collected on a sample of individuals that make use of the expectation-maximization (EM) algorithm to overcome the missing phase information. The accuracy of such strategies, compared with other phase-determination methods, must be assessed before their use can be advocated. In this study, we consider and explore sources of error between EM-derived haplotype frequency estimates and their population parameters, noting that much of this error is due to sampling error, which is inherent in all studies, even when phase can be determined. In light of this, we focus on the additional error between haplotype frequencies within a sample data set and EM-derived haplotype frequency estimates incurred by the estimation procedure. We assess the accuracy of haplotype frequency estimation as a function of a number of factors, including sample size, number of loci studied, allele frequencies, and locus-specific allelic departures from Hardy-Weinberg and linkage equilibrium. We point out the relative impacts of sampling error and estimation error, calling attention to the pronounced accuracy of EM estimates once sampling error has been accounted for. We also suggest that many factors that may influence accuracy can be assessed empirically within a data set-a fact that can be used to create "diagnostics" that a user can turn to for assessing potential inaccuracies in estimation.

Entities:  

Mesh:

Year:  2000        PMID: 10954684      PMCID: PMC1287896          DOI: 10.1086/303069

Source DB:  PubMed          Journal:  Am J Hum Genet        ISSN: 0002-9297            Impact factor:   11.025


  9 in total

1.  Genetic analysis of case/control data using estimated haplotype frequencies: application to APOE locus variation and Alzheimer's disease.

Authors:  D Fallin; A Cohen; L Essioux; I Chumakov; M Blumenfeld; D Cohen; N J Schork
Journal:  Genome Res       Date:  2001-01       Impact factor: 9.043

2.  Linkage disequilibrium at the ADH2 and ADH3 loci and risk of alcoholism.

Authors:  M Osier; A J Pakstis; J R Kidd; J F Lee; S J Yin; H C Ko; H J Edenberg; R B Lu; K K Kidd
Journal:  Am J Hum Genet       Date:  1999-04       Impact factor: 11.025

Review 3.  Inference of haplotypes from PCR-amplified samples of diploid populations.

Authors:  A G Clark
Journal:  Mol Biol Evol       Date:  1990-03       Impact factor: 16.240

4.  The Interaction of Selection and Linkage. I. General Considerations; Heterotic Models.

Authors:  R C Lewontin
Journal:  Genetics       Date:  1964-01       Impact factor: 4.562

5.  Detecting marker-disease association by testing for Hardy-Weinberg disequilibrium at a marker locus.

Authors:  D M Nielsen; M G Ehm; B S Weir
Journal:  Am J Hum Genet       Date:  1998-11       Impact factor: 11.025

6.  Molecular haplotyping of genetic markers 10 kb apart by allele-specific long-range PCR.

Authors:  S Michalatos-Beloin; S A Tishkoff; K L Bentley; K K Kidd; G Ruano
Journal:  Nucleic Acids Res       Date:  1996-12-01       Impact factor: 16.971

7.  HAPLO: a program using the EM algorithm to estimate the frequencies of multi-site haplotypes.

Authors:  M E Hawley; K K Kidd
Journal:  J Hered       Date:  1995 Sep-Oct       Impact factor: 2.645

8.  Maximum-likelihood estimation of molecular haplotype frequencies in a diploid population.

Authors:  L Excoffier; M Slatkin
Journal:  Mol Biol Evol       Date:  1995-09       Impact factor: 16.240

9.  An E-M algorithm and testing strategy for multiple-locus haplotypes.

Authors:  J C Long; R C Williams; M Urbanek
Journal:  Am J Hum Genet       Date:  1995-03       Impact factor: 11.025

  9 in total
  128 in total

1.  Genetic analysis of case/control data using estimated haplotype frequencies: application to APOE locus variation and Alzheimer's disease.

Authors:  D Fallin; A Cohen; L Essioux; I Chumakov; M Blumenfeld; D Cohen; N J Schork
Journal:  Genome Res       Date:  2001-01       Impact factor: 9.043

2.  A new statistical method for haplotype reconstruction from population data.

Authors:  M Stephens; N J Smith; P Donnelly
Journal:  Am J Hum Genet       Date:  2001-03-09       Impact factor: 11.025

3.  Score tests for association between traits and haplotypes when linkage phase is ambiguous.

Authors:  Daniel J Schaid; Charles M Rowland; David E Tines; Robert M Jacobson; Gregory A Poland
Journal:  Am J Hum Genet       Date:  2001-12-27       Impact factor: 11.025

4.  Spectrum of nonrandom associations between microsatellite loci on human chromosome 11p15.

Authors:  C Zapata; S Rodríguez; G Visedo; F Sacristán
Journal:  Genetics       Date:  2001-07       Impact factor: 4.562

5.  Molecular analysis of the beta-globin gene cluster in the Niokholo Mandenka population reveals a recent origin of the beta(S) Senegal mutation.

Authors:  Mathias Currat; Guy Trabuchet; David Rees; Pascale Perrin; Rosalind M Harding; John B Clegg; André Langaney; Laurent Excoffier
Journal:  Am J Hum Genet       Date:  2001-12-06       Impact factor: 11.025

6.  Bayesian haplotype inference for multiple linked single-nucleotide polymorphisms.

Authors:  Tianhua Niu; Zhaohui S Qin; Xiping Xu; Jun S Liu
Journal:  Am J Hum Genet       Date:  2001-11-26       Impact factor: 11.025

7.  A method for the assessment of disease associations with single-nucleotide polymorphism haplotypes and environmental variables in case-control studies.

Authors:  Lue Ping Zhao; Shuying Sue Li; Najma Khalid
Journal:  Am J Hum Genet       Date:  2003-04-16       Impact factor: 11.025

8.  On the identification of disease mutations by the analysis of haplotype similarity and goodness of fit.

Authors:  Jung-Ying Tzeng; B Devlin; Larry Wasserman; Kathryn Roeder
Journal:  Am J Hum Genet       Date:  2003-02-27       Impact factor: 11.025

9.  Caution on pedigree haplotype inference with software that assumes linkage equilibrium.

Authors:  Daniel J Schaid; Shannon K McDonnell; Liang Wang; Julie M Cunningham; Stephen N Thibodeau
Journal:  Am J Hum Genet       Date:  2002-10       Impact factor: 11.025

10.  Haplotype inference in random population samples.

Authors:  Shin Lin; David J Cutler; Michael E Zwick; Aravinda Chakravarti
Journal:  Am J Hum Genet       Date:  2002-10-17       Impact factor: 11.025

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.