Literature DB >> 17922479

Understanding the accuracy of statistical haplotype inference with sequence data of known phase.

Aida M Andrés1, Andrew G Clark, Lawrence Shimmin, Eric Boerwinkle, Charles F Sing, James E Hixson.   

Abstract

Statistical methods for haplotype inference from multi-site genotypes of unrelated individuals have important application in association studies and population genetics. Understanding the factors that affect the accuracy of this inference is important, but their assessment has been restricted by the limited availability of biological data with known phase. We created hybrid cell lines monosomic for human chromosome 19 and produced single-chromosome complete sequences of a 48 kb genomic region in 39 individuals of African American (AA) and European American (EA) origin. We employ these phase-known genotypes and coalescent simulations to assess the accuracy of statistical haplotype reconstruction by several algorithms. Accuracy of phase inference was considerably low in our biological data even for regions as short as 25-50 kb, suggesting that caution is needed when analyzing reconstructed haplotypes. Moreover, the reliability of estimated confidence in phase inference is not high enough to allow for a reliable incorporation of site-specific uncertainty information in subsequent analyses. We show that, in samples of certain mixed ancestry (AA and EA populations), the most accurate haplotypes are probably obtained when increasing sample size by considering the largest, pooled sample, despite the hypothetical problems associated with pooling across those heterogeneous samples. Strategies to improve confidence in reconstructed haplotypes, and realistic alternatives to the analysis of inferred haplotypes, are discussed.

Entities:  

Mesh:

Year:  2007        PMID: 17922479      PMCID: PMC2291540          DOI: 10.1002/gepi.20185

Source DB:  PubMed          Journal:  Genet Epidemiol        ISSN: 0741-0395            Impact factor:   2.135


  44 in total

1.  Haplotype inference in random population samples.

Authors:  Shin Lin; David J Cutler; Michael E Zwick; Aravinda Chakravarti
Journal:  Am J Hum Genet       Date:  2002-10-17       Impact factor: 11.025

2.  A comparison of bayesian methods for haplotype reconstruction from population genotype data.

Authors:  Matthew Stephens; Peter Donnelly
Journal:  Am J Hum Genet       Date:  2003-10-20       Impact factor: 11.025

3.  Detecting recent positive selection in the human genome from haplotype structure.

Authors:  Pardis C Sabeti; David E Reich; John M Higgins; Haninah Z P Levine; Daniel J Richter; Stephen F Schaffner; Stacey B Gabriel; Jill V Platko; Nick J Patterson; Gavin J McDonald; Hans C Ackerman; Sarah J Campbell; David Altshuler; Richard Cooper; Dominic Kwiatkowski; Ryk Ward; Eric S Lander
Journal:  Nature       Date:  2002-10-09       Impact factor: 49.962

4.  Selecting a maximally informative set of single-nucleotide polymorphisms for association analyses using linkage disequilibrium.

Authors:  Christopher S Carlson; Michael A Eberle; Mark J Rieder; Qian Yi; Leonid Kruglyak; Deborah A Nickerson
Journal:  Am J Hum Genet       Date:  2003-12-15       Impact factor: 11.025

Review 5.  Understanding human DNA sequence variation.

Authors:  K K Kidd; A J Pakstis; W C Speed; J R Kidd
Journal:  J Hered       Date:  2004 Sep-Oct       Impact factor: 2.645

6.  Linkage disequilibrium testing when linkage phase is unknown.

Authors:  Daniel J Schaid
Journal:  Genetics       Date:  2004-01       Impact factor: 4.562

7.  The allele frequency spectrum in genome-wide human variation data reveals signals of differential demographic history in three large world populations.

Authors:  Gabor T Marth; Eva Czabarka; Janos Murvai; Stephen T Sherry
Journal:  Genetics       Date:  2004-01       Impact factor: 4.562

8.  Little loss of information due to unknown phase for fine-scale linkage-disequilibrium mapping with single-nucleotide-polymorphism genotype data.

Authors:  A P Morris; J C Whittaker; D J Balding
Journal:  Am J Hum Genet       Date:  2004-04-07       Impact factor: 11.025

9.  Estimation of linkage disequilibrium in randomly mating populations.

Authors:  W G Hill
Journal:  Heredity (Edinb)       Date:  1974-10       Impact factor: 3.821

10.  Haplotype reconstruction from genotype data using Imperfect Phylogeny.

Authors:  Eran Halperin; Eleazar Eskin
Journal:  Bioinformatics       Date:  2004-02-26       Impact factor: 6.937

View more
  33 in total

1.  Adaptive clustering and adaptive weighting methods to detect disease associated rare variants.

Authors:  Qiuying Sha; Shuaicheng Wang; Shuanglin Zhang
Journal:  Eur J Hum Genet       Date:  2012-07-11       Impact factor: 4.246

2.  How frugal is Mother Nature with haplotypes?

Authors:  Sharlee Climer; Gerold Jäger; Alan R Templeton; Weixiong Zhang
Journal:  Bioinformatics       Date:  2008-11-04       Impact factor: 6.937

3.  Fraction of informative recombinations: a heuristic approach to analyze recombination rates.

Authors:  J-F Lefebvre; D Labuda
Journal:  Genetics       Date:  2008-04       Impact factor: 4.562

4.  Using population mixtures to optimize the utility of genomic databases: linkage disequilibrium and association study design in India.

Authors:  T J Pemberton; M Jakobsson; D F Conrad; G Coop; J D Wall; J K Pritchard; P I Patel; N A Rosenberg
Journal:  Ann Hum Genet       Date:  2007-05-30       Impact factor: 1.670

5.  Multiple rare variants as a cause of a common phenotype: several different lactase persistence associated alleles in a single ethnic group.

Authors:  Catherine J E Ingram; Tamiru Oljira Raga; Ayele Tarekegn; Sarah L Browning; Mohamed F Elamin; Endashaw Bekele; Mark G Thomas; Michael E Weale; Neil Bradman; Dallas M Swallow
Journal:  J Mol Evol       Date:  2009-11-24       Impact factor: 2.395

6.  A rare variant association test based on combinations of single-variant tests.

Authors:  Qiuying Sha; Shuanglin Zhang
Journal:  Genet Epidemiol       Date:  2014-07-25       Impact factor: 2.135

7.  Improved risk prediction for Crohn's disease with a multi-locus approach.

Authors:  Jia Kang; Subra Kugathasan; Michel Georges; Hongyu Zhao; Judy H Cho
Journal:  Hum Mol Genet       Date:  2011-03-22       Impact factor: 6.150

8.  Test of rare variant association based on affected sib-pairs.

Authors:  Qiuying Sha; Shuanglin Zhang
Journal:  Eur J Hum Genet       Date:  2014-03-26       Impact factor: 4.246

9.  Evaluation of haplotype inference using definitive haplotype data obtained from complete hydatidiform moles, and its significance for the analyses of positively selected regions.

Authors:  Koichiro Higasa; Yoji Kukita; Kiyoko Kato; Norio Wake; Tomoko Tahira; Kenshi Hayashi
Journal:  PLoS Genet       Date:  2009-05-08       Impact factor: 5.917

10.  A groupwise association test for rare mutations using a weighted sum statistic.

Authors:  Bo Eskerod Madsen; Sharon R Browning
Journal:  PLoS Genet       Date:  2009-02-13       Impact factor: 5.917

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.