Literature DB >> 35659928

Genotype error biases trio-based estimates of haplotype phase accuracy.

Brian L Browning1, Sharon R Browning2.   

Abstract

Haplotypes can be estimated from unphased genotype data via statistical methods. When parent-offspring trios are available for inferring the true phase from Mendelian inheritance rules, the accuracy of statistical phasing is usually measured by the switch error rate, which is the proportion of pairs of consecutive heterozygotes that are incorrectly phased. We present a method for estimating the genotype error rate from parent-offspring trios and a method for estimating the bias that occurs in the observed switch error rate as a result of genotype error. We apply these methods to 485,301 genotyped UK Biobank samples that include 898 White British trios and to 38,387 sequenced TOPMed samples that include 217 African Caribbean trios and 669 European American trios. We show that genotype error inflates the observed switch error rate and that the relative bias increases with sample size. For the UK Biobank White British trios, the observed switch error rate in the trio offspring is 2.4 times larger than the estimated true switch error rate (1.4 × 10-3 vs 5.8 × 10-4. We propose an alternate definition of phase error that counts two consecutive switch errors as a single error because back-to-back switch errors arise when a single heterozygote is incorrectly phased with respect to the surrounding heterozygotes. With this definition, we estimate that the average distance between phase errors is 64 megabases in the UK Biobank White British individuals.
Copyright © 2022 American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.

Entities:  

Keywords:  TOPMed; UK Biobank; genotype error rate; haplotype phase accuracy; switch error rate

Mesh:

Year:  2022        PMID: 35659928      PMCID: PMC9247820          DOI: 10.1016/j.ajhg.2022.04.019

Source DB:  PubMed          Journal:  Am J Hum Genet        ISSN: 0002-9297            Impact factor:   11.043


  24 in total

1.  Detection and integration of genotyping errors in statistical genetics.

Authors:  Eric Sobel; Jeanette C Papp; Kenneth Lange
Journal:  Am J Hum Genet       Date:  2002-01-08       Impact factor: 11.025

2.  Robust relationship inference in genome-wide association studies.

Authors:  Ani Manichaikul; Josyf C Mychaleckyj; Stephen S Rich; Kathy Daly; Michèle Sale; Wei-Min Chen
Journal:  Bioinformatics       Date:  2010-10-05       Impact factor: 6.937

3.  Estimating genotyping error rates from Mendelian errors in SNP array genotypes and their impact on inference.

Authors:  Ian W Saunders; Jesper Brohede; Garry N Hannan
Journal:  Genomics       Date:  2007-06-27       Impact factor: 5.736

4.  The NCBI dbGaP database of genotypes and phenotypes.

Authors:  Matthew D Mailman; Michael Feolo; Yumi Jin; Masato Kimura; Kimberly Tryka; Rinat Bagoutdinov; Luning Hao; Anne Kiang; Justin Paschall; Lon Phan; Natalia Popova; Stephanie Pretel; Lora Ziyabari; Moira Lee; Yu Shao; Zhen Y Wang; Karl Sirotkin; Minghong Ward; Michael Kholodov; Kerry Zbicz; Jeffrey Beck; Michael Kimelman; Sergey Shevelev; Don Preuss; Eugene Yaschenko; Alan Graeff; James Ostell; Stephen T Sherry
Journal:  Nat Genet       Date:  2007-10       Impact factor: 38.330

5.  A Fast and Simple Method for Detecting Identity-by-Descent Segments in Large-Scale Data.

Authors:  Ying Zhou; Sharon R Browning; Brian L Browning
Journal:  Am J Hum Genet       Date:  2020-03-12       Impact factor: 11.025

6.  Phasing of many thousands of genotyped samples.

Authors:  Amy L Williams; Nick Patterson; Joseph Glessner; Hakon Hakonarson; David Reich
Journal:  Am J Hum Genet       Date:  2012-08-10       Impact factor: 11.025

7.  Fast and accurate genotype imputation in genome-wide association studies through pre-phasing.

Authors:  Bryan Howie; Christian Fuchsberger; Matthew Stephens; Jonathan Marchini; Gonçalo R Abecasis
Journal:  Nat Genet       Date:  2012-07-22       Impact factor: 38.330

Review 8.  Computational methods for chromosome-scale haplotype reconstruction.

Authors:  Shilpa Garg
Journal:  Genome Biol       Date:  2021-04-12       Impact factor: 13.583

9.  The UK Biobank resource with deep phenotyping and genomic data.

Authors:  Clare Bycroft; Colin Freeman; Desislava Petkova; Gavin Band; Lloyd T Elliott; Kevin Sharp; Allan Motyer; Damjan Vukcevic; Olivier Delaneau; Jared O'Connell; Adrian Cortes; Samantha Welsh; Alan Young; Mark Effingham; Gil McVean; Stephen Leslie; Naomi Allen; Peter Donnelly; Jonathan Marchini
Journal:  Nature       Date:  2018-10-10       Impact factor: 49.962

10.  Probabilistic Estimation of Identity by Descent Segment Endpoints and Detection of Recent Selection.

Authors:  Sharon R Browning; Brian L Browning
Journal:  Am J Hum Genet       Date:  2020-10-13       Impact factor: 11.025

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.