Literature DB >> 15252420

Handling missing values in population data: consequences for maximum likelihood estimation of haplotype frequencies.

Pierre-Antoine Gourraud1, Emmanuelle Génin, Anne Cambon-Thomsen.   

Abstract

Haplotype frequency estimation in population data is an important problem in genetics and different methods including expectation maximisation (EM) methods have been proposed. The statistical properties of EM methods have been extensively assessed for data sets with no missing values. When numerous markers and/or individuals are tested, however, it is likely that some genotypes will be missing. Thus, it is of interest to investigate the behaviour of the method in the presence of incomplete genotype observations. We propose an extension of the EM method to handle missing genotypes, and we compare it with commonly used methods (such as ignoring individuals with incomplete genotype information or treating a missing allele as any other allele). Simulations were performed, starting from data sets of haematopoietic stem cell donors genotyped at three HLA loci. We deleted some data to create incomplete genotype observations in various proportions. We then compared the haplotype frequencies obtained on these incomplete data sets using the different methods to those obtained on the complete data. We found that the method proposed here provides better estimations, both qualitatively and quantitatively, but increases the computation time required. We discuss the influence of missing values on the algorithm's efficiency and the advantages and disadvantages of deleting incomplete genotypes. We propose guidelines for missing data handling in routine analysis.

Entities:  

Mesh:

Year:  2004        PMID: 15252420     DOI: 10.1038/sj.ejhg.5201233

Source DB:  PubMed          Journal:  Eur J Hum Genet        ISSN: 1018-4813            Impact factor:   4.246


  9 in total

1.  A comprehensive evaluation of SNP genotype imputation.

Authors:  Michael Nothnagel; David Ellinghaus; Stefan Schreiber; Michael Krawczak; Andre Franke
Journal:  Hum Genet       Date:  2008-12-17       Impact factor: 4.132

2.  Proceedings: human leukocyte antigen haplo-homozygous induced pluripotent stem cell haplobank modeled after the california population: evaluating matching in a multiethnic and admixed population.

Authors:  Derek James Pappas; Pierre-Antoine Gourraud; Caroline Le Gall; Julie Laurent; Alan Trounson; Natalie DeWitt; Sohel Talib
Journal:  Stem Cells Transl Med       Date:  2015-05       Impact factor: 6.940

3.  SNP-based analysis of the HLA locus in Japanese multiple sclerosis patients.

Authors:  J P McElroy; N Isobe; P A Gourraud; S J Caillier; T Matsushita; T Kohriyama; K Miyamoto; Y Nakatsuji; T Miki; S L Hauser; J R Oksenberg; J Kira
Journal:  Genes Immun       Date:  2011-06-09       Impact factor: 2.676

4.  APOBEC3H haplotypes and HIV-1 pro-viral vif DNA sequence diversity in early untreated human immunodeficiency virus-1 infection.

Authors:  P A Gourraud; A Karaouni; J M Woo; T Schmidt; J R Oksenberg; F M Hecht; T J Liegler; J D Barbour
Journal:  Hum Immunol       Date:  2010-12-15       Impact factor: 2.850

5.  A genome-wide meta-analysis of nodular sclerosing Hodgkin lymphoma identifies risk loci at 6p21.32.

Authors:  Wendy Cozen; Dalin Li; Timothy Best; David J Van Den Berg; Pierre-Antoine Gourraud; Victoria K Cortessis; Andrew D Skol; Thomas M Mack; Sally L Glaser; Lawrence M Weiss; Bharat N Nathwani; Smita Bhatia; Fredrick R Schumacher; Christopher K Edlund; Amie E Hwang; Susan L Slager; Zachary S Fredericksen; Louise C Strong; Thomas M Habermann; Brian K Link; James R Cerhan; Leslie L Robison; David V Conti; Kenan Onel
Journal:  Blood       Date:  2011-11-15       Impact factor: 22.113

6.  Refining the association of MHC with multiple sclerosis in African Americans.

Authors:  Joseph P McElroy; Bruce A C Cree; Stacy J Caillier; Peter K Gregersen; Joseph Herbert; Omar A Khan; Jan Freudenberg; Annette Lee; S Louis Bridges; Stephen L Hauser; Jorge R Oksenberg; Pierre-Antoine Gourraud
Journal:  Hum Mol Genet       Date:  2010-05-12       Impact factor: 6.150

Review 7.  A comprehensive literature review of haplotyping software and methods for use with unrelated individuals.

Authors:  Rany M Salem; Jennifer Wessel; Nicholas J Schork
Journal:  Hum Genomics       Date:  2005-03       Impact factor: 4.639

8.  Highlighting nonlinear patterns in population genetics datasets.

Authors:  Gregorio Alanis-Lobato; Carlo Vittorio Cannistraci; Anders Eriksson; Andrea Manica; Timothy Ravasi
Journal:  Sci Rep       Date:  2015-01-30       Impact factor: 4.379

9.  Statistical resolution of ambiguous HLA typing data.

Authors:  Jennifer Listgarten; Zabrina Brumme; Carl Kadie; Gao Xiaojiang; Bruce Walker; Mary Carrington; Philip Goulder; David Heckerman
Journal:  PLoS Comput Biol       Date:  2008-02-29       Impact factor: 4.475

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.