Literature DB >> 20068333

A Likelihood-Based Approach for Missing Genotype Data.

Gina M D'Angelo1, M Ilyas Kamboh, Eleanor Feingold.   

Abstract

Missing genotype data in a candidate gene association study can make it difficult to model the effects of multiple genetic variants simultaneously. In particular, when regression models are used to model phenotype as a function of SNP genotypes in several different genes, the most common approach is a complete case analysis, in which only individuals with no missing genotypes are included. But this can lead to substantial reduction in sample size and thus potential bias and loss in efficiency. A number of other methods for handling missing data are applicable, but have rarely been used in this context. The purpose of this paper is to describe how several standard methods for handling missing data can be applied or adapted to this problem, and to compare their performance using a simulation study. We demonstrate these techniques using an Alzheimer's disease association study. We show that the expectation-maximization algorithm and multiple imputation with a bootstrapped expectation-maximization sampling algorithm have the best properties of all the estimators studied.

Entities:  

Mesh:

Year:  2010        PMID: 20068333      PMCID: PMC7077088          DOI: 10.1159/000273732

Source DB:  PubMed          Journal:  Hum Hered        ISSN: 0001-5652            Impact factor:   0.444


  18 in total

1.  Maximum likelihood analysis of logistic regression models with incomplete covariate data and auxiliary information.

Authors:  N J Horton; N M Laird
Journal:  Biometrics       Date:  2001-03       Impact factor: 2.571

2.  Maximum likelihood estimation of two-level latent variable models with mixed continuous and polytomous data.

Authors:  S Y Lee; J Q Shi
Journal:  Biometrics       Date:  2001-09       Impact factor: 2.571

3.  Full Maximum Likelihood Estimation of Polychoric and Polyserial Correlations With Missing Data.

Authors:  Xin-Yuan Song; Sik-Yum Lee
Journal:  Multivariate Behav Res       Date:  2003-01-01       Impact factor: 5.923

4.  Genetic association of ubiquilin with Alzheimer's disease and related quantitative measures.

Authors:  M I Kamboh; R L Minster; E Feingold; S T DeKosky
Journal:  Mol Psychiatry       Date:  2006-03       Impact factor: 15.992

5.  Designs and analysis of two-stage studies.

Authors:  L P Zhao; S Lipsitz
Journal:  Stat Med       Date:  1992-04       Impact factor: 2.373

6.  Imputation methods to improve inference in SNP association studies.

Authors:  James Y Dai; Ingo Ruczinski; Michael LeBlanc; Charles Kooperberg
Journal:  Genet Epidemiol       Date:  2006-12       Impact factor: 2.135

7.  Testing untyped alleles (TUNA)-applications to genome-wide association studies.

Authors:  Dan L Nicolae
Journal:  Genet Epidemiol       Date:  2006-12       Impact factor: 2.135

8.  Much ado about nothing: A comparison of missing data methods and software to fit incomplete data regression models.

Authors:  Nicholas J Horton; Ken P Kleinman
Journal:  Am Stat       Date:  2007-02       Impact factor: 8.710

9.  Regression analysis with missing covariate data using estimating equations.

Authors:  L P Zhao; S Lipsitz; D Lew
Journal:  Biometrics       Date:  1996-12       Impact factor: 2.571

10.  TPH2 and TPH1: association of variants and interactions with heroin addiction.

Authors:  David A Nielsen; Sandra Barral; Dmitri Proudnikov; Scott Kellogg; Ann Ho; Jurg Ott; Mary Jeanne Kreek
Journal:  Behav Genet       Date:  2008-01-08       Impact factor: 2.805

View more
  1 in total

1.  Missing Data Methods for Partial Correlations.

Authors:  Gina M D'Angelo; Jingqin Luo; Chengjie Xiong
Journal:  J Biom Biostat       Date:  2012-12
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.