| Literature DB >> 20018064 |
Brooke L Fridley1, Shannon K McDonnell, Kari G Rabe, Rui Tang, Joanna M Biernacka, Jason P Sinnwell, David N Rider, Ellen L Goode.
Abstract
Due to the growing need to combine data across multiple studies and to impute untyped markers based on a reference sample, several analytical tools for imputation and analysis of missing genotypes have been developed. Current imputation methods rely on single imputation, which ignores the variation in estimation due to imputation. An alternative to single imputation is multiple imputation. In this paper, we assess the variation in imputation by completing both single and multiple imputations of genotypic data using MACH, a commonly used hidden Markov model imputation method. Using data from the North American Rheumatoid Arthritis Consortium genome-wide study, the use of single and multiple imputation was assessed in four regions of chromosome 1 with varying levels of linkage disequilibrium and association signals. Two scenarios for missing genotypic data were assessed: imputation of untyped markers and combination of genotypic data from two studies. This limited study involving four regions indicates that, contrary to expectations, multiple imputations may not be necessary.Entities:
Year: 2009 PMID: 20018064 PMCID: PMC2795971 DOI: 10.1186/1753-6561-3-s7-s7
Source DB: PubMed Journal: BMC Proc ISSN: 1753-6561
Figure 1Four genomic regions. (A) Null region 1 - low LD; (B) Null region 2 - high LD; (C) Associated region - PTPN22 (D) Associated region - PADI4. SNPs in boxes indicate those removed for imputation of completely untyped markers. For associated regions (A and B), boxes indicate SNPs which were imputed: blue boxes denote the set containing the imputed risk SNP (first SNP set) and green boxes denote the set including flanking markers to the risk SNP (second SNP set).
Figure 2(A) Odds ratios with 95% CI for each of the 5 imputation runs; (B) Odds ratios with 95% CI based on SI and MI from MACH Scenario I.
Figure 3Box plots of difference in individuals' dosage (observed or imputed) for each of the SNPs with missing data (Scenario II) from MACH.