Literature DB >> 14557108

Estimating haplotype frequencies and standard errors for multiple single nucleotide polymorphisms.

Shuying Sue Li1, Najma Khalid, Christopher Carlson, Lue Ping Zhao.   

Abstract

Estimating haplotype frequencies becomes increasingly important in the mapping of complex disease genes, as millions of single nucleotide polymorphisms (SNPs) are being identified and genotyped. When genotypes at multiple SNP loci are gathered from unrelated individuals, haplotype frequencies can be accurately estimated using expectation-maximization (EM) algorithms (Excoffier and Slatkin, 1995; Hawley and Kidd, 1995; Long et al., 1995), with standard errors estimated using bootstraps. However, because the number of possible haplotypes increases exponentially with the number of SNPs, handling data with a large number of SNPs poses a computational challenge for the EM methods and for other haplotype inference methods. To solve this problem, Niu and colleagues, in their Bayesian haplotype inference paper (Niu et al., 2002), introduced a computational algorithm called progressive ligation (PL). But their Bayesian method has a limitation on the number of subjects (no more than 100 subjects in the current implementation of the method). In this paper, we propose a new method in which we use the same likelihood formulation as in Excoffier and Slatkin's EM algorithm and apply the estimating equation idea and the PL computational algorithm with some modifications. Our proposed method can handle data sets with large number of SNPs as well as large numbers of subjects. Simultaneously, our method estimates standard errors efficiently, using the sandwich-estimate from the estimating equation, rather than the bootstrap method. Additionally, our method admits missing data and produces valid estimates of parameters and their standard errors under the assumption that the missing genotypes are missing at random in the sense defined by Rubin (1976).

Mesh:

Year:  2003        PMID: 14557108     DOI: 10.1093/biostatistics/4.4.513

Source DB:  PubMed          Journal:  Biostatistics        ISSN: 1465-4644            Impact factor:   5.899


  30 in total

1.  A method for the assessment of disease associations with single-nucleotide polymorphism haplotypes and environmental variables in case-control studies.

Authors:  Lue Ping Zhao; Shuying Sue Li; Najma Khalid
Journal:  Am J Hum Genet       Date:  2003-04-16       Impact factor: 11.025

2.  Polymorphisms in nucleotide excision repair genes and endometrial cancer risk.

Authors:  Jennifer A Doherty; Noel S Weiss; Sherianne Fish; Wenhong Fan; Melissa M Loomis; Lori C Sakoda; Mary Anne Rossing; Lue Ping Zhao; Chu Chen
Journal:  Cancer Epidemiol Biomarkers Prev       Date:  2011-07-12       Impact factor: 4.254

3.  A fine-scale linkage-disequilibrium measure based on length of haplotype sharing.

Authors:  Yan Wang; Lue Ping Zhao; Sandrine Dudoit
Journal:  Am J Hum Genet       Date:  2006-02-13       Impact factor: 11.025

4.  Retrospective analysis of haplotype-based case control studies under a flexible model for gene environment association.

Authors:  Yi-Hau Chen; Nilanjan Chatterjee; Raymond J Carroll
Journal:  Biostatistics       Date:  2007-05-08       Impact factor: 5.899

5.  Associations between NBS1 polymorphisms, haplotypes and smoking-related cancers.

Authors:  Sungshim L Park; Delara Bastani; Binh Y Goldstein; Shen-Chih Chang; Wendy Cozen; Lin Cai; Carlos Cordon-Cardo; Baoguo Ding; Sander Greenland; Na He; Shehnaz K Hussain; Qingwu Jiang; Yuan-Chin A Lee; Simin Liu; Ming-Lan Lu; Thomas M Mack; Jenny T Mao; Hal Morgenstern; Li-Na Mu; Sam S Oh; Allan Pantuck; Jeanette C Papp; Jianyu Rao; Victor E Reuter; Donald P Tashkin; Hua Wang; Nai-Chieh Y You; Shun-Zhang Yu; Jin-Kou Zhao; Zuo-Feng Zhang
Journal:  Carcinogenesis       Date:  2010-05-17       Impact factor: 4.944

6.  Predicting multiallelic genes using unphased and flanking single nucleotide polymorphisms.

Authors:  Shuying S Li; Hongwei Wang; Anajane Smith; Bo Zhang; Xinyi Cindy Zhang; Gary Schoch; Daniel Geraghty; John A Hansen; Lue Ping Zhao
Journal:  Genet Epidemiol       Date:  2010-12-31       Impact factor: 2.135

7.  Genetic variation in immune regulation and DNA repair pathways and stomach cancer in China.

Authors:  Shehnaz K Hussain; Li-Na Mu; Lin Cai; Shen-Chih Chang; Sungshim Lani Park; Sam S Oh; Yiren Wang; Binh Y Goldstein; Bao-Guo Ding; Qingwu Jiang; Jianyu Rao; Nai-Chieh Y You; Shun-Zhang Yu; Jeanette C Papp; Jin-Kou Zhao; Hua Wang; Zuo-Feng Zhang
Journal:  Cancer Epidemiol Biomarkers Prev       Date:  2009-08       Impact factor: 4.254

8.  Recursive organizer (ROR): an analytic framework for sequence-based association analysis.

Authors:  Lue Ping Zhao; Xin Huang
Journal:  Hum Genet       Date:  2013-03-14       Impact factor: 4.132

9.  Modeling Informatively Missing Genotypes in Haplotype Analysis.

Authors:  Nianjun Liu; Richard Bucala; Hongyu Zhao
Journal:  Commun Stat Theory Methods       Date:  2009       Impact factor: 0.893

10.  Genetic variation in CXCL12 and risk of cervical carcinoma: a population-based case-control study.

Authors:  S N Maley; S M Schwartz; L G Johnson; M Malkki; Q Du; J R Daling; S S Li; L P Zhao; E W Petersdorf; M M Madeleine
Journal:  Int J Immunogenet       Date:  2009-09-24       Impact factor: 1.466

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.