Literature DB >> 21572926

Bayesian Gaussian Mixture Models for High-Density Genotyping Arrays.

Chiara Sabatti1, Kenneth Lange.   

Abstract

Affymetrix's SNP (single-nucleotide polymorphism) genotyping chips have increased the scope and decreased the cost of gene-mapping studies. Because each SNP is queried by multiple DNA probes, the chips present interesting challenges in genotype calling. Traditional clustering methods distinguish the three genotypes of an SNP fairly well given a large enough sample of unrelated individuals or a training sample of known genotypes. This article describes our attempt to improve genotype calling by constructing Gaussian mixture models with empirically derived priors. The priors stabilize parameter estimation and borrow information collectively gathered on tens of thousands of SNPs. When data from related family members are available, our models capture the correlations in signals between relatives. With these advantages in mind, we apply the models to Affymetrix probe intensity data on 10,000 SNPs gathered on 63 genotyped individuals spread over eight pedigrees. We integrate the genotype-calling model with pedigree analysis and examine a sequence of symmetry hypotheses involving the correlated probe signals. The symmetry hypotheses raise novel mathematical issues of parameterization. Using the Bayesian information criterion, we select the best combination of symmetry assumptions. Compared to Affymetrix's software, our model leads to a reduction in no-calls with little sacrifice in overall calling accuracy.

Year:  2008        PMID: 21572926      PMCID: PMC3092390          DOI: 10.1198/016214507000000338.

Source DB:  PubMed          Journal:  J Am Stat Assoc        ISSN: 0162-1459            Impact factor:   5.033


  16 in total

1.  A system for specific, high-throughput genotyping by allele-specific primer extension on microarrays.

Authors:  T Pastinen; M Raitio; K Lindroos; P Tainola; L Peltonen; A C Syvänen
Journal:  Genome Res       Date:  2000-07       Impact factor: 9.043

2.  Normalization for cDNA microarray data: a robust composite method addressing single and multiple slide systematic variation.

Authors:  Yee Hwa Yang; Sandrine Dudoit; Percy Luu; David M Lin; Vivian Peng; John Ngai; Terence P Speed
Journal:  Nucleic Acids Res       Date:  2002-02-15       Impact factor: 16.971

3.  Algorithms for large-scale genotyping microarrays.

Authors:  Wei-mn Liu; Xiaojun Di; Geoffrey Yang; Hajime Matsuzaki; Jing Huang; Rui Mei; Thomas B Ryder; Teresa A Webster; Shoulian Dong; Guoying Liu; Keith W Jones; Giulia C Kennedy; David Kulp
Journal:  Bioinformatics       Date:  2003-12-12       Impact factor: 6.937

4.  SNP Chart: an integrated platform for visualization and interpretation of microarray genotyping data.

Authors:  Scott J Tebbutt; Igor V Opushnyev; Ben W Tripp; Ayaz M Kassamali; Wendy L Alexander; Marilyn I Andersen
Journal:  Bioinformatics       Date:  2004-08-12       Impact factor: 6.937

5.  Genomewide linkage analysis of bipolar disorder by use of a high-density single-nucleotide-polymorphism (SNP) genotyping assay: a comparison with microsatellite marker assays and finding of significant linkage to chromosome 6q22.

Authors:  F A Middleton; M T Pato; K L Gentile; C P Morley; X Zhao; A F Eisener; A Brown; T L Petryshen; A N Kirby; H Medeiros; C Carvalho; A Macedo; A Dourado; I Coelho; J Valente; M J Soares; C P Ferreira; M Lei; M H Azevedo; J L Kennedy; M J Daly; P Sklar; C N Pato
Journal:  Am J Hum Genet       Date:  2004-04-01       Impact factor: 11.025

6.  Estimation of genotype error rate using samples with pedigree information--an application on the GeneChip Mapping 10K array.

Authors:  Ke Hao; Cheng Li; Carsten Rosenow; Wing Hung Wong
Journal:  Genomics       Date:  2004-10       Impact factor: 5.736

7.  Dynamic model based algorithms for screening and genotyping over 100 K SNPs on oligonucleotide microarrays.

Authors:  Xiaojun Di; Hajime Matsuzaki; Teresa A Webster; Earl Hubbell; Guoying Liu; Shoulian Dong; Dan Bartell; Jing Huang; Richard Chiles; Geoffrey Yang; Mei-mei Shen; David Kulp; Giulia C Kennedy; Rui Mei; Keith W Jones; Simon Cawley
Journal:  Bioinformatics       Date:  2005-01-18       Impact factor: 6.937

8.  A genotype calling algorithm for affymetrix SNP arrays.

Authors:  Nusrat Rabbee; Terence P Speed
Journal:  Bioinformatics       Date:  2005-11-02       Impact factor: 6.937

9.  A dictionary model for haplotyping, genotype calling, and association testing.

Authors:  Kristin L Ayers; Chiara Sabatti; Kenneth Lange
Journal:  Genet Epidemiol       Date:  2007-11       Impact factor: 2.135

10.  Model-based analysis of oligonucleotide arrays: expression index computation and outlier detection.

Authors:  C Li; W H Wong
Journal:  Proc Natl Acad Sci U S A       Date:  2001-01-02       Impact factor: 11.205

View more
  3 in total

1.  Inferring genetic ancestry: opportunities, challenges, and implications.

Authors:  Charmaine D Royal; John Novembre; Stephanie M Fullerton; David B Goldstein; Jeffrey C Long; Michael J Bamshad; Andrew G Clark
Journal:  Am J Hum Genet       Date:  2010-05-14       Impact factor: 11.025

2.  Markov Models for inferring copy number variations from genotype data on Illumina platforms.

Authors:  Hui Wang; Jan H Veldink; Hylke Blauw; Leonard H van den Berg; Roel A Ophoff; Chiara Sabatti
Journal:  Hum Hered       Date:  2009-04-01       Impact factor: 0.444

3.  Smarter clustering methods for SNP genotype calling.

Authors:  Yan Lin; George C Tseng; Soo Yeon Cheong; Lora J H Bean; Stephanie L Sherman; Eleanor Feingold
Journal:  Bioinformatics       Date:  2008-09-29       Impact factor: 6.937

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.