Literature DB >> 18826959

Smarter clustering methods for SNP genotype calling.

Yan Lin1, George C Tseng, Soo Yeon Cheong, Lora J H Bean, Stephanie L Sherman, Eleanor Feingold.   

Abstract

MOTIVATION: Most genotyping technologies for single nucleotide polymorphism (SNP) markers use standard clustering methods to 'call' the SNP genotypes. These methods are not always optimal in distinguishing the genotype clusters of a SNP because they do not take advantage of specific features of the genotype calling problem. In particular, when family data are available, pedigree information is ignored. Furthermore, prior information about the distribution of the measurements for each cluster can be used to choose an appropriate model-based clustering method and can significantly improve the genotype calls. One special genotyping problem that has never been discussed in the literature is that of genotyping of trisomic individuals, such as individuals with Down syndrome. Calling trisomic genotypes is a more complicated problem, and the addition of external information becomes very important.
RESULTS: In this article, we discuss the impact of incorporating external information into clustering algorithms to call the genotypes for both disomic and trisomic data. We also propose two new methods to call genotypes using family data. One is a modification of the K-means method and uses the pedigree information by updating all members of a family together. The other is a likelihood-based method that combines the Gaussian or beta-mixture model with pedigree information. We compare the performance of these two methods and some other existing methods using simulation studies. We also compare the performance of these methods on a real dataset generated by the Illumina platform (www.illumina.com). AVAILABILITY: The R code for the family-based genotype calling methods (SNPCaller) is available to be downloaded from the following website: http://watson.hgen.pitt.edu/register.

Mesh:

Year:  2008        PMID: 18826959      PMCID: PMC2732271          DOI: 10.1093/bioinformatics/btn509

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  19 in total

1.  Algorithms for large-scale genotyping microarrays.

Authors:  Wei-mn Liu; Xiaojun Di; Geoffrey Yang; Hajime Matsuzaki; Jing Huang; Rui Mei; Thomas B Ryder; Teresa A Webster; Shoulian Dong; Guoying Liu; Keith W Jones; Giulia C Kennedy; David Kulp
Journal:  Bioinformatics       Date:  2003-12-12       Impact factor: 6.937

2.  Linkage disequilibrium mapping in trisomic populations: analytical approaches and an application to congenital heart defects in Down syndrome.

Authors:  Kimberly F Kerstann; Eleanor Feingold; Sallie B Freeman; Lora J H Bean; Robert Pyatt; Stuart Tinker; Amy H Jewel; George Capone; Stephanie L Sherman
Journal:  Genet Epidemiol       Date:  2004-11       Impact factor: 2.135

3.  Dynamic model based algorithms for screening and genotyping over 100 K SNPs on oligonucleotide microarrays.

Authors:  Xiaojun Di; Hajime Matsuzaki; Teresa A Webster; Earl Hubbell; Guoying Liu; Shoulian Dong; Dan Bartell; Jing Huang; Richard Chiles; Geoffrey Yang; Mei-mei Shen; David Kulp; Giulia C Kennedy; Rui Mei; Keith W Jones; Simon Cawley
Journal:  Bioinformatics       Date:  2005-01-18       Impact factor: 6.937

Review 4.  Genotyping errors: causes, consequences and solutions.

Authors:  François Pompanon; Aurélie Bonin; Eva Bellemain; Pierre Taberlet
Journal:  Nat Rev Genet       Date:  2005-11       Impact factor: 53.242

5.  GEL: a novel genotype calling algorithm using empirical likelihood.

Authors:  Dan L Nicolae; Xiaolin Wu; Kazuaki Miyake; Nancy J Cox
Journal:  Bioinformatics       Date:  2006-06-29       Impact factor: 6.937

6.  Effects of differential genotyping error rate on the type I error probability of case-control studies.

Authors:  Valentina Moskvina; Nick Craddock; Peter Holmans; Michael J Owen; Michael C O'Donovan
Journal:  Hum Hered       Date:  2006-04-06       Impact factor: 0.444

Review 7.  Factors affecting statistical power in the detection of genetic association.

Authors:  Derek Gordon; Stephen J Finch
Journal:  J Clin Invest       Date:  2005-06       Impact factor: 14.808

8.  A multi-array multi-SNP genotyping algorithm for Affymetrix SNP microarrays.

Authors:  Yuanyuan Xiao; Mark R Segal; Y H Yang; Ru-Fang Yeh
Journal:  Bioinformatics       Date:  2007-04-25       Impact factor: 6.937

9.  Bayesian Gaussian Mixture Models for High-Density Genotyping Arrays.

Authors:  Chiara Sabatti; Kenneth Lange
Journal:  J Am Stat Assoc       Date:  2008-03-01       Impact factor: 5.033

10.  A genotype calling algorithm for the Illumina BeadArray platform.

Authors:  Yik Y Teo; Michael Inouye; Kerrin S Small; Rhian Gwilliam; Panagiotis Deloukas; Dominic P Kwiatkowski; Taane G Clark
Journal:  Bioinformatics       Date:  2007-09-10       Impact factor: 6.937

View more
  9 in total

1.  TroX: a new method to learn about the genesis of aneuploidy from trisomic products of conception.

Authors:  Amir R Kermany; Laure Segurel; Tiffany R Oliver; Molly Przeworski
Journal:  Bioinformatics       Date:  2014-03-21       Impact factor: 6.937

2.  Brain network profiling defines functionally specialized cortical networks.

Authors:  Simone Di Plinio; Sjoerd J H Ebisch
Journal:  Hum Brain Mapp       Date:  2018-08-04       Impact factor: 5.038

3.  Variation in folate pathway genes contributes to risk of congenital heart defects among individuals with Down syndrome.

Authors:  Adam E Locke; Kenneth J Dooley; Stuart W Tinker; Soo Yeon Cheong; Eleanor Feingold; Emily G Allen; Sallie B Freeman; Claudine P Torfs; Clifford L Cua; Michael P Epstein; Michael C Wu; Xihong Lin; George Capone; Stephanie L Sherman; Lora J H Bean
Journal:  Genet Epidemiol       Date:  2010-09       Impact factor: 2.135

4.  Interpretation of custom designed Illumina genotype cluster plots for targeted association studies and next-generation sequence validation.

Authors:  Elizabeth A Tindall; Desiree C Petersen; Stina Nikolaysen; Webb Miller; Stephan C Schuster; Vanessa M Hayes
Journal:  BMC Res Notes       Date:  2010-02-22

5.  Prediction of thermostability from amino acid attributes by combination of clustering with attribute weighting: a new vista in engineering enzymes.

Authors:  Mansour Ebrahimi; Amir Lakizadeh; Parisa Agha-Golzadeh; Esmaeil Ebrahimie; Mahdi Ebrahimi
Journal:  PLoS One       Date:  2011-08-10       Impact factor: 3.240

6.  ALG: automated genotype calling of Luminex assays.

Authors:  Mathieu Bourgey; Mathieu Lariviere; Chantal Richer; Daniel Sinnett
Journal:  PLoS One       Date:  2011-05-06       Impact factor: 3.240

7.  Genome-Wide Association Study of Down Syndrome-Associated Atrioventricular Septal Defects.

Authors:  Dhanya Ramachandran; Zhen Zeng; Adam E Locke; Jennifer G Mulle; Lora J H Bean; Tracie C Rosser; Kenneth J Dooley; Clifford L Cua; George T Capone; Roger H Reeves; Cheryl L Maslen; David J Cutler; Eleanor Feingold; Stephanie L Sherman; Michael E Zwick
Journal:  G3 (Bethesda)       Date:  2015-07-20       Impact factor: 3.154

8.  Chromosome 21 scan in Down syndrome reveals DSCAM as a predisposing locus in Hirschsprung disease.

Authors:  Anne-Sophie Jannot; Anna Pelet; Alexandra Henrion-Caude; Asma Chaoui; Marine Masse-Morel; Stacey Arnold; Damien Sanlaville; Isabella Ceccherini; Salud Borrego; Robert M W Hofstra; Arnold Munnich; Nadège Bondurand; Aravinda Chakravarti; Françoise Clerget-Darpoux; Jeanne Amiel; Stanislas Lyonnet
Journal:  PLoS One       Date:  2013-05-06       Impact factor: 3.240

9.  A candidate gene analysis and GWAS for genes associated with maternal nondisjunction of chromosome 21.

Authors:  Jonathan M Chernus; Emily G Allen; Zhen Zeng; Eva R Hoffman; Terry J Hassold; Eleanor Feingold; Stephanie L Sherman
Journal:  PLoS Genet       Date:  2019-12-12       Impact factor: 5.917

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.